Turn any image link into clean JSON you can use right away. It helps teams capture details from IDs, invoices, and receipts without manual typing. Great for operations that need fast and accurate data entry.
A public webhook receives a request with an image URL and a list of fields to extract. The flow downloads the image, converts it to base64, and sends it to the Gemini API Flash Lite model for OCR and understanding. The model returns text that is shaped into a simple JSON object with only the fields you asked for. The response is sent back to the caller in real time, so it can plug into forms, back office tools, or custom apps.
You only need a working n8n instance and a Gemini API key. Expect to cut data entry time from minutes to seconds for each document and reduce copy paste mistakes. Common uses include ID verification, invoice data capture, receipt logging, and business card parsing for quick intake in your system.