Turn images in Google Drive into clear written notes in Google Docs. Great for teams that need fast, consistent image summaries for listings, audits, or reports. Runs local vision models so content stays on your machine.
Here is how it works. You start the run manually. The workflow pulls a Google Drive file using its file ID, converts the image to base64, and builds a list of vision models to test. It then loops through each model, sends a structured prompt and the image to your local model server, collects the results, and writes the descriptions into a Google Doc. Two prompt presets are included, one general and one for spreadsheet images, so you can pick the style you need.
To set it up, you need Google Drive and Google Docs credentials in n8n and a local model server running at the default address. Add your Google Drive image file ID and your Google Doc ID once, then run a test to confirm it writes the output. Expect faster documentation for real estate listings, marketing briefs, and research notes. Most teams see big time savings when comparing multiple models in one pass.