Create ready to publish images by auto generating a caption from any photo and placing it on the image in a clean overlay. Great for social posts, product shots, and quick editorial visuals where you need text on the image fast.
The flow starts on a manual run, downloads an image from a URL, and resizes it for a vision model. A Google Gemini model then looks at the image and returns a title and caption in a structured format. The workflow reads the image size, calculates a safe font size and position, draws a semi transparent bar, and adds the text. Merge nodes keep the data from the model and the image aligned, while a Code node handles placement math.
Setup is simple. You only need a Google Gemini API key and an image source URL. Expect to cut caption work from minutes to under a minute per image, while keeping a consistent style across your posts. Use it for social banners, blog headers, and on brand watermarks that stay readable on any photo.