Turn any image into a ready to post asset. The system writes a title and caption, then places the text neatly on the picture. Great for social posts, blogs, and quick creative needs without a designer.
Here is how it works. A manual trigger starts the run and an HTTP Request node downloads an image. The image is resized to 512 by 512 for fast AI processing, and basic image details are read. A Gemini vision model receives the image and returns a structured title and caption using a built in parser. A Code node calculates where the text should go based on the image size. Merge nodes combine the image, the caption, and the positions. Finally, the Edit Image node draws a subtle background box and overlays the title and caption on the image.
Setup is straightforward. You need a Google Gemini API key and n8n Cloud credentials for the Gemini node. Expect faster production of branded graphics, less editing time, and clear text placement. Use it for auto captioned hero images, watermarked visuals, and quick social content. With small changes, it can batch process many images and keep your style consistent.