Use prompts to find items inside images and draw boxes around them. Teams use it to review photos faster, check product shots, and spot issues in visual content. It is helpful for catalog teams, operations leads, and anyone who needs quick visual checks.
You start it on demand. The flow downloads an image, reads its width and height, and sends the file with a clear prompt to Google Gemini. Gemini returns bounding boxes in a simple JSON format with normalized values. The workflow then scales those values to real pixels and draws colored boxes on the original picture. Because the prompt controls what is found, you can switch from rabbits to people, cars, or store displays without code. This cuts manual clicking and reduces mistakes from eyeballing image details.
You will need a Google Gemini API key and an image link that is publicly reachable. Expect review time to drop from minutes to seconds, with consistent results across many photos. Common uses include ecommerce image quality checks, safety checks on parking or events, and fast tagging for media libraries. Edit the prompt to match your goal and reuse the same steps for many image types.