Turn long videos into clear text descriptions without watching them. Built for marketing and content teams that need fast tags, summaries, and compliance checks at scale. Helpful for social clips, ad reviews, and media libraries.
The run starts with a manual test. A Set node stores the video URL. An HTTP Request node downloads the file as binary. Another HTTP Request uploads that binary to Google Gemini using the upload endpoint with required headers, which returns a file link. A Wait step gives the upload time to finish. The next call uses the Gemini generate content endpoint with the file link and a clear prompt. A final Set node extracts the first candidate text into a field named videoDescription for easy use in later steps.
Use a Gemini API key through an environment variable named GeminiKey or with credentials. Expect review time to drop from many minutes to seconds per video while keeping wording consistent across assets. Common uses include content moderation notes, SEO friendly blurbs, and accessibility helper text. You can adjust the prompt to focus on brand safety, product scenes, speaker changes, or on screen text. The output is ready for tagging, catalog search, or sending to a CMS in a follow up workflow.