Turn long audio into ready content in minutes. The flow transcribes an MP3, writes a short summary, translates it to German, and drafts a cover image idea with matching visuals. It suits podcast teams, marketers, and social managers who need fast turnaround.
After you click run, a file reader loads an MP3 and Whisper turns speech into text. That text moves through ChatGPT nodes to produce a clear tl dr and a German version. A code node builds a message array and an HTTP Request node calls the chat API directly for more control. Another branch asks ChatGPT to write an image prompt and then DALLE 2 returns four cover images. A set node and a chat node shape structured output, and an HTML node builds a clean block with inline SVG for web or email.
You only need an OpenAI account and API key. Add credentials in n8n and test one branch at a time to avoid long runs. Teams can cut content prep from hours to minutes, reuse the HTML block on sites, and ship multilingual posts for each episode or webinar. It is a good fit for podcast recaps, webinar highlights, and short news briefings.