Turn voice questions into fast, helpful voice answers. Great for support teams that want a simple voice assistant for FAQs, status updates, or guided help. The flow accepts audio, understands the message with AI, writes a clear reply, and returns a natural voice response.
Here is how it works end to end. A webhook receives an audio file from your app or form. OpenAI Speech to Text turns the audio into text. The system pulls past messages with Get Chat, Aggregate, and a Window Buffer Memory to keep context, so replies stay on topic. Google Gemini writes the answer through a Basic LLM Chain. The new messages are saved back with Insert Chat. A Limit node passes one clean item to ElevenLabs, which generates the reply as audio using an HTTP Request. The response returns as a binary audio file through Respond to Webhook.
To set it up, you need API keys for OpenAI, Google Gemini, and ElevenLabs, plus a Voice ID in ElevenLabs. Expect faster replies, less typing, and consistent tone. Use it for helpdesk portals, voice widgets, or internal tools that need clear spoken answers. Map your session key strategy for multi user conversations and choose a voice that matches your brand.