Turn any text into ready to use audio through a simple API endpoint. Ideal for marketers, video editors, and content teams who need fast, consistent voiceovers without booking talent or recording time.
A public webhook receives a POST request with two fields voice_id and text. An If node checks that both inputs exist. When valid, an HTTP Request node calls the ElevenLabs text to speech API using your API key and returns the audio file directly through the response node. If inputs are missing, the flow sends a clear JSON error. Each request can pick a different voice by passing a new voice_id, which makes this useful for many brands or languages.
Setup is simple. Create an ElevenLabs API key and add it in n8n as a custom HTTP header credential. Point your app or form to the webhook URL and send voice_id and text. Expect audio back in seconds. Teams save editing time, speed up content production, and keep tone consistent across videos, explainers, and social clips. Most users see minutes of work reduced to seconds for each script.