Text-to-Speech
Stream Speech
Streams the audio bytes with our ultra-fast text-in, audio-out API.
POST
Convert text to speech and receive audio bytes in real-time.
This endpoint supports two models:
- Play 3.0 Mini: Our fast and efficient model for single-voice text-to-speech.
- Dialog 1.0: Our flagship model with best quality and multi-turn dialogue capabilities.
We also offer Dialog 1.0 Turbo which is a faster version of Dialog 1.0 from a separate endpoint.
For more information, see Models.
Check out the How to use Dialog 1.0 Text-to-Speech API guide for a step-by-step approach to using the Dialog 1.0 API to convert text into natural human-like sounding audio.
Make sure to see the Create a Multi-Turn Scripted Conversation with the Dialog 1.0 API guide for examples on how to create a multi-turn scripted conversation between two distinct speakers.
Authorizations
Body
application/json
Response
200
audio/mpeg
A stream of audio bytes.
The response is of type file
.