Text-to-Speech
Introduction
Create lifelike speech via the API
The PlayAI Text-to-Speech API enables you to convert written text into natural-sounding speech. Our API provides high-quality voice synthesis with multiple voices, languages, and customization options to suit your needs.
Models
We offer three models:
- Dialog 1.0: Our flagship model with best quality and multi-turn dialogue capabilities.
- Dialog 1.0 Turbo: A faster version of Dialog 1.0, available exclusively via the Dialog 1.0 Turbo endpoint.
- Play 3.0 Mini: Our fast and efficient model for single-voice text-to-speech.
If you clone a voice in one language and then use that cloned voice to generate speech in a different language, the output will be highly unreliable. For best results, ensure that the voice you use to generate speech matches the language of the text you want to generate speech for.
Features
- Multiple voice options with different accents and styles
- Support for various languages and dialects
- Adjustable speech parameters (speed, pitch, volume)
- Real-time streaming capabilities
- High-quality audio output in multiple formats