Introduction - PlayAI Docs

The PlayAI Text-to-Speech API enables you to convert written text into natural-sounding speech. Our API provides high-quality voice synthesis with multiple voices, languages, and customization options to suit your needs.

Models

We offer three models:

Dialog 1.0: Our flagship model with best quality and multi-turn dialogue capabilities.
Dialog 1.0 Turbo: A faster version of Dialog 1.0, available exclusively via the Dialog 1.0 Turbo endpoint.
Play 3.0 Mini: Our fast and efficient model for single-voice text-to-speech.

If you clone a voice in one language and then use that cloned voice to generate speech in a different language, the output will be highly unreliable. For best results, ensure that the voice you use to generate speech matches the language of the text you want to generate speech for.

Features

Multiple voice options with different accents and styles
Support for various languages and dialects
Adjustable speech parameters (speed, pitch, volume)
Real-time streaming capabilities
High-quality audio output in multiple formats

Stream SpeechStreams the audio bytes with our ultra-fast text-in, audio-out API.

Documentation Index

​Models

​Features

Models

Features