Speech synthesis (TTS)

How can I voice long texts?

To voice a large text, break it into parts in any way convenient for you. The maximum size of speech synthesis requests is limited to 5,000 characters.

How do I configure stress and pronunciation?

To edit the pronunciation of individual words and the text in general, use the SSML or TTS markup.

How do I add a pause in text?

To add a pause to your text, use the TTS markup. Specify the pause duration in milliseconds in parentheses. A pause will be added where you place the tag. For example: Start sil<[3000]> continue in 3 seconds. The maximum value is 7,000.

Note that the indicated pause is treated as a recommendation. SpeechKit is designed for natural speech synthesis.

A cURL request does not work in Windows PowerShell

In the Windows PowerShell terminal, the curl command is an alias for the Invoke-WebRequest system call.

The Yandex Cloud guides provide examples of API calls using the Bash shell syntax. You can run them as is in the Linux console, macOS terminal, or WSL in Windows 10 or higher. To run the examples in Windows PowerShell, you will have to modify them yourself. For more information on Bash and PowerShell command equivalents, as well as tips, see Working with the Yandex Cloud CLI and API in Microsoft Windows.

What goes into the cost of synthesis?

For examples of calculating the usage cost, pricing rules, and effective prices, see the SpeechKit pricing policy.