Training
Module
Create your first Azure AI services text to speech application - Training
In this module, you'll learn how to use Azure AI services to create a text to speech application.
This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Speech Synthesis Markup Language (SSML) is an XML-based markup language that you can use to fine-tune your text to speech output attributes such as pitch, pronunciation, speaking rate, volume, and more. It gives you more control and flexibility than plain text input.
Tip
You can hear voices in different styles and pitches reading example text by using the Voice Gallery.
SSML is designed to give you flexibility in how you want your speech output to sound, and it provides different properties for how you can customize that output. You can use SSML to:
SSML functionality is available in various tools that might fit your use case.
Important
You're billed for each character that's converted to speech, including punctuation. Although the SSML document itself isn't billable, the service counts optional elements that you use to adjust how the text is converted to speech, like phonemes and pitch, as billable characters. For more information, see the pricing note.
You can use SSML in the following ways:
inputs
property.spx synthesize --ssml SSML
command line argument.Training
Module
Create your first Azure AI services text to speech application - Training
In this module, you'll learn how to use Azure AI services to create a text to speech application.