Speech to text documentation
Speech to text from the Speech service, also known as speech recognition, enables real-time and batch transcription of audio streams into text. With additional reference text input, it also enables real-time pronunciation assessment and gives speakers feedback on the accuracy and fluency of spoken audio.
Develop with speech to text
How-To Guide
- Use the fast transcription API
- Create a custom speech project
- Train a model for custom speech
- Use compressed audio input formats