Events
Mar 17, 9 PM - Mar 21, 10 AM
Join the meetup series to build scalable AI solutions based on real-world use cases with fellow developers and experts.
Register nowThis browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Speech to text REST API is used for batch transcription and custom speech.
Important
Speech to text REST API version 2024-11-15
is the latest version that's generally available.
2024-05-15-preview
will be retired on a date to be announced.v3.0
, v3.1
, v3.2
, 3.2-preview.1
, and 3.2-preview.2
will be retired on April 1st, 2026.For more information about upgrading, see the Speech to text REST API v3.0 to v3.1, v3.1 to v3.2, and v3.2 to 2024-11-15 migration guides.
Use Speech to text REST API to:
Speech to text REST API includes such features as:
The following operation groups are applicable for batch transcription.
Operation group | Description |
---|---|
Models | Use base models or custom models to transcribe audio files. You can use models with custom speech and batch transcription. For example, you can use a model trained with a specific dataset to transcribe audio files. See Train a model and custom speech model lifecycle for examples of how to train and manage custom speech models. |
Transcriptions | Use transcriptions to transcribe a large amount of audio in storage. When you use batch transcription you send multiple files per request or point to an Azure Blob Storage container with the audio files to transcribe. See Create a transcription for examples of how to create a transcription from multiple audio files. |
Web hooks | Use web hooks to receive notifications about creation, processing, completion, and deletion events. You can use web hooks with custom speech and batch transcription. Web hooks apply to datasets, endpoints, evaluations, models, and transcriptions. |
The following operation groups are applicable for custom speech.
Operation group | Description |
---|---|
Datasets | Use datasets to train and test custom speech models. For example, you can compare the performance of a custom speech trained with a specific dataset to the performance of a base model or custom speech model trained with a different dataset. See Upload training and testing datasets for examples of how to upload datasets. |
Endpoints | Deploy custom speech models to endpoints. You must deploy a custom endpoint to use a custom speech model. See Deploy a model for examples of how to manage deployment endpoints. |
Evaluations | Use evaluations to compare the performance of different models. For example, you can compare the performance of a custom speech model trained with a specific dataset to the performance of a base model or a custom model trained with a different dataset. See test recognition quality and test accuracy for examples of how to test and evaluate custom speech models. |
Models | Use base models or custom models to transcribe audio files. You can use models with custom speech and batch transcription. For example, you can use a model trained with a specific dataset to transcribe audio files. See Train a model and custom speech model lifecycle for examples of how to train and manage custom speech models. |
Projects | Use projects to manage custom speech models, training and testing datasets, and deployment endpoints. Custom speech projects contain models, training and testing datasets, and deployment endpoints. Each project is specific to a locale. For example, you might create a project for English in the United States. See Create a project for examples of how to create projects. |
Web hooks | Use web hooks to receive notifications about creation, processing, completion, and deletion events. You can use web hooks with custom speech and batch transcription. Web hooks apply to datasets, endpoints, evaluations, models, and transcriptions. |
Service health provides insights about the overall health of the service and subcomponents. See Service Health for more information.
Events
Mar 17, 9 PM - Mar 21, 10 AM
Join the meetup series to build scalable AI solutions based on real-world use cases with fellow developers and experts.
Register nowTraining
Module
Create speech-enabled apps with Azure AI services - Training
Create speech-enabled apps with Azure AI services.
Certification
Microsoft Certified: Azure AI Engineer Associate - Certifications
Design and implement an Azure AI solution using Azure AI services, Azure AI Search, and Azure Open AI.