Speech containers overview
By using containers, you can use a subset of the Speech service features in your own environment. With Speech containers, you can build a speech application architecture optimized for both robust cloud capabilities and edge locality. Containers are great for specific security and data governance requirements.
Available Speech containers
The following table lists the Speech containers available in the Microsoft Container Registry (MCR). The table also lists the features supported by each container and the latest version of the container.
Container | Features | Supported versions and locales |
---|---|---|
Speech to text | Transcribes continuous real-time speech or batch audio recordings with intermediate results. | Latest: 4.10.0 For all supported versions and locales, see the Microsoft Container Registry (MCR) and JSON tags. |
Custom speech to text | Using a custom model from the custom speech portal, transcribes continuous real-time speech or batch audio recordings into text with intermediate results. | Latest: 4.8.0 For all supported versions and locales, see the Microsoft Container Registry (MCR) and JSON tags. |
Speech language identification1, 2 | Detects the language spoken in audio files. | Latest: 1.16.0 For all supported versions and locales, see the Microsoft Container Registry (MCR) and JSON tags. |
Neural text to speech | Converts text to natural-sounding speech by using deep neural network technology, which allows for more natural synthesized speech. | Latest: 3.5.0 For all supported versions and locales, see the Microsoft Container Registry (MCR) and JSON tags. |
1 The container is available in public preview. Containers in preview are still under development and don't meet Microsoft's stability and support requirements. 2 Not available as a disconnected container.
Request approval to run containers disconnected from the internet
To use the Speech containers in environments that are disconnected from the internet, you must submit a request form and wait for approval. For more information about applying and purchasing a commitment plan to use containers in disconnected environments, see Use containers in disconnected environments in the Azure AI services documentation.
The form requests information about you, your company, and the user scenario for which you use the container.
- On the form, you must use an email address associated with an Azure subscription ID.
- The Azure resource you use to run the container must be created with the approved Azure subscription ID.
- Check your email for updates on the status of your application from Microsoft.
After you submit the form, the Azure AI services team reviews it and emails you with a decision within 10 business days.
Billing
The Speech containers send billing information to Azure by using a Speech resource on your Azure account.
Note
Connected and disconnected container pricing and commitment tiers vary. For more information, see Speech service pricing.
Speech containers aren't licensed to run without being connected to Azure for metering. You must configure your container to always communicate billing information with the metering service. For more information, see billing arguments.
Container recipes and other container services
You can use container recipes to create containers that can be reused. Containers can be built with some or all configuration settings so that they aren't needed when the container is started. For container recipes see the following Azure AI services articles:
- Create containers for reuse
- Deploy and run container on Azure Container Instance
- Deploy a language detection container to Azure Kubernetes Service
- Use Docker Compose to deploy multiple containers
For information about other container services, see the following Azure AI services articles:
- Tutorial: Create a container image for deployment to Azure Container Instances
- Quickstart: Create a private container registry using the Azure CLI
- Tutorial: Prepare an application for Azure Kubernetes Service (AKS)