Custom Voice API reference - Azure Cognitive Services
This article demonstrates how to call a REST API endpoint for Custom Voice, a service in Azure Cognitive Services suite. Custom voice is a text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. With custom voice, you can build a highly natural-sounding voice for your brand or characters by providing human speech samples as training data. There are two scenarios for custom voice: professional and personal. To gain an understanding of each scenario and its data requirements, please read the documents on professional and personal voice.
Request access
Both professional voice and personal voice access are limited based on eligibility and usage criteria. Please request access before using REST API.
Build professional voice
Call REST API with following steps.
- Create a project for professional voice.
- Add voice talent consent.
- Add a professional voice training dataset.
- Train your professional voice model.
- Deploy your professional voice model as an endpoint.
- Synthesis with speech SDK. (It's not a part of this API. Please read document.)
Build personal voice
Call REST API with following steps.
- Create a project for personal voice.
- Add user consent.
- Create a personal voice and get speaker profile ID.
- Synthesis with speech SDK. (It's not a part of this API. Please read document.)
Samples
See also
- Custom Voice API swagger
- To create an Azure Cognitive Service resource, see Get started with Azure Cognitive Service.