Custom Voice API reference - Azure Cognitive Services

This article demonstrates how to call a REST API endpoint for Custom Voice, a service in Azure Cognitive Services suite. Custom voice is a text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. With custom voice, you can build a highly natural-sounding voice for your brand or characters by providing human speech samples as training data. There are two scenarios for custom voice: professional and personal. To gain an understanding of each scenario and its data requirements, please read the documents on professional and personal voice.

Request access

Both professional voice and personal voice access are limited based on eligibility and usage criteria. Please request access before using REST API.

Build professional voice

Call REST API with following steps.

  1. Create a project for professional voice.
  2. Add voice talent consent.
  3. Add a professional voice training dataset.
  4. Train your professional voice model.
  5. Deploy your professional voice model as an endpoint.
  6. Synthesis with speech SDK. (It's not a part of this API. Please read document.)

Build personal voice

Call REST API with following steps.

  1. Create a project for personal voice.
  2. Add user consent.
  3. Create a personal voice and get speaker profile ID.
  4. Synthesis with speech SDK. (It's not a part of this API. Please read document.)

Samples

Custom Voice API sample code

See also