Questions about Using Microsoft's Custom Text-to-Speech Avatar Service

Amr Shaarawy 0 Reputation points
2024-02-24T10:14:36.3766667+00:00

Dear Microsoft Community,

I am interested in using your Custom Text-to-Speech Avatars service for my company. We are looking to create more engaging video content using AI-generated avatars that can speak text with realistic voices and animations. I have some questions I'm hoping you can help me understand better: Pricing and Costs:

  • What are the pricing options for using this service (pay-per-use, monthly subscriptions, etc.)?
  • Roughly how much does it cost to generate a minute of avatar video speech?
  • Are there any minimum usage requirements or upfront costs?

Technical Requirements:

  • What kind of data or inputs are required to create a custom avatar voice (audio recordings, text transcripts, etc.)?
  • How much data is typically needed for good quality results?
  • Are there recommendations for the recording equipment needed?

Getting Started:

  • How do I go about requesting access to use this service?
  • What is the typical timeline for onboarding, creating an avatar model, and deployment?
  • Is there documentation or guides available I can review?

My company creates training videos, marketing content, virtual assistants and more. Being able to use AI avatars would really enhance our content creation capabilities. Please let me know any other details you need from me. I'd be happy to discuss our use case further.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,061 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,080 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,598 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Charlie Wei 3,335 Reputation points
    2024-02-26T02:41:51.09+00:00

    Hello @Amr Shaarawy ,

    Regarding your question, there isn't a detailed answer currently available in the official documentation. The Custom Text-to-Speech Avatars service requires the submission of an application form and manual review before it can be used. However, I noticed that the form mentions a fee of $2,048. I believe you should be able to obtain more detailed information after completing the application process.

    Best regards,
    Charlie


    If you find my response helpful, please consider accepting this answer and voting 'yes' to support the community. Thank you!

    1 person found this answer helpful.
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.