What model architecture is used in Microsoft’s Neural TTS service accessed by the edge-tts library?

Question

What model architecture is used in Microsoft’s Neural TTS service accessed by the edge-tts library?

I am using the edge-tts library in my project, which connects to Microsoft’s Neural Text-to-Speech (TTS) service. I need detailed information about the model architecture used by this service for my project documentation.

Specifically:

What is the underlying model architecture (e.g., FastSpeech, AdaSpeech, or other)?

This information is crucial for my academic research and documentation. Thank you for your help!

santoshkc 15,355 Reputation points Microsoft External Staff Moderator

2024-12-10T13:07:57.5866667+00:00

Hi @مريم عبدالله الماجد,

Following up to see if the below answer was helpful. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Accepted answer

0 additional answers

Your answer

santoshkc 15,355 Reputation points Microsoft External Staff Moderator

2024-12-10T13:07:57.5866667+00:00

Hi @مريم عبدالله الماجد,

Following up to see if the below answer was helpful. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Answer 1

Hi @مريم عبدالله الماجد,

Thank you for reaching out to Microsoft Q&A forum!

Microsoft's Neural TTS service utilizes cutting-edge neural network architectures to deliver high-quality, natural-sounding speech synthesis. While specific technical details about the exact architecture used in the service may not be publicly disclosed, but Microsoft uses models like FastSpeech, etc.

For academic purposes, you can reference the general approach taken by Microsoft, which focuses on end-to-end speech synthesis using parallelized models for fast and accurate text-to-speech conversion.

For more specific or unpublished information about the exact architecture used in the Neural TTS service, consider contacting Microsoft support or exploring their official research publications.

I hope you understand. And, if you have any further query do let us know.

If this answers your query, do click Accept Answer and Yes for was this answer helpful.

Share via

What model architecture is used in Microsoft’s Neural TTS service accessed by the edge-tts library?

0 additional answers

Your answer