How to increase audio quality of TTS? (Node.js)

Question

How to increase audio quality of TTS? (Node.js)

Raul 25

Hiya,

I'm playing around with the Text-to-Speech service, using Node.js with code from the quickstart guide, but I can't seem to increase the bitrate of the generated audio? The audio I'm getting doesn't sound nearly as good as that from the demo page, and I'm using the same SSML code it provides.

The speechConfig class says to modify the SpeechSynthesisOutputFormat attribute, but setting it to many different values in the list of available audio formats (tried different RIFFs, RAWs, WebMs, Audios) yields no change at all in the audio quality (same bitrate, filesize).

Would love to hear from someone that's worked with the SDK in Node.js and could confirm or refute that speechConfig.SpeechSynthesisOutputFormat = "Audio24Khz160KBitRateMonoMp3" is the right approach for increasing audio quality.

Thanks

YutongTie-MSFT 53,971 Reputation points Moderator

2023-03-07T23:54:30.8833333+00:00

Hello Raul Thanks for reaching out to us, may I know which language you are working on and which voice you chose?
Raul 25 Reputation points

2023-03-08T08:48:03.5233333+00:00

@YutongTie-MSFT The voice is US English - Aria Neural

@romungi-MSFT That did the trick! Thank you very much!
romungi-MSFT 48,911 Reputation points Microsoft Employee Moderator

2023-03-08T09:11:26.15+00:00

That's great!! I have moved my comment to answer so it could be helpful to others if you accept the same. Thanks!!

Accepted answer

0 additional answers

Your answer

YutongTie-MSFT 53,971 Reputation points Moderator

2023-03-07T23:54:30.8833333+00:00

Hello Raul Thanks for reaching out to us, may I know which language you are working on and which voice you chose?
Raul 25 Reputation points

2023-03-08T08:48:03.5233333+00:00

@YutongTie-MSFT The voice is US English - Aria Neural

@romungi-MSFT That did the trick! Thank you very much!
romungi-MSFT 48,911 Reputation points Microsoft Employee Moderator

2023-03-08T09:11:26.15+00:00

That's great!! I have moved my comment to answer so it could be helpful to others if you accept the same. Thanks!!

Answer 1

romungi-MSFT 48,911 Microsoft Employee Moderator

Raul That is the correct setting to use for changing the format. Could you try something like below instead:

speechConfig.speechSynthesisOutputFormat = sdk.SpeechSynthesisOutputFormat.Audio24Khz160KBitRateMonoMp3;

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Share via

How to increase audio quality of TTS? (Node.js)

0 additional answers

Your answer