An Azure service that integrates speech processing into apps and services.
This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
I am looking for a was to control the default duration of silence added to the start and end of each generated audio file in Azure Text-To-Speech
I am using Rest API.
Thanks in advance!
SSML has elements for that. This should get you started: https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-synthesis-markup?tabs=csharp#add-silence
I am trying hard but I am not able to make the mstts:silence tag work for me. When I try in the Azure TTS test console, it fails with an error (https://azure.microsoft.com/en-us/services/cognitive-services/text-to-speech/#features)
<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="en-US">
<mstts:silence type="Leading" value="0"/>Hello<mstts:silence type="Tailing" value="0"/>
What is the secret trick? :-)
just define the namespace ("xmlns:mstts") to the wrapper element as follows. The other namespace definition "xmlns:emo" is not needed in this case, but if you you use emo tags in the future.