Content
What is the best approach to create audio files providing a list of links using Azure Audio Content Creation?
I know how to manually create MP3 audio files from an article using Azure Audio Content Creation tools (text-to-speech). Is there a way to bulk create audio files providing a list of article links using this same tool? It can be using the command line.…


Minimal network requirements for speech-to-text
Hi, I am using Cognitive Services speech-to-text service and want to know if there is a minimal network bandwidth requirement. We want to use on some pc's that may have a poor internet connection, how would this influence the speech-to-text…


text-to-speech Python SDK SSML Not effective
text-to-speech Python SDK SSML voice-sentenceboundarysilence-exact Not effective


Batch API | External lexicon for de-DE ignored.
My example returns a nice sounding audio but the lexicon is ignored. Any hints welcome. :) Region is germanywestcentral My lexicon file: My lexicon uri: https://shelf.ir-media-tec.com/azure/tts/lexicon_deDE_ipa.xml My request JSON: { …


TTS Custom Lexicon is not applied
I am trying to apply a custom lexicon to the Text-to-Speech service. Here is my lexicon file: <?xml version="1.0" encoding="UTF-8"?> <lexicon version="1.0" …


Speech to Text
Good morning, I am exploring your Speech-To-Text with the aim of creating a service that makes calls, asks a question, analyzes the response, and records it. For each call, an instance is created to isolate the transcription. The service streams audio to…


Using Azure speech to text with nodejs converting mulaw to pcm
I'm using twilio <stream> to send mulaw audio via websocket to my nodejs app and trying to convert mulaw to pcm for azure speech to text. my code is below. the app connects and sends audio but the speech to text results are empty or nomatch. …


Synthesizer error with paragraph starts with "<p><prosody ...>"
With some sentences that start with <p><prosody ...>, the text is synthesized incorrectly. If something is between, everything is fine: <p>Hi!<prosody ...>. See the example below. Suggestions <p>Hello. My name is Matthew,…


How to set STT results to show only one language
Hello, I am currently using a custom stt for korean and have trained it with about 100,000 korean text datasets. For the recognition part, I'm using…


How to customize silence time between sentence groups in Azure Text to Speech
I am trying to reduce the gap between two sentence groups in Text to Speech in Azure console. The gap between sentences can be reduced by specifying sentenceboundarysilence i.e. sentenceboundarysilence="5ms", but I don't know how to customize…


Error with Azure STT (Speech to Text) - Internal server error
Hello everyone, I am new to Azure Speech to Text (STT) and I am facing an issue while trying to test my model. Every time I try to test it, I get the following error message: Internal server error. Please recreate the task in a while. If the failure…
ERROR: Failed to detect voice's [en-USGuyNeural] endpoint for synthesizing.
I am running into an error when changing voice using the Speech Studio. If I change voices in the SSML file, the synthesizer outputs an error. Below is a script that generates the error. <speak xmlns="http://www.w3.org/2001/10/synthesis"…


Azure TTS Problem - Inconsistent SSML Tag Functionality
I have an issue when using the 'leading-exact' and 'tailing-exact' tags through the TTS API. In the speech studio web interface, voice clips synthesized using these tags work basically as expected. However, when sending a request through the Azure API,…
Lexicon file- multilingual Jenny
Hello, I´m currently producing audio files with the multilingual Jenny on MS Speech Studio (Text-to-Speech- function). Is there a chance to create a lexicon file for Jenny multilingual? Thanks!


Lexicon not applied to text
I created a lexicon file in the 'My files' section of Speech Studio and linked that file a voice in the SSML file. Saving then reopening the SSML file shows a link to the Lexicon file, but it is not applied when playing or exporting the audio file.…


Text to Speech pricing - when are the characters counted?
I see that the text to speech for neural voices is $16 per 1M characters. When are the characters counted? Are they only counted when the Export function is triggered or are they also counted during the tuning steps?


Azure Speech studio Export failed
When exporting a text to speech audio file I receive the message Error: You're sending frequent requests to server. Retry the operation later or switch to a standard paid tier. This worked fine without issue several weeks back. Did something change and…


Speech Cognitive services Authentication error (401) in Unity C# even with correct subscription key
I was trying to implement the speech to text feature in Unity, but after I start startTranscription(string audiofilepath) then it comes to errors. I also used the correct key. Is there anything I am missing there? That's the script: using…


Speech SDK trowing error: Exception with an error code: 0xe (SPXERR_MIC_NOT_AVAILABLE)
I have a small script in python: def listen(language): speech_config.speech_recognition_language=language audio_config = speechsdk.audio.AudioConfig(use_default_microphone=True) speech_recognizer =…


Can I use Azure Text-to-Speech for Commercial Usage?
Hi @YutongTie-MSFT Can I use the Azure Text-to-Speech for Freelancing (Voice-over)? on Upwork and Fiverr. for such kinds of videos; Company introduction video, Factory introduction video, Online Marketplace (Amazon, Alibaba), and more about…

