I would like to know if there are any other avatars besides the Asian figures for text to speech ? How do I access them?
I would like to have the option to selfct differen avatar figures besides the Asian one shown.
How to fix Exception with an error code: 0xe (SPXERR_MIC_NOT_AVAILABLE)
I have built a chatbot bot framework and am now looking to integrate speech functionality for the bot. I am trying to run the below code from ms learn quickstart for speech sdk using python. …
Realtime Recognizer not utilising with Semantic Segmentation
Hi all! I'm using the Azure speechsdk.SpeechRecognizer for transcribing streamed real-time audio. While the transcription works, continuous talking will result in large paragraphs being outputted rather than sentence by sentence. I included the…
microsoft tts reads english wierd when its set to hebrew.
when the hebrew tts reads a sentence with seperated english letters it reads it weird. for example : "השם שלך הוא P A Z" doesn't say the letter "P", "A", "Z", instead it pronounces the sound of the letter. how can…
[nnnn].word.json file not found in results wordBoundaryEnabled: true
I am using Azure AI Batch Syntheses something like this …
Pronunciation Assessment fails to recognize several individual words
Several single words fail to process when using the Pronunciation Assessment service in our code or in the portal tool. Many do work but we're not able to determine why some words work and others don't. We use this service in a classroom setting to…
Audio to Audio translation
All of the information shows how to speech to text OR text to speech. Supposedly Microsoft Azure can do speech to speech and generate an AI voice that sounds natural to the person delivering the message but delivering it in different language. Where is…
Project Collaborator Cannot Access Voices in Speech Playground Voice Gallery
An AI studio resource and project have been created to experiment with text-to-speech functionality. While I have access to all sample voices in the voice gallery, my collaborator assigned as an Owner on both the resource and the project cannot see any…
Bug Report: Mispronunciation of Isolated Hungarian Words in Azure Neural TTS (hu-HU-NoemiNeural), but not in context
Description: The Azure Neural TTS system is mispronouncing specific Hungarian words when using the hu-HU-NoemiNeural voice. The issue affects more than half of the vocabulary words in a recent production run of words (full SSML shared at bottom of this…
AI text-to-speech is misreading a word in Catalan (tomàquets) but it reads perfectly its singular form (tomàquet), can you fix it?
Hello, I am using the text-to-speech service with Catalan. The word tomáquets in plural form is not read properly whereas tomàquet in singular is. The accent is misplaced. What can I do to get that fixed? Thank you,
Using Managed Identity to connect to Speech Service with NPM in Angular - Example needed
What are the steps to utilize managed identity in an Angular application to bypass API keys while connecting to the Speech Service? Examples implementation would be really helpfull
Why does Azure Speech-to-Text detect French accurately in a standalone Python script but perform poorly in a real-time video call integration?
I'm working on a real-time translation project using Azure Speech Services. When I run my translation code in a standalone Python script, it accurately recognizes and translates French and English speech. However, when the same Speech-to-Text…
Inquiry Regarding Azure Speech Service Endpoint Formats
Dear Support Team, I am currently utilizing Azure Speech Service and have encountered issues related to endpoint formats. I would like to seek your assistance on the following matters: Current Situation: Upon creating a new Speech Service resource, the…
Is the Azure Cognitive Speech Studio , Nuget package supports only .WAV file as input?
I was trying to use the Microsoft.CognitiveServices.Speech nuget package to transcribe an audio file into text, but when working I could only convert an .WAV file and no other formats such as MP3, MPEG etc, Does the package only supports .WAV files? or…
Word level emphasis in synthesis SSML
I was trying to use word level emphasis, but noticed that the emphasis is only working for en-US-GuyNeural, en-US-DavisNeural, and en-US-JaneNeural. Are there plans to extend this to other languages? Are there any alternatives that i could use to…
Bing speech Text to speech is not working
Azure bing speech from text to speech is not working. Error 404 Failed to load resoursce : The server responded with a status of 400: https://speech.platform.bing.com/synthesize The above error occured
How to identify filler words in Azure AI Speech
Hi team. Is there any feature in Azure Speech that can help us identify filler words? Please point me to the right documentation if there is any. Thanks, Sai Vishnu Soudri
How to change "number format" in Azure text to speech?
Our style of saying numbers out is different than what Azure has: 257 - is spoken as "two hundred and fifty seven" by Azure we need it to say "two hundred fifty seven" without the "and". What we currently do is change…
Few questions on the limitations of phrase list feature in Azure AI Speech
Hi. I was looking at the phrase-list feature in speech-text streaming flow and I have a few questions: Is there a limit on the number of words we can provide as part of phraseListGrammar? Is there a limit on the number of characters in each word? Is…
Inconsistencies in IPA Pronunciation in Text to Speech
Hi, I'm using SSML to ensure specific pronunciation, however, I'm experiencing some inconsistencies. For example, here's the word 'would': <speak version='1.0' xmlns='http://www.w3.org/2001/10/synthesis' xml:lang='en-US'> <voice…