Inquiry Regarding Azure AI Speech Error
Dear Azure Support Team I recently encountered an issue while using Azure AI Speech service with recordings from the VoiceMemo app on iPhone. Specifically, when attempting to process recordings of approximately 30 minutes in length, I received the…
Multilingual voice mispronounces Ukrainian as Russian
How can I resolve the issue of multilingual voices pronouncing Ukrainian as Russian when using Text to Speech with the Microsoft.CognitiveServices.Speech package in C#? Explicitly specifying the language in the code through the SpeechSynthesisLanguage…
Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS
Subject: Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS Description: The Azure Neural TTS system is mispronouncing the Welsh contraction "i’w." Instead of producing the correct pronunciation…
Phonemes are not available for pronunciation recognition in french
On the result of the pronunciation recognition, if we set to "en-US", we have all the results for the phonemes spoken/matches. As below. "Phonemes": [ { "Phoneme":…
Markdown to SSML ?
Does anyone know of a basic "preparer-converter" that takes a markdown (.md) file and converts it into an SSML file?
Speech Studio Audio Content Creation (x) Content Format and Audio Export Fail
I discovered https://speech.microsoft.com/portal, audio creation tile. (I think it should be the first one and described as "interactive batch TTS web interface.") I uploaded a file named test.txt, which has two paragraphs. For decades now,…
Batch TTS with REST: YourSynthesisId and other intro questions
I got the REST API to work on macos. Yeah!!! I could hear the output from the sample code. Alas, now I would like to submit a longer document I wrote to batch TTS and post it as my podcast. I am taking the example right off the webpage, and just…
Cognitive services pronunciation assessment always gives 100% score, even with badly pronounced words
I built a svelte (javascript) application that uses the microsoft speech sdk (v1.36), and i am using it to evaluate pronunciation in 3 languages: english, german and french. Initially i was using RecognizeOnceAsync() which waits for silence at the end of…
azure prononciation assessment time limit
i am using azure prononciation assessment to assess an audio , but the problem the assessment happens only for the 1 min of the speech and it doesnt assess the rest of the audio this is my code const sdk =…
Can you add a phrase list to the CallMediaRecognizeSpeechOptions class when using speech-to- text cognitive services from azure communications service
I am using ACS to access a multi-service Cognitive Services endpoint and doing recognition of speech input in real time via acs/telephone. I am using the default model provided by Microsoft. This is sufficient in most case but I have some place names…
Is it possible to specify in Speech SDK to always use "lbs" instead of "£" when "pounds" is recognized?
Hi, is it possible somehow to configure speech sdk in a way when word "pound" is detected that it is always meant to be lbs, not £, for example when I say, "99 pounds" it is detected as "99 lbs", but if I said, "100…
here i cannot find To create a custom avatar endpoint, follow these steps: Sign in to Speech Studio. Navigate to Custom Avatar > Your project name > Train model.
i cannot find custom avatar key after sign in to the speech studio .
How to use an Microsoft Entra ID to authenticate with the Speech to text REST API (for batch transcription
I looks like you can only authenticate to the "Speech to text REST API" with a api key (Ocp-Apim-Subscription-Key). What we would like is to authenticate with a Microsoft Entra ID. Why? Our application is running a AKS and all our containers…
Issue with speech-to-text service
While converting the given wave file from Speech-to-Text using Microsoft's Speech-to-Text service, it is not detecting "No" at 57th second in this file but detecting at 1:12 min and in other places. Speech recognised is as follow RECOGNIZED:…
How to output transcription on a word-level
With the provided callback function, the text is outputted as described by you, either after a short pause or after a maximum of 15 seconds. Is it possible to output word by word so that the text can be seen while speaking? def…
Set sound threshold for microsoft speech-to-text
Hi, It is possible setting a volume-threshold for the speech that gets transcribed? Such that if the speech is below a certain threshold then it would not get transcribed. I am using the speechSDK Br, Daniel
macos cli starter guide
I am trying to play around with azure text to speech on macos. the instructions are woefully incomplete. I start with…
Azure AI - Speech Studio - Error Message
Hi there, I receive this error message today. "为资源 xiaoshuoyuedu1 分配的角色尚未生效。 请让资源管理员配置__自定义子域__并启用 VNet 以使你的角色正常工作。" "The role assigned to resource xiaoshuoyuedu1 has not taken effect yet. Please have the resource administrator configure…
Why my TTS is suddenly become bad? Speed & punctuation isn't working properly.
This morning I tried to work on my TTS file using Brian's voice. But once I listened to the speech, the punctuation & speed weren't working properly. Also, it seems that his voice became monotone. I've tried with an already-finished project to see if…
No module named 'azure' when using azure.cognitiveservices.speech
Hello, I have a problem with importing azure.cognitiveservices.speech. I pip install the package but when importing it I got this error. ModuleNotFoundError: No module named 'azure'