no voice when I click "play" button to create speech from text
no voice when I click "play" button to create speech from text, my laptop voice turned on already.
How to get sentence word timestamp results for real-time speech recognition ?
I am using Golang's SDK this is my golang code func (m *microsoft) Do(ctx context.Context, path string) (string, error) { defer os.Remove(path) accessKeyConfig := AccessKeyList[rand.Intn(len(AccessKeyList))] subscription := accessKeyConfig.Key region…
Azure AI Speech content filter
Hey everyone, I am using the Azure AI Speech api for real time transcription of conversations. The problem I am facing is that the content filter recognizes words such as the german 'dick' as offensive. This might be true in english, however in german…
Ingesting webpage URL for the open AI web app in Azure
Hi there. In the Azure open AI studio, there is an option for defining webpage URL when you add data for the app but based on the requirements in the Microsoft website, it can only extract text up tp 20 sublinks and also I can only put one URL in it. …
create a basic voice-interactive dashboard
Hello Team, I need to create a basic voice-interactive dashboard using Azure Cognitive services like, Speech service, CLU(Conversational Language Understanding) & PowerBI.Also suggest if any other way to achieve this. It would be really helpful.
How can I use Whisper on Azure AI Speech
Hi, I recently switched from using the whisper model via Azure OpenAI to using Azure AI Speech. However, I noticed that the quality of some transcriptions is worse on Azure AI Speech. On the below page it says that it is possible to use the whisper model…
Failed to get HTTP platform singleton instance. Error: 27
Hello! I'm working with the Azure Speech Services SDK via python. The code worked well, until I started getting blank responses. Basically my request got cancelled, when checking the reason, I got this: #…
I use speech to text and want to transcribe the corresponding text, but it keeps timing out without successful recognition. Why is this happening?
this is my file,and download it https://feedback.meitudata.com/public/file/yASWSTNPh2RE3Ncv.wav
Is each voice in the voice gallery based on a clone of one specific natural person or is it synthetic?
I would like to understand whether: Each voice in the voice gallery is based on a clone of one specific natural person? Voices are synthetic (similar to those from 11Labs Voice Design) that cannot be traced back to an individual person? Thank you!
SpeechSynthesizer sometimes plays speech depending on SpeechSynthesisOutputFormat
In a C# WPF application, I call this function to convert text to speech: SpeechSynthesisResult speechSynthesisResult = await speechSynthesizer.SpeakSsmlAsync(strSsml); The audio data is returned ok. BUT the function also sometimes plays the speech as…
Azure speech speaker differentiation
Hi, I would like to use azure speech to transcribe a meeting, however i want it to differentiate between anonymous speakers, eg speaker A, speaker B. Is it possible to do that. Are there any samplesand tutorials out there that I can just take and use?…
Is there a way to make speech service transcription faster (diarization with speakers differentiated)?
Currently the speed seems to be half the time for wav and 1:1 ratio for mp4 with gstreamer. From this post, it seems half the time for wav file is the…
Microsoft: fix captioning by Speech Studio
The captioning functionality in the Speech Studio is an utter failure. This is typical output: I encourage Microsoft to implement the functionality that allows the user to specify the number of lines of text (typically one or two), and the maximum…
No audio when using SpeechSDK in pcf control (canvas app)
I made a pcf control which uses the speechsdk to synthesize text to speech. This is working when I run "npm start watch" to test this. When publishing this pcf control and use it in a canvas powerapp I cannot hear the synthisized text. What can…
![](https://techprofile.blob.core.windows.net/images/GeUyXSq_I0ymcn4rt9Ijhg.png?8D90A0)
Azure Cognitive Services Speech: Unable to get Custom Translator model results from speech translation code
In test C# code that I created based on the speech translation code in the following sample (“Using custom translation in speech translation”), I’m having trouble getting Custom Translator model translation results. The code just returns a cancellation…
Request to Increase Whisper Model Quota Limit
Hi Azure Community, I hope everyone is doing well. I am currently working on a project that requires a higher capacity of the Whisper model than my current Azure quota allows. I am seeking guidance on how to increase my Whisper model quota…
SpeakSsmlAsync is cancelled, but SpeakTextAsync is successfull
I am trying out the Azure AI service to convert text to speech from a C# WPF application. My calls through SpeakTextAsync are successfull, but my calls through SpeakSsmlAsync are returned with the Reason = Cancelled. I am on the free tier for South…
Azure Text To Speech docker container throws an exception with viseme
I'm using the Azure Text to Speech docker image (mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.3.0-amd64-en-us-jennyneural). I'm passing it SSML through the dotnet SDK. When asking for viseme (via <mstts:viseme…
how to assign operation permissions a resources
Hello, I am new to Azure and I want to use it to convert text to speech. when I creat the resources -> enter the speech studio and try to start the service, the system raised an error and say "You don't have operation permissions to [New],…
Random Words Detected by Azure Speech Recognizer in Silence
Hello Azure Support Team, I am currently using the Azure Speech Service to recognize speech inputs in my application. The setup of my speech recognizer is as follows: export const createSpeechRecognizer = () => { const speechRecognitionConfig =…
![](https://techprofile.blob.core.windows.net/images/3b270b575c094eeca63e9bc66c861c5a.png)