Azure TTS Error 404
I get error 404 when trying to fetch the mp3 file via fetch. I am using Node.js in the backend. More details: I created a functionality in my app that creates an XML document containing all SSML tags as specified by Microsoft Azure. Is it possible some…
Issue with Continuous Language Identification in Azure Speech SDK for Angular Application
We are currently using the "microsoft-cognitiveservices-speech-sdk" in our Angular application (version 14) for speech transcription and translation. The transcription and translation functionality is working as expected. However, we are…
Speech service SDK usage and issues
I am trying to connect the Azure Speech with my Azure OpenAI so that I have the option to use Azure OpenAI to ask queries either by text or voice method. Currently, I have issues with connecting the Azure AI Speech with my backend which is node.js. I am…
speech to text twilio telugu transcript is not coming empty transcript and intitally system is not responding
async def receive_json(self, text_data): try: event = text_data.get('event') if event == 'connected': logger.info("WebSocket connected event received") elif event == 'start': …
Azure Speech Studio Andrew Multilingual voice sounds glitchy
I'm having some issues with the Andrew Multilingual (en-US-AndrewMultilingualNeural) voice in the Azure Speech Studio. There's a few instances in which the voice sounds raspy and really kind of glitchy. It seems to have a lot of trouble with the word…
SpeakSsmlAsync Result always Canceled
Hello, I am building a project using Azure's SpeechSynthesizer. SpeechLog.txt I am running into the following problem: when calling SpeakSsmlAsync(ssmlText), the result always has a canceled state, and I am having a hard time understanding why. When I…
How to disable the default "Disfluency Removal" of filler words after STT transcription in Azure AI Speech?
Azure AI Speech Services defaults to removing many filler words (uh, eh, etc.) via post-transcription "Disfluency Removal". My use case includes presentation analysis for filler words, which requires a verbatim transcript. Is there a…
I need to know wether this API "Post-call transcription and analytics" can work with nodejs?
I need to know wether this API "Post-call transcription and analytics" can work with nodejs? If it is not, where I can get a proper Conversation converstion API with multi user and multi language dedection and retrun a text with given…
When using batch speech transscription the ITN feature only applies to the first option of the nBest results.
When using batch transscription the ITN feature only applies to the first option of the nBest results, whitch is not necessarily the one with the highest confidence. The batch transscription service returns a json result with the following structure…
Getting error code 0x38 (SPXERR_AUDIO_SYS_LIBRARY_NOT_FOUND) when deployed to cloud.
I am working on an interactive real time communication that uses both speech synthesizer and recognizer. In development it is working fine but when I deployed through azure web app in a Linux server it was giving the error. I don't want to process any…
Stopping Audio Playback Mid-Stream with Microsoft Neural TTS Service and Speech SDK
I'm working with the Microsoft Neural Text-to-Speech (TTS) service using the Speech SDK. I've successfully implemented audio playback, but I'm facing a challenge with controlling the playback mid-stream. My question is: How can I implement a feature to…
Random Words Detected by Azure Speech Recognizer in Silence
Hello Azure Support Team, I am currently using the Azure Speech Service to recognize speech inputs in my application. The setup of my speech recognizer is as follows: export const createSpeechRecognizer = () => { const speechRecognitionConfig =…
Azure speaker recognition model
Hello, I have created a text-independent profile for the Azure speaker recognition model via REST API. I have encoded the.wav file and tried calling the create enrollment API for which I am getting the below error response for any data. { …
Thai text has pause before IPA phoneme, Vietnamese doesn't - why?
I found different behaviors when using IPA phonemes in text-to-speech: Vietnamese: "không phải [May] xin lỗi" Flows naturally without pauses <speak xmlns="http://www.w3.org/2001/10/synthesis"…
Language data import failed: Invalid or empty textproc out file!.
I am trying to train a custom speech to text model for Malayalam. When I am trying to upload the text data, I am getting the below error. Can someone explain to me what the error is?
TTS is not working in IOS
We have been using Azure TTS for our webapp . But the TTS is not being supported on IOS while it works well on android. Also the Latency of TTS on android is very high .
Having problem with InitialSilenceTimeout for SpeechAssessment API
I'm using the SpeechAssessment API for an English Education App to rate students speech. Now I'm trying the API and been having trouble with the audios generated by the mobile App. It is like the API isn't recognizing the audio and setting the following…
Azure AI Speech Studio TextToSpeech Error 400 Synthesis failed
I'm trying to generate many multilingual voice and for the majority of the multilingual I get this message : Response status code does not indicate success: 400 (Synthesis failed. StatusCode: NotFound, Details: service does not exist: service endpoint…
Is there a problem with creating Custom Keyword models on Azure Speech Studio?
Hi, I'm trying to create custom keyword models on Azure Speech Studio but they don't appear to finish processing. I have a model that has been stuck processing for almost 12 hours now. Both basic and advanced models are getting stuck. It seems to affect…
Can Azure Speech to Text support raw Opus ?
Hi, Currently the Speech to text supports OPUS/OGG meaning I have to wrap my Opus packets in an ogg container. Is it possible to support raw…