Cloud Speech To Text stopped working on old Speech SDK version
This issue is really urgent. Cloud Speech To Text stopped working on old Speech SDK version in devices that are currently in customers. The devices are running Android applications that use Microsoft Speech SDK. We've confirmed that upgrading the…
Azure AI Speech

Problem creating SpeechRecognizer with audio stream input using node.js Speech SDK
Using Speech SDK for JavaScript v1.44.0, and following the STT in-memory streaming example, but using the fromEndpoint API to create Recognizer, as recommended in the Release Notes for that SDK version. Node.js is v22 LTS, running in Azure Cloud as an…
Azure AI Speech
Where is data stored when using the fast transcription API?
Dear Microsoft, for a project, we are using the fast transcription API from Azure as a component to transcribe text from audio. We are working with sensitive data and would like to be sure to have control or at least insight into where our data is stored…
Azure AI Speech
I am using Fast Transcription API to get transcripts for my media file, I am getting "Too Many Request" error in first try itself for free tier
I am using a Free resource of Fast Transcription API for testing purpose under Azure AI speech service. I am getting "Too Many Request" error on trying for a first time it self. And this has happen many times to different users when they…
Azure AI Speech
Tried to create a resource for a class instruction but it says it is disallowed by policy
I am taking a course (about Speech to Text) on my own and i need help: the class says to create a resource in a sandbox but when I do it says Resource 'learn-account-14303' was disallowed by policy. I have no idea how to resolve this. Can someone help…
Azure AI Speech
How to enable Real-Time Avatar Streaming API in Azure Speech service?
I would like to use the Azure Neural TTS Real-Time Avatar Streaming API I already have a Speech resource created in a supported region (e.g., East US), but I understand that access to avatar streaming is currently gated and requires approval. Could you…
Azure AI Speech

Issue with Azure Speech-to-Text API – Delayed Hindi Transcriptions
Hello Team, We have been using the Azure Speech-to-Text API for a long time, but we encountered an issue from last 4 to 5 days around 7PM to 10 PM IST while generating Hindi subtitles. The transcription jobs for Hindi were running for an unusually long…
Azure AI Speech
Reporting actual costs by usage on Azure TTS/Speech
I am assessing Azure AI Speech as a replacement for another TTS service under the 30-day free trial. I need to be able to report on actual usage and associated costs within my timezone (PDT) and am not finding that at all simple to do. Data points I need…
Azure AI Speech
Azure Speech service transcription running forver
We've been using Azure Speech Service with a managed identity for batch audio transcriptions. The service performed flawlessly until June 20th. However, as of June 23rd, our batch transcription jobs are stuck in a 'running' state indefinitely and are not…
Azure AI Speech

Text to Speech Voice Changes from Female to Male in Azure Speech Service
The Azure Speech service is being used for a bot, and there is a recurring issue where the text-to-speech voice changes from female to male. This problem has been occurring frequently over the last 2-3 weeks, whereas it was not an issue previously.
Azure AI Speech

Best way to deploy a Python FastAPI Application with Azure TTS, STT, and OpenAI LLM for a Scalable Voice AI Agent
Hey all! We are developing a voice ai agent and are looking for the best way to deploy it. It is a python fastapi backend, that calls the different API endpoints and exposes an endpoint for twilio. Requirements: low latency (we use stt, tts…
Azure AI Speech
Issues with Azure Speech Services: Incorrect transcription of "draft" as "draught" and "£" as "lbs" in UK English
I'm using Azure Speech Services with the language set to UK English, and I've noticed two recurring transcription issues: When I dictate the word "draft", it consistently transcribes as "draught", even when the context clearly favors…
Azure AI Speech

Generate phoneme from speech
I'm building an app that generate speech from text (mainly person name). For some rare name, AI pronounce it wrong. I've been using the recognize by speaking tool in Audio content creation to get the correct phoneme by saying the name properly. Then use…
Azure AI Speech
SpeechRecognizer api issue
What is the correct way to implement audio sources to avoid the this.privAudioSource.id is not a function error in SpeechRecognizer and ConversationAPI ? Are there recommended configurations for improving multi-language conversation accuracy? Should…
Azure AI Speech
Azure speech to text batch stucked on "Running" status and no percentage
this is the request: "azureRequest": { "displayName": "job_title...", "description": "job_title...", "locale": "it-it", "contentUrls": [ "{url of a wave…
Azure AI Speech
Unexpectedly high TTS character count in Azure Speech Service during live app test
Hi team, We are running a production-ready church translation app using Azure Translator and Azure Speech Services (STT & TTS, neural voice). During a 30 minute live test involving 8 user devices (all using Spanish), we observed the…
Azure AI Speech

Azure AI Speech with Whisper leaves jobs stuck in NotStarted
We've been running Azure AI Speech with Whisper (version 3.2 of the API), in batch transcription mode, for a while with no problems. Last Friday we started getting a lot of jobs stuck in NotStarted state across all our deployments. The problem has…
Azure AI Speech
Incomplete transcript when the recording has long pause before conversation resume again
I am having problem to have complete transcription when my recordings have trends like below: There a long silent in between the conversation, i.e., there is conversation from minute 1-2 then a long pause from min2-min10 then there is conversation…
Azure AI Speech
How to connect AI Foundry Models to Speech to Text and Text to Speech
Hi Team, I have my doubt! I have my FnO Flow connected to Management Center in AI Foundry. How can I develop Voice solution to frontend. Can I get some good documentations? Is it possible to deploy through deployment in AI Foundry itself or do I need…
Azure AI Speech
How to Deploy Voice Live API Models
Hi Everyone, I have a doubt! How can we deploy the Voice Live API Models (E.g. Customer Service Feature in AI Foundry Playground) to production, so that everyone can use. I want to deploy it for production! Thanks in Advance!
Azure AI Speech
