An Azure service that integrates speech processing into apps and services.
Azure Cognitive Services TTS fails to generate audio for certain Unicode characters (e.g., em dash, arrow symbols)
I'm using Azure Cognitive Services Text-to-Speech (via SSML) to generate audio from text content. Some Unicode characters cause the TTS engine to fail or produce no audio output, while others in the same Unicode block work correctly. Characters that…
Azure AI Speech
Significant Regression in Code-Switching (EN-ZH) recognition for zh-HK after April 2026 Engine Update
Problem Description We are reporting a critical regression in the Cantonese (zh-HK) Speech-to-Text service that started occurring after the March 31, 2026 API retirement and the subsequent rollout of the latest engine (MAI-Transcribe-1). The Issue:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Azure Speech Fast Transcription concurrency and autoscaling question
Hello Azure Support, We are preparing to test Azure Speech Fast Transcription and want to understand the service limits so we can configure our KEDA scaler and Azure Service Bus concurrency correctly. Could you please clarify: What is the actual…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
The v1 real-time STT endpoint returns a constant Confidence value of 0.039347406 for all NBest results globally.
The v1 real-time Speech-to-Text endpoint (/speech/recognition/{mode}/cognitiveservices/v1) returns a fixed Confidence value of 0.039347406 in the NBest array for every recognition result, regardless of audio quality, region, resource type, or SDK…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Is it expected that add_target_language() silently drops translations when language codes overlap (e.g. en + en-US, fil + fi)?
Is it expected that add_target_language() silently drops translations when language codes overlap (e.g. en + en-US, fil + fi)? I've encountered an issue where certain combinations of target languages in TranslationRecognizer cause translations to…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Random Words Detected by Azure Speech Recognizer in Silence
Hello Azure Support Team, I am currently using the Azure Speech Service to recognize speech inputs in my application. The setup of my speech recognizer is as follows: export const createSpeechRecognizer = () => { const speechRecognitionConfig =…
Azure AI Bot Service
An Azure service that provides an integrated environment for bot development.
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Azure | Azure Startups
Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
Interactive/Live Avatar not Showing Hand Movements
I recently trained and deployed a Custom Avatar using Azure Speech Studio. I had trained the avatar using videos in which there was significant hand movements. To test the avatar, I first checked it in the Text To Speech Avatar section where I gave a…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Unnable to create speech service (Azure for students subscription)
im getting the error about regions from a policy 'Resource 's...' was disallowed by Azure: This policy maintains a set of best available regions where your subscription can deploy resources. The objective of this policy is to ensure that your…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Azure Speech Ingestion client not processing audio input
Hello, I deployed the speech ingestion client via ARM template, when uploading the first wav file into the audio-input directory I received an error saying that the service wasn't valid for the free tier, so I updated my subscription and the batch…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
How to estimate pricing (training + compute) for a custom tts avatar?
We want to create a custom avatar in Speech Studio. Of course, we need to estimate the costs for this project. We need the custom avatar + custom voice. Azure Pricing (https://azure.microsoft.com/en-us/pricing/details/speech/) is a bit confusing. For the…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Specifying a model in TranscriptionOptions of TranscriptionClient
Hi I am using a code similar to the one found in: link However when I specify a model in TranscriptionOptions options = TranscriptionOptions( locales=["en-US"], enhanced_mode=enhanced_mode, models={ "en-US":…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
How to use azure tts by websocket api rather than sdk?
I would like to use the Azure TTS input text streaming capability, as described in this documentation:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Microsoft Foundry Speech Playground shows "Connection error to Realtime service: Invalid response status"
I had been using the "gpt-4o-realtime-preview" speech-to-speech model in the past months without issues. However since a few days it stopped working, not just in my python project but even in the Microsoft Foundry > Speech Playground: When I…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
It seems that there is a gc bug in the azure speech python sdk, how to solve it?
I would like to use the Azure TTS input text streaming capability, as described in this documentation:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Azure Open AI Realtime GA Model not working
I have developed a conversational AI agent using Azure OpenAI Realtime Preview model which is getting depracted in April. All the audio functions were working properly. When I moved this to Realtime GA model when ever I start speaking its terminating.…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Speech to Text Accuracy Improvement
Hi, Looking for support to improve Speech to Text Accuracy for Language identification - Currently When 2 speakers switch language, the first line is always in an incorrect language Speaker diarization - Most times 2 speakers are talking but service…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
I am using Azure voice live avatar for my app with GPT realtime ,I am seeing ICE disconnected message at that time transcript works but audio and video are lost .How to check this issue
I am using Azure voice live avatar for my app with GPT realtime ,I am seeing ICE disconnected message .at that time transcript works but audio and video are lost .How to check this issue . This issue happens after 4 Minutes or at 15 Minutes of the…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Azure Custom Avatar Model Deployment through API
I recently trained and developed a Custom Avatar using Azure Speech Service (Speech Studio). Once the Training was done, I deployed the model from the Speech Service Portal Itself. Since my usage is less, I want to optimize my costs by only deploying…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Custom Avatar Quality not Upto the Mark
I recently create my own custom avatar using Azure Speech Studio. Even with the training video recordings being done in a studio with a professional camera and a green screen background, when an Avatar video (batch) is generated, the hands often become…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
It seems that there is a gc bug in the azure speech python sdk. How to solve it?
I would like to use the Azure TTS input text streaming capability, as described in this documentation:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.