Azure AI Speech

2 answers

How to Deploy AI Foundry Models to Frontend

Hi, I have a doubt! How can I integrate my Azure AI Foundry Models to the frontend deployment so that the output/it's final product is usable after model development. If Suppose I have my backend in Power Automate flow (As a Workflow Developed), how can…

asked

Ashwath Bala S 20

commented

Ashwath Bala S 20

1 answer

Urgent question about Custom text to speech avatar & Custom Speech

Hello I know this is on request only & have the links to submit a request. A customer of mine claims she has access to both Custom voice and custom text to speech avatar already. So, I am guessing that if she goes to speech studio and to the new AI…

asked

It is VMS 100

commented

Pavankumar Purilla 8,745 Microsoft External Staff Moderator

1 answer

Azure batch transcription is running forever when used custom model

Our Azure Batch Transcription jobs using a newly trained custom English model are consistently getting stuck in a 'running' state and never completing. This custom model was built upon base models acc05d98-300c-48fb-abe4-a57a5fc925d2 and…

asked

Ulhas Hulyal, Nilesh 35

commented

santoshkc 15,435 Microsoft External Staff Moderator

1 answer

Custom Speech for Two Speakers

I'm working on a project that requires a custom speech azure model on audios that contains multiple speakers. However, I'm not sure how should i provide the training transcript to identify the different speakers...

asked

Hind AlMarzooqi 20

commented

Hind AlMarzooqi 20

2 answers

Cognitive Services Speech to Text Not Works in Deployment

Hello! I have a application in .Net 9 MVC, that uses Azure AI Speech and uses a Text to Speech function, and this functions works perfectly in local or in development scenery, but when I'll publish the app in Azure or in other hosting supplier, the Text…

asked

Andres Orozco Jaramillo 0

commented

Saideep Anchuri 9,500 Moderator

2 answers

What is a maximum audio limit output for text to speech to api endpoint?

I am using text to speech service api endpoint to convert my srt file text to speech https://region.tts.speech.microsoft.com/cognitiveservices/v1 I am not sure about what is maximum output limit for this as in minutes. It is mentioned that it is…

asked

Nikita Khandare 60

accepted

Nikita Khandare 60

2 answers

Speech to Text API do not return word timestamps for Japanese

When I submit a request to the Speech to Text API for transcription of Japanese audio I don't get the word timestamps. I have set the wordLevelTimestampsEnabled to True. I get those for other languages with the same request template. Is this not…

asked

Angel Naydenov 20

accepted

Angel Naydenov 20

1 answer

speech SDK is throwing error

hi, i am trying to use the speech SDK as mentioned in the URL: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/get-started-stt-diarization?tabs=macos&pivots=programming-language-python initially i got the error : 2025-06-08 -…

asked

Abdulla Rasfan 0

answered

Pavankumar Purilla 8,745 Microsoft External Staff Moderator

1 answer

Limitation on Text-to-Speech Audio Length in Azure Cognitive Services

How can I generate audio files longer than 10 minutes using Azure Cognitive Services' Text-to-Speech API? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the Azure community.

asked

santoshkc 15,435 Microsoft External Staff Moderator

commented

Nikita Khandare 60

2 answers

Introducing interpretation in Microsoft Teams using Azure AI Speech. But when and how?

Hello, I saw a few weeks ago the following Microsoft Azure Video where a call was translated in realtime. https://www.youtube.com/watch?v=r8gzes7aA7s Will be good to test this and be part of the BETA Testgroups. Where can I find more information about…

asked

Jose Lopez Moreno-ADM 10

commented

RG 0

1 answer

At times Speech to Text, fast transcription, is suddenly slow!

Hi Sometimes, for the same audio file, the response is a lot more slower. & i am not talking of the "waking up" issue mentioned at https://learn.microsoft.com/en-us/answers/questions/2260261/speech-to-text-s0-error-429-on-first-call is this…

asked

It is VMS 100

accepted

It is VMS 100

4 answers

How to increase parallel job processing quota in speech services speech to text batch transcription

Hello Azure Support, I’m using the Speech-to-Text v3.2 batch transcription API to process long-form audio recordings. Per Microsoft documentation, the maximum supported length for batch transcription is now 240 minutes per audio file and a 100 concurrent…

asked

Austin Chase 0

commented

Pavankumar Purilla 8,745 Microsoft External Staff Moderator

1 answer

Multiple locales error, REST, Speech to Text, fast transcription API

Here's the issue I use multiple locales as described at https://learn.microsoft.com/en-us/azure/ai-services/speech-service/fast-transcription-create?tabs=multilingual-transcription-on#request-configuration-options Locales given were "hi-IN,…

asked

It is VMS 100

accepted

It is VMS 100

1 answer

Incorrect pronunciation of Swedish word “reservation” in Azure TTS voices (Sofie, Mattias, Hillevi)

I am using the Azure Text-to-Speech service with Swedish voices (Sofie, Mattias, and Hillevi) to pronounce Swedish words. However, the pronunciation of the word “reservation” is clearly incorrect in all of these voices. Expected pronunciation (IPA):…

asked

Dmitrii Antonov 20

edited a comment

Dmitrii Antonov 20

1 answer

How to play audio from Azure Speech Service in an outbound call using Azure Communication Service?

I have downloaded the Call Automation Outbound Calling sample project from Azure and am running it locally. The call connects to the target phone number, but the audio does not play. The code fails with the error: "Action failed due to a bad request…

asked

Ashley 60

accepted

Ashley 60

2 answers

Fast Transcription API for Azure AI Foundry randomly returns 429 server too busy :-(

Currently, I rarely do fast audio transcription : https://learn.microsoft.com/en-us/azure/ai-services/speech-service/fast-transcription-create For example, I did a request for the 1st time in 3 or 4 days, & immediately get 429 server too busy (am…

asked

It is VMS 100

answered

It is VMS 100

1 answer

The result of Japanese pronunciation phonemes is empty

I am trying Azure Speech STT Japanese Pronunciation Assessment in Speech Studio. However, the JSON output shows that the phonemes of each Japanese character are empty strings. However, each syllable has an Accuracy Score. "Syllables": [ …

asked

jlst_dev 0

commented

santoshkc 15,435 Microsoft External Staff Moderator

1 answer

Can we get custom response from GPT 4o Real-time Preview model. or can we customize the response?

I have checked the GPT real-time audio reference, but couldn't get anything for custom response. https://learn.microsoft.com/en-us/azure/ai-services/openai/realtime-audio-reference

asked

Ali Abbas Baloch 0

edited a comment

Ali Abbas Baloch 0

1 answer

Semantic Segmentation Property Error

Speech to Text Accuracy for Continuous Streaming I get the following error when I try to use Semantic Segmentation Property in Speech Cognitative Service: Recognition canceled: CancellationReason.Error Error details: Connection was closed by the…

asked

Moiz Kapasi 0

commented

Manas Mohanty 6,530 Microsoft External Staff Moderator

1 answer

Unable to locate downloaded mp3 file from Speech Studio

Hi! Every time I download my mp3 file from Speech Studio, I get a message that the download is complete but I lost the folder connection to open the folder where the file has been downloaded. Pls see attached screenshot. Could someone please help?

asked

Turek, Belinda 0

commented

SriLakshmi C 6,250 Microsoft External Staff Moderator

Filter

Content

2,080 questions with Azure AI Speech tags

How to Deploy AI Foundry Models to Frontend

Urgent question about Custom text to speech avatar & Custom Speech

Azure batch transcription is running forever when used custom model

Custom Speech for Two Speakers

Cognitive Services Speech to Text Not Works in Deployment

What is a maximum audio limit output for text to speech to api endpoint?

Speech to Text API do not return word timestamps for Japanese

speech SDK is throwing error

Limitation on Text-to-Speech Audio Length in Azure Cognitive Services

Introducing interpretation in Microsoft Teams using Azure AI Speech. But when and how?

At times Speech to Text, fast transcription, is suddenly slow!

How to increase parallel job processing quota in speech services speech to text batch transcription

Multiple locales error, REST, Speech to Text, fast transcription API

Incorrect pronunciation of Swedish word “reservation” in Azure TTS voices (Sofie, Mattias, Hillevi)

How to play audio from Azure Speech Service in an outbound call using Azure Communication Service?

Fast Transcription API for Azure AI Foundry randomly returns 429 server too busy :-(

The result of Japanese pronunciation phonemes is empty

Can we get custom response from GPT 4o Real-time Preview model. or can we customize the response?

Semantic Segmentation Property Error

Unable to locate downloaded mp3 file from Speech Studio