2,080 questions with Azure AI Speech tags

Sort by: Updated
2 answers One of the answers was accepted by the question author.

How to Deploy AI Foundry Models to Frontend

Hi, I have a doubt! How can I integrate my Azure AI Foundry Models to the frontend deployment so that the output/it's final product is usable after model development. If Suppose I have my backend in Power Automate flow (As a Workflow Developed), how can…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-05T16:07:21.5333333+00:00
Ashwath Bala S 20 Reputation points
commented 2025-06-12T10:06:45.3733333+00:00
Ashwath Bala S 20 Reputation points
1 answer One of the answers was accepted by the question author.

Urgent question about Custom text to speech avatar & Custom Speech

Hello I know this is on request only & have the links to submit a request. A customer of mine claims she has access to both Custom voice and custom text to speech avatar already. So, I am guessing that if she goes to speech studio and to the new AI…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-11T05:36:04.65+00:00
It is VMS 100 Reputation points
commented 2025-06-12T04:56:05.4866667+00:00
Pavankumar Purilla 8,745 Reputation points Microsoft External Staff Moderator
1 answer

Azure batch transcription is running forever when used custom model

Our Azure Batch Transcription jobs using a newly trained custom English model are consistently getting stuck in a 'running' state and never completing. This custom model was built upon base models acc05d98-300c-48fb-abe4-a57a5fc925d2 and…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-03T07:50:43.2733333+00:00
Ulhas Hulyal, Nilesh 35 Reputation points
commented 2025-06-10T11:35:51.8733333+00:00
santoshkc 15,435 Reputation points Microsoft External Staff Moderator
1 answer One of the answers was accepted by the question author.

Custom Speech for Two Speakers

I'm working on a project that requires a custom speech azure model on audios that contains multiple speakers. However, I'm not sure how should i provide the training transcript to identify the different speakers...

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-10T06:00:03.9933333+00:00
Hind AlMarzooqi 20 Reputation points
commented 2025-06-10T11:11:17.5566667+00:00
Hind AlMarzooqi 20 Reputation points
2 answers

Cognitive Services Speech to Text Not Works in Deployment

Hello! I have a application in .Net 9 MVC, that uses Azure AI Speech and uses a Text to Speech function, and this functions works perfectly in local or in development scenery, but when I'll publish the app in Azure or in other hosting supplier, the Text…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-04T14:14:08.9233333+00:00
Andres Orozco Jaramillo 0 Reputation points
commented 2025-06-10T04:08:24.3966667+00:00
Saideep Anchuri 9,500 Reputation points Moderator
2 answers One of the answers was accepted by the question author.

What is a maximum audio limit output for text to speech to api endpoint?

I am using text to speech service api endpoint to convert my srt file text to speech https://region.tts.speech.microsoft.com/cognitiveservices/v1 I am not sure about what is maximum output limit for this as in minutes. It is mentioned that it is…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-09T04:13:57.8566667+00:00
Nikita Khandare 60 Reputation points
accepted 2025-06-09T11:37:21.0133333+00:00
Nikita Khandare 60 Reputation points
2 answers One of the answers was accepted by the question author.

Speech to Text API do not return word timestamps for Japanese

When I submit a request to the Speech to Text API for transcription of Japanese audio I don't get the word timestamps. I have set the wordLevelTimestampsEnabled to True. I get those for other languages with the same request template. Is this not…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-03T11:35:02+00:00
Angel Naydenov 20 Reputation points
accepted 2025-06-09T07:18:31.6133333+00:00
Angel Naydenov 20 Reputation points
1 answer

speech SDK is throwing error

hi, i am trying to use the speech SDK as mentioned in the URL: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/get-started-stt-diarization?tabs=macos&pivots=programming-language-python initially i got the error : 2025-06-08 -…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-08T08:29:30.8633333+00:00
Abdulla Rasfan 0 Reputation points
answered 2025-06-09T05:11:07.8566667+00:00
Pavankumar Purilla 8,745 Reputation points Microsoft External Staff Moderator
1 answer

Limitation on Text-to-Speech Audio Length in Azure Cognitive Services

How can I generate audio files longer than 10 minutes using Azure Cognitive Services' Text-to-Speech API? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the Azure community.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2024-07-31T06:44:42.8933333+00:00
santoshkc 15,435 Reputation points Microsoft External Staff Moderator
commented 2025-06-09T04:16:37.39+00:00
Nikita Khandare 60 Reputation points
2 answers

Introducing interpretation in Microsoft Teams using Azure AI Speech. But when and how?

Hello, I saw a few weeks ago the following Microsoft Azure Video where a call was translated in realtime. https://www.youtube.com/watch?v=r8gzes7aA7s Will be good to test this and be part of the BETA Testgroups. Where can I find more information about…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
Microsoft Teams | Microsoft Teams for business | Teams on mobile devices
asked 2025-02-04T13:45:52.1666667+00:00
Jose Lopez Moreno-ADM 10 Reputation points
commented 2025-06-05T18:55:11.9466667+00:00
RG 0 Reputation points
1 answer One of the answers was accepted by the question author.

At times Speech to Text, fast transcription, is suddenly slow!

Hi Sometimes, for the same audio file, the response is a lot more slower. & i am not talking of the "waking up" issue mentioned at https://learn.microsoft.com/en-us/answers/questions/2260261/speech-to-text-s0-error-429-on-first-call is this…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-03T09:05:35.0366667+00:00
It is VMS 100 Reputation points
accepted 2025-06-05T08:37:52.3633333+00:00
It is VMS 100 Reputation points
4 answers

How to increase parallel job processing quota in speech services speech to text batch transcription

Hello Azure Support, I’m using the Speech-to-Text v3.2 batch transcription API to process long-form audio recordings. Per Microsoft documentation, the maximum supported length for batch transcription is now 240 minutes per audio file and a 100 concurrent…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-03T22:32:38.7433333+00:00
Austin Chase 0 Reputation points
commented 2025-06-05T01:31:58.62+00:00
Pavankumar Purilla 8,745 Reputation points Microsoft External Staff Moderator
1 answer One of the answers was accepted by the question author.

Multiple locales error, REST, Speech to Text, fast transcription API

Here's the issue I use multiple locales as described at https://learn.microsoft.com/en-us/azure/ai-services/speech-service/fast-transcription-create?tabs=multilingual-transcription-on#request-configuration-options Locales given were "hi-IN,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-03T07:44:39.9433333+00:00
It is VMS 100 Reputation points
accepted 2025-06-03T09:03:35.2833333+00:00
It is VMS 100 Reputation points
1 answer One of the answers was accepted by the question author.

Incorrect pronunciation of Swedish word “reservation” in Azure TTS voices (Sofie, Mattias, Hillevi)

I am using the Azure Text-to-Speech service with Swedish voices (Sofie, Mattias, and Hillevi) to pronounce Swedish words. However, the pronunciation of the word “reservation” is clearly incorrect in all of these voices. Expected pronunciation (IPA):…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-05-25T09:50:13.81+00:00
Dmitrii Antonov 20 Reputation points
edited a comment 2025-06-02T16:23:09.0166667+00:00
Dmitrii Antonov 20 Reputation points
1 answer One of the answers was accepted by the question author.

How to play audio from Azure Speech Service in an outbound call using Azure Communication Service?

I have downloaded the Call Automation Outbound Calling sample project from Azure and am running it locally. The call connects to the target phone number, but the audio does not play. The code fails with the error: "Action failed due to a bad request…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-04-23T21:58:07.31+00:00
Ashley 60 Reputation points
accepted 2025-06-02T11:37:52.99+00:00
Ashley 60 Reputation points
2 answers One of the answers was accepted by the question author.

Fast Transcription API for Azure AI Foundry randomly returns 429 server too busy :-(

Currently, I rarely do fast audio transcription : https://learn.microsoft.com/en-us/azure/ai-services/speech-service/fast-transcription-create For example, I did a request for the 1st time in 3 or 4 days, & immediately get 429 server too busy (am…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-06-01T13:55:11.83+00:00
It is VMS 100 Reputation points
answered 2025-06-02T03:56:10.72+00:00
It is VMS 100 Reputation points
1 answer

The result of Japanese pronunciation phonemes is empty

I am trying Azure Speech STT Japanese Pronunciation Assessment in Speech Studio. However, the JSON output shows that the phonemes of each Japanese character are empty strings. However, each syllable has an Accuracy Score. "Syllables": [ …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-05-26T09:21:49.95+00:00
jlst_dev 0 Reputation points
commented 2025-05-30T14:55:19.0166667+00:00
santoshkc 15,435 Reputation points Microsoft External Staff Moderator
1 answer

Can we get custom response from GPT 4o Real-time Preview model. or can we customize the response?

I have checked the GPT real-time audio reference, but couldn't get anything for custom response. https://learn.microsoft.com/en-us/azure/ai-services/openai/realtime-audio-reference

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-05-27T10:38:15.8833333+00:00
Ali Abbas Baloch 0 Reputation points
edited a comment 2025-05-30T14:41:13.9433333+00:00
Ali Abbas Baloch 0 Reputation points
1 answer

Semantic Segmentation Property Error

Speech to Text Accuracy for Continuous Streaming I get the following error when I try to use Semantic Segmentation Property in Speech Cognitative Service: Recognition canceled: CancellationReason.Error Error details: Connection was closed by the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-05-25T16:07:54.5133333+00:00
Moiz Kapasi 0 Reputation points
commented 2025-05-30T08:30:29.23+00:00
Manas Mohanty 6,530 Reputation points Microsoft External Staff Moderator
1 answer

Unable to locate downloaded mp3 file from Speech Studio

Hi! Every time I download my mp3 file from Speech Studio, I get a message that the download is complete but I lost the folder connection to open the folder where the file has been downloaded. Pls see attached screenshot. Could someone please help?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,080 questions
asked 2025-05-22T09:21:54+00:00
Turek, Belinda 0 Reputation points
commented 2025-05-29T10:33:47.2766667+00:00
SriLakshmi C 6,250 Reputation points Microsoft External Staff Moderator