Azure Speech in Foundry Tools

2 answers

Documentation question: Does Microsoft Stream transcription on SharePoint use Azure AI Speech?

Hello, I'm working on a research project that requires me to accurately describe the technology stack used for transcription within Microsoft 365, and I'm hoping you can help me clarify a technical question that I have not been able to confirm through…

asked

Catherine M. Stockdale 0

commented

Catherine M. Stockdale 0

0 answers

What dialect of Irish are the speech studio text to speech models (Colm and Orla) modeled after?

I am interested in learning more about Speech Studio's text to speech models for the Irish language. Currently, there are two available voices on the platform (Colm and Orla) which are labelled under Irish (Ireland). The Irish language does not have an…

asked

Sean McGoff 0

commented

SRILAKSHMI C 18,035 Microsoft External Staff Moderator

1 answer

Next practical step is an Azure Support request for Speaker Recognition gated access for subscription?

One blocker remains, and it is on Azure’s side: the live Speaker Recognition smoke test returns 401 Unauthorized saying the subscription is not approved for the gated Speaker Recognition service. The resource/key are valid enough to exist and be mounted,…

asked

Lucy Noble 0

answered

Anshika Varshney 9,985 Microsoft External Staff Moderator

1 answer

Joanne, the Australian female voice

Hi there, My name is Jacqui, and I use an eyegaze system with voice output due to living with severe Cerebral Palsy, a physical disability, and I’m non-verbal. I’ve been searching for an updated Australian female voice for over two years now. I’ve had…

asked

Jacqui Rogers 20

commented

Georgia Kate Haege 0

1 answer

What is Azure equivalent for input_audio_noise_reduction ?

We have moved from OpenAI deployed Realtime Mini 4o to Azure deployed Realtime 4o Mini. In OpenAI we had input_audio_noise_reduction property. But this property is not available in Azure deployment. What is the Azure OpenAI equivalent of this property?

asked

Divjot Singh 20

commented

Anshika Varshney 9,985 Microsoft External Staff Moderator

0 answers

The v1 real-time STT endpoint returns a constant Confidence value of 0.039347406 for all NBest results globally.

The v1 real-time Speech-to-Text endpoint (/speech/recognition/{mode}/cognitiveservices/v1) returns a fixed Confidence value of 0.039347406 in the NBest array for every recognition result, regardless of audio quality, region, resource type, or SDK…

asked

OneReachAdmin-8533 0

commented

Anshika Varshney 9,985 Microsoft External Staff Moderator

1 answer

Test: Azure Speech Service Not Available in Foundry Tools

Question: I’m attempting to use Azure Speech services through Foundry tools, but the option does not show up or fails during setup. Details: Azure Speech resource is active Using Foundry Tools interface No clear error message, just missing or…

asked

Abhinava Maddha 105 Microsoft Employee

accepted

Abhinava Maddha 105 Microsoft Employee

1 answer

Azure Cognitive Services TTS fails to generate audio for certain Unicode characters (e.g., em dash, arrow symbols)

I'm using Azure Cognitive Services Text-to-Speech (via SSML) to generate audio from text content. Some Unicode characters cause the TTS engine to fail or produce no audio output, while others in the same Unicode block work correctly. Characters that…

asked

Pawel Pelka 20

accepted

Pawel Pelka 20

1 answer

Significant Regression in Code-Switching (EN-ZH) recognition for zh-HK after April 2026 Engine Update

Problem Description We are reporting a critical regression in the Cantonese (zh-HK) Speech-to-Text service that started occurring after the March 31, 2026 API retirement and the subsequent rollout of the latest engine (MAI-Transcribe-1). The Issue:…

asked

Peter Ng 0

commented

Karnam Venkata Rajeswari 2,390 Microsoft External Staff Moderator

2 answers

Microsoft Foundry Speech Playground shows "Connection error to Realtime service: Invalid response status"

I had been using the "gpt-4o-realtime-preview" speech-to-speech model in the past months without issues. However since a few days it stopped working, not just in my python project but even in the Microsoft Foundry > Speech Playground: When I…

asked

Phil Nylund 0

commented

SRILAKSHMI C 18,035 Microsoft External Staff Moderator

2 answers

Specifying a model in TranscriptionOptions of TranscriptionClient

Hi I am using a code similar to the one found in: link However when I specify a model in TranscriptionOptions options = TranscriptionOptions( locales=["en-US"], enhanced_mode=enhanced_mode, models={ "en-US":…

asked

Roy Aad 0

answered

SRILAKSHMI C 18,035 Microsoft External Staff Moderator

2 answers

Azure Speech Fast Transcription concurrency and autoscaling question

Hello Azure Support, We are preparing to test Azure Speech Fast Transcription and want to understand the service limits so we can configure our KEDA scaler and Azure Service Bus concurrency correctly. Could you please clarify: What is the actual…

asked

Morales, Roberto (TR Product) 0

answered

SRILAKSHMI C 18,035 Microsoft External Staff Moderator

2 answers

Is it expected that add_target_language() silently drops translations when language codes overlap (e.g. en + en-US, fil + fi)?

Is it expected that add_target_language() silently drops translations when language codes overlap (e.g. en + en-US, fil + fi)? I've encountered an issue where certain combinations of target languages in TranslationRecognizer cause translations to…

asked

Chi Hsun Wang 0

commented

SRILAKSHMI C 18,035 Microsoft External Staff Moderator

1 answer

Random Words Detected by Azure Speech Recognizer in Silence

Hello Azure Support Team, I am currently using the Azure Speech Service to recognize speech inputs in my application. The setup of my speech recognizer is as follows: export const createSpeechRecognizer = () => { const speechRecognitionConfig =…

asked

Abdul Subhan 10

commented

Greg Woods 46

1 answer

Interactive/Live Avatar not Showing Hand Movements

I recently trained and deployed a Custom Avatar using Azure Speech Studio. I had trained the avatar using videos in which there was significant hand movements. To test the avatar, I first checked it in the Text To Speech Avatar section where I gave a…

asked

DARSHIL SHAH7 80

accepted

DARSHIL SHAH7 80

0 answers

Azure Speech Ingestion client not processing audio input

Hello, I deployed the speech ingestion client via ARM template, when uploading the first wav file into the audio-input directory I received an error saying that the service wasn't valid for the free tier, so I updated my subscription and the batch…

asked

PCS Dev 0

commented

SRILAKSHMI C 18,035 Microsoft External Staff Moderator

2 answers

How to estimate pricing (training + compute) for a custom tts avatar?

We want to create a custom avatar in Speech Studio. Of course, we need to estimate the costs for this project. We need the custom avatar + custom voice. Azure Pricing (https://azure.microsoft.com/en-us/pricing/details/speech/) is a bit confusing. For the…

asked

Manuel Tospann 336

commented

SRILAKSHMI C 18,035 Microsoft External Staff Moderator

1 answer

Unnable to create speech service (Azure for students subscription)

im getting the error about regions from a policy 'Resource 's...' was disallowed by Azure: This policy maintains a set of best available regions where your subscription can deploy resources. The objective of this policy is to ensure that your…

asked

Isaac Mukiri 0

answered

SRILAKSHMI C 18,035 Microsoft External Staff Moderator

1 answer

How to use azure tts by websocket api rather than sdk？

I would like to use the Azure TTS input text streaming capability, as described in this documentation:…

asked

datou ai 0

commented

Pavankumar Purilla 11,495 Microsoft External Staff Moderator

1 answer

It seems that there is a gc bug in the azure speech python sdk, how to solve it?

I would like to use the Azure TTS input text streaming capability, as described in this documentation:…

asked

datou ai 0

commented

Karnam Venkata Rajeswari 2,390 Microsoft External Staff Moderator

Azure Speech in Foundry Tools

Filter

Content

2,327 questions with Azure Speech in Foundry Tools tags

Documentation question: Does Microsoft Stream transcription on SharePoint use Azure AI Speech?

What dialect of Irish are the speech studio text to speech models (Colm and Orla) modeled after?

Next practical step is an Azure Support request for Speaker Recognition gated access for subscription?

Joanne, the Australian female voice

What is Azure equivalent for input_audio_noise_reduction ?

The v1 real-time STT endpoint returns a constant Confidence value of 0.039347406 for all NBest results globally.

Test: Azure Speech Service Not Available in Foundry Tools

Azure Cognitive Services TTS fails to generate audio for certain Unicode characters (e.g., em dash, arrow symbols)

Significant Regression in Code-Switching (EN-ZH) recognition for zh-HK after April 2026 Engine Update

Microsoft Foundry Speech Playground shows "Connection error to Realtime service: Invalid response status"

Specifying a model in TranscriptionOptions of TranscriptionClient

Azure Speech Fast Transcription concurrency and autoscaling question

Is it expected that add_target_language() silently drops translations when language codes overlap (e.g. en + en-US, fil + fi)?

Random Words Detected by Azure Speech Recognizer in Silence

Interactive/Live Avatar not Showing Hand Movements

Azure Speech Ingestion client not processing audio input

How to estimate pricing (training + compute) for a custom tts avatar?

Unnable to create speech service (Azure for students subscription)

How to use azure tts by websocket api rather than sdk？

It seems that there is a gc bug in the azure speech python sdk, how to solve it?