1,743 questions with Azure AI Speech tags

Sort by: Updated
0 answers

How to disable the default "Disfluency Removal" of filler words after STT transcription in Azure AI Speech?

Azure AI Speech Services defaults to removing many filler words (uh, eh, etc.) via post-transcription "Disfluency Removal". My use case includes presentation analysis for filler words, which requires a verbatim transcript. Is there a…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-19T02:25:07.9633333+00:00
Dennis 0 Reputation points
0 answers

How to disable the default "Disfluency Removal" of filler words after STT transcription in Azure AI Speech?

My use case includes analyzing presentations for filler words and to do so I need verbatim transcripts. Azure STT defaults to "Disfluency Removal" (removing filler words) on the transcription before returning the JSON transcript file. Is…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-19T01:55:10.4433333+00:00
Dennis 0 Reputation points
0 answers

Never-ending "Loading resources and quota..."

When trying to deploy this model I'm met with the never-ending loading wheel. How to fix?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-17T08:58:02.44+00:00
Hilton Sewell 0 Reputation points
commented 2024-10-18T15:12:43.87+00:00
kothapally Snigdha (Quadrant Resource LLC) 20 Reputation points Microsoft Vendor
1 answer

how to run an Azure avatar in code

Good afternoon. I am writing code in Python Fastapi in which the bot asks questions and the user answers. Help with documentation to add Azure Avatare

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-17T12:37:09.68+00:00
сергей туренко 20 Reputation points
answered 2024-10-17T20:56:21.1033333+00:00
YutongTie-MSFT 52,091 Reputation points
2 answers

Issue with Pronunciation Assessment in Speech Recognition API Always Returning PronScore 100

I am using the https://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1 API with the POST method for speech-to-text conversion. Here are the details of my implementation: Programming Language: JavaScript …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,866 questions
asked 2024-08-08T06:22:44.7766667+00:00
Heba Ghazaly 5 Reputation points
answered 2024-10-17T20:34:10.2666667+00:00
Sina Salam 11,206 Reputation points
0 answers

Why Am I Getting Connection Failed Error on Android build via Unity?

get an error message on Android build via Unity Connection failed (no connection to the remote host). Internal error: 1. Error details: Failed with error: WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED Speech SDK log taken from a run that exhibits the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-17T11:07:36.4+00:00
Christian Jay Aligaga 0 Reputation points
edited the question 2024-10-17T15:26:37.23+00:00
VarunTha 8,935 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Azure AI Speech Studio TextToSpeech with voice "AlloyTurboMultilingual" shows "Error 400 Synthesis failed. StatusCode: NotFound"

Inside of Azure AI Speech Studio when trying to generate speech with the voice "AlloyTurboMultilingual" or "NovaTurboMultilingual" the following error occurs: Response status code does not indicate success: 400 (Synthesis failed.…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-10T11:29:28.04+00:00
NSM 20 Reputation points
accepted 2024-10-17T14:56:33.3833333+00:00
NSM 20 Reputation points
0 answers

Issue with Continuous Language Identification in Azure Speech SDK for Angular Application

We are currently using the "microsoft-cognitiveservices-speech-sdk" in our Angular application (version 14) for speech transcription and translation. The transcription and translation functionality is working as expected. However, we are…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-14T04:41:10.32+00:00
sanjay.bisht 0 Reputation points
commented 2024-10-17T06:09:48.5233333+00:00
romungi-MSFT 46,476 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Confused on custom neural voice docs

Hi all, I'd like to implement custom text to speech voices and I was pleased to see that azure offers different solutions, but i'm confused on the types of services. In the docs page I saw 3 types of custom speech methods: pro, lite and personal but in…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-14T16:18:01.1266667+00:00
Talkkit 20 Reputation points
accepted 2024-10-16T15:13:55.9066667+00:00
Talkkit 20 Reputation points
0 answers

Delay in Transcription (Multi-Device Conversation) Cognitive Speech

As the documentation says that multi device conversation is real time but running the sample code there is delay of about 12 - 15 second in transcribing. How can i make it real time??

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-14T09:27:08.78+00:00
Basil Ali Khan 0 Reputation points
commented 2024-10-16T13:04:55.45+00:00
santoshkc 8,955 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

In speech service text to speech pitch parameter

Hi all Iam new to azure speech service. In speech text to speech I am using conard voice. How can we increase or decath pitch for a specific word?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-16T06:26:50.0166667+00:00
raji 40 Reputation points
accepted 2024-10-16T11:39:05.8+00:00
raji 40 Reputation points
0 answers

Azure speech studio custom keyword page always return API exception

I have trained keyword, when I try to test model on the page, it always return: "API exception. This request is not authorized to perform this operation using this permission." And I can't download the model, There's no response on the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-09-25T11:39:57.97+00:00
Liang HAN 5 Reputation points
commented 2024-10-16T08:58:43.4633333+00:00
santoshkc 8,955 Reputation points Microsoft Vendor
1 answer

Creating Custom Keywords Models are stuck "processing"

When creating a new model in the Speech Studio under 'Create a custom keyword for your virtual assistant', the model creation process never completes. I tried it with multiple models and keywords, even the examples like "hey computer" never…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-09-09T11:49:36.2466667+00:00
Ziq 15 Reputation points
commented 2024-10-16T07:55:10.8933333+00:00
Stefan Teufl 0 Reputation points
1 answer

[Multi Device Conversation] - [Multi-Device Conversation][DotNet] Cannot set display name/Nickname when join conversation.

Hi, I tried to implement the code according to the example at the link: https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/quickstart/csharp/dotnet/multi-device-conversation/helloworld/Program.cs. But when I set display name…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-09-26T03:49:53.6466667+00:00
Văn Chương Mai 0 Reputation points
answered 2024-10-16T01:19:20.3933333+00:00
Văn Chương Mai 0 Reputation points
0 answers

speech api fails where Speech Studio succeeds?

I am using the Standard Tier, with a couple paragraphs of text and only a few ssml tags. The ssml pasted into Speech Studio renders correctly, even exports to an audio file correctly. The same ssml rendered through the python API causes the error below.…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-10T18:09:20.84+00:00
Jory 0 Reputation points
commented 2024-10-16T00:32:54.43+00:00
Jory 0 Reputation points
0 answers

My Speech Congnitive Basic Model can not finish training

I want to training a congnitive model to help me do speech key words cognition. I choose basic model but it have run for over 10 hours still not get any result. The doc says it only takes few minutes. Now the model still inprocessing without any result.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-09-22T00:50:36.7966667+00:00
xinyu du 0 Reputation points
commented 2024-10-15T09:38:02.3333333+00:00
SriLakshmi C (Quadrant Resource LLC) 345 Reputation points Microsoft Vendor
1 answer

Can I set maximum number of participants to real-time diarization?

Hi, I follow the document below and success to distinguish the speaker with audio streaming by ConversationTranscriber Class. (I don't use voice signature so it shows Guest-1,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-10T08:27:10.33+00:00
RES 0 Reputation points
commented 2024-10-15T09:34:19.4033333+00:00
santoshkc 8,955 Reputation points Microsoft Vendor
1 answer

[Pronunciation Assessment] Is there a way to improve the results using a custom model?

I've been experimenting with the pronunciation assessment service. My use case involves scripted assessment in Canadian French (i.e., fr-CA). So far, I've had the most success with the configuration where enableMiscue is set to false, as I have found the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-03T16:48:35.49+00:00
Francis W 0 Reputation points
commented 2024-10-15T09:21:53.73+00:00
SriLakshmi C (Quadrant Resource LLC) 345 Reputation points Microsoft Vendor
0 answers

Is there GRPC support for Speech to Text in Azure Speech SDK in java?

Hi, Is there GRPC support for Azure speech SDK? We are looking for this support for the Realtime Speech to Text feature. Is that support available in Java? If there is no GRPC support, what is the underlying architecture, and how is the voice streamed to…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,866 questions
asked 2024-10-10T12:56:31.7733333+00:00
Sai Vishnu Soudri 0 Reputation points
commented 2024-10-15T08:55:29.67+00:00
kothapally Snigdha (Quadrant Resource LLC) 20 Reputation points Microsoft Vendor
1 answer

Do we have any API Support to get the cost estimation of Audio Translation?

Do we have any API Support to get the cost estimation of audio translation from language to another language by taking input as audio duration, source language, target language, etc., required input. If any API support in java, please help me Thanks…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,743 questions
asked 2024-10-09T04:28:42.14+00:00
Ganesh P 0 Reputation points
edited an answer 2024-10-15T06:05:05.9133333+00:00
romungi-MSFT 46,476 Reputation points Microsoft Employee