Azure AI Speech

1 answer

azure prononciation assessment async assessment

i'am using azure speech recognizer sdk , to do the prononciation assessment of an audio file. the problem when the speech is in french the results are always low , and no expressive const language = await detectSingleSpeechLanguage(text) …

asked

Iheb Jandoubi 5

answered

romungi-MSFT 42,761 Microsoft Employee

0 answers

SSML: Using <lang xml:lang=""> within a multilingual voice sounds incorrect / unlike when used with the language-specific voice

I am developing a TTS application that pronounces "nonsense words" with specific language pronunciations. For example, I am using Polish language voices to pronounce non-Polish words. If I use a Polish-specific language, I hear what I expect…

asked

mkb13 11

commented

dupammi 7,140 Microsoft Vendor

1 answer

Retirement Announcement - Upgrade to Text-to-Speech Neural Voice on 31 August 2024

Text-to-Speech currently supports both standard and neural voices. However, since the neural voices provide more natural sounding speech output, and thus, a better end-user experience, we are retiring the standard voices on 31st August 2024 and they will…

asked

romungi-MSFT 42,761 Microsoft Employee

commented

Vinod Mankare 0

0 answers

Is there any way to dub audios maintaining its original intonation, breaks and speed?

I've a voice audio that has a lot of deeper and higher tones and some breaks and "word-emphasis" in specific moments, but, when using the "Speech Translation" functionality, this audio loses all of its life (all this complexity),…

asked

Lucas 0

commented

santoshkc 4,925 Microsoft Vendor

0 answers

Azure Text to Speech F0 (Free) Tier Limits

Hi, I have the F0 (Free) Tier. I send a request to TTS service and get the blendshape data and voice. When I make a request, the first 4 get a response. The 5th one does not return a response anymore. If i restart my server, I can make another 4 request…

asked

Rob Enriquez 0

commented

Rob Enriquez 0

1 answer

Speech Recognition Live transcription not detecting any other language instead of English

Hi, I am using Speech Recognition resource in my application for live transcription. It's perfectly going with English language but when I am trying to say in Hindi then it's not detecting. I want to create my application for multiple languages used in…

asked

Jagwant singh 0

commented

Jagwant singh 0

0 answers

zh-CN-XiaochenNeural Abnormal timbre

zh-CN-XiaochenNeural, abnormal timbre. The same problem occurred in October last year. https://learn.microsoft.com/en-us/answers/questions/1431823/the-timbre-of-the-voice-of-zh-cn-xiaochenneural-ha —————————————————————— How long will it take to recover…

asked

斌周 0

commented

斌周 0

0 answers

Is it possible to stream Groq LLM responses as and when I get it into Azure TTS?

Hi! I'm trying to build a real time LLM conversation bot, and need it to be as low latency as possible. I have successfully set up TTS audio output streaming…

asked

arunnair 0

commented

dupammi 7,140 Microsoft Vendor

1 answer

How to have multiple mstts:audioduration in a single <speak>?

I'm trying to adjust the duration of individual phrases so that the synthesized voice matches with the voice in the original audio. It's working perfectly when done like this: <speak xmlns="http://www.w3.org/2001/10/synthesis"…

asked

Lucas 0

answered

dupammi 7,140 Microsoft Vendor

1 answer

Do Text to Speech containers TTS provide visemes and blendshapes like the API?

I'm currently using the Speech API and consuming the visemes and blendshapes that are returned. In an effort to reduce latency I would like to run the speech services locally via the text to speech container. Does the response of the container STT…

asked

Matt Ma 0

answered

Matt Ma 0

0 answers

OpenSSL Issue When Running Azure Speech to text on docker

Hey folks, I'm trying to run speech-to-text using Python on a docker container, but I'm getting an SSL error, I have tried following the steps mentioned here for SSL setup and also installed the required dependencies as mentioned here. However, I'm still…

asked

Ayush Kumar 0

commented

romungi-MSFT 42,761 Microsoft Employee

0 answers

Exception while running Azure Speech to text SDK with jar file (UnsatisfiedLinkError , setTempDirectory)

Hi Team I'm getting errors while running my Java jar in Windows and centos7, However the same is running fine in my Eclipse IDE. The issue is coming when I build the jar and run it in the environment. The error details are below: 2024-05-01 15:50:10…

asked

Ayush Kumar 0

edited a comment

VasaviLankipalle-MSFT 14,831

0 answers

Why am I getting a quota error?

I'm using Azure TTS and getting the following quota error: "You have reached the quota with your free-tier (F0) Speech resource. To continue to create audios with neural voices, switch to a standard paid resource, or upgrade your free-tier…

asked

Rich Hawksworth 0

commented

binarystar 0

2 answers

Low Confidence level of Language Identification

Hi, I was testing the this file , which is in English language, and somehow the language identification returned with Low confidence level for en-US locale. I used both continuous and recognize once option. Are there options I can set, to always ensure…

asked

Amper, Charwin (Contractor) 65

answered

santoshkc 4,925 Microsoft Vendor

1 answer

How can I use AzureTextToSpeech in PowerApps?

I selected the connector and put a button on the canvas. On the OnSelect method I placed following code Set( _myOutput, AzureTexttospeech.ConvertTextToSpeech("en-US-JennyNeural", 'AddressInput.Language'.'en-US', TextOut.Text)); The…

asked

Joël Simons 0

answered

navba-MSFT 17,395 Microsoft Employee

2 answers

Android uses TTS SDK and 3 errors occur

Hello, our App Android version has used Microsoft's TTS SDK "com.microsoft.cognitiveservices.speech:client-sdk:1.34.0" But 3 errors appear frequently: Error 1: {CancellationReason:Error ErrorCode: ServiceTimeout ErrorDetails:USP error: timeout…

asked

newsay 25

commented

newsay 25

2 answers

I am happy with the results in "Speech Studio" for a sample wav file. How do I scale this up to longer files?

I have run a 1-minute wav file through the Speech Studio sample process and am pleased with the result. I can't figure out how to move forward in the system to process larger speech files. One branch seems to take me into a training setting where I…

asked

John Woolley 0

answered

YutongTie-MSFT 46,996

1 answer

Azure Pronuciation Assessment recognition offset lag

I'm using the Pronunciation Assessment with the recognizeOnceAsync method. We are presenting a word for assessment and measuring the response time. Sometimes the offset returned with the recognition corresponds closely with the time reported from the…

asked

Andrew Pasquale 20

accepted

Andrew Pasquale 20

0 answers

speech Synthesis Language hebrew not working

hey I am reaching out to address an issue I have encountered with the speech Synthesis Language( microsoft.cognitiveservices.speech.sdk ) functionality in JavaScript. I have noticed that when attempting to use the Hebrew language code (he-IL) for…

asked

Dorin Ben Haim 0

commented

romungi-MSFT 42,761 Microsoft Employee

0 answers

Azure speech service bot working in firefox

Firefox can’t establish a connection to the server at wss://centralindia.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?language=en-US&format=simple&Ocp-Apim-Subscription-

asked

Fauzan Ahmad 0

edited a comment

VasaviLankipalle-MSFT 14,831

Filter

Content

1,435 questions with Azure AI Speech tags

azure prononciation assessment async assessment

SSML: Using <lang xml:lang=""> within a multilingual voice sounds incorrect / unlike when used with the language-specific voice

Retirement Announcement - Upgrade to Text-to-Speech Neural Voice on 31 August 2024

Is there any way to dub audios maintaining its original intonation, breaks and speed?

Azure Text to Speech F0 (Free) Tier Limits

Speech Recognition Live transcription not detecting any other language instead of English

zh-CN-XiaochenNeural Abnormal timbre

Is it possible to stream Groq LLM responses as and when I get it into Azure TTS?

How to have multiple mstts:audioduration in a single <speak>?

Do Text to Speech containers TTS provide visemes and blendshapes like the API?

OpenSSL Issue When Running Azure Speech to text on docker

Exception while running Azure Speech to text SDK with jar file (UnsatisfiedLinkError , setTempDirectory)

Why am I getting a quota error?

Low Confidence level of Language Identification

How can I use AzureTextToSpeech in PowerApps?

Android uses TTS SDK and 3 errors occur

I am happy with the results in "Speech Studio" for a sample wav file. How do I scale this up to longer files?

Azure Pronuciation Assessment recognition offset lag

speech Synthesis Language hebrew not working

Azure speech service bot working in firefox