1,391 questions with Azure AI Speech tags

Sort by: Updated
2 answers

Speech_SegmentationSilenceTimeoutMs and speech segmentation

Dear Azure Technical Support, I'm using the Azure Speech Service for continuous speech recognition and I've encountered a behavior that I'd like to clarify. Historically, when using the continuous recognition mode, the service segmented the audio into…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2023-10-19T15:24:23.02+00:00
Domenico Zurlo 1 Reputation point
commented 2024-04-24T15:18:54.6733333+00:00
Sam Byng 0 Reputation points Microsoft Employee
2 answers

How can I fix WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED?

I have a FastAPI project which uses uvicorn server to run my application. speechsdk is used for Speech-to-Text operations, the endpoint I am using is…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2023-10-24T02:15:40.52+00:00
CRAZY INDIAN 0 Reputation points
answered 2024-04-24T12:31:43.1133333+00:00
洪斌 唐 0 Reputation points
0 answers

Do I have to be on GovCloud in order to connect/use Azure Speech Services hosted on GovCloud US Virginia?

Hi. I am working with a cloud providers solution that is located in Amazon us-east2 region. I am hoping you can help confirm if the Azure Cognitive STT and TTS integration will/should work with Azure Speech Services hosted on GovCloud US Virginia? …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-23T16:04:37.75+00:00
PeterJD 0 Reputation points
commented 2024-04-24T12:00:32.8033333+00:00
romungi-MSFT 41,961 Reputation points Microsoft Employee
0 answers

How to increase the time for which the Microsoft Speech Service SDK listens in a single go?

I am using MS speech service sdk for speech to text conversion. When I speak, my speech is converted to text after 60 seconds even if I haven't stopped speaking. It basically considers it one chunk and starts processing it. What can I do to increase this…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-03-19T09:56:24.93+00:00
Abdullah Nadeem 0 Reputation points
commented 2024-04-24T07:11:44.39+00:00
Abdul Moiz Tauqir 0 Reputation points
1 answer One of the answers was accepted by the question author.

Include custom audio files for keyword recognition training process

I am leveraging Azure Keyword Recognition service, it works pretty nice except some false wakeup. We've collected a bunch of false waking up audio files, and I was wondering whether there is some approach that we can include these false audio files into…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-23T10:16:00.5666667+00:00
Liu Wenbin (Lofty Team) 0 Reputation points
accepted 2024-04-24T06:08:38.0933333+00:00
Liu Wenbin (Lofty Team) 0 Reputation points
1 answer One of the answers was accepted by the question author.

Enabling Voice Interaction for Azure Health Bot website

I took my azure health bot and deployed it to a custom website and used the health bot container sample they have on GitHub. It says for Google Chrome, voice interaction should be enabled but the little microphone within the chat does not pop up (even if…

Azure AI Bot Service
Azure AI Bot Service
An Azure service that provides an integrated environment for bot development.
745 questions
Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-01-11T08:12:03.2833333+00:00
Ahad Anjum 60 Reputation points
accepted 2024-04-24T06:01:38.8833333+00:00
Ahad Anjum 60 Reputation points
0 answers

Pronunciation Assessment Results in Japanese Have Phonemes Empty

I'm experimenting with Azure Speech STT pronunciation assessment for Japanese, in Speech Studio. However, the JSON output shows that the phonemes for every Japanese character are empty strings "". Yet, each syllable has an AccuracyScore. But…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-23T04:09:14.57+00:00
Glen Wang 20 Reputation points
edited a comment 2024-04-24T04:48:27.5133333+00:00
navba-MSFT 16,940 Reputation points Microsoft Employee
0 answers

Is it possible to specify in Speech SDK to always use "lbs" instead of "£" when "pounds" is recognized?

Hi, is it possible somehow to configure speech sdk in a way when word "pound" is detected that it is always meant to be lbs, not £, for example when I say, "99 pounds" it is detected as "99 lbs", but if I said, "100…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-23T08:38:47.7566667+00:00
Faris Lemes 20 Reputation points
commented 2024-04-24T04:32:31.4966667+00:00
Faris Lemes 20 Reputation points
1 answer

Azure Pronuciation Assessment recognition offset lag

I'm using the Pronunciation Assessment with the recognizeOnceAsync method. We are presenting a word for assessment and measuring the response time. Sometimes the offset returned with the recognition corresponds closely with the time reported from the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-22T16:43:22.98+00:00
Andrew Pasquale 0 Reputation points
commented 2024-04-24T01:25:09.6933333+00:00
dupammi 6,150 Reputation points Microsoft Vendor
1 answer

I am happy with the results in "Speech Studio" for a sample wav file. How do I scale this up to longer files?

I have run a 1-minute wav file through the Speech Studio sample process and am pleased with the result. I can't figure out how to move forward in the system to process larger speech files. One branch seems to take me into a training setting where I…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-19T21:50:59.1133333+00:00
John Woolley 0 Reputation points
commented 2024-04-23T18:40:32.4466667+00:00
John Woolley 0 Reputation points
0 answers

Personal Voice : error 403

Hi, I have acces to the preview of Personal Voice I have test the demo I'm trying to create a real voice to use it in my application. I'm able to create the project :…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-03-25T12:47:08.0966667+00:00
Machiavello Benoit 0 Reputation points
commented 2024-04-23T12:55:10.68+00:00
Machiavello Benoit 0 Reputation points
2 answers

Android uses TTS SDK and 3 errors occur

Hello, our App Android version has used Microsoft's TTS SDK "com.microsoft.cognitiveservices.speech:client-sdk:1.34.0" But 3 errors appear frequently: Error 1: {CancellationReason:Error ErrorCode: ServiceTimeout ErrorDetails:USP error: timeout…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-16T01:53:41.8933333+00:00
newsay 20 Reputation points
commented 2024-04-23T09:03:50.9666667+00:00
newsay 20 Reputation points
1 answer

Speech service with custom endpoints.

When we were using Public Endpoints previously, we were able to start up to 80 concurrent connections per subscription key, and have not experienced any issues. However, when we start using Custom DNS Public Endpoints with whitelisted IP addresses, we…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-03T14:45:36.1433333+00:00
Mallu Swetha (MINDTREE LIMITED) 80 Reputation points Microsoft Vendor
commented 2024-04-23T07:36:55.84+00:00
YutongTie-MSFT 46,406 Reputation points
0 answers

zh-CN-XiaochenNeural Abnormal timbre

zh-CN-XiaochenNeural, abnormal timbre. The same problem occurred in October last year. https://learn.microsoft.com/en-us/answers/questions/1431823/the-timbre-of-the-voice-of-zh-cn-xiaochenneural-ha —————————————————————— How long will it take to recover…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-11T11:39:56.2733333+00:00
斌 周 0 Reputation points
commented 2024-04-23T07:00:00.0766667+00:00
YutongTie-MSFT 46,406 Reputation points
0 answers

I am receiving "Internal Server Error" on all batch speech to text requests

The Azure batch speech to text service has been working for us for some time, but today all of our requests started receiving "internal server error" responses. { "properties": { .... "error": { …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-18T04:01:09+00:00
Dustin Harmon 0 Reputation points
commented 2024-04-23T05:03:06.8433333+00:00
navba-MSFT 16,940 Reputation points Microsoft Employee
0 answers

Connection to azure cognitive service failed with Firefox

based on Firefox 84, we have a voice-assistant app, it works fine until last Saturday(4.11)。 Azure API from https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-recognize-speech?pivots=programming-language-javascript the api…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-16T10:14:58.88+00:00
Yan AI 10 Reputation points
commented 2024-04-23T03:27:45.5233333+00:00
Yan AI 10 Reputation points
0 answers

zh-CN-XiaochenNeural Abnormal timbre

zh-CN-XiaochenNeural, abnormal timbre. The same problem occurred in October last year. https://learn.microsoft.com/en-us/answers/questions/1431823/the-timbre-of-the-voice-of-zh-cn-xiaochenneural-ha ———————————————————— This situation has been ongoing for…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-22T09:07:52.45+00:00
斌 周 0 Reputation points
commented 2024-04-22T21:34:41.9233333+00:00
YutongTie-MSFT 46,406 Reputation points
0 answers

Do Text to Speech containers TTS provide visemes and blendshapes like the API?

I'm currently using the Speech API and consuming the visemes and blendshapes that are returned. In an effort to reduce latency I would like to run the speech services locally via the text to speech container. Does the response of the container STT…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-22T05:35:00.3933333+00:00
Matt Ma 0 Reputation points
commented 2024-04-22T18:12:39.0566667+00:00
VasaviLankipalle-MSFT 14,101 Reputation points
1 answer

Help Needed: Microsoft Cognitive Services Speech to Text Transcription Failed

Hi Everyone, I'm encountering difficulties with a transcription job using the Microsoft Cognitive Services Speech to Text API by creating a flow in Logic app, and I'm reaching out to the community for assistance in resolving this issue. Any insights,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-03T10:57:53.3933333+00:00
Keerthanaa Ilangovan 5 Reputation points
edited the question 2024-04-22T16:38:42.3866667+00:00
VasaviLankipalle-MSFT 14,101 Reputation points
1 answer

Concerns about speech services while using Custom End Points

I am writing to inquire about some concerns and questions we have regarding the configuration and performance of the Azure Cognitive Service, specifically the Speech service with custom domain name, which we are utilizing for our court application.   Our…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
asked 2024-04-02T08:02:59.5133333+00:00
Paluri Krishnaji (MINDTREE LIMITED) 100 Reputation points Microsoft Vendor
commented 2024-04-22T07:12:22.41+00:00
YutongTie-MSFT 46,406 Reputation points