1,763 questions with Azure AI Speech tags

Sort by: Updated
1 answer

SpeakSsmlAsync Result always Canceled

Hello, I am building a project using Azure's SpeechSynthesizer. SpeechLog.txt I am running into the following problem: when calling SpeakSsmlAsync(ssmlText), the result always has a canceled state, and I am having a hard time understanding why. When I…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,893 questions
asked 2024-11-02T21:12:50.3466667+00:00
Sven Wind 0 Reputation points
answered 2024-11-03T11:13:52.02+00:00
Sven Wind 0 Reputation points
0 answers

How to disable the default "Disfluency Removal" of filler words after STT transcription in Azure AI Speech?

Azure AI Speech Services defaults to removing many filler words (uh, eh, etc.) via post-transcription "Disfluency Removal". My use case includes presentation analysis for filler words, which requires a verbatim transcript. Is there a…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-19T02:25:07.9633333+00:00
Dennis 0 Reputation points
commented 2024-11-03T05:11:42.3266667+00:00
navba-MSFT 24,890 Reputation points Microsoft Employee
0 answers

I need to know wether this API "Post-call transcription and analytics" can work with nodejs?

I need to know wether this API "Post-call transcription and analytics" can work with nodejs? If it is not, where I can get a proper Conversation converstion API with multi user and multi language dedection and retrun a text with given…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-30T05:40:45.9666667+00:00
Nidoos Solutions 0 Reputation points
commented 2024-11-02T13:04:24.58+00:00
santoshkc 9,150 Reputation points Microsoft Vendor
0 answers

Azure TTS Error 404

I get error 404 when trying to fetch the mp3 file via fetch. I am using Node.js in the backend. More details: I created a functionality in my app that creates an XML document containing all SSML tags as specified by Microsoft Azure. Is it possible some…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-30T10:59:10.9833333+00:00
Migs Marbella 0 Reputation points
commented 2024-11-02T12:47:51.0233333+00:00
santoshkc 9,150 Reputation points Microsoft Vendor
1 answer

When using batch speech transscription the ITN feature only applies to the first option of the nBest results.

When using batch transscription the ITN feature only applies to the first option of the nBest results, whitch is not necessarily the one with the highest confidence. The batch transscription service returns a json result with the following structure…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-28T19:17:58.12+00:00
Julian Kopka Heerup 0 Reputation points
commented 2024-11-01T09:39:00.3233333+00:00
Avinash Devarakonda 155 Reputation points Microsoft Vendor
1 answer

Getting error code 0x38 (SPXERR_AUDIO_SYS_LIBRARY_NOT_FOUND) when deployed to cloud.

I am working on an interactive real time communication that uses both speech synthesizer and recognizer. In development it is working fine but when I deployed through azure web app in a Linux server it was giving the error. I don't want to process any…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-30T11:40:14.46+00:00
Sajib Hossain 0 Reputation points
answered 2024-10-31T00:21:30.1166667+00:00
YutongTie-MSFT 52,596 Reputation points
1 answer

Stopping Audio Playback Mid-Stream with Microsoft Neural TTS Service and Speech SDK

I'm working with the Microsoft Neural Text-to-Speech (TTS) service using the Speech SDK. I've successfully implemented audio playback, but I'm facing a challenge with controlling the playback mid-stream. My question is: How can I implement a feature to…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-27T06:32:10+00:00
MD SHAKIL KHAN 0 Reputation points
commented 2024-10-30T15:03:32.6666667+00:00
Saideep Anchuri 160 Reputation points Microsoft Vendor
2 answers

Random Words Detected by Azure Speech Recognizer in Silence

Hello Azure Support Team, I am currently using the Azure Speech Service to recognize speech inputs in my application. The setup of my speech recognizer is as follows: export const createSpeechRecognizer = () => { const speechRecognitionConfig =…

Azure AI Bot Service
Azure AI Bot Service
An Azure service that provides an integrated environment for bot development.
840 questions
Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
370 questions
asked 2024-06-24T07:44:54.3933333+00:00
Abdul Subhan 5 Reputation points
answered 2024-10-30T13:19:41.52+00:00
Tiago Ribeiro 0 Reputation points
1 answer

Azure speaker recognition model

Hello, I have created a text-independent profile for the Azure speaker recognition model via REST API. I have encoded the.wav file and tried calling the create enrollment API for which I am getting the below error response for any data. { …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2021-07-01T06:54:14.833+00:00
EmergingTech PowerApps 1 Reputation point
edited a comment 2024-10-30T11:01:39.8733333+00:00
Rodrigo Micael Lopes Almeida 0 Reputation points
1 answer One of the answers was accepted by the question author.

Thai text has pause before IPA phoneme, Vietnamese doesn't - why?

I found different behaviors when using IPA phonemes in text-to-speech: Vietnamese: "không phải [May] xin lỗi" Flows naturally without pauses <speak xmlns="http://www.w3.org/2001/10/synthesis"…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-30T06:45:12.82+00:00
i'm MariOhn 81 Reputation points
accepted 2024-10-30T10:04:06.43+00:00
i'm MariOhn 81 Reputation points
1 answer

Language data import failed: Invalid or empty textproc out file!.

I am trying to train a custom speech to text model for Malayalam. When I am trying to upload the text data, I am getting the below error. Can someone explain to me what the error is?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-27T18:48:28.78+00:00
Kesavaraj V 0 Reputation points
commented 2024-10-30T09:41:59.81+00:00
kothapally Snigdha (Quadrant Resource LLC) 260 Reputation points Microsoft Vendor
0 answers

TTS is not working in IOS

We have been using Azure TTS for our webapp . But the TTS is not being supported on IOS while it works well on android. Also the Latency of TTS on android is very high .

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
370 questions
asked 2024-10-23T18:12:51.21+00:00
Sumit Goyal 0 Reputation points
commented 2024-10-29T17:50:52.35+00:00
Saideep Anchuri 160 Reputation points Microsoft Vendor
1 answer

Having problem with InitialSilenceTimeout for SpeechAssessment API

I'm using the SpeechAssessment API for an English Education App to rate students speech. Now I'm trying the API and been having trouble with the audios generated by the mobile App. It is like the API isn't recognizing the audio and setting the following…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-24T14:51:29.4866667+00:00
VINICIUS FELIZATTI JACINTO 0 Reputation points
commented 2024-10-29T14:15:12.7666667+00:00
Pavankumar Purilla 675 Reputation points Microsoft Vendor
1 answer

Azure AI Speech Studio TextToSpeech Error 400 Synthesis failed

I'm trying to generate many multilingual voice and for the majority of the multilingual I get this message : Response status code does not indicate success: 400 (Synthesis failed. StatusCode: NotFound, Details: service does not exist: service endpoint…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-23T19:49:27.4+00:00
Kack Cédric 0 Reputation points
commented 2024-10-29T13:44:22.6733333+00:00
Kack Cédric 0 Reputation points
0 answers

Is there a problem with creating Custom Keyword models on Azure Speech Studio?

Hi, I'm trying to create custom keyword models on Azure Speech Studio but they don't appear to finish processing. I have a model that has been stuck processing for almost 12 hours now. Both basic and advanced models are getting stuck. It seems to affect…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-27T18:12:58.44+00:00
Mark Hingston 0 Reputation points
commented 2024-10-29T09:57:22.0833333+00:00
SriLakshmi C 505 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Can Azure Speech to Text support raw Opus ?

Hi, Currently the Speech to text supports OPUS/OGG meaning I have to wrap my Opus packets in an ogg container. Is it possible to support raw…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-27T09:23:02.31+00:00
Regis Duflaut-Averty 20 Reputation points
accepted 2024-10-29T09:13:08.51+00:00
Regis Duflaut-Averty 20 Reputation points
2 answers

Is Azure Speech-to-Text API Charging Based on Audio Length or Duration When No Audio Data is Sent During Pauses?

I am using WebSocket to capture audio input from the user's microphone. This audio data is then sent for continuous recognition (speech-to-text). My application has the functionality to pause the microphone. When paused, since Azure Speech-to-Text…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-25T09:14:23.9133333+00:00
Hieu Phan 0 Reputation points
commented 2024-10-29T08:50:06.6333333+00:00
kothapally Snigdha (Quadrant Resource LLC) 260 Reputation points Microsoft Vendor
1 answer

some Azure TextToSpeech/edge tts voices are down

the voice "fr-FR-VivienneMultilingualNeural" returns no audio anymore while the other voices work fine, ive read other have their voices down too…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-25T16:02:43.9666667+00:00
madicke mbaye 0 Reputation points
commented 2024-10-28T16:46:22.7566667+00:00
santoshkc 9,150 Reputation points Microsoft Vendor
0 answers

Azure Speech Studio Andrew Multilingual voice sounds glitchy

I'm having some issues with the Andrew Multilingual (en-US-AndrewMultilingualNeural) voice in the Azure Speech Studio. There's a few instances in which the voice sounds raspy and really kind of glitchy. It seems to have a lot of trouble with the word…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
asked 2024-10-23T14:32:22.36+00:00
Rene Lems 0 Reputation points
commented 2024-10-28T16:17:02.53+00:00
Avinash Devarakonda 155 Reputation points Microsoft Vendor
1 answer

Is the pronounciation library feature available in Azure AI Services (Text to Speech)?

We are looking for customization of pronunciations using pronunciation library. Is this available in Azure AI services (Text to Speech)? if yes, please share the information and related cost of the same. Thanks in advance.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,763 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,893 questions
asked 2024-10-25T06:11:12.01+00:00
Radha Priya 0 Reputation points
commented 2024-10-28T15:49:48.3666667+00:00
kothapally Snigdha (Quadrant Resource LLC) 260 Reputation points Microsoft Vendor