1,499 questions with Azure AI Speech tags

Sort by: Updated
0 answers

Azure Text To Speech docker container throws an exception with viseme

I'm using the Azure Text to Speech docker image (mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.3.0-amd64-en-us-jennyneural). I'm passing it SSML through the dotnet SDK. When asking for viseme (via <mstts:viseme…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-30T01:37:32.95+00:00
Jon Peterson 26 Reputation points
0 answers

how can I set the permission to the resources

Hello, I want to upload a text file to Speech Studio, but the system raised an error Does anyone help how I can fix this and assign a proper role for myself? I already set my role as a Cognitive Services User.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,074 questions
asked 2024-06-29T02:47:13.3533333+00:00
Jingxiong Wang 0 Reputation points
commented 2024-06-30T00:55:14.18+00:00
YutongTie-MSFT 47,916 Reputation points
0 answers

Speech-to-Text batch transcribe API in germanycentralwest doesn't work

Last Friday (May 31 2024) we started getting the following errors on all transcripts sent to the batch transcription API on our speech resource in…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-02T20:47:26.57+00:00
Matej the Mete 20 Reputation points
commented 2024-06-29T22:09:59.64+00:00
Matej the Mete 20 Reputation points
1 answer

Azure Cognitive Services Speech: Unable to get Custom Translator model results from speech translation code

In test C# code that I created based on the speech translation code in the following sample (“Using custom translation in speech translation”), I’m having trouble getting Custom Translator model translation results. The code just returns a cancellation…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
359 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,559 questions
asked 2024-06-28T21:35:09.9366667+00:00
Hirai, Tetu 0 Reputation points
answered 2024-06-29T15:54:48.1466667+00:00
Sina Salam 6,341 Reputation points
2 answers

Why does Viseme Callback no longer deliver blendshapes?

Hi there, A few months ago, I developed a project based on Speech Synthesis and Viseme Blendshapes. At that time, I had no problems receiving the blendshapes. Recently I realized that my code was no longer working. I receive Viseme with the corresponding…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2023-12-02T11:58:31.9033333+00:00
Ebertowski, Rene 5 Reputation points
answered 2024-06-29T14:30:25.9866667+00:00
俊杰魏 0 Reputation points
1 answer

"504.0 GatewayTimeout" On Deployed Flask Web Application's do_task() function

My Flask app is currently hosted on Azure and was deployed through docker hub. Once a form is submitted the do_task() function is called. This function takes a long time as it is using WhisperX. The program runs without problem locally but Azure's 240s…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,559 questions
Azure Static Web Apps
Azure Static Web Apps
An Azure service that provides streamlined full-stack web app development.
824 questions
asked 2024-06-27T13:03:26.35+00:00
Henrik Vlijter 0 Reputation points
answered 2024-06-29T09:38:30.2666667+00:00
YutongTie-MSFT 47,916 Reputation points
1 answer

How to create a dataset for Azure custom speech using spx (speechCLI)

I am using the following command for creating a custom speech dataset in my Azure Speech service: spx csr dataset create --api-version v3.1 --kind "Acoustic" --name "My Custom Speech" --description "My Acoustic Dataset…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-28T09:12:20.18+00:00
Mikel Broström Zalba 20 Reputation points
edited an answer 2024-06-29T08:34:55.51+00:00
Mikel Broström Zalba 20 Reputation points
0 answers

Is there a way for speech diarization to run across multiple files while keeping the speaker IDs the same for each speaker?

Let's say I have 5 large wav files of the same 4 speakers. The files are too large to concatenate into one wav file. Is there a way I can run diarization on these 5 files and keep the same speaker number for the respective voice across all files?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-26T23:08:15.51+00:00
Annie 0 Reputation points
commented 2024-06-29T04:09:03.0733333+00:00
navba-MSFT 19,480 Reputation points Microsoft Employee
0 answers

SpeakSsmlAsync is cancelled, but SpeakTextAsync is successfull

I am trying out the Azure AI service to convert text to speech from a C# WPF application. My calls through SpeakTextAsync are successfull, but my calls through SpeakSsmlAsync are returned with the Reason = Cancelled. I am on the free tier for South…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,074 questions
asked 2024-06-28T11:57:49.3633333+00:00
One More Henry 0 Reputation points
0 answers

Having issues when uploading daatset for custom STT model: Status: 400. The value of dataset.Locale is an invalid culture

I am uploading dataset in speech studio to train custom STT model. I have a .zip file which has audios in wav format and their trans. After uploading I am receiving this message: "Failed to upload data uch-ildiz. Error: Status: 400. The value of…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-27T07:05:58.88+00:00
Mutolaa 0 Reputation points
commented 2024-06-28T10:49:47.84+00:00
santoshkc 6,140 Reputation points Microsoft Vendor
0 answers

What are the HW or sound limitations for the echo cancellation algorithm in SpeechSDK

hi, I'm having some issues with the echo cancellation on my device, and I'm trying to use speech SDK, when I was analyzing the sounds that I record with microphone it seems that there are present higher harmonics which are 24dB less then primary…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-28T07:30:28.45+00:00
Faris Lemes 40 Reputation points
commented 2024-06-28T09:55:13.8466667+00:00
navba-MSFT 19,480 Reputation points Microsoft Employee
1 answer

create a basic voice-interactive dashboard

Hello Team, I need to create a basic voice-interactive dashboard using Azure Cognitive services like, Speech service, CLU(Conversational Language Understanding) & PowerBI.Also suggest if any other way to achieve this. It would be really helpful.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
Azure AI Language
Azure AI Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
379 questions
asked 2024-06-27T17:16:48.97+00:00
Vijayakumar Elumalai 105 Reputation points
commented 2024-06-28T02:53:55.7666667+00:00
Vijayakumar Elumalai 105 Reputation points
0 answers

SPEECH DELAY, Length of created audio from SSML doesnt match my SSML file

Hello, i am using azure congnite services API where i try to transform SSML into audio which work well expect my sentences doesnt start at the right time for example my last sentence "Do hrnca dáme bravčovú masť?" should start at 35290ms but…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-25T11:57:47.4833333+00:00
Jakub Chudiak 0 Reputation points
commented 2024-06-26T15:17:43.4166667+00:00
VasaviLankipalle-MSFT 15,836 Reputation points
1 answer

What is expected date to have AUDIO_INPUT_PROCESSING_ENABLE_V2 feature on Linux systems?

I'm having some issues with the current echo cancellation algorithm on Linux system, and I've noticed that there is enhanced version. Is there any date specified when this feature will be available on Linux systems? …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-25T05:18:01.2766667+00:00
Faris Lemes 40 Reputation points
commented 2024-06-26T12:50:28.4533333+00:00
santoshkc 6,140 Reputation points Microsoft Vendor
1 answer

Currently, I know that STT is supported on mobile devices when the network is good. I would like to know if STT can still be used without a network connection.

Currently, I know that STT is supported on mobile devices when the network is good. I would like to know if STT can still be used without a network connection.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-19T09:00:35.2666667+00:00
Iris Zhang 0 Reputation points
commented 2024-06-26T09:43:08.5633333+00:00
AnuragSingh-MSFT 21,076 Reputation points
0 answers

Transcription result returns Access denied due to invalid subscription key or wrong API endpoint?

I'm trying to run a batch transcription based on the example here. I have extracted my subscription key via the screenshot attached: and am trying to run the code via: import requests import json subscription_key = "<KEY VIA…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,559 questions
asked 2024-06-25T21:41:35.3366667+00:00
Annie 0 Reputation points
commented 2024-06-26T09:01:13.62+00:00
Annie 0 Reputation points
0 answers

How to get audio file with the method recognizeOnceAsync of SpeechSDK.SpeechRecognizer?

I'm using the demo code from https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/quickstart/javascript/browser/from-microphone/index.html like this below recognizer.recognizing = (s, e) => { console.log(RECOGNIZING:…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,074 questions
asked 2023-07-07T10:09:45.7133333+00:00
MarsOnly 5 Reputation points
commented 2024-06-26T01:42:48.15+00:00
Cesar Bonadio 0 Reputation points
0 answers

Random Words Detected by Azure Speech Recognizer in Silence

Hello Azure Support Team, I am currently using the Azure Speech Service to recognize speech inputs in my application. The setup of my speech recognizer is as follows: export const createSpeechRecognizer = () => { const speechRecognitionConfig =…

Azure AI Bot Service
Azure AI Bot Service
An Azure service that provides an integrated environment for bot development.
774 questions
Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-24T07:44:54.3933333+00:00
Abdul Subhan 5 Reputation points
commented 2024-06-25T04:18:44.3133333+00:00
dupammi 7,950 Reputation points Microsoft Vendor
3 answers

The Azure Cognitive Services Speech is repeating the audio twice

While I am using the Azure Cognitive Services Speech is repeating the audio twice ,

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-18T07:02:02.35+00:00
Vijay Sheth 0 Reputation points
commented 2024-06-24T12:25:39.59+00:00
santoshkc 6,140 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

The Cognitive Services Speech SDK has no sound on iPhone's Safari, but can play successfully on Mac's Safari. How should this be handled?

The Cognitive Services Speech SDK has no sound on iPhone's Safari, but can play successfully on Mac's Safari. How should this be handled? (in react) const initializeSynthesizer = () => { const speechConfig =…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,499 questions
asked 2024-06-11T11:25:35.4866667+00:00
jessebo 20 Reputation points
commented 2024-06-24T10:27:30.2333333+00:00
navba-MSFT 19,480 Reputation points Microsoft Employee