1,561 questions with Azure AI Speech tags

Sort by: Updated
0 answers

How to eliminate audio interference from speakers on the microphone while both are in use at the same time using speech recognizing and sythesizing

Hi, I'm creating a real-time voice chatbot. For speech recognition and synthesis I am using Azure Speech. What I do is recognize the voice, then send to an LLM to get a response, and then synthesize the response into audio in real time. My goal is that…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-30T17:57:22.45+00:00
Gerardo Arias 0 Reputation points
0 answers

How to read English words aloud in syllables by text-to-speech? The purpose is to make videos of memorizing English words.

It feels like these sounds are meant to optimize the reading of complete sentences, but they can't read words in detail by syllables.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-09T05:27:41.88+00:00
sxmud 0 Reputation points
commented 2024-07-30T08:15:11.8+00:00
d k 0 Reputation points
0 answers

About speaker separation in "fast-transcription-api"

Dear Azure Support Team https://learn.microsoft.com/en-us/rest/api/speechtotext/transcriptions/transcribe?view=rest-speechtotext-2024-05-15-preview&tabs=HTTP The details of the TranscribeDefinition class are not described anywhere, so how should I do…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-29T03:39:03.9533333+00:00
y.ashibe 25 Reputation points
commented 2024-07-30T04:37:39.68+00:00
y.ashibe 25 Reputation points
0 answers

Who can provide assistance?The time required for speech to text processing on the same file varies greatly, with a maximum of around 40%. Is Azure's performance like this?

Like the annex The first test ,it took approximately 8.5 seconds.first_test_log.txt But,it only took approximately 5 seconds for the second test.Second_test_log.txt

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,655 questions
asked 2024-07-30T01:59:25.77+00:00
连博10335043 0 Reputation points
0 answers

Do you have any suggestions or assistance in using the speech to text function to recognize homophones that may cause errors.

like for Chinese "枯(Ku)",recognized as "哭(Ku)".Cannot contact context. This is just a probabilistic issue.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,655 questions
asked 2024-07-29T09:14:20.0033333+00:00
连博10335043 0 Reputation points
commented 2024-07-29T11:34:59.2366667+00:00
连博10335043 0 Reputation points
0 answers

my speech to text doesnt recognized multi channel

when I upload my own file in the ingestionClient, it works but when I use the samplecode in Github it doesn't work and only give me a single channel. so it doesn't pick up the multi speakers. it works here:…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-28T07:31:46.7466667+00:00
Danial Bakhsheshi 20 Reputation points
commented 2024-07-29T04:56:03.69+00:00
navba-MSFT 20,975 Reputation points Microsoft Employee
1 answer

error:com.microsoft.cognitiveservices.speech.SpeechConfig.setTempDirectory(Ljava/lang/String;)V, run the java demo on window10

error:com.microsoft.cognitiveservices.speech.SpeechConfig.setTempDirectory(Ljava/lang/String;)V, run the java demo on window10,how to resolve thie issue? I have install Microsoft Visual C++ Redistributable for Visual Studio 2015、2017、2019 和 2022. the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-29T02:52:32.6133333+00:00
guoxiaoqiang1 0 Reputation points
commented 2024-07-29T03:29:10.1166667+00:00
AshokPeddakotla-MSFT 30,516 Reputation points
0 answers

How fix this error :Speech synthesis canceled: CancellationReason.Error Error details: Connection failed (no connection to the remote host). Internal error: 1. Error details: Failed with error: WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED wss://westeurope.tts.

I would like to use Azur Text to Speech on Raspberry Pi 4 with python but I doesn't work. I get the following Error : Speech synthesis canceled: CancellationReason.Error Error details: Connection failed (no connection to the remote host). Internal error:…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-25T20:44:07.1066667+00:00
ahmed.rhiat 0 Reputation points
edited the question 2024-07-29T03:25:51.8566667+00:00
AmaranS 4,190 Reputation points Microsoft Vendor
0 answers

azure prononciation assessment

In azure prononciation assessment for scripted speech , why i insert a word that does not in exist in the script in my speech why i don't get that word as inserted in the result of prononciation assessment?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,655 questions
asked 2024-07-17T19:31:17.9933333+00:00
Iheb Jandoubi 25 Reputation points
commented 2024-07-28T00:39:06.03+00:00
YutongTie-MSFT 48,746 Reputation points
1 answer One of the answers was accepted by the question author.

Azure Real Time Speech To Text fails to take input from Blob URL

I have implemented Azure Real Time Speech to Text using Speech SDK in Python for pre recorded audio files. It works fine when the input audio is on my machine. But fails when I give the input as the Blob url containing the audio. Please help!

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,655 questions
asked 2024-02-23T06:12:46.4266667+00:00
Indira Priyadarshini 60 Reputation points
commented 2024-07-25T06:28:23.8933333+00:00
Veerla, Tirupati Raju 0 Reputation points
0 answers

Internal Error for Custom Model for Italian language Project

I have a query regarding an issue we are facing while creating a custom model in the Azure Speech portal for the Italian language. It is throwing an internal error. The following is the list of items we have used. However, when we used the same…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-22T15:44:10.1633333+00:00
Ulhas Hulyal, Nilesh 0 Reputation points
commented 2024-07-24T19:33:00.7766667+00:00
VasaviLankipalle-MSFT 15,956 Reputation points
0 answers

When will fast transcription will be GA?

I want to use the fast transcription in production When it will be GA

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-22T04:07:51.5033333+00:00
Quill Zhou 25 Reputation points
commented 2024-07-24T12:02:05.3466667+00:00
santoshkc 6,955 Reputation points Microsoft Vendor
0 answers

Reading from Blob container instead of public uri for azure speech Api

I can download the file through the code below from my blob storage: public async Task<Stream> ReadFileFromBlobStorageAsync(string blobName) { try { var containerClient =…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,644 questions
Azure AI Language
Azure AI Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
393 questions
asked 2024-07-23T13:34:29.9133333+00:00
Danial Bakhsheshi 20 Reputation points
commented 2024-07-24T08:01:10.7333333+00:00
Danial Bakhsheshi 20 Reputation points
0 answers

Pronunciation assessment SDK is getting stuck

I'm trying to integrate the pronunciation assessment speech services Python SDK - specifically a web front-end will upload an audio file to a fastapi backend, which will then utilise whisper to transcribe and then send the transcription together with the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,655 questions
asked 2024-07-23T14:24:57.8833333+00:00
Dan Tang 0 Reputation points
commented 2024-07-24T04:20:34.3666667+00:00
VasaviLankipalle-MSFT 15,956 Reputation points
1 answer

Create or join a resource group

I would like to have or create a resource group to create a Speech Resource and have access to the text-to-speech tool. It is key for HP University.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-23T10:17:10.5333333+00:00
Rafa Martín 0 Reputation points
edited an answer 2024-07-23T17:27:00.6833333+00:00
YutongTie-MSFT 48,746 Reputation points
1 answer

Custom external lexcion does not work when calling TTS speech synthesis service using Java SDK

We don't want the * sign to sound, so we set up a custom lexicon, but the synthesized speech doesn't seem to be affected by the lexicon. <speak xmlns="http://www.w3.org/2001/10/synthesis" …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-19T03:11:48.4666667+00:00
xin chen 0 Reputation points
answered 2024-07-23T09:32:06.5266667+00:00
navba-MSFT 20,975 Reputation points Microsoft Employee
1 answer

How to synchronize real world events happening while speech recognition is happening with individual spoken words

I am trying to synchronize real world events that are occuring during live streaming of speech to Azure speech recognition services (e.g., eye gaze shifts, hardware device interactions, etc.). I note the time when I start speech recognition and record…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-01T11:45:38.4166667+00:00
Mark Miller (DevExpress) 0 Reputation points
commented 2024-07-23T05:32:05.5566667+00:00
YutongTie-MSFT 48,746 Reputation points
0 answers

SpeechSynthesizer sometimes plays speech depending on SpeechSynthesisOutputFormat

In a C# WPF application, I call this function to convert text to speech: SpeechSynthesisResult speechSynthesisResult = await speechSynthesizer.SpeakSsmlAsync(strSsml); The audio data is returned ok. BUT the function also sometimes plays the speech as…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-03T14:10:30.4733333+00:00
One More Henry 20 Reputation points
edited the question 2024-07-23T03:29:49.1366667+00:00
VasaviLankipalle-MSFT 15,956 Reputation points
1 answer One of the answers was accepted by the question author.

Pronunciation Assessment: Inconsistent Results

Hi, I'm experiencing very inconsistent results with the pronunciation assessment SDK for the same audio file when using different regions. I have tested the swedencentral and the westeurope regions. I tested them in different, languages, and the results…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-14T17:21:32.7233333+00:00
Jordan C 20 Reputation points
commented 2024-07-23T03:09:02.95+00:00
navba-MSFT 20,975 Reputation points Microsoft Employee
0 answers

Internal Server Error when running evaluation on Custom Speech

Trained a Speech to Text Model on Azure, tried running an evaluation on a test set and I'm getting "Token error rate results are not applicable for some old tests." on Speech Studio. A few weeks ago the same test wasn't giving any issues.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,561 questions
asked 2024-07-19T01:21:36.6233333+00:00
MavRedSea 0 Reputation points
commented 2024-07-22T07:05:02.11+00:00
dupammi 8,035 Reputation points Microsoft Vendor