Microsoft Q&A

Azure Speech

769 questions

An Azure service that integrates speech processing into apps and services.

Browse all Azure tags

769 questions with Azure Speech tags

Sort by: Updated
0 answers

An error occurred in the text-to-speech preview area

An error occurred in the text-to-speech preview area, I added Southeast Asia as the region of the Speech Service resource, because it can use the preview speaker, but during use, when I select the Chinese model "zh-CN-YunjianNeural", it has an…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-02-08T03:42:28.36+00:00
selina 0 Reputation points
commented 2023-02-08T12:30:54.7333333+00:00
selina 0 Reputation points
0 answers

Docker container fails to run with a model trained structured text dataset

I'm testing and using a model trained with a specific dataset using a custom-speech-to-text container. Below is the command line I used. I only change the modelId for this command. docker run --name stt --rm -it -p 5500:5000 \ -v…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-02-08T11:02:15.15+00:00
조선민 0 Reputation points
edited the question 2023-02-08T11:06:29.4066667+00:00
조선민 0 Reputation points
1 answer

How to poll asynchronous speech synthesis for status in Python

Hello, I have an object of type speechsdk.SpeechSynthesizer which I am running asynchronous speech synthesis with speech_synthesizer.speak_ssml_async() on, and I want to be able to tell when the synthesis has completed (i.e. how to poll it for an…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
Not Monitored
Not Monitored
Tag not monitored by Microsoft.
24,025 questions
asked 2023-02-07T12:04:47.82+00:00
yme 0 Reputation points
answered 2023-02-08T10:47:11.8733333+00:00
yme 0 Reputation points
0 answers

How to transcribe interview with two speakers from a single audio file similar to word 365 using spx recognize cli

Hello Everyone, I have a series of interviews recorded as MP3 files and i would like to use Azure speech CLI to transcribe them in a way similar to the integrated word 365 transcriptor format which is: I would like to use the Azure Speech service,…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
Azure Cognitive Services
Azure Cognitive Services
A group of Azure artificial intelligence services and cognitive APIs that help build intelligent apps.
950 questions
asked 2023-02-08T07:48:57.5733333+00:00
Nikolay Bogoychev 41 Reputation points
0 answers

There is something wrong with Chinese model Yunye Voice.

I am using Text-to-Speech in Microsoft Azure. with Chinese model "Yunye" and play; I hear a buzzing current sound(like the following link) https://1drv.ms/v/s!AtIg22Hya6zakk7cF2N5LyUTRM4s?e=cX6rwy The same problem doesn't happen in when I use…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-02-07T16:14:31.9866667+00:00
tenthfive 35 Reputation points
commented 2023-02-08T07:03:29.05+00:00
romungi-MSFT 27,356 Reputation points Microsoft Employee
0 answers

Speech-to-text: Disfluency Removal configuration

I am using the speech-to-text REST API (python) to do some research regarding fillers, pauses, and backtracking in Japanese (ja-JP). Can I config disfluency removal while using the Speech-to-text service? I need to have true text with all the fillers…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2022-12-16T12:26:45.407+00:00
KEN KIM 11 Reputation points
edited the question 2023-02-07T21:36:53.3933333+00:00
YutongTie-MSFT 24,981 Reputation points
6 answers

Mac M1: CLI SPX Command not found

On my Macbook M1 (20219, MacOS 12.4) I have successfully installed dotnet-sdk-6.0.301-osx-arm64 and the Speech CLI via dotnet tool install --global Microsoft.CognitiveServices.Speech.CLI but when I type spx help I get zsh:…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2022-06-27T10:46:06.86+00:00
hendryman 1 Reputation point
edited the question 2023-02-07T19:04:12.2033333+00:00
YutongTie-MSFT 24,981 Reputation points
0 answers

Why when I use the text-to-speech tool, there is a buzzing sound when playing?

Hi there, I am using Text-to-Speech in Microsoft Azure. when I select Chinese language, Voice like "Yunye" and play; I hear a buzzing current sound (see attachment). When I choose other Voices, they are all normal. Only this one Voice has a…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
Azure Cognitive Services
Azure Cognitive Services
A group of Azure artificial intelligence services and cognitive APIs that help build intelligent apps.
950 questions
asked 2023-01-31T05:21:09.5133333+00:00
Rolando Chen 40 Reputation points
commented 2023-02-07T11:56:03.8533333+00:00
AYAKI 肉球票猎人 5 Reputation points
0 answers

Unable to delete audio wav file after stop_continuous_recognition

Hi, I am using Azure start/stop_continuous_recognition function for continuous transcription of large wav audio files. After transcription I need to delete files from local storage so that my server is not out of space after transcribing many files. It…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-02-04T07:01:39.33+00:00
Deepti Rajput 0 Reputation points
commented 2023-02-06T08:03:40.6866667+00:00
romungi-MSFT 27,356 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Speech Service Cost

I am confused about the free product and free trail. There are some free products like Speech, but it said my usage is out of the quota. If I move to pay as you go, Speech is still free? Is that because my free trail ends? New to azure, apologize for…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-01-31T17:09:33.29+00:00
Haans 40 Reputation points
accepted 2023-02-06T00:35:42.7+00:00
Haans 40 Reputation points
1 answer One of the answers was accepted by the question author.

SpeechSynthesisWordBoundaryEventArgs Class

I find the speech SDK document, SpeechSynthesisWordBoundaryEventArgs Class is a possible solution for us. unlike REST API document, there is no sample code to guide us how to use it, Is SSML required for this part? How to locate words?

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-01-31T21:28:56.3266667+00:00
Nick 45 Reputation points
accepted 2023-02-05T11:36:33.1+00:00
Nick 45 Reputation points
1 answer

How to train Custom Speech-To-Text Model to recognize a word and capitalize the first letter of the phrase.

Hi, I have created a custom speech-to-text model that recognizes a phrase. It's recognizing it perfectly, but I want the resulting text in capitalized form. For E.g. the phrase is "Terminator" and the resulting text is "terminator"…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-01-19T15:09:37.89+00:00
akshay chaturvedi 0 Reputation points
answered 2023-02-04T23:06:13.8766667+00:00
akshay chaturvedi 0 Reputation points
1 answer

Unable to delete audio file

Hi, I am using azure speech to text service. Originally i have video file and then getting audio file using ffmpeg. import azure.cognitiveservices.speech as speechsdk speech_config = speechsdk.SpeechConfig(subscription=key, endpoint=endpoint) …

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2022-04-08T09:18:19.767+00:00
Pooja Kamra 6 Reputation points
commented 2023-02-04T06:43:14.46+00:00
Deepti Rajput 0 Reputation points
2 answers

can't view custom speech model data

I am trying to create and test a custom speech model. I'm able to go through all of the steps to upload data, train the model, and test the model. However, I can't view the contents of files after I upload them (for example, a plain text file has a…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2022-10-06T16:12:28.1+00:00
Victoria 11 Reputation points
answered 2023-02-03T16:21:57.9866667+00:00
Victoria 11 Reputation points
2 answers

Waiting on Microsft Azure Ashley to Unlock speaking styles

Hello Microsoft Q&A community, I have been trying to use the speaking style selection feature on Microsoft Azure's TTS service to make Ashley's TTS voice sound more human and emote. However, I have noticed that the speaking style is stuck on…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-01-23T01:52:07.8066667+00:00
Jacob Bender 0 Reputation points
answered 2023-02-03T13:05:33.01+00:00
Oxueillirep 131 Reputation points
1 answer

How to use more than one voice in a TTS JavaScript snippet

The TTS javascript project I am currently working on needs to use two voices with the ability to be able to switch between these two. However, as far as I can see, I can only configure the synthesizer engine with one voice using…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
Cognitive Service for Language
Cognitive Service for Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
171 questions
asked 2023-01-13T18:23:23.7933333+00:00
PerryHS 10 Reputation points
commented 2023-02-03T05:10:26.81+00:00
romungi-MSFT 27,356 Reputation points Microsoft Employee
0 answers

Can't play Custom Neural Voice

Hello, Can't play custom voice because of the error = Unsupported voice CustomVoiceNeural. websocket error code: 1007 Code for fetching: ` async function synthesizeSpeech(responseText) { const speechConfig =…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-01-26T23:22:17.8666667+00:00
Anastasia Germanova 0 Reputation points
commented 2023-02-02T10:29:39.2966667+00:00
romungi-MSFT 27,356 Reputation points Microsoft Employee
0 answers

How To Fix Error 4429 "exceeded the concurrent request limit"

I can't synthesise any speech anymore. I get this error: Speech synthesis canceled:  Connection was closed by the remote host. Error code: 4429.  The request is throttled because you have exceeded the concurrent request limit allowed for your sub USP…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-02-01T20:46:50.6166667+00:00
bozolino 0 Reputation points
commented 2023-02-02T06:02:14.5333333+00:00
romungi-MSFT 27,356 Reputation points Microsoft Employee
0 answers

How to config disfluency removal using REST API

I am using the speech-to-text REST API (python) to do some research regarding fillers, pauses, and backtracking in Japanese (ja-JP). Can I config disfluency removal while using the Speech-to-text service? I need to have true text with all the fillers in…

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-01-28T11:49:33.1266667+00:00
KEN KIM 11 Reputation points
commented 2023-02-01T11:13:34.1266667+00:00
romungi-MSFT 27,356 Reputation points Microsoft Employee
2 answers

Finding a Specific Azure Text-To-Speech Voice

Can anyone tell me what text to speech voice is used in the youtube shorts video: https://www.youtube.com/shorts/zS2mgrbPGSk I've been looking for days and I can really appreciate it if someone can help out, I know 99.3% it's on azure

Azure Speech
Azure Speech
An Azure service that integrates speech processing into apps and services.
769 questions
asked 2023-01-30T05:02:35.3266667+00:00
Jimmy John 0 Reputation points
answered 2023-01-31T23:19:47.54+00:00
VasaviLankipalle-MSFT 566 Reputation points Microsoft Employee