Azure AI Speech

1 answer

Internal server error creating a Test on Custom Speech Portal

Hello, I am using the Speech Studio to test some audio data with human labeled transcript for Word Error Rate. I get "failed" after I perform a test, I tried with multiple audio data. I get following error: Error message: Internal server…

asked

stt-training-user 1

answered

YutongTie-MSFT 46,976

1 answer

Visualize real-time voice data with stt

I hope that the user's real-time voice will be visualized when automatically moving on to the next page. Web audio api knows that user gestures are necessary. Can STT get real-time voice without user gestures? If there's any other way to get user…

asked

seoyeon lee 1

answered

GiftA-MSFT 11,151

1 answer

Text to speech service does not support el-GR-AthinaNeural.

Hello, i am trying to use text to speech service through PowerShell and although i can succesffully convert my greek text to speech with male voice, when i try the same conversion with the female voice (AthinaNeural) i get the below error: CANCELED:…

asked

dropdown 116

accepted

dropdown 116

1 answer

No module (_speech_py_impl) found error in python azure functions, could not install

No module (_speech_py_impl) found error in python azure functions, could not install 1 Logged in using ssh, navigated to /home/site/wwwroot/functionname - Installed modules but the same error 2 Logged in using bash, navigated to /home/site/wwwroot/ -…

asked

Raphael Titus 1

commented

锐杨 1

1 answer

Speech to Text Rest API to accept authorization token for RecordingsUrl

Hi, my company has been using Azure's speech to text service and are happy with the results. However, we've hit a snag. For the RecordingsUrl parameter, I understand we will be passing in a blob uri that's public facing (no auth required), but our…

asked

tes432 61

accepted

tes432 61

1 answer

Text to Speech does not work using Local CMD/Powershell

Hello, I am trying to convert text to speech through Azure's Speech Service, using my local PC's CLI. After i sucessffully logged in through the relevant CMDLET (Connect-AzAccount), i hit the following commands: .\spx synthesize --nodefaults --region…

asked

dropdown 116

commented

GiftA-MSFT 11,151

1 answer

What servers allow Long Audio API for text-to-speech with Aria vocaloid on Azure?

asked

DDS 1

answered

GiftA-MSFT 11,151

0 answers

Why can't I resolve SPXSpeechConfiguration.framework when I do 'pod install' with the Podfile from helloworld from MSFT Cognitive Services SDK for Swift for TTS?

For more information: https://learn.microsoft.com/en-us/objectivec/cognitive-services/speech/spxspeechconfiguration I cannot find the SPXSpeechConfiguration when I use IntelliSense for Xcode. What is missing?

asked

DDS 1

commented

romungi-MSFT 42,206 Microsoft Employee

1 answer

Custom voice in DUTCH

Hello I am trying to use speech in order to create a custom voice but in the Dutch language. Is it possible to create a custom voice in DUTCH?

asked

Aman Sharma 6

answered

GiftA-MSFT 11,151

1 answer

Using text to speech for video narration (using speech studio)

I hope someone can help. I create local public sector health videos and would like to use the text to speech mp3 downloads to narrate videos. (They are ideal). I'm seeking clarification on whether this is permitted / allowed? I only intend using the…

asked

Simon Lightwood 1

commented

Simon Lightwood 1

1 answer

Data stored by Azure Speech

Hi I use Azure Speech for speech-to-text and text-to-speech, and I would like to know more about data storage when using this service. Which of the following elements are stored on Microsoft servers ? Audio : user's voice when using…

asked

Morpheus 1

commented

Ramr-msft 17,616

1 answer

Do you have any examples with Long Audio API using Objective-C or Swift 5+?

asked

DDS 1

commented

Ramr-msft 17,616

1 answer

Custom Speech quality issue.

I am developing a speech to text solution using Azure Custom Speech API as a backend. I noticed the drop in the quality of custom trained models. Previously, I was kind of impressed by the result of a custom trained model where I can train with Text…

asked

nattawutb 1

commented

romungi-MSFT 42,206 Microsoft Employee

0 answers

Can't send API Requests from python code hosted on AWS Lightsail instance

I have a django website that is hosted on an AWS Lightsail. I am trying to add in a Speech to text tool in it. I have tried Microsoft Azure Speech to text APIs for this purpose and have used sample code from the website. The code works fine when i'm…

asked

Muzzamil Anwaar 1

commented

GiftA-MSFT 11,151

1 answer

How do I calculate the cost of creating a custom voice on Cognitive Services/Speech Service?

I have signed up for Speech Service and I have the data for training a model to create a custom voice by cloning the existing transcribed utterances. However, before I kick off the training of the model, I need to understand how much it will cost me.

asked

Greg Solovyev 1

commented

GiftA-MSFT 11,151

1 answer

Dutch Speaker Recognition

Hi, I have been doing some research on the Speech-to-Text API in Microsoft Cognitive Services. We want to develop a Speech-to-Text application in Dutch which would also have the ability to recognize multiple speakers. I found out that on…

asked

Max Willemsen 1

answered

GiftA-MSFT 11,151

1 answer

using start_keyword_recognition for keyword detection

I am following the below documents for keyword detection - azure.cognitiveservices.speech.recognizer custom-keyword-basics I have followed each step. Created the keyword model. However I am stuck at using the recognizer class and…

asked

Tushar Saurabh 1

answered

GiftA-MSFT 11,151

0 answers

Cannot delete Custom Speech model due to transcription reference

Am attempting to delete a Cognitive Service Custom Speech Model. However the following error comes up The model is referenced by a transcription. Please delete all transcriptions conducted with this model first. I have had this issue on a model from a…

asked

RurouniJones 1

commented

RurouniJones 1

0 answers

MP3 Content Type for Creating a MediaSource

I am using Azure SSML TTS to get a C# Stream object in "audio-16khz-64kbitrate-mono-mp3" format and other customizations. I subsequently create a byte[] stream from it for cross-platform Xamarin distribution. For UWP I convert it to a…

asked

Marc George 21

commented

Roy Li - MSFT 32,011 Microsoft Vendor

1 answer

speech to text sdk

is there golang sdk for speech to text? and the protocol is grpc

asked

sky 1

commented

GiftA-MSFT 11,151

Filter

Content

1,398 questions with Azure AI Speech tags

Internal server error creating a Test on Custom Speech Portal

Visualize real-time voice data with stt

Text to speech service does not support el-GR-AthinaNeural.

No module (_speech_py_impl) found error in python azure functions, could not install

Speech to Text Rest API to accept authorization token for RecordingsUrl

Text to Speech does not work using Local CMD/Powershell

What servers allow Long Audio API for text-to-speech with Aria vocaloid on Azure?

Why can't I resolve SPXSpeechConfiguration.framework when I do 'pod install' with the Podfile from helloworld from MSFT Cognitive Services SDK for Swift for TTS?

Custom voice in DUTCH

Using text to speech for video narration (using speech studio)

Data stored by Azure Speech

Do you have any examples with Long Audio API using Objective-C or Swift 5+?

Custom Speech quality issue.

Can't send API Requests from python code hosted on AWS Lightsail instance

How do I calculate the cost of creating a custom voice on Cognitive Services/Speech Service?

Dutch Speaker Recognition

using start_keyword_recognition for keyword detection

Cannot delete Custom Speech model due to transcription reference

MP3 Content Type for Creating a MediaSource

speech to text sdk