Azure AI Speech

1 answer

How to set varibles for ssml messages and speech style in Text-to-Speech

Hello. I am quite new as it is probably quite apparent. Currently am trying to set a variable for ssml Text-to-Speech, however, after going through every documentation I could find, I still have no solution. Using Python. I would be greatful if anyone…

asked

Gavin 1

answered

YutongTie-MSFT 46,996

1 answer

ChatBot in android webview

hello, I have a Luis-based Chabot integrated into the website. the chatbot works fine in desktop and mobile browsers (Google Chrome). but when I load the same webpage in android WebView it causing a problem i.e. when I give voice input it not returning…

asked

Gawas, F.Dinesh 41

answered

GiftA-MSFT 11,151

0 answers

Azure Text to Speech error

How to solve an error within Azure Speech, Text To Speech, when I get 60+K matches> The error is "Failed to fetch voices" in all its splendor. Doh. "Where is the anykey? " as Homer asked. What I am doing? you mean short of…

asked

vanderghast 1

commented

romungi-MSFT 42,761 Microsoft Employee

1 answer

SpeechServices availability, authorizationToken validation

Hi, I implemented MS speech services in my macOS app (Objective C). Everything works nice. However, I would like to check for speech service availability and check if authorizationToken is valid to inform user, that everything is okay. I would like to…

asked

Peter Krajcik 41

accepted

Peter Krajcik 41

1 answer

recognizeOnceAsync not working for input from microphone on browser

I copy and pasted the code linked below into an empty file, then hardcoded in my subscription key and region. When I open the file and click the button, the browser asks for permission to access the microphone. When I click allow, the recognizeOnceAsync…

asked

Moran, Jack Kellen 1

answered

YutongTie-MSFT 46,996

1 answer

speech to text Authentication error (401) on GitHub SDK samples c++ console

I am testing an azure cognitive services speech. Pricing tier is free (f0). I followed instructions for creating instance and service. It is listed as Cognitive services, API type: speech I am getting error code= 1 : ErrorDetails=WebSocket upgrade…

asked

Jose Fernando Garcia 21

accepted

Jose Fernando Garcia 21

1 answer

Speech to Text SRT with Timestamp

Hi, I want Subrip subtitles (SRT) file as output from Speech to Text. I tried to find the webpage where I can upload video/audio to check the result in SRT format. But, not able to find any such link. Please provide me the link where I can upload…

asked

Shailendra Abhyankar 21

accepted

Shailendra Abhyankar 21

0 answers

Some synthesized text is played with the next synthesized text.

iOS app(Swift) synthesizes sound with MicrosoftCognitiveServicesSpeech-iOS SDK ver.1.15.0. Several sentences are played correctly. The next portion of the synthesized text doesn't play in time. It plays with the next synthesized text. Result from next…

asked

Oleksandr Afanasiev 1

commented

Oleksandr Afanasiev 1

0 answers

Internal Server error coming during custom speech model training

Hi Team, We are facing internal server error while training custom speech model. "Please recreate the task in a while. If the failure still happens, please create an Azure support request." We have tried to train the model multiple times…

asked

Suman 1

commented

Suman 1

0 answers

Speech To Test - Labeled Testing Data For Model Training. Include or exclude numbers and dates

Recommendation needed from development. It appears that when uploading text transcriptions of the audio for training that there is a normalization process on the text. It appears that if the audio has decimals and dates in a row that the normalization…

asked

David Revell 1

commented

Ramr-msft 17,621

2 answers

Intonation and(+plus) Rate does not work correctly for last 3 days

Please correct this problem ASAP. INTONATION NOT WORKING WITH RATE SPEED SETTING. I am really enjoying the audio content creation so far, but I got to ask a question as there seems to be a problem within the audio content creation page. I am…

asked

Serhat AHÇI 21

accepted

Serhat AHÇI 21

1 answer

file location when audio logging is enabled in SpeechConfig

Hi, where is the file generated configuring enabled audio logging in speech config object ?

asked

Gabriele Serra 21

commented

romungi-MSFT 42,761 Microsoft Employee

1 answer

Where is cloud speech SDK?

I need to write a program where I can send an utterance "What time is it?" to Azure cloud and get back the response audio. I have done something similar with Amazon Alexa SDK as well as Google Assistant SDK. But I cannot find the right SDK to…

asked

Peter Taps 1

answered

romungi-MSFT 42,761 Microsoft Employee

2 answers

Text to speech - time synchronization

Hello, Im using the Speech Studio>Audio Content Creation tool to produce audio files with ssml. I need to get the audio files synchronized in different languages. In an ssml document with 2 sentences, is there a way to set the starting time of the…

asked

sam husson 1

answered

sam husson 1

2 answers

Can I use a personalized speech recognizer in Windows?

Hi, I've created a personalized speech recognizer model in Microsoft Azure. I'd like to know how to use this model in Cortana or Windows in general so as to open folders or documents, programs and so. I'd also like to know whether it would be…

asked

margotmg 1

commented

YutongTie-MSFT 46,996

1 answer

Is there a way to provide hints (words) to Speech to text service along with the audio file

I am wondering, if I can provide hints (in terms of a words) to Speech to text service along with the audio (.wav) file, so that it may help the service to transcribe the audio to text more accurately. Please let me know.

asked

Vamsi Reddy Chagari 1

commented

Marc Wickens 1

0 answers

Adding Phrase using Phrase_List_grammar with Ms AZure SDK

Hi I tried with following codes to add a list of Medicines with Phrase List Grammar. My codes are producing the same inaccuracy. I hope they are not working. CODE SAMPLE * import azure.cognitiveservices.speech as speechsdk > import time >…

asked

SALIL RAY 1

commented

SALIL RAY 1

1 answer

Missing apostrophes when uploading human-labeled transcript for custom speech

Hi, I am currently trying to create a custom STT using Custom Speech service, after uploading my Audio + human-labeled transcript (txt file, separated by \t, UTF-8 with BOM) , a lot of the apostrophes are missing in the Human-labeled transcription…

asked

Virtro Dev 21

commented

Virtro Dev 21

1 answer

How can I get new file ca-bundle.crt

Firstly I used openssl1.0.1 to covert radio to text on centos7.3, but I can not get result after set the SSL_CERT_DIR and SSL_CERT_FILE. this doc tells me openssl1.1.1b is required, so I I installed openssl1.1.1b from source code, After I…

asked

klen 21

commented

romungi-MSFT 42,761 Microsoft Employee

1 answer

long audio c# sample: whow to select standard neural voice

Hello, Docs state that long audio api can use bot public and custom neural voices. when I try to use the batchsyntesis action from the c# samples, I get: CustomVoice-API batchsynthesis create subscriptionKey [YourSubscriptionKey] hostURI…

asked

stefan moisei 6

answered

Ramr-msft 17,621

Filter

Content

1,438 questions with Azure AI Speech tags

How to set varibles for ssml messages and speech style in Text-to-Speech

ChatBot in android webview

Azure Text to Speech error

SpeechServices availability, authorizationToken validation

recognizeOnceAsync not working for input from microphone on browser

speech to text Authentication error (401) on GitHub SDK samples c++ console

Speech to Text SRT with Timestamp

Some synthesized text is played with the next synthesized text.

Internal Server error coming during custom speech model training

Speech To Test - Labeled Testing Data For Model Training. Include or exclude numbers and dates

Intonation and(+plus) Rate does not work correctly for last 3 days

file location when audio logging is enabled in SpeechConfig

Where is cloud speech SDK?

Text to speech - time synchronization

Can I use a personalized speech recognizer in Windows?

Is there a way to provide hints (words) to Speech to text service along with the audio file

Adding Phrase using Phrase_List_grammar with Ms AZure SDK

Missing apostrophes when uploading human-labeled transcript for custom speech

How can I get new file ca-bundle.crt

long audio c# sample: whow to select standard neural voice