Text to speech is not working

Question

Text to speech is not working

Oliver Weidner 1

Hello guys,
i was really hyped to try azure text-to-speech, but already the sample code isn't working.

This one from the offical page:
https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/get-started-text-to-speech?tabs=windows%2Cterminal&pivots=programming-language-python

> import os  
> import azure.cognitiveservices.speech as speechsdk  
>   
> # This example requires environment variables named "SPEECH_KEY" and "SPEECH_REGION"  
> speech_config = speechsdk.SpeechConfig(subscription=key, region=region)  
> audio_config = speechsdk.audio.AudioOutputConfig(use_default_speaker=True)  
>   
> # The language of the voice that speaks.  
> speech_config.speech_synthesis_voice_name='en-US-JennyNeural'  
>   
> speech_synthesizer = speechsdk.SpeechSynthesizer(speech_config=speech_config, audio_config=audio_config)  
>   
> # Get text from the console and synthesize to the default speaker.  
> print("Enter some text that you want to speak >")  
> text = input()  
>   
> speech_synthesis_result = speech_synthesizer.speak_text_async(text).get()  
>   
> if speech_synthesis_result.reason == speechsdk.ResultReason.SynthesizingAudioCompleted:  
>     print("Speech synthesized for text [{}]".format(text))  
> elif speech_synthesis_result.reason == speechsdk.ResultReason.Canceled:  
>     cancellation_details = speech_synthesis_result.cancellation_details  
>     print("Speech synthesis canceled: {}".format(cancellation_details.reason))  
>     if cancellation_details.reason == speechsdk.CancellationReason.Error:  
>         if cancellation_details.error_details:  
>             print("Error details: {}".format(cancellation_details.error_details))  
>             print("Did you set the speech resource key and region values?")

I am using python 3.9 and downloaded Microsoft Visual C++ Redistributable for Visual Studio 2015, 2017, 2019, and 2022. Also i tried it to set the environment variables, but its also not working.

This is my output:

> Enter some text that you want to speak >  
> hello there  
> Speech synthesis canceled: CancellationReason.Error  
> Error details: USP error: timeout waiting for the first audio chunk  
> Did you set the speech resource key and region values?

A new api key and a new text-to-speech module i also generated, i dont know what the problem is...

This code dont shows me any error, just creates a empty test.wav file:

> def synthesize_to_speaker():  
>     # Find your key and resource region under the 'Keys and Endpoint' tab in your Speech resource in Azure Portal  
>     # Remember to delete the brackets <> when pasting your key and region!  
>     speech_config = speechsdk.SpeechConfig(subscription=key, region=region)  
>     # In this sample we are using the default speaker  
>     # Learn how to customize your speaker using SSML in Azure Cognitive Services Speech documentation  
>     # audio_config = AudioOutputConfig(use_default_speaker=True)  
>     audio_config = speechsdk.audio.AudioOutputConfig(filename="file.mp3")  
>     synthesizer = speechsdk.SpeechSynthesizer(speech_config=speech_config, audio_config=audio_config)  
>     synthesizer.speak_text_async("Text Text Text wie geht es dir du fettes bier ich hau mir ne nudel nei")  
>   
>   
> synthesize_to_speaker()

hope someone can help me.

2 answers

Your answer

Answer 1

I just tried the speech-to-text module with this code, this is working totally fine.

def recognize_from_microphone():  
    # This example requires environment variables named "SPEECH_KEY" and "SPEECH_REGION"  
    speech_config = speechsdk.SpeechConfig(subscription=key, region=region)  
    speech_config.speech_recognition_language="en-US"  
  
    audio_config = speechsdk.audio.AudioConfig(use_default_microphone=True)  
    speech_recognizer = speechsdk.SpeechRecognizer(speech_config=speech_config, audio_config=audio_config)  
  
    print("Speak into your microphone.")  
    speech_recognition_result = speech_recognizer.recognize_once_async().get()  
  
    if speech_recognition_result.reason == speechsdk.ResultReason.RecognizedSpeech:  
        print("Recognized: {}".format(speech_recognition_result.text))  
    elif speech_recognition_result.reason == speechsdk.ResultReason.NoMatch:  
        print("No speech could be recognized: {}".format(speech_recognition_result.no_match_details))  
    elif speech_recognition_result.reason == speechsdk.ResultReason.Canceled:  
        cancellation_details = speech_recognition_result.cancellation_details  
        print("Speech Recognition canceled: {}".format(cancellation_details.reason))  
        if cancellation_details.reason == speechsdk.CancellationReason.Error:  
            print("Error details: {}".format(cancellation_details.error_details))  
            print("Did you set the speech resource key and region values?")  
  
recognize_from_microphone()

Grmacjon-MSFT 19,301 Reputation points Moderator

2022-10-29T01:49:49.703+00:00

Hi @Oliver Weidner ,

Thanks for bringing this to our attention. To clarify are you still facing issues with the code sample from the Azure docs? If yes, Please submit a feedback request on the docs page so we update the code: https://github.com/MicrosoftDocs/azure-docs/issues/new
feel free to tag me as well.

Best,
Grace

Answer 2

This issue was resolved by in this Github issue https://github.com/MicrosoftDocs/azure-docs/issues/101028

Positing the solution here for more awareness:

"*There is a known issue introduced by some security updates October 12th that might cause connectivity issues using the TTS endpoints. There is an out of band update available for Windows 11 that fixes these issues. The update may be manually installed by following the instructions here:
Windows 11 21H2: https://support.microsoft.com/topic/october-17-2022-kb5020387-os-build-22000-1100-out-of-band-5e723873-2769-4e3d-8882-5cb044455a92
Windows 11 22H2: https://support.microsoft.com/topic/october-25-2022-kb5018496-os-build-22621-755-preview-64040bea-1e02-4b6d-bad1-b036200c2cb3*"

Thanks,
Grace

Share via

Text to speech is not working

2 answers

Your answer