requests.exceptions.ConnectTimeout error in Azure Cognitive Services Text-to-speech REST API

Question

requests.exceptions.ConnectTimeout error in Azure Cognitive Services Text-to-speech REST API

eera5607 20

So, I have been trying process a folder with thousands of text files to convert each one to speech using Azure Cognitive Services Text-to-speech REST API. It works fine until it doesn't. I get errors after several successful conversions. I would like to have a stable connection so I can reliably leave the script running and not have to manually restart each time I get an error.

  TimeoutError: [WinError 10060] A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond
    
    urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='eastus.api.cognitive.microsoft.com', port=443): Max retries exceeded with url: /sts/v1.0/issueToken (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x000001F63AF32650>, 'Connection to eastus.api.cognitive.microsoft.com timed out. (connect timeout=None)'))
    
    raise ConnectTimeout(e, request=request)
    requests.exceptions.ConnectTimeout: HTTPSConnectionPool(host='eastus.api.cognitive.microsoft.com', port=443): Max retries exceeded with url: /sts/v1.0/issueToken (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x000001F63AF32650>, 'Connection to eastus.api.cognitive.microsoft.com timed out. (connect timeout=None)'))

This is my current script:

 import os
    import requests
    import time
    import chardet
    
    subscription_key = 'here my subscription key'
    region = 'eastus'
    voice_name = 'es-MX-DaliaNeural'
    output_format = 'audio-24khz-96kbitrate-mono-mp3'
    
    tts_url = f'https://{region}.tts.speech.microsoft.com/cognitiveservices/v1'
    headers = {
        'Authorization': '',
        'Content-Type': 'application/ssml+xml',
        'X-Microsoft-OutputFormat': output_format,
        'User-Agent': 'YOUR_RESOURCE_NAME'
    }
    
    # looping through all text files in the input folder
    input_folder = 'C:/path/to/text/files'
    output_folder = 'C:/path/to/folder'
    for filename in os.listdir(input_folder):
        # Check if the file is a text file
        if filename.endswith('.txt'):
            # Read the contents of the file and detect the encoding
            with open(os.path.join(input_folder, filename), 'rb') as f:
                rawdata = f.read()
                encoding = chardet.detect(rawdata)['encoding']
                text = rawdata.decode(encoding)
    
            # creating the SSML body for the TTS request
            ssml = f'<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="https://www.w3.org/2001/mstts" xml:lang="es-MX"><voice name="{voice_name}">{text}</voice></speak>'
    
            # getting the access token for the TTS service
            token_url = f'https://{region}.api.cognitive.microsoft.com/sts/v1.0/issueToken'
            token_headers = {'Ocp-Apim-Subscription-Key': subscription_key}
            response = requests.post(token_url, headers=token_headers)
            access_token = response.text
    
            headers['Authorization'] = f'Bearer {access_token}'
    
            response = requests. Post(tts_url, headers=headers, data=ssml.encode('utf-8'))
    
            if response.status_code == 200:
                # save the audio content to a file
                audio_filename = os.path.splitext(filename)[0] + '.mp3'
                with open(os.path.join(output_folder, audio_filename), 'wb') as f:
                    f.write(response.content)
                print(f'Successfully converted "{filename}" to speech')
            else:
                print(f'Error converting "{filename}" to speech: {response.content}')
    
            time. Sleep(30)

I leave 30 seconds between each conversion, but it isn't working. It converts 20-30 files and then the errors. Any help to get a more stable process? Thanks.

eera5607 20 Reputation points

2023-04-20T15:08:27.9166667+00:00

Thank you! I sent a support request asking for a quota increase. I'll first try with that. If it doesn't work I'll try the other solutions. Thank you!
eera5607 20 Reputation points

2023-04-20T15:15:37.16+00:00

Only one question: Am I using my Standard (S0) resource in that script? Do I have to specify it? Or is the subscription key enough?
romungi-MSFT 48,906 Reputation points Microsoft Employee Moderator

2023-04-24T04:54:26.9833333+00:00

@eera5607 The subscription info should be enough to determine the limits of your resource. Thanks!!

1 answer

Your answer

eera5607 20 Reputation points

2023-04-20T15:08:27.9166667+00:00

Thank you! I sent a support request asking for a quota increase. I'll first try with that. If it doesn't work I'll try the other solutions. Thank you!
eera5607 20 Reputation points

2023-04-20T15:15:37.16+00:00

Only one question: Am I using my Standard (S0) resource in that script? Do I have to specify it? Or is the subscription key enough?
romungi-MSFT 48,906 Reputation points Microsoft Employee Moderator

2023-04-24T04:54:26.9833333+00:00

@eera5607 The subscription info should be enough to determine the limits of your resource. Thanks!!

Answer 1

@eera5607 I think you might be reaching your TTS limits of 200 TPS for a resource before the errors are seen. You can increase this upto 1000 by raising a quota increase request through Azure support but the limits are still cumulative of all the calls made to the service i.e REST API, SDK, CLI or speech studio.

This increase should resolve any issues related to limits or quotas.

Since you are using the REST API to generate a token and then call the TTS URL for short audio it might be easier to migrate to the batch synthesis API as it will soon replace the Long Audio API too for larger files. Other options available, is to use the SDK instead of REST API which can help you collect logs to trace your requests if the failures continue to occur.

If you are still seeing issues with timeouts, then you might want to use the request ids or SDK logs to trace the reason for timeouts since as it could also be an issue with networks as seen in one of the threads I worked recently.

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Share via

requests.exceptions.ConnectTimeout error in Azure Cognitive Services Text-to-speech REST API

1 answer

Your answer