Voice Model Selection Issue with WebSocket API in Speech SDK

Catherine Lee 0 Reputation points
2024-09-23T04:59:15.3833333+00:00

I'm trying to use the voice 'en-US-AnaNeural' in the following code:

speech_config = speechsdk.SpeechConfig(endpoint=f"wss://{server_config.speech_region}.tts.speech.microsoft.com/cognitiveservices/websocket/v2", subscription=server_config.speech_key) speech_config.speech_synthesis_voice_name = "en-US-AnaNeural"

However, the output is a male voice instead. I tried other voice models, and most work, but a few always result in the same male voice. It seems these specific voice models are not supported. Could this be because I'm using wss instead of API requests? There are no errors during runtime."

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,061 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.