Voice Model Selection Issue with WebSocket API in Speech SDK
I'm trying to use the voice 'en-US-AnaNeural' in the following code:
speech_config = speechsdk.SpeechConfig(endpoint=f"wss://{server_config.speech_region}.tts.speech.microsoft.com/cognitiveservices/websocket/v2", subscription=server_config.speech_key) speech_config.speech_synthesis_voice_name = "en-US-AnaNeural"
However, the output is a male voice instead. I tried other voice models, and most work, but a few always result in the same male voice. It seems these specific voice models are not supported. Could this be because I'm using wss
instead of API requests? There are no errors during runtime."