The timbre of the voice of "zh-CN-XiaochenNeural" has changed, and the current timbre is completely different from what it was a few weeks ago. The results of the test using "Batch synthesis API (Preview) for text to speech" or "Speech Studio" were different from those of a few weeks ago.My SSML input would look something like this:“
<speak version='1.0' xml:lang='en-US'><voice xml:lang='zh-CN' name='zh-CN-XiaochenNeural'><prosody rate='+60.00%'><break time="750ms"/> 这天的最后,是辅导员带我去校医务室做了伤口处理。她叹着气坐在病床旁边,看我的眼神充满了怜悯。你们的事我都知道了,老师相信,这次也不是你的错......</prosody></voice></speak>”。
I still have a voice mp3 file generated a few weeks ago locally, and I can't generate the same voice as a few weeks ago. Are there any adjustments made to the "zh-CN-XiaochenNeural" voice pack?
Batch synthesis API (Preview) :
https://eastasia.customvoice.api.speech.microsoft.com/api/texttospeech/3.1-preview1/batchsynthesis/
2023/11/22
The new finding is that the results obtained by using the Microsoft Speech API are correct, but this is a free presentation service that is unstable.
Microsoft Speech API:
https://southeastasia.api.speech.microsoft.com/accfreetrial/texttospeech/acc/v3.0-beta1/vcg/speak
So why does zh-CN-XiaochenNeural in Azure Speech API(Batch synthesis API (Preview)) suddenly become a different voice compared to Microsoft Speech API?