Hey there, I am facing the same issue with all new neural and multilingual but also German focuses voices. The pronunciation for foreign words inside German sentences is really bad and makes it unusable for our usecases.
- Heute gibt es Lasagne.
- Heute gibt es Steak.
- Das Bouquet ist reintönig und frisch.
I remember that it was indeed better some time ago. Also we cant use ssml or language tags since the text to speech is ai generated and send through api calls.
I also checked all regions and countless voice models, all with the same result.
Any idea what to do here?