@Steinkrug, Michelle I have just received some feedback from product team that this could be an issue with language detection by the model and it is observed that in some cases the model identifies the word with different language id, in this case it is en-US
so the pronunciation sounds as English with a German voice. One workaround that has been suggested is to use the <lang> tag in the SSML for such a discrepancy to ensure the model explicitly pronounces the word in German. This is not an ideal scenario if you are using a real time scenario as input but if you are creating audio files for offline use you could use the workaround and generate an appropriate sounding file.
If an answer is helpful, please click on or upvote which might help other community members reading this thread.