Hello. Microsoft’s built-in voices such as Christopher, Jenny, Aria, Guy, and others do not support “alpha tags” like [chuckles], [announcing], or [friendly]. Those kinds of inline cues aren’t recognized. What they do support is SSML (Speech Synthesis Markup Language) and Microsoft’s own SSML extensions, which let you adjust tone, style, pauses, and other prosody features. For example, you can use <mstts:express-as style="friendly"> or insert an <audio> clip if you want a laugh or sound effect.
You can find the full list of supported tags and styles here:
- SSML reference for Speech Service: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-synthesis-markup
- Microsoft SSML extensions (express-as, role, emotions): https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-synthesis-markup-structure
So while you can’t drop in “[chuckles]” directly, you can achieve similar results with SSML styles or by embedding audio.