Hello d m,
Welcome to the Microsoft Q&A and thank you for posting your questions here.
For more clarity about your questions on batch synthesis text to speech avatar in Azure AI Speech:
Is it possible to get subtitles or a timed script with batch synthesis text to speech avatar?
Yes, you can get a timed script using Azure's Batch Synthesis API for text-to-speech avatars - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/text-to-speech-avatar/batch-synthesis-avatar-properties)
is it possible to get subtitles on the generated video Or even better, getting a script of the text with time stamps.
API does not directly generate subtitles, you can use the timing details from the synthesis summary to create subtitles - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/text-to-speech-avatar/batch-synthesis-avatar
Can i do this with this Azure's API? Or would I need to do something like using speech to text on my generated audio to get that "time-stamped script"?
You can achieve this with Azure's API by leveraging the timing details provided in the synthesis summary. Check the two links above.
I hope this is helpful! Do not hesitate to let me know if you have any other questions.
Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.