Hi Guillermo Proano,
Thanks for reaching out to Microsoft Q&A.
Currently, Azure Text-to-Speech transcription services do not support multiple speaker identification.
While Azure Speech-to-Text provides a Real-Time Diarization feature in Azure is capable of distinguishing speakers' voices through single-channel audio in streaming mode. This means it can provide live (real-time) speech-to-text transcription by identifying different speakers as they talk. This feature is particularly useful for live conversations or meetings where it can tag each speaker's contribution in real-time.
Kindly go through below documents for reference.
Real-time diarization quickstart - Speech service - Azure AI services | Microsoft Learn
Regarding the deprecation of the Speaker recognition feature in the Speech API, there is currently no direct replacement announced. However, you can prefer other Azure AI Speech capabilities as per your need.
Thank You.