Hi @RES,
Thank you for reaching out to Microsoft Q&A forum!
In the Azure Speech SDK for speaker diarization, the speakers are automatically identified by the SDK, and there is no strict limit on the number of speakers. The SDK will dynamically handle the number of speakers based on the conversation.
The number of participants can affect the accuracy of diarization. If there are too many speakers, the system might struggle to accurately distinguish between them, especially if voices are similar or overlap.
In real-time transcription using ConversationTranscriber
, there is no fixed limit on the number of speakers. The system automatically identifies and labels speakers dynamically. To set a maximum number of speakers, you would need to use batch transcription, which is not available in real-time transcription.
In Conversation Transcription Multichannel Diarization (Preview) of Real-time conversation transcription multichannel diarization (preview), there is no strict limit on the number of participants. You can manage participants by adding voice signatures for better speaker identification. You can also use the DifferentiateGuestSpeakers option for unknown speakers if voice signatures are not provided.
See: Real-time conversation transcription multichannel diarization (preview).
I hope this helps. Thank you.