@Shyamal Goel Based on my experience of using the speech service, I think you are using the right sample and settings as mentioned in SDK sample. There is another feature that uses multi channel audio for conversation transcription but it is in preview and recently it is announced that the feature is being retired. This uses speaker profiles to recognize speakers with their signature.
Conversation transcription multichannel diarization (preview) is retiring on March 28, 2025. For more information about migrating to other speech to text features, see Migrate away from conversation transcription multichannel diarization.
There is also a new feature in preview, fast transcription API but it uses audio files and it is currently available through REST API.
With respect to your scenario of overlap, I think you can raise an issue in the same speech SDK repo and check guidance from the SDK team on any properties that are available to set.
In the current sample, for the issue with Unknow speakers, the recommendation is to set a property that is not documented in reference. So, if there is any property for this scenario the SDK team would best advise.
You might see
Speaker ID=Unknown
in some of the early intermediate results when the speaker is not yet identified. Without intermediate diarization results (if you don't set thePropertyId.SpeechServiceResponse_DiarizeIntermediateResults
property to "true"), the speaker ID is always "Unknown".
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful. And, if you have any further query do let us know.