How to enroll speaker voice for speaker recognition using python code?

Question

How to enroll speaker voice for speaker recognition using python code?

Shubhanshi Gangil 0

I want to recognize speaker by their names along with transcription from meeting recordings. How to do that on python? I checked the documentation, and it says- The Speech SDK for Python doesn't support speaker recognition. Please select another programming language or the Python reference and samples linked from the beginning of this article.

I couldn't find any other source.

Chandra Boorla 14,685 Reputation points Microsoft External Staff Moderator

2024-09-12T18:09:45.85+00:00

Hi @Shubhanshi Gangil

Just checking in to see if the below answer provided by @ Sina Salam helped.

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

1 answer

Your answer

Chandra Boorla 14,685 Reputation points Microsoft External Staff Moderator

2024-09-12T18:09:45.85+00:00

Hi @Shubhanshi Gangil

Just checking in to see if the below answer provided by @ Sina Salam helped.

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Answer 1

Hello Shubhanshi Gangil,

Welcome to the Microsoft Q&A and thank you for posting your questions here.

I understand that your project is in need of Speech SDK for Python to support speaker recognition.

Speaker voice identification is not directly supported, additional tools or services are necessary. You can switch your focus to speaker voice embeddings, which are vector representations of a speaker’s voice, can be extracted using models like VoxCeleb or DeepSBD and compared to classify speakers. Also, Python Libraries such as pyAudioAnalysis and librosa can assist with audio analysis tasks and may help implement speaker differentiation with the right algorithms. However, you need to understand that achieving real-time speaker recognition using just the Speech SDK for Python alone is not straightforward without integrating additional tools or services.

If you will be interested to review the links below about Azure AI, Azure Speach SDK, using Python, with the option above in step-by-step resources for Speaker Voice Identification:

I hope this is helpful! Do not hesitate to let me know if you have any other questions.

Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful

Chandra Boorla 14,685 Reputation points Microsoft External Staff Moderator

2024-09-13T18:21:01.5566667+00:00

Hi @Shubhanshi Gangil

Just checking in to see if the above answer provided by @ Sina Salam helped.

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Share via

How to enroll speaker voice for speaker recognition using python code?

1 answer

Your answer