Audio Percept Audio for Non-Human Voice Identification.

Question

Hello.

Azure percept audio is mainly integrating LUIS and Speech Service of azure as an underline technology to give Voice assistance feature only after listening to a "Keyword" or "Wake-Up Word" , eg. Computer.... Turn on the AC.

Can a non human voice be detected by percept audio like a Baby crying noise, Machinery voice, Car engine voice?

Or Any way to Bring opensource audio AI models to azure percept Audio just how we can bring open source models to Percept Eyemodule (Vision).?

Answer

Hello @vinit sawant , I hope below information would be helpful for your initial query.

Speech Identification verify and identify speakers by their unique voice characteristics, by using voice biometry. You provide audio training data for a single speaker, which creates an enrollment profile based on the unique characteristics of the speaker's voice. You can then cross-check audio voice samples against this profile to verify that the speaker is the same person (speaker verification). You can also cross-check audio voice samples against a group of enrolled speaker profiles to see if it matches any profile in the group (speaker identification). Then you do recognition, it will recognize based on the file.
Characteristics somehow like nose, eye, lip,... in face recognition.

Audio Percept Audio for Non-Human Voice Identification.

1 answer