Audio Percept Audio for Non-Human Voice Identification.

vinit sawant 11 Reputation points
2022-09-15T13:10:56.973+00:00

Hello.

Azure percept audio is mainly integrating LUIS and Speech Service of azure as an underline technology to give Voice assistance feature only after listening to a "Keyword" or "Wake-Up Word" , eg. Computer.... Turn on the AC.

Can a non human voice be detected by percept audio like a Baby crying noise, Machinery voice, Car engine voice?

Or Any way to Bring opensource audio AI models to azure percept Audio just how we can bring open source models to Percept Eyemodule (Vision).?

Azure Percept
Azure Percept
A comprehensive Azure platform with added security for creating edge artificial intelligence solutions.
70 questions
Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,377 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. QuantumCache 20,026 Reputation points
    2022-09-16T00:41:17.347+00:00

    Hello @vinit sawant , I hope below information would be helpful for your initial query.

    Speech Identification verify and identify speakers by their unique voice characteristics, by using voice biometry. You provide audio training data for a single speaker, which creates an enrollment profile based on the unique characteristics of the speaker's voice. You can then cross-check audio voice samples against this profile to verify that the speaker is the same person (speaker verification). You can also cross-check audio voice samples against a group of enrolled speaker profiles to see if it matches any profile in the group (speaker identification). Then you do recognition, it will recognize based on the file.
    Characteristics somehow like nose, eye, lip,... in face recognition.