Text Independent - Verify Profile

Verify Profile
Verifies existing profiles against input audio.

Limitations:

  • Minimum audio input length is 1 second
  • Maximum audio input length is 120 seconds
  • Minimum effective speech length (excluding silence and other "non-speech" frames) is 4 seconds This limitation can be disabled by setting "ignoreMinLength" to true.
  • Minimum audio Signal-to-noise ratio (SNR) is 2dB
POST {endpoint}/speaker-recognition/verification/text-independent/profiles/{profileId}:verify?api-version=2021-09-05
POST {endpoint}/speaker-recognition/verification/text-independent/profiles/{profileId}:verify?api-version=2021-09-05&ignoreMinLength={ignoreMinLength}

URI Parameters

Name In Required Type Description
endpoint
path True

string

Supported Cognitive Services endpoints (protocol and hostname, for example: https://westus.api.cognitive.microsoft.com).

profileId
path True

string

uuid

Unique identifier for profile id (guid).

api-version
query True

string

Specifies the version of the operation to use for this request.

ignoreMinLength
query

boolean

If true, a voice print will be created immediately for this profile regardless of how much speech is supplied or stored. Default is false.

Request Header

Media Types: "audio/wav; codecs=audio/pcm"

Name Required Type Description
Ocp-Apim-Subscription-Key True

string

Request Body

Media Types: "audio/wav; codecs=audio/pcm"

Name Type Description
audioData

object

Binary audio file. Supported formats are audio/wav; codecs=audio/pcm. Supports audio up to 5MB.

Responses

Name Type Description
200 OK

RecognitionInfo

OK

Other Status Codes

SpeakerErrorInfo

Failure

Headers

x-ms-error-code: string

Security

Ocp-Apim-Subscription-Key

Type: apiKey
In: header

Examples

Successful Query

Sample Request

POST https://westus.api.cognitive.microsoft.com/speaker-recognition/verification/text-independent/profiles/49a36324-fc4b-4387-aa06-090cfbf0064f:verify?api-version=2021-09-05


"{binary file date}"

Sample Response

Content-Type: application/json
{
  "recognitionResult": "accept",
  "score": 0.63
}
Content-Type: application/json
x-ms-error-code: Error Code
{
  "error": {
    "code": "Error Code",
    "message": "Erro Messae"
  }
}

Definitions

Name Description
Error
RecognitionInfo

Speaker recognition result

ResultType

Does audio belong to targeted profile?

SpeakerErrorInfo

Speaker error message

Error

Name Type Description
code

string

message

string

RecognitionInfo

Speaker recognition result

Name Type Description
recognitionResult

ResultType

Does audio belong to targeted profile?

score

number

A float number indicating the similarity between input audio and targeted voice print. This number must be between 0 and 1. A higher number means higher similarity.

ResultType

Does audio belong to targeted profile?

Name Type Description
accept

string

reject

string

SpeakerErrorInfo

Speaker error message

Name Type Description
error

Error