ServerVad interface

Base model for VAD-based turn detection.

Extends

Properties

autoTruncate

Whether to automatically truncate the audio buffer when speech stops.

createResponse

Whether to automatically create a response when speech stops.

endOfUtteranceDetection

Configuration for end-of-utterance detection.

interruptResponse

Whether to allow the user's speech to interrupt the assistant's response.

prefixPaddingInMs

Amount of audio to include before speech is detected, in milliseconds.

silenceDurationInMs

Duration of silence required to end speech detection, in milliseconds.

threshold

Activation threshold for VAD detection. Range: 0.0 to 1.0.

type

Property Details

autoTruncate

Whether to automatically truncate the audio buffer when speech stops.

autoTruncate?: boolean

Property Value

boolean

createResponse

Whether to automatically create a response when speech stops.

createResponse?: boolean

Property Value

boolean

endOfUtteranceDetection

Configuration for end-of-utterance detection.

endOfUtteranceDetection?: EouDetectionUnion

Property Value

interruptResponse

Whether to allow the user's speech to interrupt the assistant's response.

interruptResponse?: boolean

Property Value

boolean

prefixPaddingInMs

Amount of audio to include before speech is detected, in milliseconds.

prefixPaddingInMs?: number

Property Value

number

silenceDurationInMs

Duration of silence required to end speech detection, in milliseconds.

silenceDurationInMs?: number

Property Value

number

threshold

Activation threshold for VAD detection. Range: 0.0 to 1.0.

threshold?: number

Property Value

number

type

type: "server_vad"

Property Value

"server_vad"