Share via


RequestAudioContentPart interface

An audio content part for a request. This is supported only by realtime models (e.g., gpt-realtime). For text-based models, use input_text instead.

Extends

Properties

audio

Base64-encoded audio bytes, these will be parsed as the format specified in the session input audio type configuration. This defaults to PCM 16-bit 24kHz mono if not specified.

transcript

Optional transcript of the audio content. This is not sent to the model, but will be attached to the message item for reference.

type

Property Details

audio

Base64-encoded audio bytes, these will be parsed as the format specified in the session input audio type configuration. This defaults to PCM 16-bit 24kHz mono if not specified.

audio: string

Property Value

string

transcript

Optional transcript of the audio content. This is not sent to the model, but will be attached to the message item for reference.

transcript?: string

Property Value

string

type

type: "input_audio"

Property Value

"input_audio"