RequestAudioContentPart interface
An audio content part for a request. This is supported only by realtime models (e.g., gpt-realtime). For text-based models, use input_text instead.
- Extends
Properties
| audio | Base64-encoded audio bytes, these will be parsed as the format specified in the session input audio type configuration. This defaults to PCM 16-bit 24kHz mono if not specified. |
| transcript | Optional transcript of the audio content. This is not sent to the model, but will be attached to the message item for reference. |
| type |
Property Details
audio
Base64-encoded audio bytes, these will be parsed as the format specified in the session input audio type configuration. This defaults to PCM 16-bit 24kHz mono if not specified.
audio: string
Property Value
string
transcript
Optional transcript of the audio content. This is not sent to the model, but will be attached to the message item for reference.
transcript?: string
Property Value
string
type
type: "input_audio"
Property Value
"input_audio"