How to make audio file input with gpt-4o

Question

How to make audio file input with gpt-4o

Vitalii Rentiuk 20

Hi! I was trying to use Azure AI Studio gpt4-o model with audio input. It looks like this:

messages = [{"role": "user",

"content": [{"type": "input_audio", "input_audio": {

"data": encoded_string,

"format": "wav"}}]

}]
This code was taken from OpenAI documentation https://platform.openai.com/docs/guides/audio . Although in AI studio it says that audio channel is available, I get the following response: "Error Error code: 400 - {'error': {'message': "Invalid value: 'input_audio'. Supported values are: 'text', 'image_url', 'audio_url', and 'refusal'.", 'type': 'invalid_request_error', 'param': 'messages[1].content[0].type', 'code': 'invalid_value'}}". I'm using "2024-10-01-preview" api version.

Is usage of audio files, not urls not available for now and will it be available in future?

Vitalii Rentiuk 20 Reputation points

2024-12-05T10:25:31.0566667+00:00

I can see. So for now audio files are not fully integrated in /chat/completions based on Audio file and to use them one should use standalone library you gave me a link to, is that right?
VasaviLankipalle-MSFT 18,706 Reputation points Moderator

2024-12-05T21:13:25.43+00:00

Hello @Vitalii Rentiuk , yes, I agree with that as per the documentation.

Answer accepted by question author

0 additional answers

Your answer

Vitalii Rentiuk 20 Reputation points

2024-12-05T10:25:31.0566667+00:00

I can see. So for now audio files are not fully integrated in /chat/completions based on Audio file and to use them one should use standalone library you gave me a link to, is that right?
VasaviLankipalle-MSFT 18,706 Reputation points Moderator

2024-12-05T21:13:25.43+00:00

Hello @Vitalii Rentiuk , yes, I agree with that as per the documentation.

Answer 1

VasaviLankipalle-MSFT 18,706 Moderator

Hello @Vitalii Rentiuk , Thanks for using Microsoft Q&A Platform.

If you are working with SDK then the supported audio formats are listed in the sample GitHub repository. Here are the supported formats with python SDK:

User's image To learn more about how to use the Audio file as input here is the reference sample code: https://github.com/azure-samples/aoai-realtime-audio-sdk

Is this something you are looking for?

Share via

How to make audio file input with gpt-4o

0 additional answers

Your answer