@Alexander Novikov I believe these two versions support the vision model currently, if you are using the first API version(2023-12-01-preview) and seen this error, could you try the second one (2024-02-15-preview) to check if it is consistent?
-
2023-12-01-preview
Swagger spec -
2024-02-15-preview
Swagger spec
Also, in the messages input you can try to use the image_url first and then the text. Something similar to below and check if the behavior improves?
"role": "user",
"content": [
{
{
"type": "image_url",
"image_url": {
"url": <your_url>
},
"type": "text",
"text": "Describe this picture:"
}
}
]
}