Got Error in chat completion API: At most 1 image(s) may be provided in one request

Question

Got Error in chat completion API: At most 1 image(s) may be provided in one request

Billy Zhou 20

Description:

Got Error when send two images to LLM Llama-4-Maverick-17B-128E-Instruct-FP8, is it because this model's API does not support multiple images in a single request?

it's working fine if only one image.

Error:

raise HttpResponseError(response=response)

azure.core.exceptions.HttpResponseError: (Bad Request) {"object":"error","message":"At most 1 image(s) may be provided in one request.","type":"BadRequestError","param":null,"code":400}

Code: Bad Request

Message: {"object":"error","message":"At most 1 image(s) may be provided in one request.","type":"BadRequestError","param":null,"code":400}

Sample Code:

from langchain_azure_ai.chat_models import AzureAIChatCompletionsModel
from typing import List
import base64

def get_llama_maverick_instruct_llm() -> AzureAIChatCompletionsModel:
    return AzureAIChatCompletionsModel(
        streaming=False,
        disable_streaming=True,
        model_kwargs={},
        endpoint="https://Llama4-Maverick-xxxx.com",
        model_name="Llama4-Maverick-xxxx",
        credential="xxxx",
        callbacks=None,
    )

def encode_image_to_base64(image_path: str) -> str:
    with open(image_path, "rb") as image_file:
        return base64.b64encode(image_file.read()).decode("utf-8")

def summarize_images(image_paths: List[str]) -> str:
    model = get_llama_maverick_instruct_llm()
    image_contents = [{"type": "image_url", "image_url": {"url": f"data:image/jpeg;base64,{encode_image_to_base64(path)}"}} for path in image_paths]
    messages = [
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "Summarize the content of these images:"},
                *image_contents
            ],
        }
    ]
    response = model.invoke(messages)
    return response.content
image_paths = ["simple-test-09.png", "test-05.png"]
summary = summarize_images(image_paths)
print("Summary:", summary)

JAYA SHANKAR G S 3,960 Microsoft External Staff Moderator

Hello @Billy Zhou ,

I have tried with Azure AI inference sdk with gpt-4.1-mini model and it was working fine.
So, please check once with Azure AI inference sdk and let us know.

Sample code

data_url1 = f"data:image/{image_format};base64,{image_data1}"
data_url2 = f"data:image/{image_format};base64,{image_data2}"

from azure.ai.inference.models import TextContentItem, ImageContentItem, ImageUrl
response = client.complete(
    messages=[
        SystemMessage("You are a helpful assistant that can generate responses based on images."),
        UserMessage(content=[
            TextContentItem(text="Which conclusion can be extracted from the following chart?"),
            ImageContentItem(image_url=ImageUrl(url=data_url1)),
            ImageContentItem(image_url=ImageUrl(url=data_url2))
        ]),
        
    ],
    temperature=1,
    max_tokens=2048,
)

Thank you

Billy Zhou 20 Reputation points

2025-05-06T08:47:49.6933333+00:00

Yes, GPT-4.1-mini is working fine, but Llama4-Maverick (Llama-4-Maverick-17B-128E-Instruct-FP8) is not. Is this due to the model itself?
JAYA SHANKAR G S 3,960 Reputation points Microsoft External Staff Moderator

2025-05-06T10:52:06.7133333+00:00

Hello @Billy Zhou ,

As per this documentation it is mentioned

Some models support only one image for each turn in the chat conversation and only the last image is retained in context. If you add multiple images, it results in an error.

This might be the reason you are getting this error, but we cannot confirm this is the only reason. Please check it with Azure inference sdk once and let us know.

Thank you

Billy Zhou 20

Hi @JAYA SHANKAR G S , I tried the Azure inference SDK and encountered the same error.

Error:

azure.core.exceptions.HttpResponseError: (Bad Request) {"object":"error","message":"At most 1 image(s) may be provided in one request.","type":"BadRequestError","param":null,"code":400}

Code: Bad Request

Message: {"object":"error","message":"At most 1 image(s) may be provided in one request.","type":"BadRequestError","param":null,"code":400}

Sample code:

import base64
from azure.ai.inference import ChatCompletionsClient
from azure.core.credentials import AzureKeyCredential
from azure.ai.inference.models import TextContentItem, ImageContentItem, ImageUrl, SystemMessage, UserMessage

def encode_image_to_base64(image_path: str) -> str:
    with open(image_path, "rb") as image_file:
        return base64.b64encode(image_file.read()).decode("utf-8")

# For Serverless API or Managed Compute endpoints
client = ChatCompletionsClient(
    endpoint="https://Llama4-Maverick-xxxxxxx",
    credential=AzureKeyCredential("xxxxx"),

)

data_url1 = f"data:image/png;base64,{encode_image_to_base64("11.png")}"
data_url2 = f"data:image/png;base64,{encode_image_to_base64("12.png")}"

response = client.complete(
    messages=[
        SystemMessage("You are a helpful assistant that can generate responses based on images."),
        UserMessage(content=[
            TextContentItem(text="Which conclusion can be extracted from the following chart?"),
            ImageContentItem(image_url=ImageUrl(url=data_url1)),
            ImageContentItem(image_url=ImageUrl(url=data_url2))
        ]),

    ],
    temperature=1,
    max_tokens=2048,
)

1 answer

Your answer

JAYA SHANKAR G S 3,960 Reputation points Microsoft External Staff Moderator

2025-05-06T08:13:47.7833333+00:00

Hello @Billy Zhou ,

I have tried with Azure AI inference sdk with gpt-4.1-mini model and it was working fine.
So, please check once with Azure AI inference sdk and let us know.

Sample code

data_url1 = f"data:image/{image_format};base64,{image_data1}" data_url2 = f"data:image/{image_format};base64,{image_data2}" from azure.ai.inference.models import TextContentItem, ImageContentItem, ImageUrl response = client.complete( messages=[ SystemMessage("You are a helpful assistant that can generate responses based on images."), UserMessage(content=[ TextContentItem(text="Which conclusion can be extracted from the following chart?"), ImageContentItem(image_url=ImageUrl(url=data_url1)), ImageContentItem(image_url=ImageUrl(url=data_url2)) ]), ], temperature=1, max_tokens=2048, )

Thank you
Billy Zhou 20 Reputation points

2025-05-06T08:47:49.6933333+00:00

Yes, GPT-4.1-mini is working fine, but Llama4-Maverick (Llama-4-Maverick-17B-128E-Instruct-FP8) is not. Is this due to the model itself?
JAYA SHANKAR G S 3,960 Reputation points Microsoft External Staff Moderator

2025-05-06T10:52:06.7133333+00:00

Hello @Billy Zhou ,

As per this documentation it is mentioned

Some models support only one image for each turn in the chat conversation and only the last image is retained in context. If you add multiple images, it results in an error.

This might be the reason you are getting this error, but we cannot confirm this is the only reason. Please check it with Azure inference sdk once and let us know.

Thank you
Billy Zhou 20 Reputation points

2025-05-07T01:48:18.5733333+00:00

Hi @JAYA SHANKAR G S , I tried the Azure inference SDK and encountered the same error.

Error:

azure.core.exceptions.HttpResponseError: (Bad Request) {"object":"error","message":"At most 1 image(s) may be provided in one request.","type":"BadRequestError","param":null,"code":400}

Code: Bad Request

Message: {"object":"error","message":"At most 1 image(s) may be provided in one request.","type":"BadRequestError","param":null,"code":400}

Sample code:

import base64 from azure.ai.inference import ChatCompletionsClient from azure.core.credentials import AzureKeyCredential from azure.ai.inference.models import TextContentItem, ImageContentItem, ImageUrl, SystemMessage, UserMessage def encode_image_to_base64(image_path: str) -> str: with open(image_path, "rb") as image_file: return base64.b64encode(image_file.read()).decode("utf-8") # For Serverless API or Managed Compute endpoints client = ChatCompletionsClient( endpoint="https://Llama4-Maverick-xxxxxxx", credential=AzureKeyCredential("xxxxx"), ) data_url1 = f"data:image/png;base64,{encode_image_to_base64("11.png")}" data_url2 = f"data:image/png;base64,{encode_image_to_base64("12.png")}" response = client.complete( messages=[ SystemMessage("You are a helpful assistant that can generate responses based on images."), UserMessage(content=[ TextContentItem(text="Which conclusion can be extracted from the following chart?"), ImageContentItem(image_url=ImageUrl(url=data_url1)), ImageContentItem(image_url=ImageUrl(url=data_url2)) ]), ], temperature=1, max_tokens=2048, )

Answer 1

Hello @Billy Zhou ,

It is probably limitation from model side.

So, possible workaround would be to make request with 1 image per request and append the summaries into list like below.

def summarize_images_one_by_one(image_paths: List[str]) -> str:
    model = get_llama_maverick_instruct_llm()
    summaries = []
    for path in image_paths:
        message = [
            {
                "role": "user",
                "content": [
                    {"type": "text", "text": "Summarize the content of this image:"},
                    {"type": "image_url", "image_url": {"url": f"data:image/jpeg;base64,{encode_image_to_base64(path)}"}},
                ],
            }
        ]
        response = model.invoke(message)
        summaries.append(response.content)
    return "\n".join(summaries)

Or combine the images into 1 and make single request,

from PIL import Image
import io
import base64

image_paths = [r" 2025-03-28 145848.png", r"2025-03-28 145737.png"]
images = [Image.open(x) for x in image_paths]
widths, heights = zip(*(i.size for i in images))

total_width = sum(widths)
max_height = max(heights)

new_im = Image.new('RGB', (total_width, max_height))

x_offset = 0
for im in images:
  new_im.paste(im, (x_offset,0))
  x_offset += im.size[0]

new_im.save('test.jpg')


def summarize_images(image_path: List[str]) -> str:
    model = get_llama_maverick_instruct_llm()
    image_contents = f"data:image/jpeg;base64,{encode_image_to_base64(image_path)}"
    messages = [
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "Summarize the input image"},
                {"type": "image_url", "image_url": {"url": image_contents}}
            ],
        }
        
    ]
    response = model.invoke(messages
    )
    return response
summary = summarize_images("test.jpg")
print("Summary:", summary)

Sample output i got:

'The image shows two Microsoft Azure portal windows side by side, with the left window displaying a PowerShell terminal and the right window showing a diagnostic settings page for a storage account.\n\n**Left Window: PowerShell Terminal**\n\n* The terminal is open to a directory path `/home/jaya/storage`\n* A Terraform plan is being executed, with the output displayed in the terminal\n* The plan involves creating and modifying resources, including a storage account and diagnostic settings\n* The output indicates that 1 resource will be added, 1 changed, and 0 destroyed\n* The user is prompted to confirm the actions by typing \'yes\'\n* After confirming, the Terraform apply command is executed, and the resources are created/modified successfully\n\n**Right Window: Diagnostic Settings Page**\n\n* The page is titled "samyustorage | Diagnostic settings" and displays the diagnostic settings for a storage account named "samyustorage"\n* The storage account is part of a resource group named "samyutha-terraform"\n* The diagnostic settings are enabled for the storage account, as well as for a blob storage account within it\n* Other storage accounts (queue, table, file) have their diagnostic settings disabled\n\n**Overall**\n\n* The image suggests that the user is using Terraform to manage Azure resources, including storage accounts and diagnostic settings\n* The Terraform plan and apply commands are being used to create and modify these resources\n* The diagnostic settings page provides a visual representation of the diagnostic settings for the storage account and its sub-resources.'

but i would recommend going with single image in a request till model supports.

UPDATE

Now the issue is resolved, getting expected response from the model.

Code

from PIL import Image
import io
import base64,requests,json
AZURE_API_KEY = "api_key"
url = "https://depolylamba.services.ai.azure.com/models/chat/completions?api-version=2024-05-01-preview"
head = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {AZURE_API_KEY}"}

image_paths = [r"C:\Users\v-jgs\Pictures\Screenshots\Screenshot 2025-03-28 145848.png", r"C:\Users\v-jgs\Pictures\Screenshots\Screenshot 2025-03-28 145737.png"]
image_contents = [{"type": "image_url", "image_url": {"url": f"data:image/jpeg;base64,{encode_image_to_base64(path)}"}} for path in image_paths]
messages = [
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "Summarize the content of these images:"},
                *image_contents
            ],
        }
    ]

body = {
        "messages": messages,
        "model": "Llama-4-Maverick-17B-128E-Instruct-FP8"
    }

t = requests.post(url,headers=head,data=json.dumps(body))

Output

The images show a user configuring diagnostic settings for an Azure storage account using Terraform.

**Image 1: Terraform Configuration and Deployment**

*   The first image displays a PowerShell terminal where Terraform is being used to configure and deploy Azure resources.
*   The Terraform configuration includes a diagnostic setting for a storage account, enabling metrics for "AllMetrics."
*   The user has applied the Terraform configuration, and the output shows the creation and modification of Azure resources, including a storage account and diagnostic settings.
*   The successful deployment is indicated by the message "Apply complete!" with details on the resources added, changed, or destroyed.

**Image 2: Azure Portal - Diagnostic Settings Verification**

*   The second image shows the Azure portal, specifically the diagnostic settings page for the "samyustorage" storage account.
*   The page lists various resources within the storage account, including the storage account itself and its components like blob, queue, table, and file.
*   The diagnostic status for the storage account and blob is shown as "Enabled," indicating that diagnostic settings have been successfully applied.
*   The other components (queue, table, and file) have their diagnostic status listed as "Disabled."

**Summary**

In summary, the images illustrate the process of configuring diagnostic settings for an Azure storage account using Terraform and verifying the configuration through the Azure portal. The Terraform deployment enables diagnostic metrics for the storage account, and the Azure portal confirms that the diagnostic settings are enabled for the storage account and its blob component.

Please try from your end and let us know if you have any query.

Thank you

Billy Zhou 20 Reputation points

2025-05-07T13:35:14.2+00:00

Hi @JAYA SHANKAR G S ,

I asked the same question on the meta-llama GitHub, and they said that the Maverick model supports multiple images in a single request. Could you please help investigate this issue further, as our business requires processing multiple images in a single request.

https://github.com/meta-llama/llama-cookbook/issues/938
JAYA SHANKAR G S 3,960 Reputation points Microsoft External Staff Moderator

2025-05-08T08:38:18.2633333+00:00

Hello Billy Zhou,

I have tried with other models like GPT-4.1-mini, it is working fine so no issue with sdk.
Also, i have tried with rest api getting same error, checking with our internal team will update you once we get the details.

Thank you
JAYA SHANKAR G S 3,960 Reputation points Microsoft External Staff Moderator

2025-05-23T03:48:46.5833333+00:00

Hello Billy Zhou,

Now we are getting the results for more than 1 image input, the issue is fixed please try from your end and confirm here.

Thank you

Share via

Got Error in chat completion API: At most 1 image(s) may be provided in one request

1 answer

Your answer