Share via

o3-mini Server Error: "Sorry, something went wrong."

Andreas Gnyp 10 Reputation points
2025-11-29T17:29:58.25+00:00

I get this message constantly:

{"code"=>"server_error", "message"=>"Sorry, something went wrong."}

I'm using o3-mini with Assistant and File Search which takes a vector store (works fine).

Foundry Tools
Foundry Tools

Formerly known as Azure AI Services or Azure Cognitive Services is a unified collection of prebuilt AI capabilities within the Microsoft Foundry platform


1 answer

Sort by: Most helpful
  1. Adam Zachary 2,265 Reputation points
    2025-11-30T02:53:57.5633333+00:00

    I hit the same {"code"=>"server_error", "message"=>"Sorry, something went wrong."} with o3-mini when using Assistants plus file_search on Azure. In my case it was caused by a combination of three things.

    Remove reasoning_effort anywhere you use o3-mini with Assistants and file_search Do not set reasoning_effort when you create the assistant, and do not pass it when you create the run. With o3-mini on Azure, reasoning_effort works for normal responses, but it currently breaks when you use Assistants plus file_search and a vector store. As soon as I removed reasoning_effort from both the assistant and the run, the constant server_error stopped.

    Update to a recent preview API version and a region where this combination is stable Make sure you are calling a recent preview like 2025-04-01-preview for the Assistants API and that your o3-mini deployment is in a region where Assistants plus file_search is supported, for example East US. When I switched from an older preview version and moved the deployment to a supported region, o3-mini + file_search + vector_store started working consistently.

    Avoid forcing file_search unless you really need to If you are setting tool_choice to force file_search, try running first with tool_choice set to auto. If you must force file_search, add simple retry logic around the run call, because you can still get occasional transient server_error responses.

    A minimal pattern that works for me now looks like this:

    assistant = client.beta.assistants.create(
        model="o3-mini",
        instructions="Answer using the uploaded files.",
        tools=[{"type": "file_search"}],  # no reasoning_effort here
    )
    
    assistant = client.beta.assistants.update(
        assistant_id=assistant.id,
        tool_resources={"file_search": {"vector_store_ids": [vector_store.id]}},
    )
    
    run = client.beta.threads.runs.create(
        thread_id=thread.id,
        assistant_id=assistant.id,
        # no reasoning_effort here either
        # optionally do not force tool_choice
    )
    

    If after removing reasoning_effort, updating the API version, and testing a supported region you still get server_error on every run, capture the x-ms-request-id from the response and open an Azure support ticket, because at that point it is likely a platform issue rather than your code.

    Was this answer helpful?


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.