Formerly known as Azure AI Services or Azure Cognitive Services is a unified collection of prebuilt AI capabilities within the Microsoft Foundry platform
I hit the same {"code"=>"server_error", "message"=>"Sorry, something went wrong."} with o3-mini when using Assistants plus file_search on Azure. In my case it was caused by a combination of three things.
Remove reasoning_effort anywhere you use o3-mini with Assistants and file_search Do not set reasoning_effort when you create the assistant, and do not pass it when you create the run. With o3-mini on Azure, reasoning_effort works for normal responses, but it currently breaks when you use Assistants plus file_search and a vector store. As soon as I removed reasoning_effort from both the assistant and the run, the constant server_error stopped.
Update to a recent preview API version and a region where this combination is stable Make sure you are calling a recent preview like 2025-04-01-preview for the Assistants API and that your o3-mini deployment is in a region where Assistants plus file_search is supported, for example East US. When I switched from an older preview version and moved the deployment to a supported region, o3-mini + file_search + vector_store started working consistently.
Avoid forcing file_search unless you really need to If you are setting tool_choice to force file_search, try running first with tool_choice set to auto. If you must force file_search, add simple retry logic around the run call, because you can still get occasional transient server_error responses.
A minimal pattern that works for me now looks like this:
assistant = client.beta.assistants.create(
model="o3-mini",
instructions="Answer using the uploaded files.",
tools=[{"type": "file_search"}], # no reasoning_effort here
)
assistant = client.beta.assistants.update(
assistant_id=assistant.id,
tool_resources={"file_search": {"vector_store_ids": [vector_store.id]}},
)
run = client.beta.threads.runs.create(
thread_id=thread.id,
assistant_id=assistant.id,
# no reasoning_effort here either
# optionally do not force tool_choice
)
If after removing reasoning_effort, updating the API version, and testing a supported region you still get server_error on every run, capture the x-ms-request-id from the response and open an Azure support ticket, because at that point it is likely a platform issue rather than your code.