Assistant message missing after completed run in Azure OpenAI Assistants API

Question

Assistant message missing after completed run in Azure OpenAI Assistants API

Gabriel Henrique Medeiros Santos 5

When using the Assistants API on Azure OpenAI, the assistant message is sometimes missing even though the run status is completed.

Reproduction steps:
1. Create a thread with POST /threads.
2. Add a user message with POST /threads/{thread_id}/messages.
3. Execute a run with POST /threads/{thread_id}/runs.
4. Poll until status = completed.
5. Retrieve messages via GET /threads/{thread_id}/messages.
Expected result:
After the run completes, a new message from the assistant (role=assistant, assistant_id not null) should be appended to the thread. Actual result:
The latest message in the thread is still the user message. No assistant message is returned, even after multiple seconds of polling.
GET /threads/{thread_id}/messages?run_id={run_id} also returns an empty list. Additional info:
- API version: 2025-04-01-preview
- Region: brazilsouth

Manas Mohanty 13,340 Reputation points Moderator

2025-11-14T13:05:34.4966667+00:00

Hi Gabriel Henrique Medeiros Santos

Good day.

I am trying to replicate the scenario at my side with above region and API version. Shall update you pos that.

Thank you

2 answers

Your answer

Manas Mohanty 13,340 Reputation points Moderator

2025-11-14T13:05:34.4966667+00:00

Hi Gabriel Henrique Medeiros Santos

Good day.

I am trying to replicate the scenario at my side with above region and API version. Shall update you pos that.

Thank you

Answer 1

It appears that you are experiencing an issue where the assistant's message is not being appended to the thread after a run is completed in the Azure OpenAI Assistants API. Here are a few potential reasons and troubleshooting steps you might consider:

Check for Errors in the API Response: Ensure that there are no errors being returned in the API response when you execute the run. Sometimes, the run may complete without generating an assistant message due to internal errors.
Review Thread and Message Limits: Verify if there are any limits on the number of messages in a thread or if the assistant's message is being suppressed due to some configuration settings.
Polling Timing: Although you mentioned polling for several seconds, ensure that you are giving enough time for the assistant to process and respond, especially if the model is under heavy load.
API Version and Region: Since you are using a preview version of the API (2025-04-01-preview), there may be bugs or limitations that are not present in the stable release. Consider checking the Azure OpenAI documentation or forums for any known issues related to this version or region (brazilsouth).
Diagnostic Settings: If you have diagnostic settings enabled, check the activity logs for any anomalies or issues that may provide insight into why the assistant message is missing.

If the issue persists, consider reaching out to Azure support or checking community forums for additional assistance.

References:

Answer 2

Hi Gabriel Henrique Medeiros Santos

I referred the code available as reference here to replicate the issue,

Used Gpt-4o-mini model /Brazil South as part of trial and tested both Api_version="2024-08-01-preview", and API version: 2025-04-01-preview

I am able to procure the view the Sinewave image in the end.

Suggestion based on trials

Please make sure that you are using Supported models as mentioned here.

Please test with API version api_version="2024-08-01-preview" to see if the issue is specific to API version

Please share deployment name and image link in private message if the issue persists.

Please test with another available region.


#!/usr/bin/env python3
# -*- coding: utf-8 -*-

"""
Azure OpenAI Assistants API (Python SDK) – Minimal, clean, end-to-end example.

Flow:
  1) Create assistant (with Code Interpreter tool)
  2) Create thread
  3) Add a user message
  4) Create & poll a run until terminal state
  5) List thread messages, print assistant text, and save any image output

Requirements:
  - pip install openai
  - Environment variables:
      AZURE_OPENAI_API_KEY   -> your Azure OpenAI key
      AZURE_OPENAI_ENDPOINT  -> e.g. https://<resource>.openai.azure.com
  - Replace `MODEL_DEPLOYMENT_NAME` with your deployed model name (e.g., "gpt-4o-mini")

Notes:
  - Messages list is returned newest-first; assistant reply is typically at index 0.
  - Code Interpreter outputs can include files (images). This sample saves the first image.
"""

import os
import json
import time
from typing import Optional

from openai import AzureOpenAI
from PIL import Image  # pillow is optional, used here to open image after download

# ---------- Configuration ----------
API_KEY = os.getenv("AZURE_OPENAI_API_KEY")
AZURE_ENDPOINT = os.getenv("AZURE_OPENAI_ENDPOINT")
API_VERSION = "2025-04-01-preview"   # use the Preview that matches your resource
MODEL_DEPLOYMENT_NAME = "gpt-4o-mini"  # <-- change to your deployment name

if not API_KEY or not AZURE_ENDPOINT:
    raise RuntimeError("Please set AZURE_OPENAI_API_KEY and AZURE_OPENAI_ENDPOINT env vars.")

client = AzureOpenAI(
    api_key=API_KEY,
    api_version=API_VERSION,
    azure_endpoint=AZURE_ENDPOINT,
)

# ---------- 1) Create an Assistant ----------
def create_assistant() -> str:
    print("[*] Creating assistant with Code Interpreter…")
    assistant = client.beta.assistants.create(
        name="Data Visualization",
        instructions=(
            "You are a helpful AI assistant who makes interesting visualizations based on data. "
            "You have access to a sandboxed environment for writing and testing code. "
            "When you are asked to create a visualization you should follow these steps:\n"
            "1. Write the code.\n"
            "2. Anytime you write new code display a preview of the code to show your work.\n"
            "3. Run the code to confirm that it runs.\n"
            "4. If the code is successful display the visualization.\n"
            "5. If the code is unsuccessful display the error message and try to revise the code and rerun."
        ),
        tools=[{"type": "code_interpreter"}],
        model=MODEL_DEPLOYMENT_NAME,  # must be your deployment name, not a plain model family
    )
    print(f"[+] Assistant created: {assistant.id}")
    return assistant.id

# ---------- 2) Create a Thread ----------
def create_thread() -> str:
    print("[*] Creating thread…")
    thread = client.beta.threads.create()
    print(f"[+] Thread created: {thread.id}")
    return thread.id

# ---------- 3) Add a user message ----------
def add_user_message(thread_id: str, text: str) -> None:
    print(f"[*] Adding user message: {text}")
    client.beta.threads.messages.create(
        thread_id=thread_id,
        role="user",
        content=[{"type": "text", "text": text}],
    )

# ---------- 4) Create & poll a run ----------
TERMINAL_STATES = {"completed", "failed", "cancelled", "expired"}

def create_run(thread_id: str, assistant_id: str) -> str:
    print("[*] Creating run…")
    run = client.beta.threads.runs.create(
        thread_id=thread_id,
        assistant_id=assistant_id,
        # Optional: set limits if needed (ensure non-zero to avoid empty output)
        # max_prompt_tokens=4000,
        # max_completion_tokens=1024,
    )
    print(f"[+] Run created: {run.id}")
    return run.id

def poll_run(thread_id: str, run_id: str, poll_interval: float = 2.0) -> str:
    print("[*] Polling run status… (Ctrl+C to stop)")
    start = time.time()
    while True:
        run = client.beta.threads.runs.retrieve(thread_id=thread_id, run_id=run_id)
        status = run.status
        elapsed = time.time() - start
        print(f"  - Status: {status} | Elapsed: {int(elapsed // 60)}m {int(elapsed % 60)}s")

        # Handle requires_action (tools/functions). If you see this, you must submit tool outputs.
        if status == "requires_action":
            print("[!] Run requires_action. This sample does not implement tool output submission.")
            print("    Provide tool outputs via `submit_tool_outputs` and continue polling.")
            # break or continue as needed; here we continue polling to show status.
            time.sleep(poll_interval)
            continue

        if status in TERMINAL_STATES:
            print(f"[+] Run reached terminal state: {status}")
            return status

        time.sleep(poll_interval)

# ---------- 5) Retrieve messages & save any image from Code Interpreter ----------
def list_messages(thread_id: str):
    print("[*] Listing messages (newest-first)…")
    messages = client.beta.threads.messages.list(thread_id=thread_id)
    return messages

def find_first_assistant_text(messages) -> Optional[str]:
    """
    Return the first assistant text block (if present) from newest-first messages.
    """
    try:
        for msg in messages.data:
            if msg.role == "assistant":
                # Each message may have multiple content items; pick the first text item.
                for item in msg.content:
                    if item.get("type") == "text" and "text" in item:
                        return item["text"].get("value")
    except Exception as e:
        print(f"[!] Failed to parse assistant text: {e}")
    return None

def save_first_code_interpreter_image(messages, out_path="sinewave.png") -> Optional[str]:
    """
    Scan newest-first messages for the first code-interpreter image and save it.
    Returns the local file path if saved; otherwise None.
    """
    try:
        for msg in messages.data:
            if msg.role != "assistant":
                continue
            for item in msg.content:
                # Code Interpreter image payload shape:
                # { "type": "image_file", "image_file": { "file_id": "<id>" } }
                if item.get("type") == "image_file":
                    image_file_id = item["image_file"]["file_id"]
                    print(f"[*] Found image file id: {image_file_id}")

                    content = client.files.content(image_file_id)
                    content.write_to_file(out_path)

                    print(f"[+] Image saved to: {out_path}")
                    return out_path
    except Exception as e:
        print(f"[!] No image saved (parsing or download error): {e}")
    return None

def show_image(path: str) -> None:
    try:
        img = Image.open(path)
        img.show()
        print(f"[+] Opened image viewer for: {path}")
    except Exception as e:
        print(f"[!] Could not open image viewer: {e}")

# ---------- Main ----------
def main():
    assistant_id = create_assistant()
    thread_id = create_thread()

    # User request to create a visualization
    add_user_message(thread_id, "Create a visualization of a sinewave")

    run_id = create_run(thread_id, assistant_id)
    status = poll_run(thread_id, run_id, poll_interval=2.0)

    # Fetch and display messages
    messages = list_messages(thread_id)

    # Print assistant text if present
    text = find_first_assistant_text(messages)
    if text:
        print("\n===== Assistant Reply (text) =====\n")
        print(text)
        print("\n==================================\n")
    else:
        print("[!] No assistant text found in thread messages.")

    # Try saving first image produced by Code Interpreter
    image_path = save_first_code_interpreter_image(messages, out_path="sinewave.png")
    if image_path:
        # Optional: open the image (platform-dependent viewer)
        show_image(image_path)

    # For debugging: print raw messages JSON (indent for readability)
    try:
        print("\n===== Raw Messages (JSON) =====\n")
        print(messages.model_dump_json(indent=2))
    except Exception as e:
        print(f"[!] Could not dump messages JSON: {e}")

if __name__ == "__main__":

Looking forward to hearing from you.

Thank you.

Gabriel Henrique Medeiros Santos 5 Reputation points

2025-11-19T16:19:28.09+00:00

Apparently, this has been a regional problem in Brazil.

I replicated my assistant in another region (us-east) and didn't have similar problems.

Apparently, this problem has been less frequent for the past 3 days.

The Brazilian region is experiencing several problems, from this filter issue to the assistant inventing a knowledge tool and setting the run to a complete state but without generating a message in the /messages endpoint of the assistant type.
Manas Mohanty 13,340 Reputation points Moderator

2025-11-20T11:17:50.9033333+00:00

Hi Gabriel Henrique Medeiros Santos

Thank you for your feedback. will let product group know about it.

Would also suggest you start testing Response API as I saw depreciation notices in my trials

Please accept above answer if you appreciated my inputs.

Thank you

Share via

Assistant message missing after completed run in Azure OpenAI Assistants API

2 answers

Your answer