Introduction/Summary of Problem:
I am consistently encountering 408 Request Timeout errors when attempting to call a deployed chat model (gpt-oss-120b) via an Azure AI Studio (Foundry) inference endpoint. This is happening within the Playground experience in the Azure AI Studio portal.
Detailed Scenario:
I have an Azure AI Studio setup in the East US 2 region where I've deployed a chat model, gpt-oss-120b, through the Model Catalog. When interacting with this deployed model using the 'Playground' in the AI Studio portal, I receive a 408 Request Timeout error. This issue is consistent and reproducible.
Observed Outcome / Error Details (Crucial for Microsoft to diagnose):
The browser console shows the following POST request and subsequent 408 Request Timeout. The request is made to an endpoint formatted as https://[my-ai-hub-name].services.ai.azure.com/models/chat/completions?api-version=2024-05-01-preview.
POST https://[REDACTED_HUB_NAME].services.ai.azure.com/models/chat/completions?api-version=2024-05-01-preview 408 (Request Timeout)
XMLHttpRequest.send @ manualChunk_common_core-62bed741.js:1
... [truncated stack trace, include if it fits concisely] ...
Response message from Copilot/UI: Timeout: The operation was timeout. | Apim-request-id: 2cd4bad2-87bf-421b-82f9-e5d9b5dd8c20
Request Body (from browser console):
JSON
{
Relevant Request Headers (from browser console - REDACT ALL SENSITIVE INFO):
:authority: [REDACTED_HUB_DOMAIN]
:method: POST
:path: /models/chat/completions?api-version=2024-05-01-preview
:scheme: https
api-key: [REDACTED_API_KEY]
content-type: application/json
origin: https://ai.azure.com
referer: https://ai.azure.com/
request-id: [REDACTED_REQUEST_ID]
traceparent: [REDACTED_TRACEPARENT]
user-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/139.0.0.0 Safari/537.36
x-ms-client-request-id: [REDACTED_CLIENT_REQUEST_ID]
x-ms-useragent: AzureOpenAI.Studio/ai.azure.com
APIM from playground when sending 1 message "test message":
Chat history
test message
Copilot said:
Azure AI Foundry
AI-generated content may be incorrect
Error
Timeout: The operation was timeout. | Apim-request-id: 1ac28c24-fbef-4be2-bcba-48ca94f7275b
New chat session startedThe assistant setup has been updated. Previous messages won't be used as context for new queries.New chat session started
test message again
Copilot said:
Azure AI Foundry
AI-generated content may be incorrect
Error
Timeout: The operation was timeout. | Apim-request-id: e34ecede-a08a-460b-a986-41a905696df1__
Environment Details:__
Location: East US 2
Deployed Model Name: gpt-oss-120b
API Version: 2024-05-01-preview
Approximate Timestamp of a recent failure (include time zone): [Insert specific date and time, e.g., "August 19, 2025, 1:40 PM EDT"]
Troubleshooting Efforts:
Confirmed that the endpoint URL and API key align with what the Azure AI Studio playground itself is using for this deployment.
Attempted to use az ml CLI commands to list related workspaces or endpoints, but these commands did not return expected results for deployments made via the AI Studio project, suggesting a different management approach for 'Foundry Models'.
I've checked for associated Application Insights resources for the AI Studio project's deployments, but I haven't been able to confirm one is connected for this telemetry.
I am aware of recent reports regarding intermittent 408 timeouts for Azure AI Studio/Foundry services in the East US region and am seeking to confirm if this is part of that broader issue or specific to my deployment.
XMLHttpRequest.send @ manualChunk_common_core-62bed741.js:1
... [truncated stack trace, include if it fits concisely] ...
Response message from Copilot/UI: Timeout: The operation was timeout. | Apim-request-id: 2cd4bad2-87bf-421b-82f9-e5d9b5dd8c20
__Request Body (from browser console):__
JSON
```json
{
Relevant Request Headers (from browser console - REDACT ALL SENSITIVE INFO):
:authority: [REDACTED_HUB_DOMAIN]
:method: POST
:path: /models/chat/completions?api-version=2024-05-01-preview
:scheme: https
api-key: [REDACTED_API_KEY]
content-type: application/json
origin: https://ai.azure.com
referer: https://ai.azure.com/
request-id: [REDACTED_REQUEST_ID]
traceparent: [REDACTED_TRACEPARENT]
user-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/139.0.0.0 Safari/537.36
x-ms-client-request-id: [REDACTED_CLIENT_REQUEST_ID]
x-ms-useragent: AzureOpenAI.Studio/ai.azure.com
Environment Details:
__Location:__ `East US 2`
__Deployed Model Name:__ `gpt-oss-120b`
__API Version:__ `2024-05-01-preview`
__Approximate Timestamp of a recent failure (include time zone):__ [Insert specific date and time, e.g., "August 19, 2025, 1:40 PM EDT"]
Troubleshooting Efforts:
Confirmed that the endpoint URL and API key align with what the Azure AI Studio playground itself is using for this deployment.
Attempted to use az ml CLI commands to list related workspaces or endpoints, but these commands did not return expected results for deployments made via the AI Studio project, suggesting a different management approach for 'Foundry Models'.
I've checked for associated Application Insights resources for the AI Studio project's deployments, but I haven't been able to confirm one is connected for this telemetry.
I am aware of recent reports regarding intermittent 408 timeouts for Azure AI Studio/Foundry services in the East US region and am seeking to confirm if this is part of that broader issue or specific to my deployment.
Checked Azure Service Health for East US 2 for any related incidents (none currently reported).
- Verified the model deployment status in the AI Studio portal shows 'Healthy'.
I can provide more information privately. I am surprised I had to pay $29/month just to post to a public forum to verify why Azure's own service is not working.... please advise on resolution.
I observed this in my app first, then removed my app from the situation and went straight to playground and same behavior. I have used timeout params as long as 3 minutes and still same issue.