Share via

Azure OpenAI Conversations API endpoint returns 404 on all tested configurations

administrator 40 Reputation points
2026-06-10T15:20:14.8933333+00:00

Hi,

The Conversations API (POST /openai/v1/conversations) per documentation at https://learn.microsoft.com/azure/foundry/openai/latest#conversations returns 404 across multiple Azure OpenAI resource configurations we've tested:

  • Resource 1: OpenAI, Sweden Central
    • Resource 2: AIServices, East US 2

Tested both .openai.azure.com and .services.ai.azure.com hostnames, and api-version values v1, preview, and various date-formatted versions. All return HTTP 404 "Resource not found".

The companion Responses API (POST /openai/v1/responses) works correctly on the same resources/endpoints — returning HTTP 200 with valid responses. This confirms our auth, host, and v1 API surface are functioning. Only the /v1/conversations endpoint specifically is not reachable.

Could anyone confirm:

  1. Is the Conversations API currently in private preview requiring allowlist enrollment?
  2. If so, can subscription <sub-id> be added to that allowlist?
  3. If the API is meant to be GA / public preview, is there a feature registration or other enablement step we're missing?
Azure OpenAI in Foundry Models

Answer accepted by question author

Divyesh Govaerdhanan 11,065 Reputation points MVP Volunteer Moderator
2026-06-10T22:19:44.6166667+00:00

Hi administrator,

Welcome to Microsoft Q&A,

The POST /openai/v1/conversations endpoint exists in the reference spec, but it is not yet part of the deployed v1 GA surface. No allowlist or feature registration is required. It simply hasn't rolled out yet. This is exactly why the Responses API returns 200 for the same resource, whereas Conversations returns 404.

In the meantime, Use previous_response_id in the Responses API to maintain multi-turn conversation state server-side, without manually managing history arrays:

POST {endpoint}/openai/v1/responses
{
  "model": "gpt-4.1",
  "input": "your follow-up message",
  "previous_response_id": "<id from previous response>"
}


The service chains context automatically, which covers the same use case as a persistent conversation_id.

To track when Conversations support goes live, monitor the API version changelog.

Please Upvote and accept the answer if it helps!!

Was this answer helpful?

1 person found this answer helpful.
0 comments No comments

1 additional answer

Sort by: Most helpful
  1. Karnam Venkata Rajeswari 3,665 Reputation points Microsoft External Staff Moderator
    2026-06-11T10:36:00.3133333+00:00

    Hello @administrator ,

    Welcome to Microsoft Q&A .Thank you for reaching out to us.

    The Conversations API endpoint is documented but not yet fully deployed across all environments

    The POST /openai/v1/conversations endpoint is available in the official REST API reference, however current platform behavior indicates that it is not yet exposed as part of the actively deployed v1 Generally Available surface across all resources and regions.

    At the same time, the API lifecycle guidance confirms that the v1 API currently supports only a subset of capabilities, with additional features continuing to roll out progressively

    This explains the observed pattern:

    • POST /openai/v1/responses is suppoetedand available and returns HTTP 200
    • POST /openai/v1/conversations is not yet exposed which returns HTTP 404

    As asked,

    1. Private preview / allowlist requirement
      1. There is no official documentation indicating that the Conversations API requires allowlist enrollment.
      2. No tenant-level or subscription-level access approval is referenced.
    2. Allowlist enablement possibility Since no allowlist mechanism is defined, no formal onboarding or enablement request process exists for this endpoint.
    3. Feature registration or additional enablement
      • There is no documented feature registration or manual enablement step associated with /openai/v1/conversations.
      • The behavior is instead aligned with feature rollout timing across environments.

    For multi-turn conversational scenarios, the Responses API can be used effectively to achieve the same outcome.

    POST {endpoint}/openai/v1/responses
    {
      "model": "<deployment-name>",
      "input": "follow-up message",
      "previous_response_id": "<id from previous response>"
    }
    

    This helps to -

    1. Maintain multi-turn conversation context automatically
    2. Eliminate the need to manage full message history manually
    3. Provides functional equivalence to a persistent conversation_id workflow
    4. Fully supports across current GA environments

    To ensure consistency and eliminate edge conditions, the following checks can be helpful

    1. Confirm usage of the correct v1 route pattern:
      • /openai/v1/...
    2. Ensuring request structure follows v1 design:
      • Deployment name passed as "model" in request body
    3. Validating REST execution independently:
      • Include proper headers (API key or Bearer token)
      • Capture apim-request-id for traceability
    4. Collecting request diagnostics:
      • Full request URL
      • UTC timestamp
      • Response headers and payload
    5. Verifying behavior across:
      • Multiple regions
      • Multiple resource types (OpenAI vs AI Services)

    Since this behavior is tied to feature rollout, availability is expected to evolve over time.

    The following references might be helpful , please check them out

    Thank you

     

    Please "Accept" the answer with an "Upvote" by clicking on "Accept Answer" and thumbs-up button if the response was helpful. This will be benefitting other community members who face the same issue.

    Was this answer helpful?

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.