An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
Hi GenixPRO,
Thanks for sharing the details — I understand how this can be confusing, especially when transitioning from the Assistants API to Foundry prompt agents.
In Azure AI Foundry Agent Service, creating a conversation alone will not generate a reply. A conversation simply holds context and history — it does not execute the model.
To actually get a response from your agent, you must explicitly create a response. This is the step where the model runs and produces output.
The expected flow is:
- Create agent
- Create (or use) conversation
- Create response (this triggers execution)
- Retrieve the response
There’s no need to manually call a separate “responses endpoint” from your frontend code.
When using Foundry:
- All operations (agents, conversations, responses) are invoked through the project endpoint
- The SDK or REST API internally handles the correct routes
- The key requirement is calling
responses.create()
Based on your scenario, the issue is most likely that the response creation step is missing.
You can update your implementation as follows:
// Create a conversation
const conversation = await client.conversations.create();
// Trigger the agent by creating a response
const response = await client.responses.create({
agent_reference: {
name: "your-agent-name",
version: "your-agent-version"
},
conversation_id: conversation.id,
input: "Hello"
});
// Retrieve the generated output
const result = await client.responses.get(response.id);
console.log(result.output_text);
For production scenarios, it is recommended to route requests through a backend service rather than calling Foundry directly from a mobile frontend. This helps with:
- Secure credential management
- Better control and observability
- Alignment with recommended authentication methods (such as Microsoft Entra ID)
- A conversation only stores context — it does not run the model
- You must call
responses.create()to generate output - There is no need to manually call a separate “responses endpoint”
- Using a backend layer is the preferred integration pattern
Build with agents, conversations, and responses
Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help.