Formerly known as Azure AI Services or Azure Cognitive Services is a unified collection of prebuilt AI capabilities within the Microsoft Foundry platform
Hello @Benoit Lalonde ,
Welcome to Microsoft Q&A .Thank you for reaching out to us.
The behavior observed aligns with how multi-agent workflows function when executed within a shared conversation context. In sequential workflows, each agent processes input and contributes output to the same conversation stream, which is then rendered by the client interface such as Teams or Copilot.
When multiple agents operate using a common conversationId, the conversation maintains continuity across steps. This allows downstream agents to receive prior outputs as context, which is essential for chaining logic and building a coherent final response. However, this shared context also results in intermediate outputs being included in the activity stream.
Although autoSend: false is configured for certain steps, this setting primarily controls whether a workflow step explicitly emits a message, but it does not guarantee suppression of all outputs from appearing in the conversation. Since the conversation is shared and activity events are streamed through the bot runtime, intermediate outputs may still be surfaced.
It is important to clarify that there is currently no workflow-only configuration that ensures intermediate agent outputs remain completely hidden while retaining a shared conversation context. This behavior is tied to how conversation history persisted and rendered as part of the workflow execution model.
Please check if the following workarounds help -
- Introducing filtering at the bot layer - A middleware layer can be added in the bot pipeline to control which activities are passed to the client.
- Middleware processes every incoming and outgoing activity in the conversation pipeline
- It enables inspection and modification of messages before they are rendered
- Filtering can be applied based on:
- Agent identity
- Message metadata or structure
- Only the final agent response can be allowed to pass through while suppressing intermediate outputs
- Using external orchestration - Instead of relying on conversation-bound workflows
- Execute intermediate agents outside the client-facing conversation
- Pass outputs internally between agents
- Send only the final response back to the user interface
- Applying limited workflow adjustments - This may reduce visibility but does not guarantee suppression in all cases.
- Enable autoSend only for the final agent
- Keep intermediate outputs as structured data rather than conversational messages
The following references might be helpful , please check them out
- Build a workflow in Microsoft Foundry - Microsoft Foundry | Microsoft Learn
- Build with agents, conversations, and responses in Foundry Agent Service - Microsoft Foundry | Microsoft Learn
- Middleware - Bot Service | Microsoft Learn
Thank you