If you can't find answers to your questions in this document, and still need help check the Azure AI services support options guide. Azure AI Foundry Agent Service is part of Azure AI services.
Do you store any data used in the Foundry Agent Service API?
Yes. Foundry Agent Service is a stateful API, meaning it retains data. There are two types of data stored in the Foundry Agent Service API:
- Stateful entities: Threads, messages, and runs created during usage.
- Files: Uploaded during Foundry Agent Service setup or as part of a message.
Where is this data stored?
Basic Setup: Data is stored in a secure, Microsoft-managed storage account that is logically separated.
Standard Setup: Data is stored in your own Azure resources, giving you full ownership and control.
How long is this data stored?
All used data persists in this system unless you explicitly delete this data. Use the delete function with the thread ID of the thread you want to delete. Clearing the Run in the Foundry Agent Service Playground doesn't delete threads, however deleting them using delete function won't list them in the thread page.
Does Foundry Agent Service support customer-managed key encryption (CMK)?
Today we support CMK for Threads and Files in Foundry Agent Service.
Is my data used by Microsoft for training models?
No. Data is not used by Microsoft for training models. See the Responsible AI documentation for more information.
Where is data stored geographically?
Azure AI Foundry Agent Service endpoints are regional, and data is stored in the same region as the endpoint. For more information, see the Azure data residency documentation.
How am I charged for Foundry Agent Service?
- Inference cost (input and output) of the base model you're using for each Agent (for example gpt-4-0125). If you've created multiple agents, you'll be charged for the base model attached to each Agent.
- If you've enabled the Code Interpreter tool - for example your agent calls Code Interpreter simultaneously in two different threads, this would create two Code Interpreter sessions, each of which would be charged. Each session is active by default for one hour, which means that you would only pay this fee once if your user keeps giving instructions to Code Interpreter in the same thread for up to one hour.
- File search is billed based on the vector storage used.
For more information, see the pricing page.
Is there any additional pricing or quota for using Foundry Agent Service?
No. All quotas apply to using models with Foundry Agent Service.
Does the Foundry Agent Service API support non-Azure OpenAI models?
Yes, the Foundry Agent Service API supports non-Azure OpenAI models. See the models page for more information.