Question 1

Do you store any data used in the Foundry Agent Service API?

Accepted Answer

Yes. Foundry Agent Service is a stateful API, meaning it retains data. There are two types of data stored in the Foundry Agent Service API:

Stateful entities: Threads, messages, and runs created during usage.
Files: Uploaded during Foundry Agent Service setup or as part of a message.

Question 2

Where is this data stored?

Accepted Answer

Basic Setup: Data is stored in a secure, Microsoft-managed storage account that is logically separated.
Standard Setup: Data is stored in your own Azure resources, giving you full ownership and control.

Question 3

How long is this data stored?

Accepted Answer

All used data persists in this system unless you explicitly delete this data. Use the delete function with the thread ID of the thread you want to delete. Clearing the Run in the Foundry Agent Service Playground doesn't delete threads, however deleting them using delete function won't list them in the thread page.

Question 4

Does Foundry Agent Service support customer-managed key encryption (CMK)?

Accepted Answer

Today we support CMK for Threads and Files in Foundry Agent Service.

Question 5

Is my data used by Microsoft for training models?

Accepted Answer

No. Data is not used by Microsoft for training models. See the Responsible AI documentation for more information.

Question 6

Where is data stored geographically?

Accepted Answer

Azure AI Foundry Agent Service endpoints are regional, and data is stored in the same region as the endpoint. For more information, see the Azure data residency documentation.

Question 7

How am I charged for Foundry Agent Service?

Accepted Answer

Inference cost (input and output) of the base model you're using for each Agent (for example gpt-4-0125). If you've created multiple agents, you'll be charged for the base model attached to each Agent.
If you've enabled the Code Interpreter tool - for example your agent calls Code Interpreter simultaneously in two different threads, this would create two Code Interpreter sessions, each of which would be charged. Each session is active by default for one hour, which means that you would only pay this fee once if your user keeps giving instructions to Code Interpreter in the same thread for up to one hour.
File search is billed based on the vector storage used.

For more information, see the pricing page.

Question 8

Is there any additional pricing or quota for using Foundry Agent Service?

Accepted Answer

No. All quotas apply to using models with Foundry Agent Service.

Question 9

Does the Foundry Agent Service API support non-Azure OpenAI models?

Accepted Answer

Yes, the Foundry Agent Service API supports non-Azure OpenAI models. See the models page for more information.

Share via

Azure AI Foundry Agent Service frequently asked questions