An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
Hi GenixPRO
Thanks for your detailed question. This is a common confusion when working with Assistants API and moving to Foundry Agents. Let me try to clarify both behavior and your issues step by step.
1. About memory and vector stores in Assistants API
In Assistants API, you do not need to attach memory separately.
- Memory is already handled by the Thread
- Thread stores the conversation history automatically
- You simply keep adding messages to the same thread
So, no need to attach memory to:
- assistant
- thread separately
This is expected behavior.
Now about vector stores:
- Vector stores are only used for file search (RAG scenarios)
- They are used to read and search content from uploaded files
- They are not used to store generated files like PDFs or charts
So, if your expectation is that generated files will be saved into vector store, that will not happen.
2. Why your generated files are not retrievable
When using tools like code interpreter:
- Files (like charts or PDFs) are generated during the run
- These files are stored temporarily as part of the tool execution
- They are not automatically persisted for reuse later
That is why:
- your first assistant creates files but cannot retrieve later
- second assistant cannot access those outputs
This is a limitation of the Assistants API design.
3. Important point about Assistants API
The Assistants API is being retired, so this behavior will not be improved further.
You can check here: Assistants API concepts and deprecation notice [learn.microsoft.com]
Microsoft recommends moving to Foundry Agents.
4. How Foundry Agents solve this better
Foundry Agents are designed as a more complete system:
- They manage conversations (threads) automatically
- They support tools and workflows in a more structured way
- They allow better integration with storage and enterprise data
They also give you more control for:
- where data is stored
- how tools execute
- how outputs are managed
Agents are built as a full platform instead of just an API layer. [dev.to]
If you stay on Assistants API:
- Do not rely on vector store for generated files
- Capture generated file output in your application code
- Store files externally (blob storage or database)
- Pass file references back to users
If you move to Foundry Agents:
- Treat agent as a long running service
- Use external storage for generated outputs
Design flow where tool output is saved and returned explicitly
This should help explain the behavior you are seeing. Do let me know if you have any further queries.
Thankyou!