A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
Hello AyusmanBasu-6332,
I understand that you're running into a frustrating issue with the Fabric Data Agent while getting an error that says: "tool_user_error: Fabric run failed during execution." Since you indicated that you've already checked the troubleshooting page and it's not providing useful insights,
This strongly suggests the failure is occurring in the service-to-service execution layer between Foundry Agent and Fabric, not inside the model itself.
Below is a consolidated checklist covering configuration, permissions, networking, and quota angles.
1.Role & Identity Validation
Even if you’re Entra ID admin, what matters is the identity used by Foundry Agent at runtime.
Verify Foundry project Managed Identity has:
-
- Azure AI User
- Azure AI Project Manager
- (If applicable) Cognitive Services OpenAI Contributor
The same Managed Identity has the required role in the Fabric Workspace.
Remove and re-add the Managed Identity to Fabric Workspace to force a fresh token trust evaluation.
Sometimes token audience validation or cached identity bindings desync internally.
2.Fabric Capacity / Concurrency Limits
This is a common cause.
Even if manual runs succeed in Fabric, external invocation from Foundry can fail if:
Fabric capacity is throttled
Concurrency limits are reached
Background job limits are exceeded
Capacity was paused or auto-scaled
Check Fabric capacity status, Workspace compute health, Active job count
There is no specific public “Agent Tool quota,” but Fabric capacity and cross-service throttling absolutely apply.
3.Resource & Network Configuration Changes
Since this stopped suddenly, check whether any changes occurred in:
VNET configuration
Private endpoints
DNS resolution
Firewall rules
Cross-region routing
If you are using private endpoints, confirm:
- The endpoint wasn’t modified after agent creation
- DNS still resolves correctly from Foundry service context
4.Regional Alignment
Confirm Foundry project region, Fabric workspace region
Cross-region invocation can introduce routing failures or latency-related execution failures.
Ideally, both services should be in the same region.
5.Agent & Connection Validation
Even though you recreated the connection, verify:
The Fabric Data Agent is published
The connection shows as healthy
The agent instructions clearly reference when to use the Fabric tool
The connection is properly selected in the Agent configuration
Sometimes a connection exists but isn’t fully published or bound.
6.Backend / Service Incident Possibility
Because it was working and then suddenly stopped, this is highly indicative of:
A backend update
A short-lived service regression
Token exchange failure between services
Check Azure Service Health for Azure AI Foundry, Microsoft Fabric
Even brief degradation can produce this exact generic tool_user_error.
7.Model Compatibility
Since changing models didn’t help, this is unlikely the root cause. However, confirm:
The model (e.g., GPT-4.1) is supported in your region
You are not hitting TPM/RPM model limits
If it were a model quota issue, you would typically see a quota or rate limit error rather than a Fabric execution failure.
Most Likely Causes
Given your scenario, the highest probability causes are:
- Fabric capacity throttling
- Managed Identity token trust refresh issue
- Backend regression between Foundry and Fabric
Recommended Steps
Check Fabric capacity health and active job usage.
Remove and re-add Managed Identity permissions in Fabric.
Fully delete and recreate the Fabric connection.
Confirm region alignment.
Check Azure Service Health.
Given that it worked earlier without configuration changes, this is very unlikely to be a misconfiguration on your side and more likely a capacity or backend execution issue.
I Hope this helps. Do let me know if you have any further queries.
Thank you!