Container Apps Job Maximum Allowed Cores exceeded
I have an event driven Container Apps Job which has been running well for about a month. It is using a General Purpose D-series workload profile with 4 vCPU and 16GB memory. The profile is used only by this job as is currently set to a min of 0 instances and a max of 1 instance. The job runs on .5 CPU, 1GB memory and has a max of 8 replicas. Recently we have started noticing frequent issues with running even a single job. The system logs show messages like this:
Maximum Allowed Cores exceeded for the Managed Environment. Please check https://learn.microsoft.com/en-us/azure/container-apps/quotas for resource limits
and
0/4 nodes are available: 1 node(s) had untolerated taint {virtual-kubelet.io/provider: legion}, 3 node(s) didn't match Pod's node affinity/selector. preemption: 0/4 nodes are available: 4 Preemption is not helpful for scheduling.
However, when I look at my usage quotas, we are nowhere near any of them:
{
"value": [
{
"currentValue": 0,
"limit": 200,
"name": {
"localizedValue": "Managed Environment General Purpose Cores",
"value": "ManagedEnvironmentGeneralPurposeCores"
},
"usage": 0
},
{
"currentValue": 0,
"limit": 200,
"name": {
"localizedValue": "Managed Environment Memory Optimized Cores",
"value": "ManagedEnvironmentMemoryOptimizedCores"
},
"usage": 0
},
{
"currentValue": 7,
"limit": 100,
"name": {
"localizedValue": "Managed Environment Consumption Cores",
"value": "ManagedEnvironmentConsumptionCores"
},
"usage": 7
}
]
}
Also, in the portal I see this which also shows no quotas even being close:
What might be causing these errors if we are not exceeding any quotas? Or is there one that I am missing?
Interestingly, I am also noticing that even when there is only 1 message, when this error is encountered, there will be multiple jobs trying to run at the same time even though parallelism is set to 1.