An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
This behavior is consistent with a service-side issue in Azure OpenAI Global Batch rather than with batch content, API version, or configuration.
From the available guidance, there are no client-side troubleshooting steps that explain or resolve a situation where:
- Minimal, valid batches (for example, a single "Say hello." request) remain stuck in
validating, and - The issue reproduces across multiple regions and API versions after previously successful runs the same day.
The documented troubleshooting items for Global Batch focus on:
- Input validation errors (for example,
invalid_json_line,model_not_found,invalid_request,too_many_tasks,url_mismatch,model_mismatch,empty_file,duplicate_custom_id,input_modified,input_no_permissions). These surface as explicit error codes, not as jobs stuck indefinitely invalidating. - Known client-side issues such as:
- Resources deployed via Azure CLI having non-standard endpoint subdomains that prevent Global Batch from working out of the box.
- UTF-8-BOM–encoded JSONL files being rejected.
- Modifying the input blob after submission causing the job to fail.
None of these match a scenario where jobs never leave validating and do not transition to a failed state with an error code.
Given the symptoms and the prior similar incident that required a backend fix, the next actionable steps are:
- Treat this as a potential regional or platform incident for Azure OpenAI Global Batch.
- Check Azure Status and Service Health in the Azure portal for incidents affecting Azure OpenAI in the impacted regions (East US 2 and North Central US).
- If no incident is shown, open a support case with Azure Support, providing:
- Resource regions and deployment names (for example,
gpt-4.1-global-batchin East US 2 and North Central US). - The affected batch IDs (
batch_0aea3966,batch_43ea94a8,batch_8c9946fc) and the last successful batch ID (batch_9495cf59) with timestamps. - Confirmation that minimal test batches with valid JSONL input are also stuck in
validating.
- Resource regions and deployment names (for example,
The support team can then correlate these batch IDs and regions with backend telemetry and apply or request the necessary service-side remediation.
References:
- Getting started with Azure OpenAI batch deployments (rest-api)
- Getting started with Azure OpenAI batch deployments (ai-foundry-portal)
- Getting started with Azure OpenAI batch deployments (programming-language-python)
- Document Intelligence classification and extraction requests timing out - Microsoft Q&A