Summary
Calls to a mistral-document-ai-2512 serverless deployment on Microsoft Foundry intermittently return HTTP 408 "upstream request timeout" and HTTP 503 "service unavailable" when the payload uses document_url with a base64-encoded PDF. The exact same endpoint, deployment, model and credentials succeed reliably when the payload uses image_url with a base64-encoded image. PDF inputs were working consistently until a few days ago, without any change on our side.
Endpoint pattern
https://<resource>.cognitiveservices.azure.com/providers/mistral/azure/ocr
Model
mistral-document-ai-2512 (deployed as global-standard serverless)
Failing request body (PDF)
{
"model": "mistral-document-ai-2512",
"document": {
"type": "document_url",
"document_url": "data:application/pdf;base64,<...>"
},
"include_image_base64": false
}
Working request body (image, same endpoint)
{
"model": "mistral-document-ai-2512",
"document": {
"type": "image_url",
"image_url": "data:image/png;base64,<...>"
},
"include_image_base64": false
}
Error responses
Two distinct errors are returned, both server-side:
408 - upstream request timeout
503 - service unavailable
The errors appear non-deterministic — the same PDF payload can return 408 on one attempt and 503 on the next. Far below our 50 RPM quota in every case (no 429 ever observed).
Reproduction details
- PDF size tested: small (single page, well under the documented 30 MB / 30 page limit)
- Also tested with
pages: [0] to restrict processing to the first page only — same 408 / 503 mix
- Tried with
include_image_base64 set to both true and false — same behavior
- Client-side timeout set to 180000 ms — errors are clearly server-side, not client-side
- Multiple retries with 30 to 45 second spacing all fail
- Image inputs (PNG, same byte range) succeed within seconds on the same endpoint
What we have tried
- Increased client timeout up to 180 seconds
- Added retry logic (up to 5 attempts, 45 seconds between tries)
- Sent only
pages: [0] to minimize processing load
- Toggled
include_image_base64
- Verified the payload matches the documented API contract
Questions
- Is there a known regression or capacity issue affecting the
document_url (PDF) path of mistral-document-ai-2512 over the past few days?
- Is the PDF processing pipeline routed differently from the
image_url pipeline? The clear asymmetry (PDFs fail, images succeed on the same endpoint) suggests separate backend handling.
- Are 408 and 503 expected responses when underlying capacity is constrained, even when the client is well below its assigned RPM quota?
- Are there recommended workarounds while this is investigated, other than client-side PDF rasterization to PNG images?
Thanks for any guidance from the Foundry or Mistral team.
Summary
Calls to a mistral-document-ai-2512 serverless deployment on Microsoft Foundry intermittently return HTTP 408 "upstream request timeout" and HTTP 503 "service unavailable" when the payload uses document_url with a base64-encoded PDF. The exact same endpoint, deployment, model and credentials succeed reliably when the payload uses image_url with a base64-encoded image. PDF inputs were working consistently until a few days ago, without any change on our side.
Endpoint pattern
https://<resource>.cognitiveservices.azure.com/providers/mistral/azure/ocr
Model
mistral-document-ai-2512 (deployed as global-standard serverless)
Failing request body (PDF)
{
"model": "mistral-document-ai-2512",
"document": {
"type": "document_url",
"document_url": "data:application/pdf;base64,<...>"
},
"include_image_base64": false
}
Working request body (image, same endpoint)
{
"model": "mistral-document-ai-2512",
"document": {
"type": "image_url",
"image_url": "data:image/png;base64,<...>"
},
"include_image_base64": false
}
Error responses
Two distinct errors are returned, both server-side:
408 - upstream request timeout
503 - service unavailable
The errors appear non-deterministic — the same PDF payload can return 408 on one attempt and 503 on the next. Far below our 50 RPM quota in every case (no 429 ever observed).
Reproduction details
- PDF size tested: small (single page, well under the documented 30 MB / 30 page limit)
- Also tested with
pages: [0] to restrict processing to the first page only — same 408 / 503 mix
- Tried with
include_image_base64 set to both true and false — same behavior
- Client-side timeout set to 180000 ms — errors are clearly server-side, not client-side
- Multiple retries with 30 to 45 second spacing all fail
- Image inputs (PNG, same byte range) succeed within seconds on the same endpoint
What we have tried
- Increased client timeout up to 180 seconds
- Added retry logic (up to 5 attempts, 45 seconds between tries)
- Sent only
pages: [0] to minimize processing load
- Toggled
include_image_base64
- Verified the payload matches the documented API contract
Questions
- Is there a known regression or capacity issue affecting the
document_url (PDF) path of mistral-document-ai-2512 over the past few days?
- Is the PDF processing pipeline routed differently from the
image_url pipeline? The clear asymmetry (PDFs fail, images succeed on the same endpoint) suggests separate backend handling.
- Are 408 and 503 expected responses when underlying capacity is constrained, even when the client is well below its assigned RPM quota?
- Are there recommended workarounds while this is investigated, other than client-side PDF rasterization to PNG images?
Thanks for any guidance from the Foundry or Mistral team.
Summary
Calls to a mistral-document-ai-2512 serverless deployment on Microsoft Foundry intermittently return HTTP 408 "upstream request timeout" and HTTP 503 "service unavailable" when the payload uses document_url with a base64-encoded PDF. The exact same endpoint, deployment, model and credentials succeed reliably when the payload uses image_url with a base64-encoded image. PDF inputs were working consistently until a few days ago, without any change on our side.
Endpoint pattern
https://<resource>.cognitiveservices.azure.com/providers/mistral/azure/ocr
Model
mistral-document-ai-2512 (deployed as global-standard serverless)
Failing request body (PDF)
{
"model": "mistral-document-ai-2512",
"document": {
"type": "document_url",
"document_url": "data:application/pdf;base64,<...>"
},
"include_image_base64": false
}
Working request body (image, same endpoint)
{
"model": "mistral-document-ai-2512",
"document": {
"type": "image_url",
"image_url": "data:image/png;base64,<...>"
},
"include_image_base64": false
}
Error responses
Two distinct errors are returned, both server-side:
408 - upstream request timeout
503 - service unavailable
The errors appear non-deterministic — the same PDF payload can return 408 on one attempt and 503 on the next. Far below our 50 RPM quota in every case (no 429 ever observed).
Reproduction details
- PDF size tested: small (single page, well under the documented 30 MB / 30 page limit)
- Also tested with
pages: [0] to restrict processing to the first page only — same 408 / 503 mix
- Tried with
include_image_base64 set to both true and false — same behavior
- Client-side timeout set to 180000 ms — errors are clearly server-side, not client-side
- Multiple retries with 30 to 45 second spacing all fail
- Image inputs (PNG, same byte range) succeed within seconds on the same endpoint
What we have tried
- Increased client timeout up to 180 seconds
- Added retry logic (up to 5 attempts, 45 seconds between tries)
- Sent only
pages: [0] to minimize processing load
- Toggled
include_image_base64
- Verified the payload matches the documented API contract
Questions
- Is there a known regression or capacity issue affecting the
document_url (PDF) path of mistral-document-ai-2512 over the past few days?
- Is the PDF processing pipeline routed differently from the
image_url pipeline? The clear asymmetry (PDFs fail, images succeed on the same endpoint) suggests separate backend handling.
- Are 408 and 503 expected responses when underlying capacity is constrained, even when the client is well below its assigned RPM quota?
- Are there recommended workarounds while this is investigated, other than client-side PDF rasterization to PNG images?
Thanks for any guidance from the Foundry or Mistral team.
Summary
Calls to a mistral-document-ai-2512 serverless deployment on Microsoft Foundry intermittently return HTTP 408 "upstream request timeout" and HTTP 503 "service unavailable" when the payload uses document_url with a base64-encoded PDF. The exact same endpoint, deployment, model and credentials succeed reliably when the payload uses image_url with a base64-encoded image. PDF inputs were working consistently until a few days ago, without any change on our side.
Endpoint pattern
https://<resource>.cognitiveservices.azure.com/providers/mistral/azure/ocr
Model
mistral-document-ai-2512 (deployed as global-standard serverless)
Failing request body (PDF)
{
"model": "mistral-document-ai-2512",
"document": {
"type": "document_url",
"document_url": "data:application/pdf;base64,<...>"
},
"include_image_base64": false
}
Working request body (image, same endpoint)
{
"model": "mistral-document-ai-2512",
"document": {
"type": "image_url",
"image_url": "data:image/png;base64,<...>"
},
"include_image_base64": false
}
Error responses
Two distinct errors are returned, both server-side:
408 - upstream request timeout
503 - service unavailable
The errors appear non-deterministic — the same PDF payload can return 408 on one attempt and 503 on the next. Far below our 50 RPM quota in every case (no 429 ever observed).
Reproduction details
- PDF size tested: small (single page, well under the documented 30 MB / 30 page limit)
- Also tested with
pages: [0] to restrict processing to the first page only — same 408 / 503 mix
- Tried with
include_image_base64 set to both true and false — same behavior
- Client-side timeout set to 180000 ms — errors are clearly server-side, not client-side
- Multiple retries with 30 to 45 second spacing all fail
- Image inputs (PNG, same byte range) succeed within seconds on the same endpoint
What we have tried
- Increased client timeout up to 180 seconds
- Added retry logic (up to 5 attempts, 45 seconds between tries)
- Sent only
pages: [0] to minimize processing load
- Toggled
include_image_base64
- Verified the payload matches the documented API contract
Questions
- Is there a known regression or capacity issue affecting the
document_url (PDF) path of mistral-document-ai-2512 over the past few days?
- Is the PDF processing pipeline routed differently from the
image_url pipeline? The clear asymmetry (PDFs fail, images succeed on the same endpoint) suggests separate backend handling.
- Are 408 and 503 expected responses when underlying capacity is constrained, even when the client is well below its assigned RPM quota?
- Are there recommended workarounds while this is investigated, other than client-side PDF rasterization to PNG images?
Thanks for any guidance from the Foundry or Mistral team.