Share via

Document Intelligence Studio, Train custom model, internal server error

KYOUNGTAEK OH 0 Reputation points
2026-02-25T22:50:52.1466667+00:00

I am trying to create new custom model.

Today, I got an error(Internal Server Error).

I'v made lots of custom models so I think Azure has some problem.

What should I do?

Foundry Tools
Foundry Tools
Formerly known as Azure AI Services or Azure Cognitive Services is a unified collection of prebuilt AI capabilities within the Microsoft Foundry platform
{count} votes

1 answer

Sort by: Most helpful
  1. Anshika Varshney 7,815 Reputation points Microsoft External Staff Moderator
    2026-02-26T00:26:08.7433333+00:00

    Hi KYOUNGTAEK OH,

    Thanks for reporting this. You’re not alone intermittent “Internal Server Error” issues during custom model training in Document Intelligence Studio have been observed before, even for users who’ve successfully trained many models in the past. In most cases, the issue isn’t the dataset itself but a temporary backend or configuration condition.

    Here are the troubleshooting steps that usually resolve this:

    1. Validate the training files first (important)

    Even if the same files worked earlier, run all training documents through the Layout model in Studio. If any document fails layout analysis, custom training can return an Internal Server Error instead of a validation message. This is one of the most common hidden causes.

    1. Check service limits and quotas

    In the Azure Portal, open your Document Intelligence resource → Usage + quotas and confirm:

    • You haven’t hit the concurrent training job limit
    • Your dataset size and page count are within documented limits for your model type Quota exhaustion can surface as a generic server error rather than a clear message in Studio. [learn.microsoft.com]
    1. Retry after a short pause or switch regions

    Several users have reported this error during periods of regional backend load. If possible:

    • Wait a bit and retry
    • Or temporarily create a new Document Intelligence resource in another region and test training there If it succeeds, that confirms a transient service-side issue rather than a data problem.
    1. Delete and recreate the failed model

    If the model creation partially succeeded in the background, retries may keep failing.

    • Delete the failed/stuck model from Studio
    • Wait a few minutes
    • Start a fresh training run with the same dataset This has resolved similar “internal error” cases reported by other users. [learn.microsoft.com]
    1. Use API logs for confirmation (optional but helpful)

    If the error continues, triggering training via the REST API or SDK can return more descriptive error details than the Studio UI, which helps confirm whether the failure is data‑related or service‑side. [github.com]

    If all of the above checks pass and the error persists across retries or regions, it’s very likely a service-side issue. Do let me know if you have any further queries.

    If this answers your query, please do click Accept Answer and Yes for was this answer helpful.

    Thankyou!

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.