Azure Form Recognizer v3.0 - How can the model be retrained without the need to labelling in the existing model (Custom neural)

Md Asfaque 1 Reputation point
2022-10-17T10:59:29.303+00:00

Hi, I want to retrain the model whenever any new document comes under the existing model. Do I need to label all the existing fields again for that particular document? Is there any other option wherein I just pass the document and model gets trained using REST API or any other methods.

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,433 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,621 Reputation points
    2022-10-18T05:35:56.513+00:00

    @Md Asfaque Thanks for the question. You can label the extra labels if any. The Prebuilt models you can't train and use as REST API.

    When you use the Form Recognizer custom model, you provide your own training data so the model can train to your industry-specific forms. You can train a model with five filled-in forms or an empty form (include the word "empty" in the file name) plus two filled-in forms. Even if you have enough filled-in forms to train with, adding an empty form to your training data set can improve the accuracy of the model.

    It's important to use a data set that's optimized for training. Use the following tips to ensure you get the best results from the Train Model operation:

    • If possible, use text-based PDF documents instead of image-based documents. Scanned PDFs are handled as images.
    • Use one empty form and two filled-in forms if you have them available.
    • For filled-in forms, use examples that have all of their fields filled in.
    • Use forms with different values in each field.
    • If your form images are of lower quality, use a larger data set (10-15 images, for example).