Azure form recognizer less accuracy and more time

ta 1 Reputation point
2021-07-22T13:52:42.967+00:00

I really need some suggestions regarding azure form recognizer. Can I ask please?

I am working on app where user will upload image of ID cards, (format can be jpeg, jpg, pdf). and i have to extract information with mapping. for that i have used form recognizer. but the problem was the accuracy is less for bad images and it was collection some garbage sometimes, and the time to process is also 45 seconds for one image.

so i thought to use azure ocr for it, i used classifier code and then azure ocr. now i have to map the json file.

my output will be kind of like this below:
public virtual int Id { get; set; }

my question is , am i in the wrong path? or the right one. if i am in the wrong path what should i follow?
also is there any AZURE classifier which i can use before form recognizer?
and is there any automatic json mapping service in azure?

Would you kindly give your suggestions what i should do to increase the accuracy and reduce the timing/?

should i move to OCR? or stick to form recognizer?
or should I build a custom model using python?

public virtual string FirstName { get; set; }

public virtual string LastName { get; set; }

public virtual string Sex { get; set; }

public virtual DateTime BirthDate { get; set; }

public virtual string IdNumber { get; set; }

public virtual string Address { get; set; }

public virtual string Province { get; set; }

public virtual string PostalCode { get; set; }

public virtual string ProvinceLetters { get; set; }

public virtual string City { get; set; }

public virtual string DdNumber { get; set; }

public virtual string Class { get; set; }

public virtual string Rest { get; set; }

public virtual DateTime IssueDate { get; set; }

public virtual DateTime ExpiryDate { get; set; }

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,447 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,621 Reputation points
    2021-07-23T03:29:29.527+00:00

    @ta Generally we start with 5 documents as training set and you should be able to add more documents to your training incrementally to see an improvement in the results. If you don’t see improvements after doing that, We will forward to the form recognizer team. Please follow the document to Train a custom model using the sample labeling tool.

    Have you checked out the Knowledge Extraction Recipes resource? https://github.com/microsoft/knowledge-extraction-recipes-forms

    Comparison of form recognizer solution: https://cazton.com/blogs/executive/form-recognition-azure-aws-gcp