How to process only the page or two of a form when the input file has many more irrelevant pages.

Rob Blackin 0 Reputation points
2023-08-31T18:23:24.9866667+00:00

We have a form with 10 pages but the part that actually has the form is one or 2 pages. However, when we analyze using the model id we get charged for 10 pages.

Is there a way that we can just get charged for the 2 pages where the form was trained?

Only way i can think to do it is

  • Run OCR on it and identify the start and end of the form and split the document or pass in page numbers (still get charged 10 pages of ocr)

Is there any other way?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,621 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. VasaviLankipalle-MSFT 17,021 Reputation points
    2023-08-31T22:11:57.5033333+00:00

    Hello @Rob Blackin , Thanks for using Microsoft Q&A Platform.

    As we know the Document Intelligence is billed by number of pages analyzed, and when a document is submitted for analysis, in general all pages are analyzed unless we set a page range in the "pages" parameter in the request: https://azure.microsoft.com/en-us/pricing/details/form-recognizer/

    In your case, you can set the page range to these 2 pages and get charged for those pages that are analyzed, this should work. This is the best possible way otherwise you can directly upload the documents in chunks that are required. I have reproduced and this page range filter worked for me.

    User's image

    Please try this and let us know if you still have issues.

    If you have billing issues, please raise a support ticket here: https://portal.azure.com/#blade/Microsoft_Azure_Support/HelpAndSupportBlade/overview

    I hope this helps.

    Regards,
    Vasavi

    -Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.