PDF Extractor

Shambhu Rai 1,411 Reputation points
2022-04-19T13:42:56.037+00:00

Hi Expert,

I wanted to use Form Recognizer to load the data from PDF files using pipeline is there any perquisites conditions or criteria to use the data fir curation and further transformation purpose

Azure AI Document Intelligence
{count} votes

Answer accepted by question author
  1. YutongTie-MSFT 54,011 Reputation points Moderator
    2022-04-19T21:40:35.393+00:00

    Hello @Shambhu Rai

    Sure, I can provide you a custom training example in Form Recognizer Studio to see the requirement, life cycle and challenge.

    There are some points you should know before:

    1. Language Support, please check if your target language is support here: https://learn.microsoft.com/en-us/azure/applied-ai-services/form-recognizer/language-support
    2. Supported document format, please check if your invoice is good for the format requirement: https://learn.microsoft.com/en-us/azure/applied-ai-services/form-recognizer/concept-model-overview#input-requirements 3. QuickStart guidance, you can refer to this guidance to try our product quick to see if it fulfill your need.
      https://learn.microsoft.com/en-us/azure/applied-ai-services/form-recognizer/quickstarts/try-v3-form-recognizer-studio

    For the life cycle, the documents you upload will not change or disappear from the blob. After the training, the result will be in JSON file in the same blob as below screenshot.
    194461-image.png

    For challenge, for now I feel Form Recognizer is a good fit for common scenario. Based on my knowledge, some of the customer is suffering from the multipage table is not supported now.

    Please check above information and let me know if you have other concern, I am glad to help.

    Regards,
    Yutong

    -Please kindly accept the answer if you feel helpful to help the community, thanks a lot.

    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.