Share via

Analysis doesn't tag correctly

Cristian Felipe Calderon Perez 0 Reputation points
2024-12-03T14:48:26.35+00:00

Hello, I am reading a pdf whose structure is the basis of a business document, there I need to identify names and documents of the people but the extraction is done wrongly, it does not take the fields united but separated and I cannot make it work

Azure Document Intelligence in Foundry Tools

2 answers

Sort by: Most helpful
  1. Harsh Jain 75 Reputation points
    2024-12-04T17:22:59.9966667+00:00

    Hey @Cristian Felipe Calderon Perez ,

    I have reviewed your query and will do my best to help you resolve the issue in the simplest way possible. Please follow these steps:

    1. Verify File Format: Ensure your file is in one of the following supported formats: JPG, PNG, BMP, PDF (text or scanned), or TIFF.
    2. Upload to Azure: Upload the file to your Azure Storage Account.
    3. Train the Model:
      • Mark the Bounding Box around the necessary data in the file.
      • Ensure all files used for training are in a similar format.
    4. Validate the Model: Once the training is complete, verify the performance of the trained model.

    I hope this resolves your issue. If you have any further questions or need assistance, feel free to reach out.

    Was this answer helpful?

    0 comments No comments

  2. Pavankumar Purilla 11,575 Reputation points Microsoft External Staff Moderator
    2024-12-03T16:49:26.2566667+00:00

    Hi Cristian Felipe Calderon Perez,
    Greetings & Welcome to the Microsoft Q&A forum! Thank you for sharing your query.

    I understand that you are facing an issue with incorrect extraction of names and document fields from your PDF where fields are being separated instead of being read as a single entity.

    Here are some steps you can take to troubleshoot the issue:

    • Check the quality of the PDF document: Make sure that the PDF document is of good quality and is not corrupted. If the document is of poor quality or is corrupted, it may affect the accuracy of the extraction.
    • Check the layout of the PDF document: Make sure that the layout of the PDF document is consistent and follows a standard structure. If the layout is inconsistent or does not follow a standard structure, it may affect the accuracy of the extraction.
    • Adjust the extraction settings: You can adjust the extraction settings in Azure Document Intelligence to improve the accuracy of the extraction. For example, you can adjust the field mapping options to specify how fields should be processed, or you can adjust the confidence threshold to control the level of confidence required for a field to be extracted.
    • Train a custom model: If the extraction is still not accurate, you can train a custom model in Azure Document Intelligence to improve the accuracy of the extraction. You can use the labeling tool to label the data in your PDF document and train a custom model to extract the data based on the labeled data.

    If you continue to experience difficulties, please feel free to reach out and will escalate the issue to the appropriate team to ensure it is resolved promptly.

    Hope this helps. Do let us know if you have any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful.

    Was this answer helpful?


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.