Hi Cristian Felipe Calderon Perez,
Greetings & Welcome to the Microsoft Q&A forum! Thank you for sharing your query.
I understand that you are facing an issue with incorrect extraction of names and document fields from your PDF where fields are being separated instead of being read as a single entity.
Here are some steps you can take to troubleshoot the issue:
- Check the quality of the PDF document: Make sure that the PDF document is of good quality and is not corrupted. If the document is of poor quality or is corrupted, it may affect the accuracy of the extraction.
- Check the layout of the PDF document: Make sure that the layout of the PDF document is consistent and follows a standard structure. If the layout is inconsistent or does not follow a standard structure, it may affect the accuracy of the extraction.
- Adjust the extraction settings: You can adjust the extraction settings in Azure Document Intelligence to improve the accuracy of the extraction. For example, you can adjust the field mapping options to specify how fields should be processed, or you can adjust the confidence threshold to control the level of confidence required for a field to be extracted.
- Train a custom model: If the extraction is still not accurate, you can train a custom model in Azure Document Intelligence to improve the accuracy of the extraction. You can use the labeling tool to label the data in your PDF document and train a custom model to extract the data based on the labeled data.
If you continue to experience difficulties, please feel free to reach out and will escalate the issue to the appropriate team to ensure it is resolved promptly.
Hope this helps. Do let us know if you have any further queries.
If this answers your query, do click Accept Answer and Yes for was this answer helpful.