How to make the prebuilt layout model detect tables with missing top and bottom border lines in a PDF??

CHONG Jun Kit 0 Reputation points
2024-09-23T17:55:19.58+00:00

I have a PDF with tables that appear without top and bottom border lines, causing them to not be detected by the prebuilt layout model. As a result, they are being detected as normal paragraphs. I need to detect a total of 2 tables, one from each PDF page.
User's image

Issue:

  • Table like these is not being detected (rather detected as normal paragraph, without table structure)

Ideally Expected:

  • Detect a total of 2 tables, one from each pdf page.

I think this has to do with fine-tuning existing prebuilt layout model, but I don't see any option to perform this.

What is the ideal way to overcome this issue?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,666 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.