How to fix "Contains duplicated BoundingBoxes instances." error on Document Intelligence Studio model training

Jérémy 0 Reputation points
2024-01-05T11:47:27.5+00:00

Hello,

I'm trying to build a model based on a static template.

I'd like to build using the "Template" mode, not the neural one.

I labeled 1 PDF and then I get the error "You need 5 documents to use the Template mode".

So I've labeled 5 documents using only :

  • "Draw regions" to map text areas
  • and clicked on detected checkboxes to map them to a "Selection mark" field.

When I try to train the model, I get an error saying that I have only 1 valid files, because the others contains "Contains duplicated BoundingBoxes instances.".

I've tried cleaning up everything : spacing drawn regions to avoid overflowing, removing auto-labeled texts and using only drawn regions, etc.

And nothing seems to fix the issue.

I can't find any documentation or topics online about this "Contains duplicated BoundingBoxes instances" so I'm kind of blocked as I don't get where is the issue with my labelling.

Thanks ahead for your help.

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,920 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Jérémy 0 Reputation points
    2024-01-05T15:31:22.18+00:00

    It seems like it works when I don't use "auto-label".
    As soon as I use it, the training gets blocked with the "Contains duplicated BoundingBoxes instances" error.

    I'm gonna map everything 5 times by hands and hope it works.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.