Will Documument Intelligence Studio support cross-page labelling anytime soon?

Asmond Loo 0 Reputation points
2024-12-13T01:47:48.75+00:00

Currently. Azure Doc Intelligence does not support cross-page labelling, will there be an update release to support this 'feature'. I need to train a custom model to identify and filter references or other unnecessary similar fields across all pages in a document.

If there are no upcoming updates in the near future, are there any alternatives in the studio that can support what I need to accomplish?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
2,135 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,671 questions
{count} votes

1 answer

Sort by: Most helpful
  1. navba-MSFT 27,550 Reputation points Microsoft Employee Moderator
    2024-12-13T09:48:16.1933333+00:00

    @Asmond Loo Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

    .

    Currently, Azure Document Intelligence does not support cross-page labeling directly. However, there are ongoing updates and enhancements to the service, and it's always good to keep an eye on the Azure Document Intelligence release notes for the latest features.

    .

    Alternative / workarounds:

    • Tabular fields support cross page tables by default. To label a table that spans multiple pages, label each row of the table across the different pages in the single table. As a best practice, ensure that your dataset contains a few samples of the expected variations. For example, include both samples where an entire table is on a single page and samples of a table spanning two or more pages. More info here.
    • Multi page tables: When tables span multiple pages, label a single table. Add documents to the training dataset with the expected variations represented—documents with the table on a single page only and documents with the table spanning two or more pages with all the rows labeled.
    • Custom Models: You can train custom models to identify and filter references or other fields. Although this might require some manual effort to ensure consistency across pages, it can be a viable solution.

    .

    Hope this helps.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.