Why the confidence score is different in Azure form recognizer and Microsoft Syntex?

Subhash Kumar Mahato 55 Reputation points
2023-06-15T11:33:21.52+00:00

I am doing a comparison between the Microsoft Syntex and Azure Form Recognizer for IDP (Intelligent document processing) on the basis of performance and accuracy.

I have used same 15 training documents for both (Form Recognizer and Microsoft Syntex) to train a custom model. When I have tested both the technologies with same test document, output was same in Form Recognizer and Microsoft Syntex, but there was difference in the confidence score.

In more precise information, I have used custom template model in Azure Form recognizer and Custom Layout method in Microsoft Syntex.

Here, I want to know that why there is difference in the confidence score? How the confidence score is being calculated in the Azure From recognizer and Microsoft Syntex?

Thank you,

Microsoft 365
Microsoft 365
Formerly Office 365, is a line of subscription services offered by Microsoft which adds to and includes the Microsoft Office product line.
3,989 questions
Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,449 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,459 questions
{count} votes

Accepted answer
  1. VasaviLankipalle-MSFT 15,006 Reputation points
    2023-06-15T21:31:43.83+00:00

    Hello @Subhash Kumar Mahato , Thanks for using Microsoft Q&A platform.

    Firstly, coming to Azure Form Recognizer calculates confidence scores based on the probability that the prediction is correct. Confidence scores are estimated for predicted words, key-value pairs, selection marks, regions, and signatures. However, not all document fields return a confidence score.

    As mentioned here in the documentation, Confidence scores generally have two data points. It's important to combine these two confidence scores (field level confidence score and the text extraction confidence score) to generate one overall score, which can help determine whether to automatically accept the prediction or flag it for human review, especially in scenarios where accuracy is critical.

    I don't have much knowledge regarding the Microsoft Syntex. In Microsoft Syntex, the confidence score might be calculated using different techniques or methods. However, it's important to take these into consideration when evaluating the performance of both confidence scores.

    I hope this helps.

    Regards,
    Vasavi

    -Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful