Issues with Azure Document Intelligence OCR and Labeling

jonasmil 10 Reputation points
2024-11-26T17:58:23.6433333+00:00
  1. OCR Character Recognition Accuracy:
    • When analyzing the document (see the inserted example), certain letters, such as "Y," are being misinterpreted as "<" or other incorrect characters. This creates inconsistencies in the extracted data.
      User's image
    • Some text elements in the document are not detected at all, even though they are visible and clear.
    • Would focusing the training process on individual table columns improve detection accuracy?
    • What are the recommended approaches for training and testing models on structured documents like financial reports?
    • Are there specific criteria or critical steps to ensure the model generalizes well across similar documents?
  2. Labeling Workflow Limitations:
    • During the labeling process, the platform does not allow me to manually modify incorrect characters or text detected by the OCR engine.
    • Even when I download the labels.json file, make corrections manually, and re-upload it, the changes are not reflected in the system. This hinders the ability to refine the training data effectively.
    • Is there a straightforward method to modify OCR-detected text during the labeling step?
  3. UI Functionality Missing for Comparison:
    • I could not find an option in the UI to visually compare the raw image with the OCR-processed version side by side. Such a feature would make it easier to identify recognition errors and validate corrections.
    • Does Azure Document Intelligence provide a feature to visually compare the raw image with the OCR'd result in the same interface? If not, are there any suggested workflows or tools for doing this?
Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,775 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.