Issues with Azure Document Intelligence OCR and Labeling
jonasmil
10
Reputation points
- OCR Character Recognition Accuracy:
- When analyzing the document (see the inserted example), certain letters, such as "Y," are being misinterpreted as "<" or other incorrect characters. This creates inconsistencies in the extracted data.
- Some text elements in the document are not detected at all, even though they are visible and clear.
- Would focusing the training process on individual table columns improve detection accuracy?
- What are the recommended approaches for training and testing models on structured documents like financial reports?
- Are there specific criteria or critical steps to ensure the model generalizes well across similar documents?
- When analyzing the document (see the inserted example), certain letters, such as "Y," are being misinterpreted as "<" or other incorrect characters. This creates inconsistencies in the extracted data.
- Labeling Workflow Limitations:
- During the labeling process, the platform does not allow me to manually modify incorrect characters or text detected by the OCR engine.
- Even when I download the
labels.json
file, make corrections manually, and re-upload it, the changes are not reflected in the system. This hinders the ability to refine the training data effectively. - Is there a straightforward method to modify OCR-detected text during the labeling step?
- UI Functionality Missing for Comparison:
- I could not find an option in the UI to visually compare the raw image with the OCR-processed version side by side. Such a feature would make it easier to identify recognition errors and validate corrections.
- Does Azure Document Intelligence provide a feature to visually compare the raw image with the OCR'd result in the same interface? If not, are there any suggested workflows or tools for doing this?
Sign in to answer