Azure Document analysis custom extraction model failing to recognize '.'(Period).
When training the custom extraction model within the Document Intelligence Studio, the default Optical Character Recognition (OCR) engine appears to exhibit a limitation in accurately identifying the period character when it precedes numeric digits. To put it simply, I am not able to select the preceding period while labelling the documents.
The image below illustrates the mentioned issue.
Conversely, the OCR engine accurately recognizes and processes the period when it functions as a decimal separator following a numeric value.
This limitation is hindering the accurate labeling of fields containing such instances (e.g., ".789").
Any insights or potential solutions would be greatly appreciated.