Form Recogniser fails to extract text from boxed form
We are trying to use azure-form-recogniser to read text from handwritten scanned forms. We have been training custom form recogniser models. We would like to know if the azure form recogniser is intelligent enough to pickup characters with in the boxes and ignore the boxes around the form? does the azure form recogniser have built in cleansing functions to remove boxes?
Thanks for reaching out to us. Could you please share the version you are using and also the clear picture so that we can investigate?
For your question, the new version Form Recognizer is able to recognizer characters from box.
Sign in to comment
We are using Cognitive Services Form Recognizer v2.1. Please see attached clearer picture of what we are trying to capture from forms.
What is the recommended way of labelling forms with characters in boxes? Is it to label each box as a separate label and the perform post-processing to combine the model inference for each box to form a Telephone number in this case? Please confirm.
Thanks for the waiting. An update here is we have forwarded this issue to PG to see any way we can optimize this better. For you question, I am waiting for a confirmation and will let you know soon.