Before achieving 97% accuracy in our label dataset, we added around 30 additional samples. Here are some key strategies to enhance the overall quality when dealing with forms containing boxes:
- Incorporate Samples with Numerals: Ensure to include samples featuring numbers like '1' to address potential misinterpretations.
- Include Varied Dates and Phone Numbers:
- Example Date: 11/21/1991
- Example Phone: 151-582-51151
- Utilize Samples Starting with Specific Letters: Add samples where inputs begin with 'L' to test initial character recognition.
- Examples: "Lloyd", "llama"
- Prepare Samples with Potential Scanning Issues: Create and include samples that might scan poorly, such as those written in pencil. Pencil marks can often be less distinct, thus providing useful data on recognition accuracy.
If you are receiving poor accuracy order number list item number 4 is what helped us drastically. A pencil may be the key to your accuracy!