File extensions missing in OCR APA despite settings

Question

I'm trying to get an AutoIt3 script to work and I'm just a user of another script, so I don't have a lot of control over how it calls the OCR API.

If I provide a jpg image, the returned text from OCR has eliminated text that also happen to be file extensions (or at least XML, xml, and PDF). It recognizes other text with no issue. It also treats a double underline "__" as a delimiter, and makes the string in to a table, jumbling everything up.
I'm trying to parse screens created by an app over which I have absolutely no control and it is chock full of the above strings.

I have File Explorer options set to show extensions, and confirmed that Developer Settings shows the same option.

Are there some buried configuration options for the OCR API?

If I try to recognize a jpg image containing the string "XML hello" I get "hello" as the return value.

Answer

OCR API, not APA :-)

Answer

It's not that simple. "XML Hello" translates fine if it comes from a Notepad screenshot. My image from my app doesn't work, despite enhancing contrast, etc. Not sure how to post images here.

Answer

Hello @Jim Gurley

Have you visited the Autoitscript forums for assistance with this issue? This is not really a Windows issue.

I do hope this answers your question.

Thanks.

--
--If the reply is helpful, please Upvote and Accept as answer--

Share via

File extensions missing in OCR APA despite settings

3 answers