Share via

How to OCR images in Word document

Anonymous
2024-07-07T04:20:49+00:00

How do I OCR a word document made up of images of text? I don't know how to open anything in One Note. I can't print to it either. None of the instructions I've followed work. Windows 10, Microsoft 365

Microsoft 365 and Office | Word | For home | Windows

Locked Question. This question was migrated from the Microsoft Support Community. You can vote on whether it's helpful, but you can't add comments or replies or follow the question.

0 comments No comments

5 answers

Sort by: Most helpful
  1. Suzanne S Barnhill 277.1K Reputation points MVP Volunteer Moderator
    2024-07-07T11:50:19+00:00

    You can select an image in the document and copy it, then paste into OneNote. In OneNote, right-click on the image and choose Copy Text from Picture.

    Then, in another part of the OneNote page or in a Word document, paste the result. Depending on the quality of the image, you may have to do some cleanup, and if the text in the image is highly formatted (columns, tables, etc.), the result may well be disappointing. Here's the result from the above image:

    9 people found this answer helpful.
    0 comments No comments
  2. Charles Kenyon 166.6K Reputation points Volunteer Moderator
    2024-07-07T15:41:19+00:00

    PDF files can be edited in Word, sort of…

    I am unsure which steps you have followed. Suzanne has given specific instructions for using OneNote.

    How was the file created originally, and by which program? It could have been created from a scan or a picture taken by a phone camera. Those are pictures of words saved as pdfs. Just as you can have a picture of a car. You can see the car in the picture, but you can't change the timing of the engine in that picture. You can't change the order of text or otherwise edit it with a picture of text. Word can open such a file, but it can't edit it. You have a Word file that contains a picture of text rather than text.

    In that case, you need to convert the picture to text. This is a process known as optical character recognition. This is built into Adobe Acrobat (but not the free Acrobat Reader) and is also in Office OneNote. Most scanner software comes with an OCR component as well. Word does not have OCR capability. OneNote does.

    Once translated into text, it can be edited in Word but there will still be formatting anomalies.

    If you simply want to write on the document (but not in it) you can add a Text Box floating on top of the document layer, whether or not it has been put through the OCR process.

    Web pages or Word documents that have been saved as PDF will not need the OCR process, they retain their text, although not all their Word structure and formatting. Documents created as PDF from other programs will likely be even more problematic.

    Finally, documents converted from pdf (or really any other format) to Word can be tough to edit because the conversion process never has a one-to-one matching of how formatting is done under the hood. This means that a converted document will seldom be formatted in Word in a way that uses Word features well for that formatting. An example is multiple section breaks to change margins, where in Word you would simply change the paragraph indent. Margins and Indents in Word. Another example is that Word formatting of text is best done using Styles and those will not be used. It will all be direct formatting. That can make a huge difference in how easy it is to edit. The Importance of Styles in Microsoft Word.

    If possible, find the file from which the pdf was created and edit that file, using the program that created it. Then if you need it in Word format and it is not, convert it directly to Word. This will cut out one conversion process and make for fewer editing problems.

    When I really need the document in Word format and intend to do much editing, I create a new Word file and paste the content into it as plain text. Then I format it to match the original using Styles for the formatting as much as possible. This takes time; for me, it is worth it and saves a lot of frustration.

    2 people found this answer helpful.
    0 comments No comments
  3. Anonymous
    2024-07-07T10:24:08+00:00

    Dear BohanBohan,

    Thanks for visiting Microsoft Community. 

    I understand that you are unable to OCR a Word document composed of images, and that the instructions you followed do not work.

    I'm sorry to tell you that there is no ability to OCR images in Word, but there is in OneNote.

    Due to the limited information I have, I am unable to provide you with accurate advice. In order to better understand and solve your problem, I need to confirm some information with you:

    1. What are you referring to when you mention that you don't know how to open any content in OneNote? Can you give me more detailed information?
    2. What are the instructions you are following? Is it the method in this link?

    Copy text from pictures and file printouts using OCR in OneNote - Microsoft Support

    If not, you copied the content from Word to OneNote and then used OCR using the method in this link.

    Please feel free to contact me if you need further assistance.

    Best regards

    Miyeon.S - MSFT |Microsoft Community Support Specialist

    1 person found this answer helpful.
    0 comments No comments
  4. Anonymous
    2024-07-08T01:03:56+00:00

    We are glad to receive your reply. If you need further help, please feel free to contact Microsoft Community.

    Miyeon.S - MSFT |Microsoft Community Support Specialist

    0 comments No comments
  5. Anonymous
    2024-07-07T19:24:10+00:00

    I think I got it now. I couldn't just paste something in One Note, or open anything. I had to go through certain steps to create a notebook. The "Create" button didn't work before because I had to give it a name. I finally was able to make my image text-searchable in One Note and now I'll investigate ways to spell check it.

    0 comments No comments