Convert Word to PDF to HTML

Vini875 0 Reputation points
2024-04-04T02:19:25.34+00:00

Hi,

Me and my team have been using MS word to convert PDF documents to MS Word, Edit them and Convert Word to HTML using MS Word Save as .html option.

As we convert many files during the day, is there an API endpoint available which can take care of these conversions ?

As we found conversion quality (w.r.t. content) is much better using above method as compared to any other tools available , so would have been easier if Microsoft has any endpoints to do the same?

Thanks & Regards

Vinish

Word Management
Word Management
Word: A family of Microsoft word processing software products for creating web, email, and print documents.Management: The act or process of organizing, handling, directing or controlling something.
893 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Michael Taylor 48,486 Reputation points
    2024-04-04T02:33:30.1633333+00:00

    Why not just write a PowerShell script to automate the Office app to do the conversion for you? Seems like you could have an "input" folder where you drop the PDFs. Run a Powershell script (could be scheduled to run periodically) that takes any PDFs in the folder, converts them using Office and drops them into a "working" folder that your team can then edit. Once the editing is done have the team put the files into a "ready" folder that another Powershell script (also schedulable) reads from to convert to HTML. Have that script convert the files and place them into a "done" folder.

    Here's a starter link for working Word in Powershell.

    There are also third party products that can do these types of conversions as well but you have to buy them. Not sure if that is an option for you or not.