With Cognitive Service - Read - can I extract paragraphs and tables. How do I do that.

Gomes, Sabrina D 25 Reputation points
2023-08-14T23:19:10.66+00:00

I have long documents that need to be re-formated. Sometimes when they are in Word format, they are corrupted. I want to extract entire texts and tables from a pdf version and paste the entire information into a new document under certain rules. I suppose Read would help me extract paragraphs and tables, but how.

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
2,013 questions
{count} votes

Accepted answer
  1. VasaviLankipalle-MSFT 18,656 Reputation points
    2023-08-15T04:37:41.6033333+00:00

    Hello @Gomes, Sabrina D , Thanks for using Microsoft Q&A Platform.

    The Read OCR model is available in Azure AI Vision and Document Intelligence with some common features: https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/overview-ocr#ocr-common-features

    The Document Intelligence Read Optical Character Recognition (OCR) model is a part of Azure's Applied AI Services and is best for extracting text from PDF documents and scanned images. It can extract both printed and handwritten text and can detect paragraphs, text lines, words, locations, and languages. https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/concept-read?view=doc-intel-3.1.0

    Please refer to this documentation to know more about the Read edition that best fits your scenario: https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/overview-ocr#ocr-read-editions

    You can use layout model to extract the paragraphs, tables, selection marks, lines and words.

    I would suggest visiting this documentation to choose the model that best suits your requirement. https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/choose-model-feature?view=doc-intel-3.1.0

    I hope this helps.

    Regards,
    Vasavi

    -Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.