Creating Quality Document Summaries With Azure

Education is the Key 65 Reputation points
2023-05-02T03:59:31.8233333+00:00

Other than using Cognitive Service for Language is there a way to get coherent document summaries from unstructured PDFs using Cognitive Search?

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
720 questions
Azure AI Language
Azure AI Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
358 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,393 questions
{count} votes

Accepted answer
  1. ajkuma 22,416 Reputation points Microsoft Employee
    2023-05-02T19:54:34.15+00:00

    @Education is the Key ,

    Based on my understanding of your scenario description, highlighting the following approaches:

    You may leverage Azure OpenAI - this guide shows how to perform document summarization by using the Azure OpenAI GPT-3 model. Kindly take a look at the document architecture and workflow.
    Query-based document summarization

    You may use Azure Cognitive Search to extract information from unstructured PDFs and create document summaries. With Azure Cognitive Search feature "Skillsets" that allows you to define a set of skills to extract information from unstructured data.

    To create document summaries, you may use the "Text Analytics" skill to extract key phrases and sentiment from the PDFs, and then use the "Custom Entity Recognition" skill to identify important entities in the text. You can then use this information to generate a summary of the document.

    You can find more information on how to create a skillset for document summarization in the Azure Cognitive Search documentation.

    You can also use the Cognitive Search Enrichment Pipeline to perform additional processing on the extracted content, such as entity recognition or sentiment analysis, to further enhance the quality of the document summaries.

    Kindly let us know if you have further questions.


0 additional answers

Sort by: Most helpful