How to make Azure AI search return images (in OpenAI service - from your own data)

Hadjmbarek Nadia 45 Reputation points
2024-09-27T21:15:20.9233333+00:00

Hello, I am working on a RAG application with Azure AI search to index my PDF documents stored in Azure blob storage. The documents contains figures. The end goal is to create azure openai service (from your own data) to have a chatbot on my documents. The documents are 1000 pages long and contain figures.

I want :

  1. that the answers of azure OpenAI service contain the related figures from the indexed document when needed
  2. Do a variable size chunking based on the markedownformat since my documents are so long.
Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
1,351 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,621 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Daniel Fang 1,060 Reputation points MVP
    2024-09-28T00:55:32.8233333+00:00

    The original PDF documents are not store directly in Azure AI Search. It will be processed and chunked during the ingestion phases.

    1- the PDF document (text content) will be broken down into smaller chunks and stored in AI Search as individual records. then embedding will be created for text for vector search

    https://learn.microsoft.com/en-us/azure/architecture/solution-ideas/articles/ai-search-skillsetsDiagram that shows the AI Search architecture to convert unstructured data into structured data.

    2- If your requirements is to allow user to search figures by keywords: As the figures/diagram is not imported with the text from the PDF, you will need to do additional work to make these figures/diagram stored and searchable in the AI Search. i.e. generate image embedding for the figures (use Vision & embedding model) and enable them for vector search.
    3- If your requirements is to just display the figures together with returned text: you probably need to store the figures/images separately in a storage account and find a way to reference them inside text documents and display in the application.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.