Is it possible to split the PDFs into single page pdfs in azure blob storage as and when it is uploaded?

Sachitha C 20 Reputation points
2023-05-22T11:07:09.37+00:00

So I need to run semantic search query over the content(pdfs) in azure blob and I need them to be chunked into single pages...is there any way to automate this?

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
700 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,425 questions
0 comments No comments
{count} votes

Accepted answer
  1. Konstantinos Passadis 17,286 Reputation points
    2023-05-22T11:15:28.09+00:00

    Hello @Sachitha C !

    Welcome to Microsoft QnA!

    I see you want to split PDF Documents into single pages for your Semantic search

    As i can think of you can do that with additional 3rd party tools-libraries like

    iText, PyPDF2, PDFBox, and PDFLib.

    There is no native way in Storage Blobs to do that !

    I hope this helps!

    Kindly mark the answer as Accepted and Upvote in case it helped!

    Regards

    0 comments No comments

0 additional answers

Sort by: Most helpful