Extracting an image from PDF using Azure OpenAI assistants GPT 4o

Question

Extracting an image from PDF using Azure OpenAI assistants GPT 4o

Mohamed Hussein 710

Hi,

I was trying to fetch an image from a PDF file (attached) using the assistants playground

When i used file search option, it failed. But when i used code interpreter it succeeds

My question is when using assistants APIs, if used uploads a PDF file which's mix of text and images at most of cases, which tool should i assign FileSearch or Code interpreter

EmbeddedImage.pdf

Accepted answer

0 additional answers

Your answer

Answer 1

Pavankumar Purilla 8,570 Microsoft External Staff Moderator

Hi Mohamed Hussein,
Greetings & Welcome to the Microsoft Q&A forum! Thank you for posting your query!

For extracting text and metadata from documents, such as PDFs. It works well for searching and analyzing the text content but may struggle with images and non-text elements in PDFs.

The Code Interpreter is more versatile and can process complex file types, including PDFs that contain both images and text. It can be used to extract images.

If the document is highly mixed with images, charts, and text and you need to extract images the Code Interpreter would be the better tool.

Hope this helps. Do let us know if you have any further queries.

If this answers your query, do click Accept Answer and Yes for was this answer helpful.

Pavankumar Purilla 8,570 Reputation points Microsoft External Staff Moderator

2024-11-13T16:52:25.8033333+00:00

Hi Mohamed Hussein,
Hope you are doing well.
Following up to see if the above answer was helpful. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.
Mohamed Hussein 710 Reputation points

2024-11-13T17:09:19.8066667+00:00

Thank you @Pavankumar Purilla for your answer, does that mean we should assisn both tools to every single file uploaded?
Pavankumar Purilla 8,570 Reputation points Microsoft External Staff Moderator

2024-11-13T23:14:55.5666667+00:00

Hi Mohamed Hussein,
Greetings of the day!
To handle file uploads effectively, you don't need to assign both tools to every file. Instead, use a targeted approach based on the file's content and your needs. If the file mainly contains text, File Search is efficient. For files with images or mixed content, the Code Interpreter is better.

Share via

Extracting an image from PDF using Azure OpenAI assistants GPT 4o

0 additional answers

Your answer