Why is the OCR service not accepting pdf files?

S-A 20 Reputation points
2024-01-22T08:03:47.84+00:00

On the Azure AI Vision Studio, I am trying to test the performance of the OCR service. I want to integrate it with Azure AI Search as the enrichment has limitations. However, whenever I upload a pdf file, it doen't get read, nothing is detected. Why is that and how can I make it work? Note: The portal says it can read pdf files. User's image

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
784 questions
Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
329 questions
{count} votes

1 answer

Sort by: Most helpful
  1. ajkuma 23,386 Reputation points Microsoft Employee
    2024-01-22T14:06:39.97+00:00

    @S-A , Thanks for posting this question. Do you receive any specific error messages? Is this issue confined only to a few specific PDFs?

    Based on my understanding of your scenario. Just to highlight, OCR skillset is not a free skillset, as outlined in this doc it only supports 20 documents per day.

    AttachCognitive Services to a skillset - Azure Cognitive Search | Microsoft Learn

    -Image extraction is an Azure AI Search operation that occurs when documents are cracked prior to enrichment. Image extraction is billable on all tiers, except for 20 free daily extractions on the free tier.

    Need to use the Microsoft.Skills.Vision.OcrSkill to extract text from image https://docs.microsoft.com/azure/search/cognitive-search-skill-ocr

    Extract text from images - Azure AI Search | Microsoft Learn

    – See this approach from on SO https://stackoverflow.com/a/73973654 and since the OCR skill calls this same API.

    Kindly let us know, I’ll follow up with you further.

    0 comments No comments