Azure Cognitive search , blob metadata versus file metadata (pdf)

Sophie Gonzalez 0 Reputation points
2023-10-24T13:47:00.4233333+00:00

Among other thing, I'm indexing a container of pdf files. According to documentation pdf files have a metadata_title proprety which ACS get at document extraction. The blob of the files also has custom property which ACS get without issues.

The problem is that the blob of the file also has a custom property called metatdata_title my index has a field called metatdata_title, i want this field to contain the blob property not the file property, It would be very troublesome to rename the blob property in my process, is there a way to access blob property data in input field mapping ?

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
741 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,436 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Azar 19,565 Reputation points
    2023-10-24T15:33:18.7433333+00:00

    Hi @Sophie Gonzalez

    Yes you can specify the field mapping to use the custom blob property. Field mapping allows you to extract and map data from your blob properties into your search index fields.

    "fieldMappings": [   
      {         "sourceFieldName": "/metadata_title",  // Specify the path to the custom blob property              		"targetFieldName": "metadata_title"   // Specify the target field in your search index     } ] 
    
    
    

    If this helps kindly accept the answer thanks much.

    0 comments No comments