How can I create a datasource using a wildcard for blob folder property?

Gavris Vancea 5 Reputation points
2024-10-18T07:44:32.13+00:00

Hello,
I am trying to integrate an Azure Datalake Gen 2 with Azure AI Search. The directory structure looks like this: docs/{entity_id}/{type}/file.pdf. I want to create an indexer that index only the files from the directories: docs/{entity_id}/basic.
I have already tried to create a datasource in Azure AI Search with a query(for the directory) which uses wildcard for the entity_id but it didn't work.

What could be the best approach for this kind of structure?

If the wildcards are not allowed, how can we workaround it?

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
1,339 questions
{count} vote

1 answer

Sort by: Most helpful
  1. Laxman Reddy Revuri 5,395 Reputation points Microsoft External Staff Moderator
    2024-10-18T18:33:49.1633333+00:00

    Hi @Gavris Vancea
    Thanks for the question and using MS Q&A platform.
    Connect the key information from your data source to the fields in your search index, including the file content and any related metadata.
    Once your indexer is set up, run it to start indexing the files from the specified folders.
    After running the indexer, review your search index to ensure the correct documents were added. Also, monitor the indexer for any errors or problems.
    Index data from Azure Data Lake Storage Gen2
    Azure Data Lake Storage Gen2 indexer - Azure AI Search | Microsoft Learn
    Simple query syntax - Azure AI Search | Microsoft Learn
    Partial terms, patterns, and special characters - Azure AI Search | Microsoft Learn
    I hope this information is helpful. If you have any further questions, please don't hesitate to reach out.
     If the answer is helpful, please click "Accept Answer" and kindly upvote it.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.