Hello Mansi Gusain,
Welcome to the Microsoft Q&A and thank you for posting your questions here.
Problem
I understand that you would like to use external URL as a data provider for azure ai search.
Solution
To achieve your goal of using a URL as a data provider for Azure AI Search without scraping the data first. I will provide the answer based on your questions:
My use case is that I have my data in the form of a url for accessing the content . Now I wish to make this url as a data provider to my azure ai search how can i do that?
Azure Cognitive Search can ingest data from various sources, but directly indexing content from a URL without scraping is not a built-in feature. However, you can set up a process where the content from the URL is ingested and indexed in a way that suits your needs.
https://docs.microsoft.com/en-us/azure/search/search-create-index-portal and https://docs.microsoft.com/en-us/azure/search/search-indexer-overview
Which type of storge account and provider would be the best fit for this use case?
For storing the data fetched from the URL, Azure Blob Storage is the best fit. It is scalable, cost-effective, and well-integrated with Azure Cognitive Search.
- Storage Account Type: Use a General-purpose v2 (GPv2) storage account.
- Blob Container: Create a container within the storage account to store the fetched data.
https://docs.microsoft.com/en-us/azure/storage/blobs/storage-blobs-introduction and https://docs.microsoft.com/en-us/azure/storage/blobs/storage-quickstart-blobs-portal
Please suggest some method.
Put into consideration the followings:
- Data Ingestion
- Blob Storage Organization
- Indexing Configuration
- Security
Accept Answer
I hope this is helpful! Do not hesitate to let me know if you have any other questions.
** Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful ** so that others in the community facing similar issues can easily find the solution.
Best Regards,
Sina Salam