Why does my search index keep referencing older documents no longer in data?

Harsh Bangad 85 Reputation points
2024-10-09T17:46:24.0633333+00:00

I am working on Azure AI Studio with a deployed model gpt 4o to query documents. I have a data store connection where my documents lie/ index and a job in azure machine learning to reindex the connected datastore everyday. I remove old files from the connected datastore and expect the job to get only new data while i ask questions. The new data is a part of answer but i still get the older answers as well which gives wrong answers. I do not want the index to have older data. It is even removed from the datastore. How do i solve this?

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
3,232 questions
Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
1,274 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,351 questions
{count} votes

1 answer

Sort by: Most helpful
  1. AshokPeddakotla-MSFT 35,951 Reputation points
    2024-10-10T02:15:04.4733333+00:00

    Harsh Bangad Greetings!

    As you have mentioned you already removed the old data, one reason could be that the indexer is not detecting the deleted files and is still keeping them in the index.

    What kind of data store are you referring to?

    I understand that you have already checked that the data is removed. But sometimes the data might not be deleted completely.

    Make sure that the old files are removed from the connected datastore completely. You can do this by checking the datastore directly to confirm that the files are no longer present.

    Incase if you are referring to Azure Storage, see Change and delete detection using indexers for Azure Storage in Azure AI Search

    Also, see Update or rebuild an index in Azure AI Search and let me know if that helps.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.