Notifying the Index of Changes (Windows Search)
By using the notification APIs components can notify the indexer that an item has been changed, moved or deleted, and can add search scopes to the Windows Search indexer's queue of URLs that require indexing.
Components can notify the Windows Search indexer that data in their store has changed. Typically, you can rely on the indexer's scheduled crawls. However, providing notifications to the indexer can improve performance by ensuring the indexer doesn't crawl the entire store on incremental indexes. For example, this might recommended if you expect your data store to be exceptionally large and/or exceptionally busy, such as an email data store for example.
- Implementing Indexer-managed Notifications
- Implementing Provider-managed Notifications
- Additional Resources
- Related topics
Implementing Indexer-managed Notifications
Indexer-managed notifications enable you to control access to your data store while freeing you of maintaining the notifications queue throughout the entire indexing process. Your notifications provider must monitor changes to the data store and create a notifications queue. Periodically, your provider sends a batch of change notifications to the indexer. When the indexer receives your notifications, it returns an acknowledgement and you can remove the item(s) from your queue. If after a period of time you do not receive an acknowledgement, you can re-send the notifications. In case of failure, the indexer rebuilds its internal queue of items to crawl or performs an incremental crawl of the store.
To implement indexer-managed notifications, you need to implement the following:
- A mechanism for monitoring changes in your data store.
- A data structure to queue up information (multiple SEARCH_ITEM_PERSISTENT_CHANGE structures) about those changes.
- ISearchPersistentItemsChangedSink interface to send your notifications to the indexer and to get notification acknowledgements from the indexer.
Notifications Queue
You need to monitor and queue up every change in your data store to send to the indexer as a notification. How many notifications you queue up and how frequently you send them to the indexer depends on your circumstance. Perhaps you send a batch of notifications for every n number of changes or after some t time interval, or a combination of the two.
The indexer expects the notifications to come in an array of SEARCH_ITEM_PERSISTENT_CHANGE structures, so you may choose to implement your queue similarly.
ISearchPersistentItemsChangedSink
To access this interface, you first instantiate an ISearchManager object to gain access to an ISearchCatalogManager object. From that ISearchCatalogManager object, you instantiate an ISearchPersistentItemsChangedSink object and notify the indexer of the data changes with a call to the OnItemsChanged method.
In the call to this method, you include the number of changes being reported and an array of SEARCH_ITEM_PERSISTENT_CHANGE structures. You get back an array of HR completion codes indicating whether each URL was accepted for indexing. This is your acknowledgement from the indexer.
Implementing Provider-managed Notifications
Provider-managed notifications enable you to control access to your data store and to monitor the progress of the indexer as it updates the Windows Search catalog. Your provider must monitor changes to the data store and create a notifications queue. Periodically, your provider sends a batch of change notifications to the indexer. When the indexer receives your notifications, it returns an acknowledgement. If after a period of time you do not receive an acknowledgement, you can re-send the notifications. As the indexer crawls the data store and updates the Windows Search catalog, it notifies your provider of each catalog update and you can remove the item(s) from your queue. Your provider maintains its notification queue throughout this process so that in case of failure, you can resend notifications to the indexer.
To implement provider-managed notifications, you need to implement the following:
- A mechanism for monitoring changes in your data store.
- A data structure to queue up information (multiple SEARCH_ITEM_CHANGE structures) about those changes.
- ISearchItemsChangedSink interface to send your notifications to the indexer and to get notification acknowledgements from the indexer.
- ISearchNotifyInlineSite interface to receive updates about the status of indexing.
Notifications Queue
You need to monitor and queue up every change in your data store to send to the indexer as a notification. How many notifications you queue up and how frequently you send them to the indexer depends on your circumstance. Perhaps you send a batch of notifications for every n number of changes or after some t time interval, or a combination of the two.
The indexer expects the notifications to come in an array of SEARCH_ITEM_CHANGE structures, so you may choose to store change information similarly. However, you also need to be able to match the notifications you send with the acknowledgements and updates returned by the indexer. You may also want to be able to detect how long it takes to get acknowledgements, so you can decide if/when to resend notifications.
ISearchItemsChangedSink
To access this interface, you first instantiate an ISearchManager object to gain access to an ISearchCatalogManager object. From that ISearchCatalogManager object, you instantiate an ISearchItemsChangedSink object and notify the indexer of the data changes with a call to the OnItemsChanged method.
In the call to this method, you include the number of changes being reported and an array of SEARCH_ITEM_CHANGE structures. You get back an array of indexer-assigned DocIds that represent each change as well as an array of HR completion codes indicating whether each URL was accepted for indexing. This is your acknowledgement from the indexer that it has received your notifications and is preparing to index the items.
From that point forward the indexer sends updates using the ISearchNotifyInlineSite interface.
ISearchNotifyInlineSite
In order to get updates about the status of both your items and the catalog, you must register your ISearchNotifyInlineSite interface with the indexer so it can send you callbacks. Each update sent using ISearchNotifyInlineSite::OnItemIndexedStatusChange identifies the items by DocId, the status of each item (SEARCH_ITEM_INDEXING_STATUS) and the indexing phase (SEARCH_INDEXING_PHASE) the items are in.
Not only do you receive updates on the status of each item, as described earlier, you also get important information about the status of the catalog itself. The Windows Search service can be interrupted or restarted by the end user, a third party application, or some other failure. When this happens, you need a way to determine what notifications to repush to the indexer.
The ISearchNotifyInlineSite::OnCatalogStatusChange method, called by the Windows Search service, informs clients about the status of the catalog using the parameters described in the following table.
Parameter | Description |
---|---|
guidCatalogResetSignature | A GUID representing the catalog reset. If this GUID changes, all notifications must be resent. |
guidCheckPointSignature | A GUID representing the last checkpoint restored. If this GUID changes, all notifications accumulated since the last saved checkpoint must be resent. |
dwLastCheckPointNumber | A number indicating the last checkpoint saved. |
When a catalog checkpoint occurs, the search service updates the dwLastCheckPointNumber, and all notifications sent prior to that checkpoint are safe and recoverable in the event of a service failure. Notifications providers need to track only those notifications sent between checkpoints and repush in case of a catalog restore or reset.
If a catalog restore occurs, the search service rolls back the catalog to the last saved checkpoint and updates the guidCheckPointSignature. In this situation, notifications providers must repush all notifications accumulated since the most recent saved checkpoint as identified by the dwLastCheckPointNumber.
If a catalog reset should occur, the search service resets the entire catalog and updates the guidCatalogResetSignature. Notifications provider must repush its entire crawl scope again.
Additional Resources
- For an overview of the indexing process, see The Indexing Process.
- For overviews of the Catalog Manager and Catalog Search Manager (CSM), see Using the Catalog Manager and Using the Crawl Scope Manager.
- For information about creating a Shell data store, see Implementing the Basic Folder Object Interfaces.
Related topics
-
Conceptual