Extending Language Resources for Indexing Service
Note
Indexing Service is no longer supported as of Windows XP and is unavailable for use as of Windows 8. Instead, use Windows Search for client side search and Microsoft Search Server Express for server side search.
Indexing Service uses language resources such as word breakers and stemmers to break text in its native locale during index creation and query processing. Microsoft provides word breakers and stemmers for a number of languages.
This section describes how to implement and use custom word breakers and stemmers for languages and locales not provided for by Microsoft. This section contains the following topics:
- About Language Resources describes how the Indexing Service language resources process text and properties in their native locale.
- Applying Language Resources describes how the index building and querying process for Indexing Service determines which language resources to use for a particular file.
- Constructing Language Resource Components describes how to create custom word breakers and stemmers.
- Linguistic and Unicode Considerations describes a variety of linguistic and Unicode character set considerations that may apply to some languages.
- Troubleshooting Language Resources describes some tips on troubleshooting word breaker and stemmer implementations.
- Language Resource Samples presents details about the sample word breaker and stemmer in the Platform Software Development Kit (SDK).