Linguistic and Unicode Considerations
This section contains a list of linguistic and Unicode considerations that might affect word breaker and stemmer implementation. The list is not an exhaustive one.
This section includes the following topics:
Additional Resources
- For a list of lanuages supported by word breakers, see Languages Supported by Windows Search.
- If you need to identify the language of a piece of text, you can use Language Auto-Detection (LAD), which is available in Windows 7 and later. For more information, see Extended Linguistic Services (ELS).
- For applicable reference documentation, see Data Add-in Interfaces.
Related topics
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for