Linguistic and Unicode Considerations

This section contains a list of linguistic and Unicode considerations that might affect word breaker and stemmer implementation. The list is not an exhaustive one.

This section includes the following topics:

Additional Resources

Extending Language Resources

Understanding Language Resource Components

Implementing a Word Breaker and Stemmer

Troubleshooting Language Resources and Best Practices