PreTokenizer Class
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Base class for all pre-tokenizers classes. The PreTokenizer is in charge of doing the pre-segmentation step.
public abstract class PreTokenizer
type PreTokenizer = class
Public MustInherit Class PreTokenizer
- Inheritance
-
PreTokenizer
- Derived
Constructors
PreTokenizer() |
Methods
PreTokenize(String) |
Splits the given string in multiple substrings at the word boundary, keeping track of the offsets of said substrings from the original string. |
Applies to
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for