Share via


PreTokenizer Class

Definition

Base class for all pre-tokenizers classes. The PreTokenizer is in charge of doing the pre-segmentation step.

public abstract class PreTokenizer
type PreTokenizer = class
Public MustInherit Class PreTokenizer
Inheritance
PreTokenizer
Derived

Constructors

PreTokenizer()

Methods

PreTokenize(String)

Splits the given string in multiple substrings at the word boundary, keeping track of the offsets of said substrings from the original string.

Applies to