Share via


LexicalTokenizer Class

Definition

Base type for tokenizers. Please note LexicalTokenizer is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ClassicTokenizer, EdgeNGramTokenizer, KeywordTokenizer, KeywordTokenizer, MicrosoftLanguageStemmingTokenizer, MicrosoftLanguageTokenizer, NGramTokenizer, PathHierarchyTokenizer, PatternTokenizer, LuceneStandardTokenizer, LuceneStandardTokenizer and UaxUrlEmailTokenizer.

public class LexicalTokenizer
type LexicalTokenizer = class
Public Class LexicalTokenizer
Inheritance
LexicalTokenizer
Derived

Properties

Name

The name of the tokenizer. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.

Applies to