ClassicTokenizer Class

Definition

Grammar-based tokenizer that is suitable for processing most European-language documents. This tokenizer is implemented using Apache Lucene.

public class ClassicTokenizer : Azure.Search.Documents.Indexes.Models.LexicalTokenizer
type ClassicTokenizer = class
    inherit LexicalTokenizer
Public Class ClassicTokenizer
Inherits LexicalTokenizer
Inheritance
ClassicTokenizer

Constructors

ClassicTokenizer(String)

Initializes a new instance of ClassicTokenizer.

Properties

MaxTokenLength

The maximum token length. Default is 255. Tokens longer than the maximum length are split. The maximum token length that can be used is 300 characters.

Name

The name of the tokenizer. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.

(Inherited from LexicalTokenizer)

Applies to