Delen via


StandardTokenizer Class

Definition

Caution

This type is obsolete. Please use StandardTokenizerV2 instead.

Breaks text following the Unicode Text Segmentation rules. This tokenizer is implemented using Apache Lucene. http://lucene.apache.org/core/4_10_3/analyzers-common/org/apache/lucene/analysis/standard/StandardTokenizer.html

[Newtonsoft.Json.JsonObject("#Microsoft.Azure.Search.StandardTokenizer")]
[System.Obsolete("This type is obsolete. Please use StandardTokenizerV2 instead.")]
public class StandardTokenizer : Microsoft.Azure.Search.Models.Tokenizer
[<Newtonsoft.Json.JsonObject("#Microsoft.Azure.Search.StandardTokenizer")>]
[<System.Obsolete("This type is obsolete. Please use StandardTokenizerV2 instead.")>]
type StandardTokenizer = class
    inherit Tokenizer
Public Class StandardTokenizer
Inherits Tokenizer
Inheritance
StandardTokenizer
Attributes
Newtonsoft.Json.JsonObjectAttribute ObsoleteAttribute

Constructors

StandardTokenizer()

Initializes a new instance of the StandardTokenizer class.

StandardTokenizer(String, Nullable<Int32>)

Initializes a new instance of the StandardTokenizer class.

Properties

MaxTokenLength

Gets or sets the maximum token length. Default is 255. Tokens longer than the maximum length are split

Name

Gets or sets the name of the tokenizer. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.

(Inherited from Tokenizer)

Methods

Validate()

Validate the object.

Applies to