EdgeNGramTokenizer Class
- java.
lang. Object - com.
azure. search. documents. indexes. models. LexicalTokenizer - com.
azure. search. documents. indexes. models. EdgeNGramTokenizer
- com.
- com.
public final class EdgeNGramTokenizer
extends LexicalTokenizer
Tokenizes the input from an edge into n-grams of the given size(s). This tokenizer is implemented using Apache Lucene.
Constructor Summary
| Constructor | Description |
|---|---|
| EdgeNGramTokenizer(String name) |
Creates an instance of Edge |
Method Summary
| Modifier and Type | Method and Description |
|---|---|
|
static
Edge |
fromJson(JsonReader jsonReader)
Reads an instance of Edge |
| Integer |
getMaxGram()
Get the max |
| Integer |
getMinGram()
Get the min |
| String |
getOdataType()
Get the odata |
|
List<Token |
getTokenChars()
Get the token |
|
Edge |
setMaxGram(Integer maxGram)
Set the max |
|
Edge |
setMinGram(Integer minGram)
Set the min |
|
Edge |
setTokenChars(List<TokenCharacterKind> tokenChars)
Set the token |
|
Edge |
setTokenChars(TokenCharacterKind[] tokenChars)
Set the token |
|
Json |
toJson(JsonWriter jsonWriter) |
Methods inherited from LexicalTokenizer
Methods inherited from java.lang.Object
Constructor Details
EdgeNGramTokenizer
public EdgeNGramTokenizer(String name)
Creates an instance of EdgeNGramTokenizer class.
Parameters:
Method Details
fromJson
public static EdgeNGramTokenizer fromJson(JsonReader jsonReader)
Reads an instance of EdgeNGramTokenizer from the JsonReader.
Parameters:
Returns:
Throws:
getMaxGram
public Integer getMaxGram()
Get the maxGram property: The maximum n-gram length. Default is 2. Maximum is 300.
Returns:
getMinGram
public Integer getMinGram()
Get the minGram property: The minimum n-gram length. Default is 1. Maximum is 300. Must be less than the value of maxGram.
Returns:
getOdataType
public String getOdataType()
Get the odataType property: A URI fragment specifying the type of tokenizer.
Overrides:
EdgeNGramTokenizer.getOdataType()Returns:
getTokenChars
public List<TokenCharacterKind> getTokenChars()
Get the tokenChars property: Character classes to keep in the tokens.
Returns:
setMaxGram
public EdgeNGramTokenizer setMaxGram(Integer maxGram)
Set the maxGram property: The maximum n-gram length. Default is 2. Maximum is 300.
Parameters:
Returns:
setMinGram
public EdgeNGramTokenizer setMinGram(Integer minGram)
Set the minGram property: The minimum n-gram length. Default is 1. Maximum is 300. Must be less than the value of maxGram.
Parameters:
Returns:
setTokenChars
public EdgeNGramTokenizer setTokenChars(List<TokenCharacterKind> tokenChars)
Set the tokenChars property: Character classes to keep in the tokens.
Parameters:
Returns:
setTokenChars
public EdgeNGramTokenizer setTokenChars(TokenCharacterKind[] tokenChars)
Set the tokenChars property: Character classes to keep in the tokens.
Parameters:
Returns:
toJson
public JsonWriter toJson(JsonWriter jsonWriter)
Overrides:
EdgeNGramTokenizer.toJson(JsonWriter jsonWriter)Parameters:
Throws: