PatternTokenizer Class
- java.
lang. Object - com.
azure. search. documents. indexes. models. LexicalTokenizer - com.
azure. search. documents. indexes. models. PatternTokenizer
- com.
- com.
public final class PatternTokenizer
extends LexicalTokenizer
Tokenizer that uses regex pattern matching to construct distinct tokens. This tokenizer is implemented using Apache Lucene.
Constructor Summary
| Constructor | Description |
|---|---|
| PatternTokenizer(String name) |
Creates an instance of Pattern |
Method Summary
| Modifier and Type | Method and Description |
|---|---|
|
static
Pattern |
fromJson(JsonReader jsonReader)
Reads an instance of Pattern |
|
List<Regex |
getFlags()
Get the flags property: Regular expression flags. |
| Integer |
getGroup()
Get the group property: The zero-based ordinal of the matching group in the regular expression pattern to extract into tokens. |
| String |
getOdataType()
Get the odata |
| String |
getPattern()
Get the pattern property: A regular expression pattern to match token separators. |
|
Pattern |
setFlags(List<RegexFlags> flags)
Set the flags property: Regular expression flags. |
|
Pattern |
setFlags(RegexFlags[] flags)
Set the flags property: Regular expression flags. |
|
Pattern |
setGroup(Integer group)
Set the group property: The zero-based ordinal of the matching group in the regular expression pattern to extract into tokens. |
|
Pattern |
setPattern(String pattern)
Set the pattern property: A regular expression pattern to match token separators. |
|
Json |
toJson(JsonWriter jsonWriter) |
Methods inherited from LexicalTokenizer
Methods inherited from java.lang.Object
Constructor Details
PatternTokenizer
public PatternTokenizer(String name)
Creates an instance of PatternTokenizer class.
Parameters:
Method Details
fromJson
public static PatternTokenizer fromJson(JsonReader jsonReader)
Reads an instance of PatternTokenizer from the JsonReader.
Parameters:
Returns:
Throws:
getFlags
public List<RegexFlags> getFlags()
Get the flags property: Regular expression flags.
Returns:
getGroup
public Integer getGroup()
Get the group property: The zero-based ordinal of the matching group in the regular expression pattern to extract into tokens. Use -1 if you want to use the entire pattern to split the input into tokens, irrespective of matching groups. Default is -1.
Returns:
getOdataType
public String getOdataType()
Get the odataType property: A URI fragment specifying the type of tokenizer.
Overrides:
PatternTokenizer.getOdataType()Returns:
getPattern
public String getPattern()
Get the pattern property: A regular expression pattern to match token separators. Default is an expression that matches one or more non-word characters.
Returns:
setFlags
public PatternTokenizer setFlags(List<RegexFlags> flags)
Set the flags property: Regular expression flags.
Parameters:
Returns:
setFlags
public PatternTokenizer setFlags(RegexFlags[] flags)
Set the flags property: Regular expression flags.
Parameters:
Returns:
setGroup
public PatternTokenizer setGroup(Integer group)
Set the group property: The zero-based ordinal of the matching group in the regular expression pattern to extract into tokens. Use -1 if you want to use the entire pattern to split the input into tokens, irrespective of matching groups. Default is -1.
Parameters:
Returns:
setPattern
public PatternTokenizer setPattern(String pattern)
Set the pattern property: A regular expression pattern to match token separators. Default is an expression that matches one or more non-word characters.
Parameters:
Returns:
toJson
public JsonWriter toJson(JsonWriter jsonWriter)
Overrides:
PatternTokenizer.toJson(JsonWriter jsonWriter)Parameters:
Throws: