DictionaryDecompounderTokenFilter Class
- java.
lang. Object - com.
azure. search. documents. indexes. models. TokenFilter - com.
azure. search. documents. indexes. models. DictionaryDecompounderTokenFilter
- com.
- com.
public final class DictionaryDecompounderTokenFilter
extends TokenFilter
Decomposes compound words found in many Germanic languages. This token filter is implemented using Apache Lucene.
Constructor Summary
| Constructor | Description |
|---|---|
| DictionaryDecompounderTokenFilter(String name, List<String> wordList) |
Creates an instance of Dictionary |
Method Summary
| Modifier and Type | Method and Description |
|---|---|
|
static
Dictionary |
fromJson(JsonReader jsonReader)
Reads an instance of Dictionary |
| Integer |
getMaxSubwordSize()
Get the max |
| Integer |
getMinSubwordSize()
Get the min |
| Integer |
getMinWordSize()
Get the min |
| String |
getOdataType()
Get the odata |
| List<String> |
getWordList()
Get the word |
| Boolean |
isOnlyLongestMatched()
Get the only |
|
Dictionary |
setMaxSubwordSize(Integer maxSubwordSize)
Set the max |
|
Dictionary |
setMinSubwordSize(Integer minSubwordSize)
Set the min |
|
Dictionary |
setMinWordSize(Integer minWordSize)
Set the min |
|
Dictionary |
setOnlyLongestMatched(Boolean onlyLongestMatched)
Set the only |
|
Json |
toJson(JsonWriter jsonWriter) |
Methods inherited from TokenFilter
Methods inherited from java.lang.Object
Constructor Details
DictionaryDecompounderTokenFilter
public DictionaryDecompounderTokenFilter(String name, List<String> wordList)
Creates an instance of DictionaryDecompounderTokenFilter class.
Parameters:
Method Details
fromJson
public static DictionaryDecompounderTokenFilter fromJson(JsonReader jsonReader)
Reads an instance of DictionaryDecompounderTokenFilter from the JsonReader.
Parameters:
Returns:
Throws:
getMaxSubwordSize
public Integer getMaxSubwordSize()
Get the maxSubwordSize property: The maximum subword size. Only subwords shorter than this are outputted. Default is 15. Maximum is 300.
Returns:
getMinSubwordSize
public Integer getMinSubwordSize()
Get the minSubwordSize property: The minimum subword size. Only subwords longer than this are outputted. Default is 2. Maximum is 300.
Returns:
getMinWordSize
public Integer getMinWordSize()
Get the minWordSize property: The minimum word size. Only words longer than this get processed. Default is 5. Maximum is 300.
Returns:
getOdataType
public String getOdataType()
Get the odataType property: A URI fragment specifying the type of token filter.
Overrides:
DictionaryDecompounderTokenFilter.getOdataType()Returns:
getWordList
public List<String> getWordList()
Get the wordList property: The list of words to match against.
Returns:
isOnlyLongestMatched
public Boolean isOnlyLongestMatched()
Get the onlyLongestMatched property: A value indicating whether to add only the longest matching subword to the output. Default is false.
Returns:
setMaxSubwordSize
public DictionaryDecompounderTokenFilter setMaxSubwordSize(Integer maxSubwordSize)
Set the maxSubwordSize property: The maximum subword size. Only subwords shorter than this are outputted. Default is 15. Maximum is 300.
Parameters:
Returns:
setMinSubwordSize
public DictionaryDecompounderTokenFilter setMinSubwordSize(Integer minSubwordSize)
Set the minSubwordSize property: The minimum subword size. Only subwords longer than this are outputted. Default is 2. Maximum is 300.
Parameters:
Returns:
setMinWordSize
public DictionaryDecompounderTokenFilter setMinWordSize(Integer minWordSize)
Set the minWordSize property: The minimum word size. Only words longer than this get processed. Default is 5. Maximum is 300.
Parameters:
Returns:
setOnlyLongestMatched
public DictionaryDecompounderTokenFilter setOnlyLongestMatched(Boolean onlyLongestMatched)
Set the onlyLongestMatched property: A value indicating whether to add only the longest matching subword to the output. Default is false.
Parameters:
Returns:
toJson
public JsonWriter toJson(JsonWriter jsonWriter)
Overrides:
DictionaryDecompounderTokenFilter.toJson(JsonWriter jsonWriter)Parameters:
Throws: