KnownAnalyzerNames enum

Fields

ArLucene

Arabic

ArMicrosoft

Arabic

BgLucene

Bulgarian

BgMicrosoft

Bulgarian

BnMicrosoft

Bangla

CaLucene

Catalan

CaMicrosoft

Catalan

CsLucene

Czech

CsMicrosoft

Czech

DaLucene

Danish

DaMicrosoft

Danish

DeLucene

German

DeMicrosoft

German

ElLucene

Greek

ElMicrosoft

Greek

EnLucene

English

EnMicrosoft

English

EsLucene

Spanish

EsMicrosoft

Spanish

EtMicrosoft

Estonian

EuLucene

Basque

FaLucene

Persian

FiLucene

Finnish

FiMicrosoft

Finnish

FrLucene

French

FrMicrosoft

French

GaLucene

Irish

GlLucene

Galician

GuMicrosoft

Gujarati

HeMicrosoft

Hebrew

HiLucene

Hindi

HiMicrosoft

Hindi

HrMicrosoft

Croatian

HuLucene

Hungarian

HuMicrosoft

Hungarian

HyLucene

Armenian

IdLucene

Indonesian (Bahasa)

IdMicrosoft

Indonesian (Bahasa)

IsMicrosoft

Icelandic

ItLucene

Italian

ItMicrosoft

Italian

JaLucene

Japanese

JaMicrosoft

Japanese

Keyword

Treats the entire content of a field as a single token. This is useful for data like zip codes, ids, and some product names.

KnMicrosoft

Kannada

KoLucene

Korean

KoMicrosoft

Korean

LtMicrosoft

Lithuanian

LvLucene

Latvian

LvMicrosoft

Latvian

MlMicrosoft

Malayalam

MrMicrosoft

Marathi

MsMicrosoft

Malay (Latin)

NbMicrosoft

Norwegian

NlLucene

Dutch

NlMicrosoft

Dutch

NoLucene

Norwegian

PaMicrosoft

Punjabi

Pattern

Flexibly separates text into terms via a regular expression pattern.

PlLucene

Polish

PlMicrosoft

Polish

PtBRLucene

Portuguese (Brazil)

PtBRMicrosoft

Portuguese (Brazil)

PtPTLucene

Portuguese (Portugal)

PtPTMicrosoft

Portuguese (Portugal)

RoLucene

Romanian

RoMicrosoft

Romanian

RuLucene

Russian

RuMicrosoft

Russian

Simple

Divides text at non-letters and converts them to lower case.

SkMicrosoft

Slovak

SlMicrosoft

Slovenian

SrCyrillicMicrosoft

Serbian (Cyrillic)

SrLatinMicrosoft

Serbian (Latin)

StandardAsciiFoldingLucene

See https://lucene.apache.org/core/6_6_1/analyzers-common/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilter.html

StandardLucene

See: https://lucene.apache.org/core/6_6_1/core/org/apache/lucene/analysis/standard/StandardAnalyzer.html

Stop

Divides text at non-letters; Applies the lowercase and stopword token filters.

SvLucene

Swedish

SvMicrosoft

Swedish

TaMicrosoft

Tamil

TeMicrosoft

Telugu

ThLucene

Thai

ThMicrosoft

Thai

TrLucene

Turkish

TrMicrosoft

Turkish

UkMicrosoft

Ukrainian

UrMicrosoft

Urdu

ViMicrosoft

Vietnamese

Whitespace

An analyzer that uses the whitespace tokenizer.

ZhHansLucene

Chinese Simplified

ZhHansMicrosoft

Chinese Simplified

ZhHantLucene

Chinese Traditional

ZhHantMicrosoft

Chinese Traditional