Stemmer Token Filter
editStemmer Token Filter
editA filter that provides access to (almost) all of the available stemming token filters through a single unified interface. For example:
PUT /my_index
{
"settings": {
"analysis" : {
"analyzer" : {
"my_analyzer" : {
"tokenizer" : "standard",
"filter" : ["standard", "lowercase", "my_stemmer"]
}
},
"filter" : {
"my_stemmer" : {
"type" : "stemmer",
"name" : "light_german"
}
}
}
}
}
The language/name parameter controls the stemmer with the following
available values (the preferred filters are marked in bold):
|
Arabic |
|
|
Armenian |
|
|
Basque |
|
|
Bengali |
|
|
Brazilian Portuguese |
|
|
Bulgarian |
|
|
Catalan |
|
|
Czech |
|
|
Danish |
|
|
Dutch |
|
|
English |
|
|
Finnish |
|
|
French |
|
|
Galician |
|
|
German |
|
|
Greek |
|
|
Hindi |
|
|
Hungarian |
|
|
Indonesian |
|
|
Irish |
|
|
Italian |
|
|
Kurdish (Sorani) |
|
|
Latvian |
|
|
Lithuanian |
|
|
Norwegian (Bokmål) |
|
|
Norwegian (Nynorsk) |
|
|
Portuguese |
|
|
Romanian |
|
|
Russian |
|
|
Spanish |
|
|
Swedish |
|
|
Turkish |