Stop token filter
editStop token filter
editA token filter of type stop that removes stop words from token
streams.
The following are settings that can be set for a stop token filter
type:
|
|
A list of stop words to use. Defaults to |
|
|
A path (either relative to |
|
|
Set to |
|
|
Set to |
The stopwords parameter accepts either an array of stopwords:
PUT /my_index
{
"settings": {
"analysis": {
"filter": {
"my_stop": {
"type": "stop",
"stopwords": ["and", "is", "the"]
}
}
}
}
}
or a predefined language-specific list:
PUT /my_index
{
"settings": {
"analysis": {
"filter": {
"my_stop": {
"type": "stop",
"stopwords": "_english_"
}
}
}
}
}
Elasticsearch provides the following predefined list of languages:
_arabic_, _armenian_, _basque_, _bengali_, _brazilian_, _bulgarian_,
_catalan_, _czech_, _danish_, _dutch_, _english_, _finnish_,
_french_, _galician_, _german_, _greek_, _hindi_, _hungarian_,
_indonesian_, _irish_, _italian_, _latvian_, _norwegian_, _persian_,
_portuguese_, _romanian_, _russian_, _sorani_, _spanish_,
_swedish_, _thai_, _turkish_.
For the empty stopwords list (to disable stopwords) use: _none_.