Apostrophe token filteredit
Strips all characters after an apostrophe, including the apostrophe itself.
This filter is included in Elasticsearch’s built-in Turkish language analyzer. It uses Lucene’s ApostropheFilter, which was built for the Turkish language.
Exampleedit
The following analyze API request demonstrates how the apostrophe token filter works.
response = client.indices.analyze( body: { tokenizer: 'standard', filter: [ 'apostrophe' ], text: "Istanbul'a veya Istanbul'dan" } ) puts response
GET /_analyze { "tokenizer" : "standard", "filter" : ["apostrophe"], "text" : "Istanbul'a veya Istanbul'dan" }
The filter produces the following tokens:
[ Istanbul, veya, Istanbul ]
Add to an analyzeredit
The following create index API request uses the apostrophe token filter to configure a new custom analyzer.
response = client.indices.create( index: 'apostrophe_example', body: { settings: { analysis: { analyzer: { standard_apostrophe: { tokenizer: 'standard', filter: [ 'apostrophe' ] } } } } } ) puts response
PUT /apostrophe_example { "settings": { "analysis": { "analyzer": { "standard_apostrophe": { "tokenizer": "standard", "filter": [ "apostrophe" ] } } } } }