WARNING: Version 1.7 of Elasticsearch has passed its EOL date.

This documentation is no longer being maintained and may be removed. If you are running this version, we strongly advise you to upgrade. For the latest information, see the current release documentation.

« Normalization Token Filter CJK Bigram Token Filter »

› › ›

CJK Width Token Filter

edit

IMPORTANT: This documentation is no longer updated. Refer to Elastic's version policy and the latest documentation.

CJK Width Token Filter

edit

The cjk_width token filter normalizes CJK width differences:

Folds fullwidth ASCII variants into the equivalent basic Latin
Folds halfwidth Katakana variants into the equivalent Kana

This token filter can be viewed as a subset of NFKC/NFKD Unicode normalization. See the ICU Analysis Plugin for full normalization support.

« Normalization Token Filter CJK Bigram Token Filter »