You are looking at preliminary documentation for a future release. Not what you want? See the current release documentation.
cjk_width token filter normalizes CJK width differences:
- Folds fullwidth ASCII variants into the equivalent basic Latin
- Folds halfwidth Katakana variants into the equivalent Kana
This token filter can be viewed as a subset of NFKC/NFKD
Unicode normalization. See the
for full normalization support.