IMPORTANT: No additional bug fixes or documentation updates
will be released for this version. For the latest information, see the
current release documentation.
nori analyzer
edit
A newer version is available. Check out the latest documentation.
nori analyzer
editThe nori analyzer consists of the following tokenizer and token filters:
-
nori_tokenizer -
nori_part_of_speechtoken filter -
nori_readingformtoken filter -
lowercasetoken filter
It supports the decompound_mode and user_dictionary settings from
nori_tokenizer and the stoptags setting from
nori_part_of_speech.