You are looking at documentation for an older release. Not what you want? See the current release documentation.
Elasticsearch ships with a wide range of built-in analyzers, which can be used in any index without further configuration:
- Standard Analyzer
standardanalyzer divides text into terms on word boundaries, as defined by the Unicode Text Segmentation algorithm. It removes most punctuation, lowercases terms, and supports removing stop words.
- Simple Analyzer
simpleanalyzer divides text into terms whenever it encounters a character which is not a letter. It lowercases all terms.
- Whitespace Analyzer
whitespaceanalyzer divides text into terms whenever it encounters any whitespace character. It does not lowercase terms.
- Stop Analyzer
stopanalyzer is like the
simpleanalyzer, but also supports removal of stop words.
- Keyword Analyzer
keywordanalyzer is a “noop” analyzer that accepts whatever text it is given and outputs the exact same text as a single term.
- Pattern Analyzer
patternanalyzer uses a regular expression to split the text into terms. It supports lower-casing and stop words.
- Language Analyzers
- Elasticsearch provides many language-specific analyzers like
- Fingerprint Analyzer
fingerprintanalyzer is a specialist analyzer which creates a fingerprint which can be used for duplicate detection.