Simple Analyzer

The simple analyzer breaks text into terms whenever it encounters a character which is not a letter. All terms are lower cased.

Example output

POST _analyze
{
  "analyzer": "simple",
  "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog's bone."
}

The above sentence would produce the following terms:

[ the, quick, brown, foxes, jumped, over, the, lazy, dog, s, bone ]

Configuration

The simple analyzer is not configurable.

Definition

The simple analzyer consists of:

If you need to customize the simple analyzer then you need to recreate it as a custom analyzer and modify it, usually by adding token filters. This would recreate the built-in simple analyzer and you can use it as a starting point for further customization:

PUT /simple_example
{
  "settings": {
    "analysis": {
      "analyzer": {
        "rebuilt_simple": {
          "tokenizer": "lowercase",
          "filter": [         
          ]
        }
      }
    }
  }
}

You’d add any token filters here.