Character Filtersedit

Character filters are used to preprocess the string of characters before it is passed to the tokenizer. A character filter may be used to strip out HTML markup, , or to convert "&" characters to the word "and".

Elasticsearch has built in characters filters which can be used to build custom analyzers.