Improved Text Scoring with BM25
Today the default scoring algorithm in Elasticsearch is TF/IDF. This default will change to BM25 once Elasticsearch switches to Lucene 6. In this talk, Britta will tell you all about BM25 – what it is, how it differs from TF/IDF and other scoring techniques, and why it might be the better default going forward.
Britta Weber was lured away from a career in academic research on image processing by all that’s awesome in Elasticsearch. She joined the company in May 2013 as a software engineer in the company’s Berlin office. When not writing code and thinking about machine learning, she enjoys singing renaissance madrigals.