Improved Text Scoring with BM25
Register to Watch
Plus, we'll send you relevant content.
What to explore next...Kibana Deep Dive Machine Learning Deep Dive Managing Elasticsearch Clusters with Elastic Cloud Enterprise
Today the default scoring algorithm in Elasticsearch is TF/IDF. This default will change to BM25 once Elasticsearch switches to Lucene 6. In this talk, Britta will tell you all about BM25 – what it is, how it differs from TF/IDF and other scoring techniques, and why it might be the better default going forward.
Britta Weber was lured away from a career in academic research on image processing by all that’s awesome in Elasticsearch. She joined the company in May 2013 as a software engineer in the company’s Berlin office. When not writing code and thinking about machine learning, she enjoys singing renaissance madrigals.