ML Research

Int4: More Scalar Quantization in Lucene

LuceneML Research

•Apr 17, 2024

Int4: More Scalar Quantization in Lucene

Optimizing scalar quantization in Lucene and adding int4 support.

Benjamin Trent

Thomas Veasey

All Articles

Scalar Quantization Optimized for Vector Databases

ML Research

•Apr 3, 2024

Scalar Quantization Optimized for Vector Databases

Optimizing scalar quantization for the vector database use case allows us to achieve significantly better performance for the same retrieval quality at high compression ratios.

Thomas Veasey

Benjamin Trent

Evaluating RAG: A journey through metrics

ML Research

•Dec 1, 2023

Evaluating RAG: A journey through metrics

Learn how Elastic is evaluating RAG.

Quentin Herreros

Thomas Veasey

Thanos Papaoikonomou

Introducing Scalar Quantization in Lucene

LuceneML Research

•Nov 11, 2023

Introducing Scalar Quantization in Lucene

How did we introduce scalar quantization into Lucene

Benjamin Trent

LuceneML Research

•Oct 25, 2023

Scalar quantization 101

What is scalar quantization and how does it work?

Benjamin Trent

Improving information retrieval in the Elastic Stack: Improved inference performance with ELSER v2

ML Research

•Oct 17, 2023

Improving information retrieval in the Elastic Stack: Improved inference performance with ELSER v2

Learn about the improvements we've made to the inference performance of ELSER v2.

Thomas Veasey

Quentin Herreros

Valeriy Khakhutskyy

Improving information retrieval in the Elastic Stack: Optimizing retrieval with ELSER v2

ML Research

•Oct 17, 2023

Improving information retrieval in the Elastic Stack: Optimizing retrieval with ELSER v2

Learn about how we're reducing retrieval costs for ELSER v2.

Thomas Veasey

Quentin Herreros

Valeriy Khakhutskyy

Generative AI architectures with transformers explained from the ground up

ML ResearchGenerative AI

•Sep 29, 2023

Generative AI architectures with transformers explained from the ground up

This long-form article explains how generative AI works, from the ground all the way up to generative transformer architectures with a focus on intuitions.

Aris Papadopoulos

Vector search in Elasticsearch: The rationale behind the design

Vector SearchML Research

•Jul 24, 2023

Vector search in Elasticsearch: The rationale behind the design

There are different ways to implement a vector database, which have different trade-offs. In this blog, you'll learn more about how vector search has been integrated into Elastisearch and the trade-offs that we made.

Adrien Grand

Open-sourcing sysgrok — An AI assistant for analyzing, understanding, and optimizing systems

ML Research

•Jun 28, 2023

Open-sourcing sysgrok — An AI assistant for analyzing, understanding, and optimizing systems

Sysgrok is an experimental proof-of-concept, intended to demonstrate how LLMs can be used to help SWEs and SREs understand systems, debug issues, and optimize performance.

Sean Heelan