Vector Database

How we doubled vector search throughput on Elasticsearch Serverless

How we brought Elasticsearch's native SIMD scoring engine to serverless, and why serverless is where vector search innovation happens next.

How we doubled vector search throughput on Elasticsearch Serverless
Cutting Elasticsearch DiskBBQ query quantization time by 5x

May 27, 2026

Cutting Elasticsearch DiskBBQ query quantization time by 5x

See how asymmetric quantization cuts DiskBBQ query quantization overhead from about 20% to 4% with little recall impact.

Up to 3x faster stored-vector queries in Elasticsearch

May 21, 2026

Up to 3x faster stored-vector queries in Elasticsearch

Elasticsearch 9.4 provides a simpler way to search with vectors stored in an Elasticsearch index, with up to 3x lower latency.

12x faster Elasticsearch vector indexing: deploying NVIDIA cuVS with GPU and CPU tiers

12x faster Elasticsearch vector indexing: deploying NVIDIA cuVS with GPU and CPU tiers

Two patterns for deploying NVIDIA cuVS GPU-accelerated HNSW indexing in Elasticsearch: combined build-and-serve nodes for small clusters and a dedicated GPU ingest tier with ILM handoff to CPU for production at scale.

Elasticsearch Vector DiskBBQ filter search is now 3–5x faster

May 13, 2026

Elasticsearch Vector DiskBBQ filter search is now 3–5x faster

Learn how Elasticsearch 9.4 makes restrictive filtered DiskBBQ vector search 3–5x faster and more stable by avoiding wasted centroid and postings-list work when selectivity is high.

Elasticsearch's BBQ vs. TurboQuant: 10–40× faster on CPU and lower ranking noise

May 6, 2026

Elasticsearch's BBQ vs. TurboQuant: 10–40× faster on CPU and lower ranking noise

A head-to-head look at Elasticsearch BBQ and TurboQuant, including throughput, ranking accuracy, and why uniform quantization wins for CPU vector search with up to 40× faster comparisons and smaller ranking noise.

How to measure and improve Elasticsearch search recall: from 0.43 to 0.75 with hybrid search

How to measure and improve Elasticsearch search recall: from 0.43 to 0.75 with hybrid search

Learn how to measure and improve search recall in Elasticsearch by combining BM25 lexical search with Jina AI vector embeddings, using the rank_eval API to validate the improvement with real numbers.

Preconditioning Vectors: Making Elasticsearch VectorDB Better Binary Quantization work for every vector

Preconditioning Vectors: Making Elasticsearch VectorDB Better Binary Quantization work for every vector

Modern quantization techniques can hurt recall when using older models or embeddings that aren’t normally distributed. Learn how preconditioning fixes these vectors through random orthogonal projection, making BBQ more effective and recovering recall.

How we built Elasticsearch simdvec to make vector search one of the fastest in the world

How we built Elasticsearch simdvec to make vector search one of the fastest in the world

How we built Elasticsearch simdvec, the hand-tuned SIMD kernel library behind every vector search query in Elasticsearch.

Ready to build state of the art search experiences?

Sufficiently advanced search isn’t achieved with the efforts of one. Elasticsearch is powered by data scientists, ML ops, engineers, and many more who are just as passionate about search as you are. Let’s connect and work together to build the magical search experience that will get you the results you want.

Try it yourself