Build AI search into your applications

The Elasticsearch Relevance Engine™ (ESRE) is designed to power artificial intelligence-based search applications. Use ESRE to apply semantic search with superior relevance out of the box (without domain adaptation), integrate with external large language models (LLMs), implement hybrid search, and use third-party or your own transformer models.

Start free trial

See how easy it is to get started with setting up the Elasticsearch Relevance Engine.

See quick start video

Build advanced RAG-based applications using ESRE.

Use private, internal data as context with the capabilities of generative AI models to provide up-to-date, reliable responses to user inquiries.

Watch video

AI for all developers

Elevate search with AI

Deliver advanced AI relevance capabilities to your application with ESRE no matter your expertise level. ESRE has a suite of features to help you get started or build upon your experience with AI. You have the flexibility and control to deploy machine learning and Generative AI search apps however you see fit.

Try self-paced hands-on learning

New to semantic search?
Delivering semantic search to your application shouldn’t require a deep level of expertise. Get the best in class semantic search right out of the box with the Elastic Learned Sparse Encoder model. With a simplified deployment get started quickly delivering semantic search without the heavy lifting of training, and maintaining a machine learning model.
Familiar with embeddings & search vectors?
Convert unstructured data into vector embeddings, efficiently search them using approximate nearest neighbor search. Combine your own domain-specific data in context windows to improve the relevance of LLMs' human-like output.
Bring your own transformer models
Use the Eland Python library to bring your own trained machine learning model, or use a third-party model from a public repository like HuggingFace. Pick from a wide variety of supported architectures that fit your needs.

Elasticsearch Relevance Engine

Elasticsearch - All-in-one vector search power house

Generate embeddings. Store, search and manage vectors. Get semantic search with Elastic’s own Learned Sparse Encoder machine learning model. Ingest all data types. Integrate with rapidly evolving large language models.

RRF hybrid ranking
RRF (Reciprocal Rank Fusion) is a method for combining document rankings from multiple retrieval systems. In the near future, RRF will support blending results from sparse vector models like BM25 and Elastic's retrieval model yielding the best in class zero shot ranking method. Hybrid ranking with RRF lets you tune search results from multiple retrievers with less effort.
Learn to use RRF for hybrid search
Elastic Learned Sparse Encoder
Our new model delivers highly relevant semantic search out of the box, without domain adaptation. It's available with one click while configuring your search application. Elastic Learned Sparse Encoder expands queries with related keywords and relevance scores, so they’re easily interpretable and ready for use right away.
Read the blog
Retrieval Augmented Generation
Give LLMs business-specific information using your private data (not just publicly trained data). Use Elasticsearch for high relevance context windows that draw on your proprietary data to improve LLM output and relevance. Access Generative AI with APIs and plugins integrated with the LLM of your choice.
Learn more about Retrieval Augmented Generation
Vector database
Get a full vector search experience at scale – don’t just store and search embeddings, create them! Capture the meaning and context of your unstructured data, including text and images, with embeddings for dense retrieval. Secure your embeddings at the document level to ensure data is in the right hands.
Learn about vector databases
Bring your own transformer models
Bring your own proprietary transformer model into Elastic. Or upload pretrained models from third-party repositories like the HuggingFace model hub — with support for a variety of supported architectures such as BERT, BART, ELECTRA, and more.
Read BYOM blog
Data integrations & ingestion libraries
Familiar tools such as Elastic Agent or Logstash to index your data. An ever-expanding list of integrations (such as Confluence, S3, or Google Drive). Native database connectors (such as MySQL, MongoDB). A web crawler for online sources. For custom app data, Kibana APIs or build-your-own connector with familiar frameworks.
Watch the ingestion webinar

"I’m thrilled about the benefits we can bring to customers through our investments to harness Elasticsearch within RelativityOne. We're experimenting with ESRE right now and are excited about its potential to deliver powerful, AI-augmented search results to our customers."

Chris BrownChief Product Officer, Relativity

Code samples

Start building vector search

Use a single API to import an embedding model, generate embeddings, and write search queries at scale using approximate nearest neighbor search.

See vector search guide

Import a HuggingFace model into Elasticsearch
Generate embeddings in Elasticsearch
Index embeddings in Elasticsearch
Semantic search at scale

docker run -it --rm elastic/eland \
eland_import_hub_model \
--cloud-id $CLOUD_ID \
-u <username> -p <password> \
--hub-model-id sentence-transformers/msmarco-MiniLM-L-12-v3 \
--task-type text_embedding \
--start

Frequently asked questions

What is Elasticsearch Relevance Engine?

Elasticsearch Relevance Engine is a set of features that help developers build AI search applications and includes:

Industry leading advanced relevance ranking features, including traditional keyword search with BM25, a foundation of relevant, hybrid search for all domains.
Full vector database capabilities – including the ability to create embeddings, in addition to storage and retrieval of vectors.
Elastic Learned Sparse Encoder – our new machine learning model for semantic search across a range of domains Hybrid ranking (RRF) for pairing vector and textual search capabilities for optimal search relevance across a variety of domains.
Support to integrate 3rd-party transformer models such as OpenAI GPT-3 and 4 via APIs
A full suite of data ingestion tools such as database connectors, 3rd-party data integrations, web crawler, and APIs to create custom connectors
Developer tools to build search applications across all types of data: text, images, time-series, geo, multimedia, and more.

What can I build with Elasticsearch Relevance Engine?

Elasticsearch is a leading search technology for websites (like ecommerce product and discovery) and internal information (such as customer success knowledge bases and enterprise search). With ESRE, we're providing a toolkit to build AI powered search experiences. Enable users to express their queries in natural language, in the form of a question or a description of the kind of information they seek. Combine this natural language capability with Generative AI to further enhance these models’ abilities with context from your own, private or proprietary data.

Are Elasticsearch and Elasticsearch Relevance Engine the same thing?

Yes, capabilities included with Elasticsearch Relevance Engine are designed and integrated at the _search api within Elasticsearch. Developers can use the Elastic API or familiar tools, such as Kibana, to interact with capabilities that make up Elasticsearch Relevance Engine together with Elasticsearch for a seamless experience..

What is Elastic Learned Sparse Encoder?

Elastic Learned Sparse Encoder is a model built by Elastic for high relevance semantic search across a variety of domains. Currently, an English-only machine learning model, it captures the relationships between meanings and words for information retrieval. Interested in benchmark tests with our new retrieval model? Read this blog to learn more.

What is a transformer, and is Elastic Learned Sparse Encoder a transformer model?

A transformer is a deep neural network architecture which serves as the basis for LLMs. Transformers consist of various components and can be composed of encoders, decoders and many “deep” neural network layers with many millions (or even billions) of parameters. They are typically trained on very large corpora of text like data on the Internet, and can be fine-tuned to perform a variety of NLP tasks. Our new retrieval model uses a transformer architecture but consists only of an encoder designed specifically for semantic search across a wide variety of domains.

How do I get started with Elasticsearch Relevance Engine? Do I need to purchase Elasticsearch Relevance Engine separately?

All of Elasticsearch Relevance Engine’s capabilities come with Elasticsearch Platinum and Enterprise plans, as part of the 8.8 release. You can easily get started with embeddings and vector search, and try out the retrieval model model. Check out a demo of Elastic Learned Sparse Encoder's capabilities. If you have an Elasticsearch license, Elasticsearch Relevance Engine is included as part of your purchase.

Context engineering

Vector database

Search powered applications

Logs

Threat protection

Workflows

Elasticsearch

Kibana (Discover, Dashboards)

Elastic Agent Builder

AutoOps

Piped query language

Jina AI search models

Elastic Cloud Serverless

Elastic Cloud Hosted

Self-managed Elasticsearch

Ecommerce search

Customer support search

Search-driven apps

Log analytics

Infrastructure monitoring

Digital experience monitoring

App performance monitoring

AIOps

LLM observability

Next-gen SIEM

Workflows for security

XDR and endpoint security

AI for security

10x your data's value

Cloud providers

Elastic AI Ecosystem

Search AI Partner Program

AV-Comparatives

Forrester Wave™ XDR

Gartner Magic Quadrant Leader

IDC MarketScape

Search

Security

Observability

Get started

Demo gallery

Downloads

Integrations

Docs

Elasticsearch Labs

Elastic Security Labs

Elastic Observability Labs

Blog

Community

Events

Webinars

Discuss

Training

Support

Consulting

Build AI search into your applications

AI for all developers

Elevate search with AI

New to semantic search?

Familiar with embeddings & search vectors?

Bring your own transformer models

Elasticsearch Relevance Engine

Elasticsearch - All-in-one vector search power house

RRF hybrid ranking

Elastic Learned Sparse Encoder

Retrieval Augmented Generation

Vector database

Bring your own transformer models

Data integrations & ingestion libraries

"I’m thrilled about the benefits we can bring to customers through our investments to harness Elasticsearch within RelativityOne. We're experimenting with ESRE right now and are excited about its potential to deliver powerful, AI-augmented search results to our customers."

Chris BrownChief Product Officer, Relativity

Code samples

Start building vector search

Import a HuggingFace model into Elasticsearch

Generate embeddings in Elasticsearch

Index embeddings in Elasticsearch

Semantic search at scale

Frequently asked questions

What is Elasticsearch Relevance Engine?

What can I build with Elasticsearch Relevance Engine?