Glossary

This glossary describes essential terms and concepts to help you understand Elasticsearch and its related technologies.

Rate Limit

A restriction on the number of API requests within a given time period. Rate limits protect the service from overload and ensure fair usage. You need to account for them when planning how to integrate an embedding API into production workflows.

Recall@k

The fraction of all relevant items that appear among the top-k results. Recall@10 of 0.8 means 80% of relevant items were found in the top 10. Recall is the primary metric for first-stage retrieval, where the goal is to catch as many relevant items as possible.

Relevance Score

A numerical score assigned by a reranker to each query-document pair, indicating how relevant the document is to the query. Results are sorted by relevance score to produce the final ranked list. Unlike cosine similarity scores, relevance scores are not normalized: they are meaningful for ranking candidates within a single query but cannot be compared across different queries or models, and do not represent absolute measures of semantic similarity

Representation Learning

A field of machine learning focused on automatically learning data representations that make downstream tasks easier, rather than relying on hand-engineered features. Methods range from autoencoders to contrastive learning, with embedding models being a relevant application. The key insight is that useful structure, such as semantic meaning or visual similarity, can be discovered directly from data without explicit feature specification.

Reranker

A model that takes a list of search results and reorders them by more carefully evaluating each result's relevance to the query. Rerankers are used after first-stage retrieval to improve precision. Because they evaluate each query-document pair individually, they make more nuanced relevance judgments than bi-encoder models. Jina Reranker is designed for this purpose.

Reranking

Re-scoring and reordering an initial set of search results to improve their quality. This is the second stage in a typical two-stage retrieval pipeline: the first stage finds candidates quickly; the reranker ensures the most relevant ones appear at the top.

Retrieval

Finding and returning relevant information from a database or index in response to a query. In the embedding world, retrieval means converting the query into a vector and searching for the most similar document vectors.

Retrieval-Augmented Generation (RAG)

A technique that improves the accuracy of large language models by first retrieving relevant information from a knowledge base and then providing that information to the model along with the user's question. Instead of relying solely on what the model learned during training, RAG grounds the response in retrieved, up-to-date data. Embedding models and rerankers are core components of RAG pipelines.

최첨단 검색 환경을 구축할 준비가 되셨나요?

충분히 고급화된 검색은 한 사람의 노력만으로는 달성할 수 없습니다. Elasticsearch는 여러분과 마찬가지로 검색에 대한 열정을 가진 데이터 과학자, ML 운영팀, 엔지니어 등 많은 사람들이 지원합니다. 서로 연결하고 협력하여 원하는 결과를 얻을 수 있는 마법 같은 검색 환경을 구축해 보세요.

직접 사용해 보세요