Glossary

This glossary describes essential terms and concepts to help you understand Elasticsearch and its related technologies.

k-Nearest Neighbors (kNN)

A search method that finds the k most similar vectors to a query by comparing it against every item in the dataset using a chosen distance metric. kNN is exhaustive and exact, guaranteeing optimal results, but scales as O(n) per query, making it impractical for large datasets. In practice it is used for small datasets or as a ground truth baseline for measuring the recall of approximate nearest neighbor methods.

Knowledge Base

A collection of documents or data that a search or RAG system retrieves from at query time. It can contain structured or unstructured content such as company documents, product manuals, or research papers. In embedding-based systems, the knowledge base is chunked, embedded, and indexed into a vector store. Its quality directly determines the quality of retrieval and generation: incomplete or noisy content yields unreliable results regardless of model quality.

Knowledge Distillation

Training a smaller model (the student) to replicate the behavior of a larger, more capable model (the teacher). The student learns not just from raw data but from the teacher's outputs, which contain richer information about relationships between examples. The result is a compact model that performs closer to the large model than if trained on its own.

准备好打造最先进的搜索体验了吗?

足够先进的搜索不是一个人的努力就能实现的。Elasticsearch 由数据科学家、ML 操作员、工程师以及更多和您一样对搜索充满热情的人提供支持。让我们联系起来,共同打造神奇的搜索体验,让您获得想要的结果。

亲自试用