Glossary

This glossary describes essential terms and concepts to help you understand Elasticsearch and its related technologies.

NDCG (Normalized Discounted Cumulative Gain)

A ranking metric that measures how well a system places relevant results near the top of a ranked list. It accumulates relevance gains across positions, applies a logarithmic discount to results appearing lower in the ranking, and normalizes the score against an ideal ordering, hence normalized discounted cumulative gain. Unlike binary metrics such as recall, NDCG supports graded relevance where some results are more relevant than others. Scores range from 0 to 1, with 1 representing a perfect ranking, making it the standard metric for retrieval benchmarks like MTEB.

Negative Pair

Two pieces of content that should be treated as dissimilar during training. For example, a question is paired with an unrelated passage. The model learns to produce embeddings that are far apart for negative pairs.

Negative Sampling

The process of selecting negative examples for contrastive training. The quality of negatives significantly affects model performance. Random negatives are easy for the model and provide little learning signal; carefully selected hard negatives drive meaningful improvement.

NLP (Natural Language Processing)

A field of artificial intelligence concerned with enabling machines to understand, process, and generate human language. NLP underpins most tasks in text-based AI systems, from tokenization and parsing to semantic understanding and generation. Modern NLP is dominated by transformer-based models, which have largely replaced earlier rule-based and statistical approaches.

Normalization

Scaling a vector so its length equals 1, producing a unit vector. After normalization, cosine similarity and dot product become equivalent, which simplifies computation. Most embedding models output normalized vectors by default.

准备好打造最先进的搜索体验了吗?

足够先进的搜索不是一个人的努力就能实现的。Elasticsearch 由数据科学家、ML 操作员、工程师以及更多和您一样对搜索充满热情的人提供支持。让我们联系起来,共同打造神奇的搜索体验,让您获得想要的结果。

亲自试用