type
status
date
slug
summary
tags
category
icon
password
Sparse search
TF-IDF
- TF:
- IDF:
- Output:
BM25
- The TF-IDF score increases linearly with the number of relevant tokens. So, if the frequency doubles — so does the TF-IDF score.
The IDF part of BM25 (left) compared to
the IDF of TF-IDF (right).
Dense Search
Sentence-Bert
- cos similarity
- Use Hugging face transformer