Back to library

🔎Combine BM25 and Semantic Search (Hybrid Search)

Build hybrid search layer by layer — BM25 alone, vectors alone, then RRF fusion — so you can debug retrieval failures and predict which query types each layer fixes before you ship.

Applied14 drops~2-week path · 5–8 min/daytechnology

Phase 1Why One Retriever Is Always Wrong About Half Your Queries

Watch BM25 and vectors fail at opposite query shapes

4 drops
  1. Vectors forget the literal token — that's the entire problem

    6 min

    Vectors forget the literal token — that's the entire problem

  2. BM25 is a thirty-year-old formula your vector DB is hiding from you

    6 min

    BM25 is a thirty-year-old formula your vector DB is hiding from you

  3. Vector search is fuzzy intent matching dressed as math

    6 min

    Vector search is fuzzy intent matching dressed as math

  4. Two retrievers, one ranking — the simplest fix nobody runs first

    6 min

    Two retrievers, one ranking — the simplest fix nobody runs first

Phase 2Running BM25 and Vectors Side by Side, Then Fusing Them

Run both retrievers on the same query and fuse the lists

5 drops
  1. Twenty docs is the smallest corpus that teaches you everything

    5 min

    Twenty docs is the smallest corpus that teaches you everything

  2. Run BM25 first and watch the literal queries snap into place

    6 min

    Run BM25 first and watch the literal queries snap into place

  3. Run vectors next and feel the paraphrase query come unstuck

    6 min

    Run vectors next and feel the paraphrase query come unstuck

  4. Reciprocal Rank Fusion in five lines — the part everyone overthinks

    7 min

    Reciprocal Rank Fusion in five lines — the part everyone overthinks

  5. Print three columns and watch hybrid pick up both halves

    6 min

    Print three columns and watch hybrid pick up both halves

Phase 3Where Hybrid Earns Its Keep — and Where It Doesn't

Map where hybrid earns its keep and where it doesn't

4 drops
  1. Hybrid wins on the queries you didn't think to test

    6 min

    Hybrid wins on the queries you didn't think to test

  2. Hybrid wins more on heterogeneous corpora than uniform ones

    6 min

    Hybrid wins more on heterogeneous corpora than uniform ones

  3. Sometimes hybrid is the wrong answer — and that's the most useful insight

    6 min

    Sometimes hybrid is the wrong answer — and that's the most useful insight

  4. Three production patterns cover almost every hybrid deployment

    7 min

    Three production patterns cover almost every hybrid deployment

Phase 4Ship Hybrid Retrieval and Predict the Lift Per Query Type

Add hybrid retrieval to a docs RAG and predict the lift

1 drop
  1. Wire hybrid into a docs RAG and predict the lift by query type

    8 min

    Wire hybrid into a docs RAG and predict the lift by query type

Frequently asked questions

What is hybrid search and how does it combine BM25 with vector search?
This is covered in the “Combine BM25 and Semantic Search (Hybrid Search)” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
Why does pure semantic search miss exact-keyword queries like error codes and SKUs?
This is covered in the “Combine BM25 and Semantic Search (Hybrid Search)” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
What is reciprocal rank fusion (RRF) and why is it the default fusion method?
This is covered in the “Combine BM25 and Semantic Search (Hybrid Search)” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
When should I weight BM25 over vectors in a hybrid retriever?
This is covered in the “Combine BM25 and Semantic Search (Hybrid Search)” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
How do I tell if my RAG actually needs hybrid search or just better embeddings?
This is covered in the “Combine BM25 and Semantic Search (Hybrid Search)” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.