Why does my RAG system miss obvious matches near boundaries?

This is covered in the "Learn Chunking Strategies for RAG" learning path on Droplet. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.

What's the difference between fixed, recursive, and semantic chunking?

This is covered in the "Learn Chunking Strategies for RAG" learning path on Droplet. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.

How do I pick a chunk size for my documents?

This is covered in the "Learn Chunking Strategies for RAG" learning path on Droplet. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.

When should I use parent-child chunks instead of bigger chunks?

This is covered in the "Learn Chunking Strategies for RAG" learning path on Droplet. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.

Does chunking matter more than the embedding model I choose?

This is covered in the "Learn Chunking Strategies for RAG" learning path on Droplet. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.

Back to library

🧩Learn Chunking Strategies for RAG

Compare fixed, recursive, semantic, and document-aware chunking on the same source so trade-offs become visible — then pick a chunking strategy for one of your own document types and defend the choice.

Applied14 drops~2-week path · 5–8 min/daytechnology

Phase 1Why Chunking Decides Your Retrieval Quality

See why chunk size and boundaries beat fancier embedding tricks

4 drops

Bad chunks beat a great embedding model every time
6 min
Bad chunks beat a great embedding model every time
The 500-token chunk is a hunch, not a strategy
6 min
The 500-token chunk is a hunch, not a strategy
Retrieval doesn't fail loudly — it fails by half-answer
7 min
Retrieval doesn't fail loudly — it fails by half-answer
Four chunking strategies, one decision space
6 min
Four chunking strategies, one decision space

Phase 2Fixed, Recursive, and Semantic on One Doc

Run fixed, recursive, and semantic splits on one document

5 drops

Pick one document and three queries — that's your test rig
7 min
Pick one document and three queries — that's your test rig
Run fixed-size first — the baseline that exposes everything else
7 min
Run fixed-size first — the baseline that exposes everything else
Recursive splitting respects what your document already tells it
7 min
Recursive splitting respects what your document already tells it
Semantic chunking splits where the topic actually shifts
8 min
Semantic chunking splits where the topic actually shifts
Three strategies, three queries — read the diff before you decide
8 min
Three strategies, three queries — read the diff before you decide

Phase 3Overlap, Parent-Child, and Structure-Aware Splits

Handle overlap, parent-child chunks, and code, tables, PDFs

4 drops

Overlap, parent-child, or structure-aware — pick the fix at the scale of the failure
8 min
Overlap, parent-child, or structure-aware — pick the fix at the scale of the failure
Embed small for precision, return big for context — that's parent-child
8 min
Embed small for precision, return big for context — that's parent-child
Code and tables fail because their boundaries aren't punctuation
8 min
Code and tables fail because their boundaries aren't punctuation
With PDFs, the first chunking decision is which extractor to trust
8 min
With PDFs, the first chunking decision is which extractor to trust

Phase 4Pick and Defend a Strategy for Your Docs

Pick and defend a chunking strategy for your own document

1 drop

Pick a chunking strategy for your own document type — and defend it
18 min
Pick a chunking strategy for your own document type — and defend it

Frequently asked questions

Why does my RAG system miss obvious matches near boundaries?: This is covered in the “Learn Chunking Strategies for RAG” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
What's the difference between fixed, recursive, and semantic chunking?: This is covered in the “Learn Chunking Strategies for RAG” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
How do I pick a chunk size for my documents?: This is covered in the “Learn Chunking Strategies for RAG” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
When should I use parent-child chunks instead of bigger chunks?: This is covered in the “Learn Chunking Strategies for RAG” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
Does chunking matter more than the embedding model I choose?: This is covered in the “Learn Chunking Strategies for RAG” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.

🧩Learn Chunking Strategies for RAG

Phase 1Why Chunking Decides Your Retrieval Quality

Bad chunks beat a great embedding model every time

The 500-token chunk is a hunch, not a strategy

Retrieval doesn't fail loudly — it fails by half-answer

Four chunking strategies, one decision space

Phase 2Fixed, Recursive, and Semantic on One Doc

Pick one document and three queries — that's your test rig

Run fixed-size first — the baseline that exposes everything else

Recursive splitting respects what your document already tells it

Semantic chunking splits where the topic actually shifts

Three strategies, three queries — read the diff before you decide

Phase 3Overlap, Parent-Child, and Structure-Aware Splits

Overlap, parent-child, or structure-aware — pick the fix at the scale of the failure

Embed small for precision, return big for context — that's parent-child

Code and tables fail because their boundaries aren't punctuation

With PDFs, the first chunking decision is which extractor to trust

Phase 4Pick and Defend a Strategy for Your Docs

Pick a chunking strategy for your own document type — and defend it

Frequently asked questions

🐍Python Decorators Introduction

🦀Rust Lifetimes Explained

☸️Kubernetes Core Concepts

📈Big O Intuition

Phase 1Why Chunking Decides Your Retrieval Quality

Bad chunks beat a great embedding model every time

The 500-token chunk is a hunch, not a strategy

Retrieval doesn't fail loudly — it fails by half-answer

Four chunking strategies, one decision space

Phase 2Fixed, Recursive, and Semantic on One Doc

Pick one document and three queries — that's your test rig

Run fixed-size first — the baseline that exposes everything else

Recursive splitting respects what your document already tells it

Semantic chunking splits where the topic actually shifts

Three strategies, three queries — read the diff before you decide

Phase 3Overlap, Parent-Child, and Structure-Aware Splits

Overlap, parent-child, or structure-aware — pick the fix at the scale of the failure

Embed small for precision, return big for context — that's parent-child

Code and tables fail because their boundaries aren't punctuation

With PDFs, the first chunking decision is which extractor to trust

Phase 4Pick and Defend a Strategy for Your Docs

Pick a chunking strategy for your own document type — and defend it

Frequently asked questions

Related paths

🐍Python Decorators Introduction

🦀Rust Lifetimes Explained

☸️Kubernetes Core Concepts

📈Big O Intuition