Back to library

📉Detect Drift in LLM and ML Apps

Stop confusing 'data drift' and 'concept drift' — they need different fixes. Walk one feature through both kinds of drift on a real-shaped dataset, then design a drift dashboard for an LLM app where ground truth is delayed by 7 days.

Advanced14 drops~2-week path · 5–8 min/daytechnology

Phase 1Three drifts, three fixes

Three drifts, three fixes — don't mix them up

4 drops
  1. Data drift, concept drift, and label drift are three different bugs

    7 min

    Data drift, concept drift, and label drift are three different bugs

  2. Same feature, two stories — when input shift causes the metric drop and when it doesn't

    7 min

    Same feature, two stories — when input shift causes the metric drop and when it doesn't

  3. LLM apps add prompt drift and judge drift on top of the classic three

    8 min

    LLM apps add prompt drift and judge drift on top of the classic three

  4. Ground truth is delayed — and that lag is the central monitoring problem

    7 min

    Ground truth is delayed — and that lag is the central monitoring problem

Phase 2Rolling windows on a pricing feature

Rolling windows on a real-shaped pricing feature

5 drops
  1. Pick a window size that matches your signal's expected drift rate

    7 min

    Pick a window size that matches your signal's expected drift rate

  2. The reference window is the model your monitor is defending — pick it deliberately

    7 min

    The reference window is the model your monitor is defending — pick it deliberately

  3. Pick a test that matches your data type, not the test you happen to remember

    7 min

    Pick a test that matches your data type, not the test you happen to remember

  4. Inject a synthetic drift and confirm your monitor catches it

    8 min

    Inject a synthetic drift and confirm your monitor catches it

  5. Calibrate the threshold to your false-alarm tolerance, not to a textbook p-value

    7 min

    Calibrate the threshold to your false-alarm tolerance, not to a textbook p-value

Phase 3DDM, ADWIN, Page-Hinkley vs metric gates

DDM, ADWIN, Page-Hinkley — and metric gates

4 drops
  1. Your teammate proposes 'just alert on a 10% accuracy drop'

    7 min

    Your teammate proposes 'just alert on a 10% accuracy drop'

  2. DDM, ADWIN, Page-Hinkley — three change-detectors for three problems

    8 min

    DDM, ADWIN, Page-Hinkley — three change-detectors for three problems

  3. Statistical test fires but metric is fine — who do you trust?

    8 min

    Statistical test fires but metric is fine — who do you trust?

  4. Retrain or re-monitor? Use the drift type to decide, not the alarm severity

    8 min

    Retrain or re-monitor? Use the drift type to decide, not the alarm severity

Phase 4Design a dashboard for delayed truth

Design a drift dashboard for delayed ground truth

1 drop
  1. Design a drift dashboard for an LLM app with 7-day-delayed ground truth

    10 min

    Design a drift dashboard for an LLM app with 7-day-delayed ground truth

Frequently asked questions

What's the difference between data drift, concept drift, and label drift?
This is covered in the “Detect Drift in LLM and ML Apps” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
When should I use a statistical drift test like KS vs a metric-based gate?
This is covered in the “Detect Drift in LLM and ML Apps” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
How do you detect drift in an LLM app when ground truth is delayed by days?
This is covered in the “Detect Drift in LLM and ML Apps” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
What window size should a rolling drift monitor use?
This is covered in the “Detect Drift in LLM and ML Apps” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
When does drift mean 'retrain' vs 'just re-monitor'?
This is covered in the “Detect Drift in LLM and ML Apps” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.