Back to library

πŸͺ„Understand Emergent Capabilities in LLMs

See past the 'mystical jump' headlines: read the original emergence papers next to the 'mirage' rebuttal, watch the same task switch from jumpy to smooth when you swap the metric, and finish able to predict whether the capability you actually care about will scale in steps or in slopes.

Applied14 drops~2-week path Β· 5–8 min/daytechnology

Phase 1What Emergence Meant

Read the original emergence papers and the excitement

4 drops
  1. Emergence is a curve that looks like a cliff

    6 min

    Emergence is a curve that looks like a cliff

  2. If you can't extrapolate, you can't plan

    6 min

    If you can't extrapolate, you can't plan

  3. Three tasks that defined the emergence canon

    6 min

    Three tasks that defined the emergence canon

  4. Scale isn't one number β€” it's at least three

    6 min

    Scale isn't one number β€” it's at least three

Phase 2Watch the Jump Disappear

Switch metrics and watch the jump disappear

5 drops
  1. A linear x-axis turns every curve into a wall

    6 min

    A linear x-axis turns every curve into a wall

  2. Exact-match accuracy is a step function

    7 min

    Exact-match accuracy is a step function

  3. Replot 3-digit addition and watch the cliff melt

    7 min

    Replot 3-digit addition and watch the cliff melt

  4. Why composition turns slopes into cliffs

    7 min

    Why composition turns slopes into cliffs

  5. Different runs, different thresholds

    6 min

    Different runs, different thresholds

Phase 3Mirage, Real, or Both?

Reconcile the mirage rebuttal with what's still real

4 drops
  1. The rebuttal: emergence is a metric artifact

    7 min

    The rebuttal: emergence is a metric artifact

  2. Both papers can be right at once

    7 min

    Both papers can be right at once

  3. Some capabilities get worse with scale

    7 min

    Some capabilities get worse with scale

  4. The 2026 consensus is calmer than the 2022 hype

    7 min

    The 2026 consensus is calmer than the 2022 hype

Phase 4Predict Your Capability's Curve

Predict whether your capability will jump or slope

1 drop
  1. Predict the curve for one capability you actually care about

    8 min

    Predict the curve for one capability you actually care about

Frequently asked questions

What are emergent abilities in large language models?
This is covered in the β€œUnderstand Emergent Capabilities in LLMs” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
Are emergent abilities in LLMs real or just a measurement artifact?
This is covered in the β€œUnderstand Emergent Capabilities in LLMs” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
What did the 'Emergent Abilities Are a Mirage' paper actually show?
This is covered in the β€œUnderstand Emergent Capabilities in LLMs” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
How does the choice of metric affect whether a capability looks emergent?
This is covered in the β€œUnderstand Emergent Capabilities in LLMs” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.
Can you predict in advance which LLM capabilities will be emergent?
This is covered in the β€œUnderstand Emergent Capabilities in LLMs” learning path. Start with daily 5-minute micro-lessons that build from fundamentals to hands-on application.