Library

Page 3 of 14

🎧Understand Audio Embeddings

Stop forcing audio through text. Drops show the audio-native path — wav2vec, CLAP, MERT — and when it beats transcribe-then-embed for music search, speaker ID, and sound classification. By the end you can plan a 'find similar drums' search over a sample library.

Foundations~2-week path · 5-8 min/day

🧭Understand ANN Algorithms: HNSW, IVF, PQ

Stop tuning ef and M by trial and error — see HNSW, IVF, and PQ as physical structures (a multilayer skip graph, a coarse cluster index, and a vector compressor) so you can predict which one fits a 100M-vector workload before you benchmark anything.

Advanced~2-week path · 5-8 min/day

🧬Sentence vs Token Embeddings

Stop grabbing BERT's [CLS] token and calling it a sentence embedding. By the end you'll know exactly when token, pooled, and contrastively-trained vectors each win — and design a 100K-doc semantic search you can defend.

Applied~2-week path · 5-8 min/day

📝Generate Commit Messages with AI

Stop letting your git history decay into 'fix stuff' two weeks after you adopt Conventional Commits. By the end you'll have an AI commit hook reading your diff and producing a compliant message every time — and a team convention doc that makes it stick.

Foundations~2-week path · 5-8 min/day

📉Detect Drift in LLM and ML Apps

Stop confusing 'data drift' and 'concept drift' — they need different fixes. Walk one feature through both kinds of drift on a real-shaped dataset, then design a drift dashboard for an LLM app where ground truth is delayed by 7 days.

Advanced~2-week path · 5-8 min/day

🗺️Choose an LLM Deployment Topology

Stop choosing between 'just call OpenAI' and 'self-host on H100s' — there are four real LLM topologies in between. By the end you can sketch a 12-month plan that survives 10x traffic growth.

Applied~2-week path · 5-8 min/day

🌫️Build Intuition for Diffusion Models

Stop reading 'noise to image' as magic and start seeing it as a learned vector field that pulls samples toward the data. By the end you can sketch one denoising step and explain how classifier-free guidance bends the field toward 'a cat in a hat.'

Applied~2-week path · 5-8 min/day

🔬Build an LLM Eval Harness for Production

Stop running eval notebooks once and forgetting them. Build a three-layer harness — pre-merge CI, pre-deploy gate, online sampling — with the right cadence, budget, and judge calibration for a production RAG app.

Advanced~2-week path · 5-8 min/day

⚖️Audit AI Models for Bias

Three fairness metrics. One model. They disagree. Walk a synthetic loan classifier through demographic parity, equalized odds, and calibration; see where they conflict; then outline a regulator-defensible audit plan for a resume screener.

Foundations~2-week path · 5-8 min/day

🚦Version and A/B Test Prompts in Production

Stop shipping prompt edits like config tweaks and start treating them like code with versions, canaries, and kill switches. By the end you can write a one-page rollout plan with success criteria, sample size, and a rollback trigger that someone else could execute.

Advanced~2-week path · 5-8 min/day

🧾Use Vision-Language Models for OCR and Document Extraction

Stop gluing Tesseract to brittle regex parsers. Design VLM-based document extraction pipelines that return typed JSON with confidence scores — and know exactly when classical OCR still wins on cost.

Applied~2-week path · 5-8 min/day

🧪Use AI to Generate Tests

Turn AI from a happy-path test generator into a real partner that probes boundaries, error paths, and oracle gaps — so the suite catches bugs instead of memorizing them.

Foundations~2-week path · 5-8 min/day

📊Use AI for Spreadsheet Workflows

Stop pasting your sheet into ChatGPT and hoping. Learn four reusable patterns — formula generation, bulk row processing, cleanup, summary — that keep your spreadsheet as the source of truth and let you ship a workflow that cleans, classifies, and summarizes a 200-row dataset.

Foundations~2-week path · 5-8 min/day

📝Use AI for Meeting Notes That You'll Actually Read

Stop treating AI meeting notes as a dumping ground nobody reads. Build a per-meeting-type workflow that ends in shared decisions and assigned actions — not another inbox full of ignored summaries.

Foundations~2-week path · 5-8 min/day

🔍Use AI for Code Review

Stop accepting every AI review comment uncritically — and stop ignoring them all. By the end you'll know exactly what AI catches reliably, what it misses, and how to write a review prompt your team actually trusts.

Foundations~2-week path · 5-8 min/day

🖼️Understand Vision Transformers (ViT)

Walk one 224x224 image through patching, embedding, and attention until ViT stops feeling like a magic trick — then predict where the heads attend on a cat-and-person photo before the demo confirms it.

Applied~2-week path · 5-8 min/day

🔊Understand Text-to-Speech Quality Dimensions

Build a five-axis TTS scorecard — naturalness, prosody, latency, consistency, controllability — that replaces demo-vibe-checks with a defensible audit you can take into any voice-agent vendor meeting.

Applied~2-week path · 5-8 min/day

✂️Understand Image Segmentation with SAM

Separate semantic, instance, and promptable segmentation so you can pick the right tool — then plan a tiny SAM-powered pipeline that crops product photos for an ecommerce catalog before you write a line of code.

Applied~2-week path · 5-8 min/day

📡Stream LLM Responses for Snappy UX

Stop shipping six-second blank screens — switch to SSE streaming and watch perceived latency collapse from seconds to milliseconds. By the end you'll add a stop button and graceful retry to a streamed chat without dropping tokens.

Applied~2-week path · 5-8 min/day

📉PCA: Dimensionality Reduction from Eigenvectors

Connect PCA to the eigenvectors of the covariance matrix, then compress a 50-feature dataset to 5 components and defend exactly how much information you kept.

Applied~2-week path · 5-8 min/day

💰Optimize Cost in LLM Applications

Stop watching your LLM bill scale linearly with traffic. By the end you can take any feature, name three cost cuts with dollar estimates, and defend the tradeoffs to your team.

Applied~2-week path · 5-8 min/day

🎨Master Text-to-Image Prompt Craft

Build an internal recipe for prompting diffusion models — subject, medium, style, lighting, weight, negative — so you can generate brand-aligned images on demand instead of copying random prompts from marketplaces.

Foundations~2-week path · 5-8 min/day

🔭Learn LLM Observability Fundamentals

Stop finding out about LLM regressions from angry user emails. By the end you'll know what to log on every call, which tools fit which signal, and how to sketch one dashboard an on-call engineer can read at 3am.

Applied~2-week path · 5-8 min/day

🧱Get Structured Output with Pydantic and JSON Schema

Replace markdown-fenced near-JSON and regex band-aids with a Pydantic schema the API enforces for you. By the end you can convert any ad-hoc prompt to typed output and measure how many parse failures you just deleted.

Applied~2-week path · 5-8 min/day