Skip to content

rohitg00/ai-engineering-from-scratch

AI Engineering from Scratch — reference manual banner

MIT License 416 lessons 20 phases GitHub stars Website

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

84% of students already use AI tools. Only 18% feel prepared to use them professionally. This curriculum closes that gap.

416 lessons. 20 phases. ~320 hours. Python, TypeScript, Rust, Julia. Every lesson ships a reusable artifact: a prompt, a skill, an agent, an MCP server. Free, open source, MIT.

You don't just learn AI. You build it. End-to-end. By hand.

How this works

Most AI material teaches in scattered pieces. A paper here, a fine-tuning post there, a flashy agent demo somewhere else. The pieces rarely line up. You ship a chatbot but can't explain its loss curve. You hook a function to an agent but can't say what attention does inside the model that's calling it.

This curriculum is the spine. 20 phases, 416 lessons, four languages: Python, TypeScript, Rust, Julia. Linear algebra at one end, autonomous swarms at the other. Every algorithm gets built from raw math first. Backprop. Tokenizer. Attention. Agent loop. By the time PyTorch shows up, you already know what it's doing under the hood.

Each lesson runs the same loop: read the problem, derive the math, write the code, run the test, keep the artifact. No five-minute videos, no copy-paste deploys, no hand-holding. Free, open source, and built to run on your own laptop.

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

The shape of the curriculum

Twenty phases stack on top of each other. Math is the floor. Agents and production are the roof. Skip ahead if you already know the lower layers, but don't skip and then wonder why something at the top is breaking.

%%{init: {'theme':'base','themeVariables':{'primaryColor':'#fafaf5','primaryTextColor':'#1a1a1a','primaryBorderColor':'#3553ff','lineColor':'#3553ff','fontFamily':'JetBrains Mono','fontSize':'12px'}}}%%
flowchart TB
  P0["Phase 0 — Setup & Tooling"] --> P1["Phase 1 — Math Foundations"]
  P1 --> P2["Phase 2 — ML Fundamentals"]
  P2 --> P3["Phase 3 — Deep Learning Core"]
  P3 --> P4["Phase 4 — Vision"]
  P3 --> P5["Phase 5 — NLP"]
  P3 --> P6["Phase 6 — Speech & Audio"]
  P3 --> P9["Phase 9 — RL"]
  P5 --> P7["Phase 7 — Transformers"]
  P7 --> P8["Phase 8 — GenAI"]
  P7 --> P10["Phase 10 — LLMs from Scratch"]
  P10 --> P11["Phase 11 — LLM Engineering"]
  P10 --> P12["Phase 12 — Multimodal"]
  P11 --> P13["Phase 13 — Tools & Protocols"]
  P13 --> P14["Phase 14 — Agent Engineering"]
  P14 --> P15["Phase 15 — Autonomous Systems"]
  P15 --> P16["Phase 16 — Multi-Agent & Swarms"]
  P14 --> P17["Phase 17 — Infrastructure & Production"]
  P15 --> P18["Phase 18 — Ethics & Alignment"]
  P16 --> P19["Phase 19 — Capstone Projects"]
  P17 --> P19
  P18 --> P19
Loading
░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

The shape of a lesson

Each lesson lives in its own folder, with the same structure across the entire curriculum:

phases/<NN>-<phase-name>/<NN>-<lesson-name>/
├── code/      runnable implementations (Python, TypeScript, Rust, Julia)
├── docs/
│   └── en.md  lesson narrative
└── outputs/   prompts, skills, agents, or MCP servers this lesson produces

Every lesson follows six beats. The Build It / Use It split is the spine — you implement the algorithm from scratch first, then run the same thing through the production library. You understand what the framework is doing because you wrote the smaller version yourself.

%%{init: {'theme':'base','themeVariables':{'primaryColor':'#fafaf5','primaryTextColor':'#1a1a1a','primaryBorderColor':'#3553ff','lineColor':'#3553ff','fontFamily':'JetBrains Mono','fontSize':'13px'}}}%%
flowchart LR
  M["MOTTO<br/><sub>one-line core idea</sub>"] --> Pr["PROBLEM<br/><sub>concrete pain</sub>"]
  Pr --> C["CONCEPT<br/><sub>diagrams &amp; intuition</sub>"]
  C --> B["BUILD IT<br/><sub>raw math, no frameworks</sub>"]
  B --> U["USE IT<br/><sub>same thing in PyTorch / sklearn</sub>"]
  U --> S["SHIP IT<br/><sub>prompt · skill · agent · MCP</sub>"]
Loading

Getting started

Three ways in. Pick one.

Option A — read. Open any completed lesson on aiengineeringfromscratch.com or expand a phase under Contents. No setup, no cloning.

Option B — clone and run.

git clone https://github.com/rohitg00/ai-engineering-from-scratch.git
cd ai-engineering-from-scratch
python phases/01-math-foundations/01-linear-algebra-intuition/code/vectors.py

Option C — find your level (recommended). Skip ahead intelligently. Inside Claude Code:

/find-your-level

Ten questions. Maps your knowledge to a starting phase, builds a personalized path with hour estimates. After each phase:

/check-understanding 3        # quiz yourself on phase 3
ls phases/03-deep-learning-core/05-loss-functions/outputs/
# ├── prompt-loss-function-selector.md
# └── prompt-loss-debugger.md

Prerequisites

  • You can write code (any language; Python helps).
  • You want to understand how AI actually works, not just call APIs.

Built-in Claude Code skills

Skill What it does
/find-your-level Ten-question placement quiz. Maps your knowledge to a starting phase and produces a personalized path with hour estimates.
/check-understanding <phase> Per-phase quiz, eight questions, with feedback and specific lessons to review.
░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

Every lesson ships something

Other curricula end with "congratulations, you learned X." Each lesson here ends with a reusable tool you can install or paste into your daily workflow.

FIG_001 · A
PROMPTS
FIG_001 · B
SKILLS
FIG_001 · C
AGENTS
FIG_001 · D
MCP SERVERS
Paste into any AI assistant for expert-level help on a narrow task. Drop into Claude Code, Cursor, Codex, or any agent that reads SKILL.md. Deploy as autonomous workers — you wrote the loop yourself in Phase 14. Plug into any MCP-compatible client. Built end-to-end in Phase 13.

Install the lot with SkillKit. Real tools, not homework. By the end of the curriculum, you have a portfolio of 416 artifacts you actually understand because you built them.

FIG_002 · A worked sample

Phase 14, lesson 1: the agent loop. ~120 lines of pure Python, no dependencies.

code/agent_loop.py   build it

def run(query, tools):
    history = [user(query)]
    for step in range(MAX_STEPS):
        msg = llm(history)
        if msg.tool_calls:
            for call in msg.tool_calls:
                result = tools[call.name](**call.args)
                history.append(tool_result(call.id, result))
            continue
        return msg.content
    raise StepLimitExceeded

outputs/skill-agent-loop.md   ship it

---
name: agent-loop
description: ReAct-style loop for any tool list
phase: 14
lesson: 01
---

Implement a minimal agent loop that...

outputs/prompt-debug-agent.md

You are an agent debugger. Given the trace
of an agent run, identify the step where
the agent went wrong and explain why...
░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

Contents

Twenty phases. Click any phase to expand its lesson list.

Phase 0: Setup & Tooling 12 lessons

Get your environment ready for everything that follows.

# Lesson Type Lang
01 Dev Environment Build Python, TypeScript, Rust
02 Git & Collaboration Learn
03 GPU Setup & Cloud Build Python
04 APIs & Keys Build Python, TypeScript
05 Jupyter Notebooks Build Python
06 Python Environments Build Python
07 Docker for AI Build Python
08 Editor Setup Build
09 Data Management Build Python
10 Terminal & Shell Learn
11 Linux for AI Learn
12 Debugging & Profiling Build Python
Phase 1 — Math Foundations  22 lessons  The intuition behind every AI algorithm, through code.
# Lesson Type Lang
01 Linear Algebra Intuition Learn Python, Julia
02 Vectors, Matrices & Operations Build Python, Julia
03 Matrix Transformations & Eigenvalues Build Python, Julia
04 Calculus for ML: Derivatives & Gradients Learn Python
05 Chain Rule & Automatic Differentiation Build Python
06 Probability & Distributions Learn Python
07 Bayes' Theorem & Statistical Thinking Build Python
08 Optimization: Gradient Descent Family Build Python
09 Information Theory: Entropy, KL Divergence Learn Python
10 Dimensionality Reduction: PCA, t-SNE, UMAP Build Python
11 Singular Value Decomposition Build Python, Julia
12 Tensor Operations Build Python
13 Numerical Stability Build Python
14 Norms & Distances Build Python
15 Statistics for ML Build Python
16 Sampling Methods Build Python
17 Linear Systems Build Python
18 Convex Optimization Build Python
19 Complex Numbers for AI Learn Python
20 The Fourier Transform Build Python
21 Graph Theory for ML Build Python
22 Stochastic Processes Learn Python
Phase 2 — ML Fundamentals  18 lessons  Classical ML — still the backbone of most production AI.
# Lesson Type Lang
01 What Is Machine Learning Learn Python
02 Linear Regression from Scratch Build Python
03 Logistic Regression & Classification Build Python
04 Decision Trees & Random Forests Build Python
05 Support Vector Machines Build Python
06 KNN & Distance Metrics Build Python
07 Unsupervised Learning: K-Means, DBSCAN Build Python
08 Feature Engineering & Selection Build Python
09 Model Evaluation: Metrics, Cross-Validation Build Python
10 Bias, Variance & the Learning Curve Learn Python
11 Ensemble Methods: Boosting, Bagging, Stacking Build Python
12 Hyperparameter Tuning Build Python
13 ML Pipelines & Experiment Tracking Build Python
14 Naive Bayes Build Python
15 Time Series Fundamentals Build Python
16 Anomaly Detection Build Python
17 Handling Imbalanced Data Build Python
18 Feature Selection Build Python
Phase 3 — Deep Learning Core  13 lessons  Neural networks from first principles. No frameworks until you build one.
# Lesson Type Lang
01 The Perceptron: Where It All Started Build Python
02 Multi-Layer Networks & Forward Pass Build Python
03 Backpropagation from Scratch Build Python
04 Activation Functions: ReLU, Sigmoid, GELU & Why Build Python
05 Loss Functions: MSE, Cross-Entropy, Contrastive Build Python
06 Optimizers: SGD, Momentum, Adam, AdamW Build Python
07 Regularization: Dropout, Weight Decay, BatchNorm Build Python
08 Weight Initialization & Training Stability Build Python
09 Learning Rate Schedules & Warmup Build Python
10 Build Your Own Mini Framework Build Python
11 Introduction to PyTorch Build Python
12 Introduction to JAX Build Python
13 Debugging Neural Networks Build Python
Phase 4 — Computer Vision  28 lessons  From pixels to understanding — image, video, 3D, VLMs, and world models.
# Lesson Type Lang
01 Image Fundamentals: Pixels, Channels, Color Spaces Learn Python
02 Convolutions from Scratch Build Python
03 CNNs: LeNet to ResNet Build Python
04 Image Classification Build Python
05 Transfer Learning & Fine-Tuning Build Python
06 Object Detection — YOLO from Scratch Build Python
07 Semantic Segmentation — U-Net Build Python
08 Instance Segmentation — Mask R-CNN Build Python
09 Image Generation — GANs Build Python
10 Image Generation — Diffusion Models Build Python
11 Stable Diffusion — Architecture & Fine-Tuning Build Python
12 Video Understanding — Temporal Modeling Build Python
13 3D Vision: Point Clouds, NeRFs Build Python
14 Vision Transformers (ViT) Build Python
15 Real-Time Vision: Edge Deployment Build Python, Rust
16 Build a Complete Vision Pipeline Build Python
17 Self-Supervised Vision — SimCLR, DINO, MAE Build Python
18 Open-Vocabulary Vision — CLIP Build Python
19 OCR & Document Understanding Build Python
20 Image Retrieval & Metric Learning Build Python
21 Keypoint Detection & Pose Estimation Build Python
22 3D Gaussian Splatting from Scratch Build Python
23 Diffusion Transformers & Rectified Flow Build Python
24 SAM 3 & Open-Vocabulary Segmentation Build Python
25 Vision-Language Models (ViT-MLP-LLM) Build Python
26 Monocular Depth & Geometry Estimation Build Python
27 Multi-Object Tracking & Video Memory Build Python
28 World Models & Video Diffusion Build Python
Phase 5 — NLP: Foundations to Advanced  29 lessons  Language is the interface to intelligence.
# Lesson Type Lang
01 Text Processing: Tokenization, Stemming, Lemmatization Build Python
02 Bag of Words, TF-IDF & Text Representation Build Python
03 Word Embeddings: Word2Vec from Scratch Build Python
04 GloVe, FastText & Subword Embeddings Build Python
05 Sentiment Analysis Build Python
06 Named Entity Recognition (NER) Build Python
07 POS Tagging & Syntactic Parsing Build Python
08 Text Classification — CNNs & RNNs for Text Build Python
09 Sequence-to-Sequence Models Build Python
10 Attention Mechanism — The Breakthrough Build Python
11 Machine Translation Build Python
12 Text Summarization Build Python
13 Question Answering Systems Build Python
14 Information Retrieval & Search Build Python
15 Topic Modeling: LDA, BERTopic Build Python
16 Text Generation Build Python
17 Chatbots: Rule-Based to Neural Build Python
18 Multilingual NLP Build Python
19 Subword Tokenization: BPE, WordPiece, Unigram, SentencePiece Learn Python
20 Structured Outputs & Constrained Decoding Build Python
21 NLI & Textual Entailment Learn Python
22 Embedding Models Deep Dive Learn Python
23 Chunking Strategies for RAG Build Python
24 Coreference Resolution Learn Python
25 Entity Linking & Disambiguation Build Python
26 Relation Extraction & Knowledge Graph Construction Build Python
27 LLM Evaluation: RAGAS, DeepEval, G-Eval Build Python
28 Long-Context Evaluation: NIAH, RULER, LongBench, MRCR Learn Python
29 Dialogue State Tracking Build Python
Phase 6 — Speech & Audio  17 lessons  Hear, understand, speak.
# Lesson Type Lang
01 Audio Fundamentals: Waveforms, Sampling, FFT Learn Python
02 Spectrograms, Mel Scale & Audio Features Build Python
03 Audio Classification Build Python
04 Speech Recognition (ASR) Build Python
05 Whisper: Architecture & Fine-Tuning Build Python
06 Speaker Recognition & Verification Build Python
07 Text-to-Speech (TTS) Build Python
08 Voice Cloning & Voice Conversion Build Python
09 Music Generation Build Python
10 Audio-Language Models Build Python
11 Real-Time Audio Processing Build Python, Rust
12 Build a Voice Assistant Pipeline Build Python
13 Neural Audio Codecs — EnCodec, SNAC, Mimi, DAC Learn Python
14 Voice Activity Detection & Turn-Taking Build Python
15 Streaming Speech-to-Speech — Moshi, Hibiki Learn Python
16 Voice Anti-Spoofing & Audio Watermarking Build Python
17 Audio Evaluation — WER, MOS, MMAU, Leaderboards Learn Python
Phase 7 — Transformers Deep Dive  14 lessons  The architecture that changed everything.
# Lesson Type Lang
01 Why Transformers: The Problems with RNNs Learn Python
02 Self-Attention from Scratch Build Python
03 Multi-Head Attention Build Python
04 Positional Encoding: Sinusoidal, RoPE, ALiBi Build Python
05 The Full Transformer: Encoder + Decoder Build Python
06 BERT — Masked Language Modeling Build Python
07 GPT — Causal Language Modeling Build Python
08 T5, BART — Encoder-Decoder Models Learn Python
09 Vision Transformers (ViT) Build Python
10 Audio Transformers — Whisper Architecture Learn Python
11 Mixture of Experts (MoE) Build Python
12 KV Cache, Flash Attention & Inference Optimization Build Python
13 Scaling Laws Learn Python
14 Build a Transformer from Scratch Build Python
Phase 8 — Generative AI  14 lessons  Create images, video, audio, 3D, and more.
# Lesson Type Lang
01 Generative Models: Taxonomy & History Learn Python
02 Autoencoders & VAE Build Python
03 GANs: Generator vs Discriminator Build Python
04 Conditional GANs & Pix2Pix Build Python
05 StyleGAN Build Python
06 Diffusion Models — DDPM from Scratch Build Python
07 Latent Diffusion & Stable Diffusion Build Python
08 ControlNet, LoRA & Conditioning Build Python
09 Inpainting, Outpainting & Editing Build Python
10 Video Generation Build Python
11 Audio Generation Build Python
12 3D Generation Build Python
13 Flow Matching & Rectified Flows Build Python
14 Evaluation: FID, CLIP Score Build Python
Phase 9 — Reinforcement Learning  12 lessons  The foundation of RLHF and game-playing AI.
# Lesson Type Lang
01 MDPs, States, Actions & Rewards Learn Python
02 Dynamic Programming Build Python
03 Monte Carlo Methods Build Python
04 Q-Learning, SARSA Build Python
05 Deep Q-Networks (DQN) Build Python
06 Policy Gradients — REINFORCE Build Python
07 Actor-Critic — A2C, A3C Build Python
08 PPO Build Python
09 Reward Modeling & RLHF Build Python
10 Multi-Agent RL Build Python
11 Sim-to-Real Transfer Build Python
12 RL for Games Build Python
Phase 10 — LLMs from Scratch  22 lessons  Build, train, and understand large language models.
# Lesson Type Lang
01 Tokenizers: BPE, WordPiece, SentencePiece Build Python
02 Building a Tokenizer from Scratch Build Python
03 Data Pipelines for Pre-Training Build Python
04 Pre-Training a Mini GPT (124M) Build Python
05 Distributed Training, FSDP, DeepSpeed Build Python
06 Instruction Tuning — SFT Build Python
07 RLHF — Reward Model + PPO Build Python
08 DPO — Direct Preference Optimization Build Python
09 Constitutional AI & Self-Improvement Build Python
10 Evaluation — Benchmarks, Evals Build Python
11 Quantization: INT8, GPTQ, AWQ, GGUF Build Python, Rust
12 Inference Optimization Build Python
13 Building a Complete LLM Pipeline Build Python
14 Open Models: Architecture Walkthroughs Learn Python
15 Speculative Decoding and EAGLE-3 Build Python
16 Differential Attention (V2) Build Python
17 Native Sparse Attention (DeepSeek NSA) Build Python
18 Multi-Token Prediction (MTP) Build Python
19 DualPipe Parallelism Learn Python
20 DeepSeek-V3 Architecture Walkthrough Learn Python
21 Jamba — Hybrid SSM-Transformer Learn Python
22 Async and Hogwild! Inference Build Python
Phase 11 — LLM Engineering  15 lessons  Put LLMs to work in production.
# Lesson Type Lang
01 Prompt Engineering: Techniques & Patterns Build Python
02 Few-Shot, CoT, Tree-of-Thought Build Python
03 Structured Outputs Build Python, TypeScript
04 Embeddings & Vector Representations Build Python
05 Context Engineering Build Python, TypeScript
06 RAG: Retrieval-Augmented Generation Build Python, TypeScript
07 Advanced RAG: Chunking, Reranking Build Python
08 Fine-Tuning with LoRA & QLoRA Build Python
09 Function Calling & Tool Use Build Python
10 Evaluation & Testing Build Python
11 Caching, Rate Limiting & Cost Build Python
12 Guardrails & Safety Build Python
13 Building a Production LLM App Build Python
14 Model Context Protocol (MCP) Build Python
15 Prompt Caching & Context Caching Build Python
Phase 12 — Multimodal AI  25 lessons  See, hear, read, and reason across modalities — from ViT patches to computer-use agents.
# Lesson Type Lang
01 Vision Transformers and the Patch-Token Primitive Learn Python
02 CLIP and Contrastive Vision-Language Pretraining Build Python
03 BLIP-2 Q-Former as Modality Bridge Build Python
04 Flamingo and Gated Cross-Attention Learn Python
05 LLaVA and Visual Instruction Tuning Build Python
06 Any-Resolution Vision — Patch-n'-Pack and NaFlex Build Python
07 Open-Weight VLM Recipes: What Actually Matters Learn Python
08 LLaVA-OneVision: Single, Multi, Video Build Python
09 Qwen-VL Family and Dynamic-FPS Video Learn Python
10 InternVL3 Native Multimodal Pretraining Learn Python
11 Chameleon Early-Fusion Token-Only Build Python
12 Emu3 Next-Token Prediction for Generation Learn Python
13 Transfusion Autoregressive + Diffusion Build Python
14 Show-o Discrete-Diffusion Unified Learn Python
15 Janus-Pro Decoupled Encoders Build Python
16 MIO Any-to-Any Streaming Learn Python
17 Video-Language Temporal Grounding Build Python
18 Long-Video at Million-Token Context Build Python
19 Audio-Language Models: Whisper to AF3 Build Python
20 Omni Models: Thinker-Talker Streaming Build Python
21 Embodied VLAs: RT-2, OpenVLA, π0, GR00T Learn Python
22 Document and Diagram Understanding Build Python
23 ColPali Vision-Native Document RAG Build Python
24 Multimodal RAG and Cross-Modal Retrieval Build Python
25 Multimodal Agents and Computer-Use (Capstone) Build Python
Phase 13 — Tools & Protocols  23 lessons  The interfaces between AI and the real world.
# Lesson Type Lang
01 The Tool Interface Learn Python
02 Function Calling Deep Dive Build Python
03 Parallel and Streaming Tool Calls Build Python
04 Structured Output Build Python
05 Tool Schema Design Learn Python
06 MCP Fundamentals Learn Python
07 Building an MCP Server Build Python
08 Building an MCP Client Build Python
09 MCP Transports Learn Python
10 MCP Resources and Prompts Build Python
11 MCP Sampling Build Python
12 MCP Roots and Elicitation Build Python
13 MCP Async Tasks Build Python
14 MCP Apps Build Python
15 MCP Security I — Tool Poisoning Learn Python
16 MCP Security II — OAuth 2.1 Build Python
17 MCP Gateways and Registries Learn Python
18 MCP Auth in Production — DCR + JWKS on iii Build Python
19 A2A Protocol Build Python
20 OpenTelemetry GenAI Build Python
21 LLM Routing Layer Learn Python
22 Skills and Agent SDKs Learn Python
23 Capstone — Tool Ecosystem Build Python
Phase 14 — Agent Engineering  30 lessons  Build agents from first principles — loop, memory, planning, frameworks, benchmarks, production.
# Lesson Type Lang
01 The Agent Loop Build Python
02 ReWOO and Plan-and-Execute Build Python
03 Reflexion and Verbal Reinforcement Learning Build Python
04 Tree of Thoughts and LATS Build Python
05 Self-Refine and CRITIC Build Python
06 Tool Use and Function Calling Build Python
07 Memory — Virtual Context and MemGPT Build Python
08 Memory Blocks and Sleep-Time Compute Build Python
09 Hybrid Memory — Mem0 Vector + Graph + KV Build Python
10 Skill Libraries and Lifelong Learning — Voyager Build Python
11 Planning with HTN and Evolutionary Search Build Python
12 Anthropic's Workflow Patterns Build Python
13 LangGraph — Stateful Graphs and Durable Execution Build Python
14 AutoGen v0.4 — Actor Model Build Python
15 CrewAI — Role-Based Crews and Flows Build Python
16 OpenAI Agents SDK — Handoffs, Guardrails, Tracing Build Python
17 Claude Agent SDK — Subagents and Session Store Build Python
18 Agno and Mastra — Production Runtimes Learn Python, TypeScript
19 Benchmarks — SWE-bench, GAIA, AgentBench Learn Python
20 Benchmarks — WebArena and OSWorld Learn Python
21 Computer Use — Claude, OpenAI CUA, Gemini Build Python
22 Voice Agents — Pipecat and LiveKit Build Python
23 OpenTelemetry GenAI Semantic Conventions Build Python
24 Agent Observability — Langfuse, Phoenix, Opik Learn Python
25 Multi-Agent Debate and Collaboration Build Python
26 Failure Modes — Why Agents Break Build Python
27 Prompt Injection and the PVE Defense Build Python
28 Orchestration Patterns — Supervisor, Swarm, Hierarchical Build Python
29 Production Runtimes — Queue, Event, Cron Learn Python
30 Eval-Driven Agent Development Build Python
Phase 15 — Autonomous Systems  22 lessons  Long-horizon agents, self-improvement, and the 2026 safety stack.
# Lesson Type Lang
01 From Chatbots to Long-Horizon Agents (METR) Learn Python
02 STaR, V-STaR, Quiet-STaR: Self-Taught Reasoning Learn Python
03 AlphaEvolve: Evolutionary Coding Agents Learn Python
04 Darwin Gödel Machine: Self-Modifying Agents Learn Python
05 AI Scientist v2: Workshop-Level Research Learn Python
06 Automated Alignment Research (Anthropic AAR) Learn Python
07 Recursive Self-Improvement: Capability vs Alignment Learn Python
08 Bounded Self-Improvement Designs Learn Python
09 Autonomous Coding Agent Landscape (SWE-bench, CodeAct) Learn Python
10 Claude Code Permission Modes and Auto Mode Learn Python
11 Browser Agents and Indirect Prompt Injection Learn Python
12 Durable Execution for Long-Running Agents Learn Python
13 Action Budgets, Iteration Caps, Cost Governors Learn Python
14 Kill Switches, Circuit Breakers, Canary Tokens Learn Python
15 HITL: Propose-Then-Commit Learn Python
16 Checkpoints and Rollback Learn Python
17 Constitutional AI and Rule Overrides Learn Python
18 Llama Guard and Input/Output Classification Learn Python
19 Anthropic Responsible Scaling Policy v3.0 Learn Python
20 OpenAI Preparedness Framework and DeepMind FSF Learn Python
21 METR Time Horizons and External Evaluation Learn Python
22 CAIS, CAISI, and Societal-Scale Risk Learn Python
Phase 16 — Multi-Agent & Swarms  25 lessons  Coordination, emergence, and collective intelligence.
# Lesson Type Lang
01 Why Multi-Agent Learn TypeScript
02 FIPA-ACL Heritage and Speech Acts Learn Python
03 Communication Protocols Build TypeScript
04 The Multi-Agent Primitive Model Learn Python
05 Supervisor / Orchestrator-Worker Pattern Build Python
06 Hierarchical Architecture and Decomposition Drift Learn Python
07 Society of Mind and Multi-Agent Debate Build Python
08 Role Specialization — Planner / Critic / Executor / Verifier Build Python
09 Parallel Swarm and Networked Architectures Build Python
10 Group Chat and Speaker Selection Build Python
11 Handoffs and Routines (Stateless Orchestration) Build Python
12 A2A — The Agent-to-Agent Protocol Build Python
13 Shared Memory and Blackboard Patterns Build Python
14 Consensus and Byzantine Fault Tolerance Build Python
15 Voting, Self-Consistency, and Debate Topology Build Python
16 Negotiation and Bargaining Build Python
17 Generative Agents and Emergent Simulation Build Python
18 Theory of Mind and Emergent Coordination Build Python
19 Swarm Optimization (PSO, ACO) Build Python
20 MARL — MADDPG, QMIX, MAPPO Learn Python
21 Agent Economies, Token Incentives, Reputation Learn Python
22 Production Scaling — Queues, Checkpoints, Durability Build Python
23 Failure Modes — MAST, Groupthink, Monoculture Learn Python
24 Evaluation and Coordination Benchmarks Learn Python
25 Case Studies and 2026 State of the Art Learn Python
Phase 17 — Infrastructure & Production  28 lessons  Ship AI to the real world.
# Lesson Type Lang
01 Managed LLM Platforms — Bedrock, Azure OpenAI, Vertex AI Learn Python
02 Inference Platform Economics — Fireworks, Together, Baseten, Modal Learn Python
03 GPU Autoscaling on Kubernetes — Karpenter, KAI Scheduler Learn Python
04 vLLM Serving Internals — PagedAttention, Continuous Batching, Chunked Prefill Learn Python
05 EAGLE-3 Speculative Decoding in Production Learn Python
06 SGLang and RadixAttention for Prefix-Heavy Workloads Learn Python
07 TensorRT-LLM on Blackwell with FP8 and NVFP4 Learn Python
08 Inference Metrics — TTFT, TPOT, ITL, Goodput, P99 Learn Python
09 Production Quantization — AWQ, GPTQ, GGUF, FP8, NVFP4 Learn Python
10 Cold Start Mitigation for Serverless LLMs Learn Python
11 Multi-Region LLM Serving and KV Cache Locality Learn Python
12 Edge Inference — ANE, Hexagon, WebGPU, Jetson Learn Python
13 LLM Observability Stack Selection Learn Python
14 Prompt Caching and Semantic Caching Economics Learn Python
15 Batch APIs — the 50% Discount as Industry Standard Learn Python
16 Model Routing as a Cost-Reduction Primitive Learn Python
17 Disaggregated Prefill/Decode — NVIDIA Dynamo and llm-d Learn Python
18 vLLM Production Stack with LMCache KV Offloading Learn Python
19 AI Gateways — LiteLLM, Portkey, Kong, Bifrost Learn Python
20 Shadow, Canary, and Progressive Deployment Learn Python
21 A/B Testing LLM Features — GrowthBook and Statsig Learn Python
22 Load Testing LLM APIs — k6, LLMPerf, GenAI-Perf Build Python
23 SRE for AI — Multi-Agent Incident Response Learn Python
24 Chaos Engineering for LLM Production Learn Python
25 Security — Secrets, PII Scrubbing, Audit Logs Learn Python
26 Compliance — SOC 2, HIPAA, GDPR, EU AI Act, ISO 42001 Learn Python
27 FinOps for LLMs — Unit Economics and Multi-Tenant Attribution Learn Python
28 Self-Hosted Serving Selection — llama.cpp, Ollama, TGI, vLLM, SGLang Learn Python
Phase 18 — Ethics, Safety & Alignment  30 lessons  Build AI that helps humanity. Not optional.
# Lesson Type Lang
01 Instruction-Following as Alignment Signal Learn Python
02 Reward Hacking & Goodhart's Law Learn Python
03 Direct Preference Optimization Family Learn Python
04 Sycophancy as RLHF Amplification Learn Python
05 Constitutional AI & RLAIF Learn Python
06 Mesa-Optimization & Deceptive Alignment Learn Python
07 Sleeper Agents — Persistent Deception Learn Python
08 In-Context Scheming in Frontier Models Learn Python
09 Alignment Faking Learn Python
10 AI Control — Safety Despite Subversion Learn Python
11 Scalable Oversight & Weak-to-Strong Learn Python
12 Red-Teaming: PAIR & Automated Attacks Build Python
13 Many-Shot Jailbreaking Learn Python
14 ASCII Art & Visual Jailbreaks Build Python
15 Indirect Prompt Injection Build Python
16 Red-Team Tooling: Garak, Llama Guard, PyRIT Build Python
17 WMDP & Dual-Use Capability Evaluation Learn Python
18 Frontier Safety Frameworks — RSP, PF, FSF Learn
19 Model Welfare Research Learn Python
20 Bias & Representational Harm Build Python
21 Fairness Criteria: Group, Individual, Counterfactual Learn Python
22 Differential Privacy for LLMs Build Python
23 Watermarking: SynthID, Stable Signature, C2PA Build Python
24 Regulatory Frameworks: EU, US, UK, Korea Learn
25 EchoLeak & CVEs for AI Learn Python
26 Model, System & Dataset Cards Build Python
27 Data Provenance & Training-Data Governance Learn Python
28 Alignment Research Ecosystem: MATS, Redwood, Apollo, METR Learn
29 Moderation Systems: OpenAI, Perspective, Llama Guard Build Python
30 Dual-Use Risk: Cyber, Bio, Chem, Nuclear Learn
Phase 19 — Capstone Projects  17 projects  2026 end-to-end shippable products, 20-40 hours each.
# Project Combines Lang
01 Terminal-Native Coding Agent P0 P5 P7 P10 P11 P13 P14 P15 P17 P18 TypeScript, Python
02 RAG over Codebase (Cross-Repo Semantic Search) P5 P7 P11 P13 P17 Python, TypeScript
03 Real-Time Voice Assistant (ASR → LLM → TTS) P6 P7 P11 P13 P14 P17 Python, TypeScript
04 Multimodal Document QA (Vision-First) P4 P5 P7 P11 P12 P17 Python, TypeScript
05 Autonomous Research Agent (AI-Scientist Class) P0 P2 P3 P7 P10 P14 P15 P16 P18 Python
06 DevOps Troubleshooting Agent for Kubernetes P11 P13 P14 P15 P17 P18 Python, TypeScript
07 End-to-End Fine-Tuning Pipeline P2 P3 P7 P10 P11 P17 P18 Python
08 Production RAG Chatbot (Regulated Vertical) P5 P7 P11 P12 P17 P18 Python, TypeScript
09 Code Migration Agent (Repo-Level Upgrade) P5 P7 P11 P13 P14 P15 P17 Python, TypeScript
10 Multi-Agent Software Engineering Team P11 P13 P14 P15 P16 P17 Python, TypeScript
11 LLM Observability & Eval Dashboard P11 P13 P17 P18 TypeScript, Python
12 Video Understanding Pipeline (Scene → QA) P4 P6 P7 P11 P12 P17 Python, TypeScript
13 MCP Server with Registry and Governance P11 P13 P14 P17 P18 Python, TypeScript
14 Speculative-Decoding Inference Server P3 P7 P10 P17 Python
15 Constitutional Safety Harness + Red-Team Range P10 P11 P13 P14 P18 Python
16 GitHub Issue-to-PR Autonomous Agent P11 P13 P14 P15 P17 Python, TypeScript
17 Personal AI Tutor (Adaptive, Multimodal) P5 P6 P11 P12 P14 P17 P18 Python, TypeScript
░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

The toolkit

Every lesson produces a reusable artifact. By the end you have:

outputs/
├── prompts/      prompt templates for every AI task
├── skills/       SKILL.md files for AI coding agents
├── agents/       agent definitions ready to deploy
└── mcp-servers/  MCP servers built during the course

Install them with SkillKit. Plug them into Claude Code, Cursor, or any MCP-compatible agent. Real tools, not homework.

Where to start

Background Start at Estimated time
New to programming and AI Phase 0 — Setup ~306 hours
Know Python, new to ML Phase 1 — Math Foundations ~270 hours
Know ML, new to deep learning Phase 3 — Deep Learning Core ~200 hours
Know deep learning, want LLMs and agents Phase 10 — LLMs from Scratch ~100 hours
Senior engineer, only want agent engineering Phase 14 — Agent Engineering ~60 hours
░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

Why this matters now

FIG_003 · A
THE INDUSTRY SIGNAL
FIG_003 · B
FOUNDATIONAL PAPERS COVERED

"The hottest new programming language is English."
Andrej Karpathy (tweet)

"Software engineering is being remade in front of our eyes."
Boris Cherny, creator of Claude Code

"Models will keep getting better. The skill that compounds is knowing what to build."
— Industry consensus, 2026

  • Attention Is All You Need — Vaswani et al., 2017 → Phase 7
  • Language Models are Few-Shot Learners (GPT-3) → Phase 10
  • Denoising Diffusion Probabilistic ModelsPhase 8
  • InstructGPT / RLHFPhase 10
  • Direct Preference OptimizationPhase 10
  • Chain-of-Thought PromptingPhase 11
  • ReAct: Reasoning + Acting in LLMsPhase 14
  • Model Context Protocol — Anthropic → Phase 13
░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

Contributing

Goal Read
Contribute a lesson or fix CONTRIBUTING.md
Fork for your team or school FORKING.md
Lesson template LESSON_TEMPLATE.md
Track progress ROADMAP.md
Glossary glossary/terms.md
Code of conduct CODE_OF_CONDUCT.md
░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

Star history

Star history

If this manual helped you, star the repo. It keeps the project alive.

License

MIT. Use it however you want — fork it, teach it, sell it, ship it. Attribution appreciated, not required.

Maintained by Rohit Ghumare and the community.

@ghumare64  ·  aiengineeringfromscratch.com  ·  Report / Suggest