GitHub - rohitg00/ai-engineering-from-scratch: Learn it. Build it. Ship it for others.

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

84% of students already use AI tools. Only 18% feel prepared to use them professionally. This curriculum closes that gap.

416 lessons. 20 phases. ~320 hours. Python, TypeScript, Rust, Julia. Every lesson ships a reusable artifact: a prompt, a skill, an agent, an MCP server. Free, open source, MIT.

You don't just learn AI. You build it. End-to-end. By hand.

How this works

Most AI material teaches in scattered pieces. A paper here, a fine-tuning post there, a flashy agent demo somewhere else. The pieces rarely line up. You ship a chatbot but can't explain its loss curve. You hook a function to an agent but can't say what attention does inside the model that's calling it.

This curriculum is the spine. 20 phases, 416 lessons, four languages: Python, TypeScript, Rust, Julia. Linear algebra at one end, autonomous swarms at the other. Every algorithm gets built from raw math first. Backprop. Tokenizer. Attention. Agent loop. By the time PyTorch shows up, you already know what it's doing under the hood.

Each lesson runs the same loop: read the problem, derive the math, write the code, run the test, keep the artifact. No five-minute videos, no copy-paste deploys, no hand-holding. Free, open source, and built to run on your own laptop.

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

The shape of the curriculum

Twenty phases stack on top of each other. Math is the floor. Agents and production are the roof. Skip ahead if you already know the lower layers, but don't skip and then wonder why something at the top is breaking.

%%{init: {'theme':'base','themeVariables':{'primaryColor':'#fafaf5','primaryTextColor':'#1a1a1a','primaryBorderColor':'#3553ff','lineColor':'#3553ff','fontFamily':'JetBrains Mono','fontSize':'12px'}}}%%
flowchart TB
  P0["Phase 0 — Setup &amp; Tooling"] --> P1["Phase 1 — Math Foundations"]
  P1 --> P2["Phase 2 — ML Fundamentals"]
  P2 --> P3["Phase 3 — Deep Learning Core"]
  P3 --> P4["Phase 4 — Vision"]
  P3 --> P5["Phase 5 — NLP"]
  P3 --> P6["Phase 6 — Speech &amp; Audio"]
  P3 --> P9["Phase 9 — RL"]
  P5 --> P7["Phase 7 — Transformers"]
  P7 --> P8["Phase 8 — GenAI"]
  P7 --> P10["Phase 10 — LLMs from Scratch"]
  P10 --> P11["Phase 11 — LLM Engineering"]
  P10 --> P12["Phase 12 — Multimodal"]
  P11 --> P13["Phase 13 — Tools &amp; Protocols"]
  P13 --> P14["Phase 14 — Agent Engineering"]
  P14 --> P15["Phase 15 — Autonomous Systems"]
  P15 --> P16["Phase 16 — Multi-Agent &amp; Swarms"]
  P14 --> P17["Phase 17 — Infrastructure &amp; Production"]
  P15 --> P18["Phase 18 — Ethics &amp; Alignment"]
  P16 --> P19["Phase 19 — Capstone Projects"]
  P17 --> P19
  P18 --> P19

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

The shape of a lesson

Each lesson lives in its own folder, with the same structure across the entire curriculum:

phases/<NN>-<phase-name>/<NN>-<lesson-name>/
├── code/      runnable implementations (Python, TypeScript, Rust, Julia)
├── docs/
│   └── en.md  lesson narrative
└── outputs/   prompts, skills, agents, or MCP servers this lesson produces

Every lesson follows six beats. The Build It / Use It split is the spine — you implement the algorithm from scratch first, then run the same thing through the production library. You understand what the framework is doing because you wrote the smaller version yourself.

%%{init: {'theme':'base','themeVariables':{'primaryColor':'#fafaf5','primaryTextColor':'#1a1a1a','primaryBorderColor':'#3553ff','lineColor':'#3553ff','fontFamily':'JetBrains Mono','fontSize':'13px'}}}%%
flowchart LR
  M["MOTTO<br/><sub>one-line core idea</sub>"] --> Pr["PROBLEM<br/><sub>concrete pain</sub>"]
  Pr --> C["CONCEPT<br/><sub>diagrams &amp; intuition</sub>"]
  C --> B["BUILD IT<br/><sub>raw math, no frameworks</sub>"]
  B --> U["USE IT<br/><sub>same thing in PyTorch / sklearn</sub>"]
  U --> S["SHIP IT<br/><sub>prompt · skill · agent · MCP</sub>"]

Getting started

Three ways in. Pick one.

Option A — read. Open any completed lesson on aiengineeringfromscratch.com or expand a phase under Contents. No setup, no cloning.

Option B — clone and run.

git clone https://github.com/rohitg00/ai-engineering-from-scratch.git
cd ai-engineering-from-scratch
python phases/01-math-foundations/01-linear-algebra-intuition/code/vectors.py

Option C — find your level (recommended). Skip ahead intelligently. Inside Claude Code:

/find-your-level

Ten questions. Maps your knowledge to a starting phase, builds a personalized path with hour estimates. After each phase:

/check-understanding 3        # quiz yourself on phase 3
ls phases/03-deep-learning-core/05-loss-functions/outputs/
# ├── prompt-loss-function-selector.md
# └── prompt-loss-debugger.md

Prerequisites

You can write code (any language; Python helps).
You want to understand how AI actually works, not just call APIs.

Built-in Claude Code skills

Skill	What it does
`/find-your-level`	Ten-question placement quiz. Maps your knowledge to a starting phase and produces a personalized path with hour estimates.
`/check-understanding <phase>`	Per-phase quiz, eight questions, with feedback and specific lessons to review.

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

Every lesson ships something

Other curricula end with "congratulations, you learned X." Each lesson here ends with a reusable tool you can install or paste into your daily workflow.

_{FIG_001 · A} PROMPTS	_{FIG_001 · B} SKILLS	_{FIG_001 · C} AGENTS	_{FIG_001 · D} MCP SERVERS
Paste into any AI assistant for expert-level help on a narrow task.	Drop into Claude Code, Cursor, Codex, or any agent that reads `SKILL.md`.	Deploy as autonomous workers — you wrote the loop yourself in Phase 14.	Plug into any MCP-compatible client. Built end-to-end in Phase 13.

Install the lot with SkillKit. Real tools, not homework. By the end of the curriculum, you have a portfolio of 416 artifacts you actually understand because you built them.

FIG_002 · A worked sample

Phase 14, lesson 1: the agent loop. ~120 lines of pure Python, no dependencies.

code/agent_loop.py _{build it}

def run(query, tools):
    history = [user(query)]
    for step in range(MAX_STEPS):
        msg = llm(history)
        if msg.tool_calls:
            for call in msg.tool_calls:
                result = tools[call.name](**call.args)
                history.append(tool_result(call.id, result))
            continue
        return msg.content
    raise StepLimitExceeded

outputs/skill-agent-loop.md _{ship it}

---
name: agent-loop
description: ReAct-style loop for any tool list
phase: 14
lesson: 01
---

Implement a minimal agent loop that...

outputs/prompt-debug-agent.md

You are an agent debugger. Given the trace
of an agent run, identify the step where
the agent went wrong and explain why...

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

#	Lesson	Type	Lang
01	Dev Environment	Build	Python, TypeScript, Rust
02	Git & Collaboration	Learn	—
03	GPU Setup & Cloud	Build	Python
04	APIs & Keys	Build	Python, TypeScript
05	Jupyter Notebooks	Build	Python
06	Python Environments	Build	Python
07	Docker for AI	Build	Python
08	Editor Setup	Build	—
09	Data Management	Build	Python
10	Terminal & Shell	Learn	—
11	Linux for AI	Learn	—
12	Debugging & Profiling	Build	Python

Phase 1 — Math Foundations 22 lessons The intuition behind every AI algorithm, through code.

#	Lesson	Type	Lang
01	Linear Algebra Intuition	Learn	Python, Julia
02	Vectors, Matrices & Operations	Build	Python, Julia
03	Matrix Transformations & Eigenvalues	Build	Python, Julia
04	Calculus for ML: Derivatives & Gradients	Learn	Python
05	Chain Rule & Automatic Differentiation	Build	Python
06	Probability & Distributions	Learn	Python
07	Bayes' Theorem & Statistical Thinking	Build	Python
08	Optimization: Gradient Descent Family	Build	Python
09	Information Theory: Entropy, KL Divergence	Learn	Python
10	Dimensionality Reduction: PCA, t-SNE, UMAP	Build	Python
11	Singular Value Decomposition	Build	Python, Julia
12	Tensor Operations	Build	Python
13	Numerical Stability	Build	Python
14	Norms & Distances	Build	Python
15	Statistics for ML	Build	Python
16	Sampling Methods	Build	Python
17	Linear Systems	Build	Python
18	Convex Optimization	Build	Python
19	Complex Numbers for AI	Learn	Python
20	The Fourier Transform	Build	Python
21	Graph Theory for ML	Build	Python
22	Stochastic Processes	Learn	Python

Phase 2 — ML Fundamentals 18 lessons Classical ML — still the backbone of most production AI.

#	Lesson	Type	Lang
01	What Is Machine Learning	Learn	Python
02	Linear Regression from Scratch	Build	Python
03	Logistic Regression & Classification	Build	Python
04	Decision Trees & Random Forests	Build	Python
05	Support Vector Machines	Build	Python
06	KNN & Distance Metrics	Build	Python
07	Unsupervised Learning: K-Means, DBSCAN	Build	Python
08	Feature Engineering & Selection	Build	Python
09	Model Evaluation: Metrics, Cross-Validation	Build	Python
10	Bias, Variance & the Learning Curve	Learn	Python
11	Ensemble Methods: Boosting, Bagging, Stacking	Build	Python
12	Hyperparameter Tuning	Build	Python
13	ML Pipelines & Experiment Tracking	Build	Python
14	Naive Bayes	Build	Python
15	Time Series Fundamentals	Build	Python
16	Anomaly Detection	Build	Python
17	Handling Imbalanced Data	Build	Python
18	Feature Selection	Build	Python

Phase 3 — Deep Learning Core 13 lessons Neural networks from first principles. No frameworks until you build one.

#	Lesson	Type	Lang
01	The Perceptron: Where It All Started	Build	Python
02	Multi-Layer Networks & Forward Pass	Build	Python
03	Backpropagation from Scratch	Build	Python
04	Activation Functions: ReLU, Sigmoid, GELU & Why	Build	Python
05	Loss Functions: MSE, Cross-Entropy, Contrastive	Build	Python
06	Optimizers: SGD, Momentum, Adam, AdamW	Build	Python
07	Regularization: Dropout, Weight Decay, BatchNorm	Build	Python
08	Weight Initialization & Training Stability	Build	Python
09	Learning Rate Schedules & Warmup	Build	Python
10	Build Your Own Mini Framework	Build	Python
11	Introduction to PyTorch	Build	Python
12	Introduction to JAX	Build	Python
13	Debugging Neural Networks	Build	Python

Phase 4 — Computer Vision 28 lessons From pixels to understanding — image, video, 3D, VLMs, and world models.

#	Lesson	Type	Lang
01	Image Fundamentals: Pixels, Channels, Color Spaces	Learn	Python
02	Convolutions from Scratch	Build	Python
03	CNNs: LeNet to ResNet	Build	Python
04	Image Classification	Build	Python
05	Transfer Learning & Fine-Tuning	Build	Python
06	Object Detection — YOLO from Scratch	Build	Python
07	Semantic Segmentation — U-Net	Build	Python
08	Instance Segmentation — Mask R-CNN	Build	Python
09	Image Generation — GANs	Build	Python
10	Image Generation — Diffusion Models	Build	Python
11	Stable Diffusion — Architecture & Fine-Tuning	Build	Python
12	Video Understanding — Temporal Modeling	Build	Python
13	3D Vision: Point Clouds, NeRFs	Build	Python
14	Vision Transformers (ViT)	Build	Python
15	Real-Time Vision: Edge Deployment	Build	Python, Rust
16	Build a Complete Vision Pipeline	Build	Python
17	Self-Supervised Vision — SimCLR, DINO, MAE	Build	Python
18	Open-Vocabulary Vision — CLIP	Build	Python
19	OCR & Document Understanding	Build	Python
20	Image Retrieval & Metric Learning	Build	Python
21	Keypoint Detection & Pose Estimation	Build	Python
22	3D Gaussian Splatting from Scratch	Build	Python
23	Diffusion Transformers & Rectified Flow	Build	Python
24	SAM 3 & Open-Vocabulary Segmentation	Build	Python
25	Vision-Language Models (ViT-MLP-LLM)	Build	Python
26	Monocular Depth & Geometry Estimation	Build	Python
27	Multi-Object Tracking & Video Memory	Build	Python
28	World Models & Video Diffusion	Build	Python

Phase 5 — NLP: Foundations to Advanced 29 lessons Language is the interface to intelligence.

#	Lesson	Type	Lang
01	Text Processing: Tokenization, Stemming, Lemmatization	Build	Python
02	Bag of Words, TF-IDF & Text Representation	Build	Python
03	Word Embeddings: Word2Vec from Scratch	Build	Python
04	GloVe, FastText & Subword Embeddings	Build	Python
05	Sentiment Analysis	Build	Python
06	Named Entity Recognition (NER)	Build	Python
07	POS Tagging & Syntactic Parsing	Build	Python
08	Text Classification — CNNs & RNNs for Text	Build	Python
09	Sequence-to-Sequence Models	Build	Python
10	Attention Mechanism — The Breakthrough	Build	Python
11	Machine Translation	Build	Python
12	Text Summarization	Build	Python
13	Question Answering Systems	Build	Python
14	Information Retrieval & Search	Build	Python
15	Topic Modeling: LDA, BERTopic	Build	Python
16	Text Generation	Build	Python
17	Chatbots: Rule-Based to Neural	Build	Python
18	Multilingual NLP	Build	Python
19	Subword Tokenization: BPE, WordPiece, Unigram, SentencePiece	Learn	Python
20	Structured Outputs & Constrained Decoding	Build	Python
21	NLI & Textual Entailment	Learn	Python
22	Embedding Models Deep Dive	Learn	Python
23	Chunking Strategies for RAG	Build	Python
24	Coreference Resolution	Learn	Python
25	Entity Linking & Disambiguation	Build	Python
26	Relation Extraction & Knowledge Graph Construction	Build	Python
27	LLM Evaluation: RAGAS, DeepEval, G-Eval	Build	Python
28	Long-Context Evaluation: NIAH, RULER, LongBench, MRCR	Learn	Python
29	Dialogue State Tracking	Build	Python

Phase 6 — Speech & Audio 17 lessons Hear, understand, speak.

#	Lesson	Type	Lang
01	Audio Fundamentals: Waveforms, Sampling, FFT	Learn	Python
02	Spectrograms, Mel Scale & Audio Features	Build	Python
03	Audio Classification	Build	Python
04	Speech Recognition (ASR)	Build	Python
05	Whisper: Architecture & Fine-Tuning	Build	Python
06	Speaker Recognition & Verification	Build	Python
07	Text-to-Speech (TTS)	Build	Python
08	Voice Cloning & Voice Conversion	Build	Python
09	Music Generation	Build	Python
10	Audio-Language Models	Build	Python
11	Real-Time Audio Processing	Build	Python, Rust
12	Build a Voice Assistant Pipeline	Build	Python
13	Neural Audio Codecs — EnCodec, SNAC, Mimi, DAC	Learn	Python
14	Voice Activity Detection & Turn-Taking	Build	Python
15	Streaming Speech-to-Speech — Moshi, Hibiki	Learn	Python
16	Voice Anti-Spoofing & Audio Watermarking	Build	Python
17	Audio Evaluation — WER, MOS, MMAU, Leaderboards	Learn	Python

Phase 7 — Transformers Deep Dive 14 lessons The architecture that changed everything.

#	Lesson	Type	Lang
01	Why Transformers: The Problems with RNNs	Learn	Python
02	Self-Attention from Scratch	Build	Python
03	Multi-Head Attention	Build	Python
04	Positional Encoding: Sinusoidal, RoPE, ALiBi	Build	Python
05	The Full Transformer: Encoder + Decoder	Build	Python
06	BERT — Masked Language Modeling	Build	Python
07	GPT — Causal Language Modeling	Build	Python
08	T5, BART — Encoder-Decoder Models	Learn	Python
09	Vision Transformers (ViT)	Build	Python
10	Audio Transformers — Whisper Architecture	Learn	Python
11	Mixture of Experts (MoE)	Build	Python
12	KV Cache, Flash Attention & Inference Optimization	Build	Python
13	Scaling Laws	Learn	Python
14	Build a Transformer from Scratch	Build	Python

Phase 8 — Generative AI 14 lessons Create images, video, audio, 3D, and more.

#	Lesson	Type	Lang
01	Generative Models: Taxonomy & History	Learn	Python
02	Autoencoders & VAE	Build	Python
03	GANs: Generator vs Discriminator	Build	Python
04	Conditional GANs & Pix2Pix	Build	Python
05	StyleGAN	Build	Python
06	Diffusion Models — DDPM from Scratch	Build	Python
07	Latent Diffusion & Stable Diffusion	Build	Python
08	ControlNet, LoRA & Conditioning	Build	Python
09	Inpainting, Outpainting & Editing	Build	Python
10	Video Generation	Build	Python
11	Audio Generation	Build	Python
12	3D Generation	Build	Python
13	Flow Matching & Rectified Flows	Build	Python
14	Evaluation: FID, CLIP Score	Build	Python

Phase 9 — Reinforcement Learning 12 lessons The foundation of RLHF and game-playing AI.

#	Lesson	Type	Lang
01	MDPs, States, Actions & Rewards	Learn	Python
02	Dynamic Programming	Build	Python
03	Monte Carlo Methods	Build	Python
04	Q-Learning, SARSA	Build	Python
05	Deep Q-Networks (DQN)	Build	Python
06	Policy Gradients — REINFORCE	Build	Python
07	Actor-Critic — A2C, A3C	Build	Python
08	PPO	Build	Python
09	Reward Modeling & RLHF	Build	Python
10	Multi-Agent RL	Build	Python
11	Sim-to-Real Transfer	Build	Python
12	RL for Games	Build	Python

Phase 10 — LLMs from Scratch 22 lessons Build, train, and understand large language models.

#	Lesson	Type	Lang
01	Tokenizers: BPE, WordPiece, SentencePiece	Build	Python
02	Building a Tokenizer from Scratch	Build	Python
03	Data Pipelines for Pre-Training	Build	Python
04	Pre-Training a Mini GPT (124M)	Build	Python
05	Distributed Training, FSDP, DeepSpeed	Build	Python
06	Instruction Tuning — SFT	Build	Python
07	RLHF — Reward Model + PPO	Build	Python
08	DPO — Direct Preference Optimization	Build	Python
09	Constitutional AI & Self-Improvement	Build	Python
10	Evaluation — Benchmarks, Evals	Build	Python
11	Quantization: INT8, GPTQ, AWQ, GGUF	Build	Python, Rust
12	Inference Optimization	Build	Python
13	Building a Complete LLM Pipeline	Build	Python
14	Open Models: Architecture Walkthroughs	Learn	Python
15	Speculative Decoding and EAGLE-3	Build	Python
16	Differential Attention (V2)	Build	Python
17	Native Sparse Attention (DeepSeek NSA)	Build	Python
18	Multi-Token Prediction (MTP)	Build	Python
19	DualPipe Parallelism	Learn	Python
20	DeepSeek-V3 Architecture Walkthrough	Learn	Python
21	Jamba — Hybrid SSM-Transformer	Learn	Python
22	Async and Hogwild! Inference	Build	Python

Phase 11 — LLM Engineering 15 lessons Put LLMs to work in production.

#	Lesson	Type	Lang
01	Prompt Engineering: Techniques & Patterns	Build	Python
02	Few-Shot, CoT, Tree-of-Thought	Build	Python
03	Structured Outputs	Build	Python, TypeScript
04	Embeddings & Vector Representations	Build	Python
05	Context Engineering	Build	Python, TypeScript
06	RAG: Retrieval-Augmented Generation	Build	Python, TypeScript
07	Advanced RAG: Chunking, Reranking	Build	Python
08	Fine-Tuning with LoRA & QLoRA	Build	Python
09	Function Calling & Tool Use	Build	Python
10	Evaluation & Testing	Build	Python
11	Caching, Rate Limiting & Cost	Build	Python
12	Guardrails & Safety	Build	Python
13	Building a Production LLM App	Build	Python
14	Model Context Protocol (MCP)	Build	Python
15	Prompt Caching & Context Caching	Build	Python

Phase 12 — Multimodal AI 25 lessons See, hear, read, and reason across modalities — from ViT patches to computer-use agents.

#	Lesson	Type	Lang
01	Vision Transformers and the Patch-Token Primitive	Learn	Python
02	CLIP and Contrastive Vision-Language Pretraining	Build	Python
03	BLIP-2 Q-Former as Modality Bridge	Build	Python
04	Flamingo and Gated Cross-Attention	Learn	Python
05	LLaVA and Visual Instruction Tuning	Build	Python
06	Any-Resolution Vision — Patch-n'-Pack and NaFlex	Build	Python
07	Open-Weight VLM Recipes: What Actually Matters	Learn	Python
08	LLaVA-OneVision: Single, Multi, Video	Build	Python
09	Qwen-VL Family and Dynamic-FPS Video	Learn	Python
10	InternVL3 Native Multimodal Pretraining	Learn	Python
11	Chameleon Early-Fusion Token-Only	Build	Python
12	Emu3 Next-Token Prediction for Generation	Learn	Python
13	Transfusion Autoregressive + Diffusion	Build	Python
14	Show-o Discrete-Diffusion Unified	Learn	Python
15	Janus-Pro Decoupled Encoders	Build	Python
16	MIO Any-to-Any Streaming	Learn	Python
17	Video-Language Temporal Grounding	Build	Python
18	Long-Video at Million-Token Context	Build	Python
19	Audio-Language Models: Whisper to AF3	Build	Python
20	Omni Models: Thinker-Talker Streaming	Build	Python
21	Embodied VLAs: RT-2, OpenVLA, π0, GR00T	Learn	Python
22	Document and Diagram Understanding	Build	Python
23	ColPali Vision-Native Document RAG	Build	Python
24	Multimodal RAG and Cross-Modal Retrieval	Build	Python
25	Multimodal Agents and Computer-Use (Capstone)	Build	Python

Phase 13 — Tools & Protocols 23 lessons The interfaces between AI and the real world.

#	Lesson	Type	Lang
01	The Tool Interface	Learn	Python
02	Function Calling Deep Dive	Build	Python
03	Parallel and Streaming Tool Calls	Build	Python
04	Structured Output	Build	Python
05	Tool Schema Design	Learn	Python
06	MCP Fundamentals	Learn	Python
07	Building an MCP Server	Build	Python
08	Building an MCP Client	Build	Python
09	MCP Transports	Learn	Python
10	MCP Resources and Prompts	Build	Python
11	MCP Sampling	Build	Python
12	MCP Roots and Elicitation	Build	Python
13	MCP Async Tasks	Build	Python
14	MCP Apps	Build	Python
15	MCP Security I — Tool Poisoning	Learn	Python
16	MCP Security II — OAuth 2.1	Build	Python
17	MCP Gateways and Registries	Learn	Python
18	MCP Auth in Production — DCR + JWKS on iii	Build	Python
19	A2A Protocol	Build	Python
20	OpenTelemetry GenAI	Build	Python
21	LLM Routing Layer	Learn	Python
22	Skills and Agent SDKs	Learn	Python
23	Capstone — Tool Ecosystem	Build	Python

Phase 14 — Agent Engineering 30 lessons Build agents from first principles — loop, memory, planning, frameworks, benchmarks, production.

#	Lesson	Type	Lang
01	The Agent Loop	Build	Python
02	ReWOO and Plan-and-Execute	Build	Python
03	Reflexion and Verbal Reinforcement Learning	Build	Python
04	Tree of Thoughts and LATS	Build	Python
05	Self-Refine and CRITIC	Build	Python
06	Tool Use and Function Calling	Build	Python
07	Memory — Virtual Context and MemGPT	Build	Python
08	Memory Blocks and Sleep-Time Compute	Build	Python
09	Hybrid Memory — Mem0 Vector + Graph + KV	Build	Python
10	Skill Libraries and Lifelong Learning — Voyager	Build	Python
11	Planning with HTN and Evolutionary Search	Build	Python
12	Anthropic's Workflow Patterns	Build	Python
13	LangGraph — Stateful Graphs and Durable Execution	Build	Python
14	AutoGen v0.4 — Actor Model	Build	Python
15	CrewAI — Role-Based Crews and Flows	Build	Python
16	OpenAI Agents SDK — Handoffs, Guardrails, Tracing	Build	Python
17	Claude Agent SDK — Subagents and Session Store	Build	Python
18	Agno and Mastra — Production Runtimes	Learn	Python, TypeScript
19	Benchmarks — SWE-bench, GAIA, AgentBench	Learn	Python
20	Benchmarks — WebArena and OSWorld	Learn	Python
21	Computer Use — Claude, OpenAI CUA, Gemini	Build	Python
22	Voice Agents — Pipecat and LiveKit	Build	Python
23	OpenTelemetry GenAI Semantic Conventions	Build	Python
24	Agent Observability — Langfuse, Phoenix, Opik	Learn	Python
25	Multi-Agent Debate and Collaboration	Build	Python
26	Failure Modes — Why Agents Break	Build	Python
27	Prompt Injection and the PVE Defense	Build	Python
28	Orchestration Patterns — Supervisor, Swarm, Hierarchical	Build	Python
29	Production Runtimes — Queue, Event, Cron	Learn	Python
30	Eval-Driven Agent Development	Build	Python

Phase 15 — Autonomous Systems 22 lessons Long-horizon agents, self-improvement, and the 2026 safety stack.

#	Lesson	Type	Lang
01	From Chatbots to Long-Horizon Agents (METR)	Learn	Python
02	STaR, V-STaR, Quiet-STaR: Self-Taught Reasoning	Learn	Python
03	AlphaEvolve: Evolutionary Coding Agents	Learn	Python
04	Darwin Gödel Machine: Self-Modifying Agents	Learn	Python
05	AI Scientist v2: Workshop-Level Research	Learn	Python
06	Automated Alignment Research (Anthropic AAR)	Learn	Python
07	Recursive Self-Improvement: Capability vs Alignment	Learn	Python
08	Bounded Self-Improvement Designs	Learn	Python
09	Autonomous Coding Agent Landscape (SWE-bench, CodeAct)	Learn	Python
10	Claude Code Permission Modes and Auto Mode	Learn	Python
11	Browser Agents and Indirect Prompt Injection	Learn	Python
12	Durable Execution for Long-Running Agents	Learn	Python
13	Action Budgets, Iteration Caps, Cost Governors	Learn	Python
14	Kill Switches, Circuit Breakers, Canary Tokens	Learn	Python
15	HITL: Propose-Then-Commit	Learn	Python
16	Checkpoints and Rollback	Learn	Python
17	Constitutional AI and Rule Overrides	Learn	Python
18	Llama Guard and Input/Output Classification	Learn	Python
19	Anthropic Responsible Scaling Policy v3.0	Learn	Python
20	OpenAI Preparedness Framework and DeepMind FSF	Learn	Python
21	METR Time Horizons and External Evaluation	Learn	Python
22	CAIS, CAISI, and Societal-Scale Risk	Learn	Python

Phase 16 — Multi-Agent & Swarms 25 lessons Coordination, emergence, and collective intelligence.

#	Lesson	Type	Lang
01	Why Multi-Agent	Learn	TypeScript
02	FIPA-ACL Heritage and Speech Acts	Learn	Python
03	Communication Protocols	Build	TypeScript
04	The Multi-Agent Primitive Model	Learn	Python
05	Supervisor / Orchestrator-Worker Pattern	Build	Python
06	Hierarchical Architecture and Decomposition Drift	Learn	Python
07	Society of Mind and Multi-Agent Debate	Build	Python
08	Role Specialization — Planner / Critic / Executor / Verifier	Build	Python
09	Parallel Swarm and Networked Architectures	Build	Python
10	Group Chat and Speaker Selection	Build	Python
11	Handoffs and Routines (Stateless Orchestration)	Build	Python
12	A2A — The Agent-to-Agent Protocol	Build	Python
13	Shared Memory and Blackboard Patterns	Build	Python
14	Consensus and Byzantine Fault Tolerance	Build	Python
15	Voting, Self-Consistency, and Debate Topology	Build	Python
16	Negotiation and Bargaining	Build	Python
17	Generative Agents and Emergent Simulation	Build	Python
18	Theory of Mind and Emergent Coordination	Build	Python
19	Swarm Optimization (PSO, ACO)	Build	Python
20	MARL — MADDPG, QMIX, MAPPO	Learn	Python
21	Agent Economies, Token Incentives, Reputation	Learn	Python
22	Production Scaling — Queues, Checkpoints, Durability	Build	Python
23	Failure Modes — MAST, Groupthink, Monoculture	Learn	Python
24	Evaluation and Coordination Benchmarks	Learn	Python
25	Case Studies and 2026 State of the Art	Learn	Python

Phase 17 — Infrastructure & Production 28 lessons Ship AI to the real world.

#	Lesson	Type	Lang
01	Managed LLM Platforms — Bedrock, Azure OpenAI, Vertex AI	Learn	Python
02	Inference Platform Economics — Fireworks, Together, Baseten, Modal	Learn	Python
03	GPU Autoscaling on Kubernetes — Karpenter, KAI Scheduler	Learn	Python
04	vLLM Serving Internals — PagedAttention, Continuous Batching, Chunked Prefill	Learn	Python
05	EAGLE-3 Speculative Decoding in Production	Learn	Python
06	SGLang and RadixAttention for Prefix-Heavy Workloads	Learn	Python
07	TensorRT-LLM on Blackwell with FP8 and NVFP4	Learn	Python
08	Inference Metrics — TTFT, TPOT, ITL, Goodput, P99	Learn	Python
09	Production Quantization — AWQ, GPTQ, GGUF, FP8, NVFP4	Learn	Python
10	Cold Start Mitigation for Serverless LLMs	Learn	Python
11	Multi-Region LLM Serving and KV Cache Locality	Learn	Python
12	Edge Inference — ANE, Hexagon, WebGPU, Jetson	Learn	Python
13	LLM Observability Stack Selection	Learn	Python
14	Prompt Caching and Semantic Caching Economics	Learn	Python
15	Batch APIs — the 50% Discount as Industry Standard	Learn	Python
16	Model Routing as a Cost-Reduction Primitive	Learn	Python
17	Disaggregated Prefill/Decode — NVIDIA Dynamo and llm-d	Learn	Python
18	vLLM Production Stack with LMCache KV Offloading	Learn	Python
19	AI Gateways — LiteLLM, Portkey, Kong, Bifrost	Learn	Python
20	Shadow, Canary, and Progressive Deployment	Learn	Python
21	A/B Testing LLM Features — GrowthBook and Statsig	Learn	Python
22	Load Testing LLM APIs — k6, LLMPerf, GenAI-Perf	Build	Python
23	SRE for AI — Multi-Agent Incident Response	Learn	Python
24	Chaos Engineering for LLM Production	Learn	Python
25	Security — Secrets, PII Scrubbing, Audit Logs	Learn	Python
26	Compliance — SOC 2, HIPAA, GDPR, EU AI Act, ISO 42001	Learn	Python
27	FinOps for LLMs — Unit Economics and Multi-Tenant Attribution	Learn	Python
28	Self-Hosted Serving Selection — llama.cpp, Ollama, TGI, vLLM, SGLang	Learn	Python

Phase 18 — Ethics, Safety & Alignment 30 lessons Build AI that helps humanity. Not optional.

#	Lesson	Type	Lang
01	Instruction-Following as Alignment Signal	Learn	Python
02	Reward Hacking & Goodhart's Law	Learn	Python
03	Direct Preference Optimization Family	Learn	Python
04	Sycophancy as RLHF Amplification	Learn	Python
05	Constitutional AI & RLAIF	Learn	Python
06	Mesa-Optimization & Deceptive Alignment	Learn	Python
07	Sleeper Agents — Persistent Deception	Learn	Python
08	In-Context Scheming in Frontier Models	Learn	Python
09	Alignment Faking	Learn	Python
10	AI Control — Safety Despite Subversion	Learn	Python
11	Scalable Oversight & Weak-to-Strong	Learn	Python
12	Red-Teaming: PAIR & Automated Attacks	Build	Python
13	Many-Shot Jailbreaking	Learn	Python
14	ASCII Art & Visual Jailbreaks	Build	Python
15	Indirect Prompt Injection	Build	Python
16	Red-Team Tooling: Garak, Llama Guard, PyRIT	Build	Python
17	WMDP & Dual-Use Capability Evaluation	Learn	Python
18	Frontier Safety Frameworks — RSP, PF, FSF	Learn	—
19	Model Welfare Research	Learn	Python
20	Bias & Representational Harm	Build	Python
21	Fairness Criteria: Group, Individual, Counterfactual	Learn	Python
22	Differential Privacy for LLMs	Build	Python
23	Watermarking: SynthID, Stable Signature, C2PA	Build	Python
24	Regulatory Frameworks: EU, US, UK, Korea	Learn	—
25	EchoLeak & CVEs for AI	Learn	Python
26	Model, System & Dataset Cards	Build	Python
27	Data Provenance & Training-Data Governance	Learn	Python
28	Alignment Research Ecosystem: MATS, Redwood, Apollo, METR	Learn	—
29	Moderation Systems: OpenAI, Perspective, Llama Guard	Build	Python
30	Dual-Use Risk: Cyber, Bio, Chem, Nuclear	Learn	—

Phase 19 — Capstone Projects 17 projects 2026 end-to-end shippable products, 20-40 hours each.

#	Project	Combines	Lang
01	Terminal-Native Coding Agent	P0 P5 P7 P10 P11 P13 P14 P15 P17 P18	TypeScript, Python
02	RAG over Codebase (Cross-Repo Semantic Search)	P5 P7 P11 P13 P17	Python, TypeScript
03	Real-Time Voice Assistant (ASR → LLM → TTS)	P6 P7 P11 P13 P14 P17	Python, TypeScript
04	Multimodal Document QA (Vision-First)	P4 P5 P7 P11 P12 P17	Python, TypeScript
05	Autonomous Research Agent (AI-Scientist Class)	P0 P2 P3 P7 P10 P14 P15 P16 P18	Python
06	DevOps Troubleshooting Agent for Kubernetes	P11 P13 P14 P15 P17 P18	Python, TypeScript
07	End-to-End Fine-Tuning Pipeline	P2 P3 P7 P10 P11 P17 P18	Python
08	Production RAG Chatbot (Regulated Vertical)	P5 P7 P11 P12 P17 P18	Python, TypeScript
09	Code Migration Agent (Repo-Level Upgrade)	P5 P7 P11 P13 P14 P15 P17	Python, TypeScript
10	Multi-Agent Software Engineering Team	P11 P13 P14 P15 P16 P17	Python, TypeScript
11	LLM Observability & Eval Dashboard	P11 P13 P17 P18	TypeScript, Python
12	Video Understanding Pipeline (Scene → QA)	P4 P6 P7 P11 P12 P17	Python, TypeScript
13	MCP Server with Registry and Governance	P11 P13 P14 P17 P18	Python, TypeScript
14	Speculative-Decoding Inference Server	P3 P7 P10 P17	Python
15	Constitutional Safety Harness + Red-Team Range	P10 P11 P13 P14 P18	Python
16	GitHub Issue-to-PR Autonomous Agent	P11 P13 P14 P15 P17	Python, TypeScript
17	Personal AI Tutor (Adaptive, Multimodal)	P5 P6 P11 P12 P14 P17 P18	Python, TypeScript

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

The toolkit

Every lesson produces a reusable artifact. By the end you have:

outputs/
├── prompts/      prompt templates for every AI task
├── skills/       SKILL.md files for AI coding agents
├── agents/       agent definitions ready to deploy
└── mcp-servers/  MCP servers built during the course

Install them with SkillKit. Plug them into Claude Code, Cursor, or any MCP-compatible agent. Real tools, not homework.

Where to start

Background	Start at	Estimated time
New to programming and AI	Phase 0 — Setup	~306 hours
Know Python, new to ML	Phase 1 — Math Foundations	~270 hours
Know ML, new to deep learning	Phase 3 — Deep Learning Core	~200 hours
Know deep learning, want LLMs and agents	Phase 10 — LLMs from Scratch	~100 hours
Senior engineer, only want agent engineering	Phase 14 — Agent Engineering	~60 hours

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

Why this matters now

_{FIG_003 · A}
THE INDUSTRY SIGNAL

_{FIG_003 · B}
FOUNDATIONAL PAPERS COVERED

"The hottest new programming language is English."
— Andrej Karpathy (tweet)

"Software engineering is being remade in front of our eyes."
— Boris Cherny, creator of Claude Code

"Models will keep getting better. The skill that compounds is knowing what to build."
— Industry consensus, 2026

Attention Is All You Need — Vaswani et al., 2017 → Phase 7
Language Models are Few-Shot Learners (GPT-3) → Phase 10
Denoising Diffusion Probabilistic Models → Phase 8
InstructGPT / RLHF → Phase 10
Direct Preference Optimization → Phase 10
Chain-of-Thought Prompting → Phase 11
ReAct: Reasoning + Acting in LLMs → Phase 14
Model Context Protocol — Anthropic → Phase 13

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

Contributing

Goal	Read
Contribute a lesson or fix	CONTRIBUTING.md
Fork for your team or school	FORKING.md
Lesson template	LESSON_TEMPLATE.md
Track progress	ROADMAP.md
Glossary	glossary/terms.md
Code of conduct	CODE_OF_CONDUCT.md

░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒░░░▒▒▒

Star history

If this manual helped you, star the repo. It keeps the project alive.

License

MIT. Use it however you want — fork it, teach it, sell it, ship it. Attribution appreciated, not required.

Maintained by Rohit Ghumare and the community.

_{@ghumare64 ·
aiengineeringfromscratch.com ·
Report / Suggest}

Name		Name	Last commit message	Last commit date
Latest commit History 922 Commits
.claude/skills		.claude/skills
.github		.github
assets		assets
glossary		glossary
outputs		outputs
phases		phases
projects		projects
scripts		scripts
site		site
web		web
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
FORKING.md		FORKING.md
LESSON_TEMPLATE.md		LESSON_TEMPLATE.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
requirements.txt		requirements.txt
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

How this works

The shape of the curriculum

The shape of a lesson

Getting started

Prerequisites

Built-in Claude Code skills

Every lesson ships something

FIG_002 · A worked sample

Contents

Phase 0: Setup & Tooling `12 lessons`

The toolkit

Where to start

Why this matters now

Contributing

Star history

License

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

How this works

The shape of the curriculum

The shape of a lesson

Getting started

Prerequisites

Built-in Claude Code skills

Every lesson ships something

FIG_002 · A worked sample

Contents

Phase 0: Setup & Tooling 12 lessons

The toolkit

Where to start

Why this matters now

Contributing

Star history

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Phase 0: Setup & Tooling `12 lessons`

Packages