~37 min read

AI 时代专业课程设置参考资料

调研范围：全球独立大牛 / 个人 IP / 研究机构 / 公司 / 大学开源 / 工程社区开设的 AI 课程。

只列课表（每门课讲什么、有几节、时长、是否有 capstone），不写分析、评价、价格比较。

所有人名 / 课程名保留英文。

A 类 — 神级个人课程#

A1. Neural Networks: Zero to Hero — Andrej Karpathy#

平台：YouTube + GitHub
URL：https://karpathy.ai/zero-to-hero.html
总长度：8 个视频，约 13 小时
覆盖范围：从 micrograd 手搓反向传播一路到从零构建 GPT 和 BPE Tokenizer

完整视频清单：

1. The spelled-out intro to neural networks and backpropagation: building micrograd（145 分钟）
2. The spelled-out intro to language modeling: building makemore（117 分钟）— bigram 字符级语言模型，PyTorch tensors 入门
3. Building makemore Part 2: MLP（75 分钟）— 多层感知机、学习率、超参、过拟合
4. Building makemore Part 3: Activations & Gradients, BatchNorm（115 分钟）
5. Building makemore Part 4: Becoming a Backprop Ninja（56 分钟）— 不用 autograd 手动反传
6. Building makemore Part 5: Building a WaveNet（56 分钟）
7. Let's build GPT: from scratch, in code, spelled out（116 分钟）— 跟着 "Attention is All You Need" 和 GPT-2/GPT-3 paper
8. Let's build the GPT Tokenizer（133 分钟）— BPE 实现

Capstone / 作业：无正式作业，每个 video 末尾有 suggested exercises。

A2. Software Is Changing (Again) — Andrej Karpathy（Y Combinator 2025 keynote）#

平台：YouTube / YC Startup Library
URL：https://www.ycombinator.com/library/MW-andrej-karpathy-software-is-changing-again
总长度：1 个 keynote 演讲（不是课程）

章节：

1. Software Evolution: From 1.0 to 3.0
2. Programming in English: Rise of Software 3.0
3. LLMs as Infrastructure（LLM OS, fabs, utilities）
4. Psychology of LLMs（"people spirits", jagged intelligence, hallucinations）
5. Partial Autonomy & Human-AI Collaboration
6. Tesla Autopilot & Autonomy Sliders
7. Vibe Coding（MenuGen demo）
8. Building for Agents
9. Future Outlook（mainframe era of LLMs）

A3. Stanford CS231n — Convolutional Neural Networks for Visual Recognition（Karpathy 早期版本，Winter 2016）#

URL：https://cs231n.stanford.edu/2016/syllabus.html
总长度：15 个 lecture

Lecture 清单：

1. Intro to Computer Vision, historical context
2. Image Classification — k-NN, Linear Classification
3. Linear classification II, optimization, SGD
4. Backpropagation, Intro to Neural Networks
5. Training Neural Networks Part 1（activations, init, batch norm）
6. Training Neural Networks Part 2（parameter updates, dropout）+ CNN intro
7. CNN architectures, convolution/pooling, ImageNet case studies
8. Localization and Detection
9. Visualization, DeepDream, Style Transfer, Adversarial Examples
10. RNN, LSTM — Language modeling, Image captioning
11. Practical ConvNet training（augmentation, transfer learning, distributed）
12. Frameworks（Caffe, Torch, Theano, TensorFlow）
13. Segmentation, Soft Attention, Spatial Transformer Networks
14. Videos and Unsupervised Learning
15. Invited talks / Conclusions

Lecture Notes（学生自学路径）：

Module 0: Preparation（Python/Numpy）
Module 1: Neural Networks（图像分类、线性分类、optimization、backprop、neural network 三部曲、case study）
Module 2: Convolutional Neural Networks（架构、可视化、transfer learning）

Capstone：3 个 Assignment（kNN/SVM/Softmax/2-layer NN；CNN/Dropout/BN；RNN/LSTM/Captioning）+ Final Project。

A4. Stanford CS231n — Deep Learning for Computer Vision（最新 Spring 2026 版）#

URL：https://cs231n.stanford.edu/schedule.html
总长度：18 个 lecture + Midterm + Final Project poster

Lecture 清单（按模块分）：

Module 1 — Deep Learning Basics

1. Introduction
2. Image Classification with Linear Classifiers
3. Regularization and Optimization
4. Neural Networks and Backpropagation

Module 2 — Perceiving and Understanding the Visual World

5. Image Classification with CNNs
6. CNN Architectures
7. Recurrent Neural Networks
8. Attention and Transformers
9. Object Detection, Image Segmentation
10. Video Understanding
11. Large Scale Distributed Training

Module 3 — Generative and Interactive Visual Intelligence

12. Self-supervised Learning
13. Generative Models 1（VAEs, GANs, autoregressive）
14. Generative Models 2（Diffusion）
15. 3D Vision
16. Vision and Language
17. World Modeling
18. Human-Centered AI

Capstone：3 个 Assignments + Project Proposal + Final Project + Poster Session。

A5. Practical Deep Learning for Coders — Fast.ai Part 1（Jeremy Howard）#

URL：https://course.fast.ai/
总长度：9 个 lesson，每个约 90 分钟

Lesson 清单：

1. Getting started
2. Deployment
3. Neural net foundations
4. Natural Language (NLP)
5. From-scratch model
6. Random forests
7. Collaborative filtering
8. Convolutions (CNNs)
9. Bonus: Data ethics

Capstone：无正式 capstone，每 lesson 有 summary 文档 + Kaggle / 自建项目。

工具栈：PyTorch, fastai, Hugging Face Transformers, Gradio。

A6. Fast.ai Part 2 — Deep Learning from the Foundations / Stable Diffusion（Jeremy Howard）#

URL：https://course19.fast.ai/part2 + https://course.fast.ai/Lessons/part2.html
总长度：17 个 lesson（最新版包含 Stable Diffusion）

Lesson 清单（早期 "Deep Learning from the Foundations" 版本，lesson 8-14）：

8. Matrix multiplication; forward and backward passes
9. Loss functions, optimizers, and the training loop
10. Looking inside the model（callbacks, hooks, batchnorm）
11. Data Block API, and generic optimizer
12. Advanced training techniques; ULMFiT from scratch
13. Basics of Swift for Deep Learning
14. C interop; Protocols; Putting it all together

最新版（"Deep Learning Foundations to Stable Diffusion"）扩展内容：matrix multiplication, clustering, backpropagation, autoencoders, attention, transformers, latent diffusion models。

A7. Deep Learning Specialization — Andrew Ng（DeepLearning.AI）#

URL：https://www.coursera.org/specializations/deep-learning
总长度：5 门课，约 129 小时

课程清单：

1. Neural Networks and Deep Learning（25h）— 全连接网络、向量化、架构参数
2. Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization（24h）— Momentum, RMSprop, Adam, TensorFlow
3. Structuring Machine Learning Projects（7h）— 错误诊断、bias/variance、transfer / multi-task learning
4. Convolutional Neural Networks（36h）— CNN、neural style transfer、2D/3D 视觉
5. Sequence Models（37h）— RNN/LSTM/GRU、HuggingFace tokenizers、Transformer、NER、QA

Capstone：每门课有 programming assignments，无单独 capstone。

A8. Machine Learning Specialization — Andrew Ng（DeepLearning.AI / Stanford Online，2022 重制版）#

URL：https://www.coursera.org/specializations/machine-learning-introduction
总长度：3 门课，约 95 小时

课程清单：

1. Supervised Machine Learning: Regression and Classification（33h）— 线性回归、逻辑回归、特征工程、训练优化
2. Advanced Learning Algorithms（34h）— Neural networks with TensorFlow、多分类、决策树、ensemble、transfer learning
3. Unsupervised Learning, Recommenders, Reinforcement Learning（28h）— 聚类、异常检测、降维、推荐系统、deep RL

A9. Machine Learning Engineering for Production (MLOps) Specialization — Andrew Ng / Robert Crowe（DeepLearning.AI）#

URL：https://www.coursera.org/specializations/machine-learning-engineering-for-production-mlops
总长度：4 门课的专项；其中 Machine Learning in Production 单课为 3 周，~15 小时

Machine Learning in Production 单课课表：

Week 1: Overview of the ML Lifecycle and Deployment（8 videos, 2 assignments, 2 labs）
Week 2: Modeling Challenges and Strategies（16 videos, 2 assignments, 1 lab）
Week 3: Data Definition and Baseline（17 videos, 2 assignments, 2 labs）

后续课程涵盖 ML Data Lifecycle, ML Modeling Pipelines, Deploying ML Models。

A10. Generative AI with Large Language Models — Andrew Ng / AWS（DeepLearning.AI）#

URL：https://www.coursera.org/learn/generative-ai-with-llms
总长度：3 周，约 17 小时

Week 清单：

Week 1: Generative AI Use Cases, Project Lifecycle, and Model Pre-training（17 videos / 116 min, 7 readings, 1 quiz, Lab 1: Summarize Dialogue）— Transformer 架构、pre-training、scaling laws
Week 2: Fine-tuning and Evaluating Large Language Models（10 videos / 78 min, 1 quiz, Lab 2: Fine-tune for dialogue summarization）— Instruction fine-tuning、PEFT（LoRA, soft prompts）
Week 3: Reinforcement Learning and LLM-powered Applications（21 videos / 141 min, 1 quiz, Lab 3: Fine-tune FLAN-T5 with RLHF）— RLHF、deployment、CoT、responsible AI

Capstone：3 个 AWS labs（无独立项目）。

A11. DeepLearning.AI 其他 Specializations / Professional Certificates#

URL：https://www.deeplearning.ai/courses/

完整列表：

1. Machine Learning Specialization
2. Deep Learning Specialization
3. Natural Language Processing Specialization
4. TensorFlow Developer Professional Certificate
5. PyTorch for Deep Learning
6. Generative AI for Software Development
7. Data Analytics
8. Mathematics for Machine Learning and Data Science
9. AI for Good
10. AI for Medicine
11. Machine Learning Engineering for Production (MLOps)

典型 short courses（每个 1-2 小时）：

AI Prompting for Everyone（Andrew Ng）
Agentic AI（Andrew Ng）
AI for Everyone（Andrew Ng）
Generative AI for Everyone（Andrew Ng）
LangChain for LLM Application Development（Andrew Ng + Harrison Chase）
AI Agents in LangGraph（Harrison Chase + Rotem Weiss）
Multi AI Agent Systems with crewAI
Retrieval Augmented Generation (RAG)
Building with Llama 4（Amit Sangani / Meta）
AI Agents for Image and Video Generation（Google）
Build Interactive Agents with Generative UI（CopilotKit）
Building Multimodal Data Pipelines（Snowflake）
Spec-Driven Development with Coding Agents（JetBrains）
Efficient Inference with SGLang
Agent Memory（Oracle）
Build and Train an LLM with JAX（Google）

A12. Build a Large Language Model From Scratch — Sebastian Raschka#

URL：https://github.com/rasbt/LLMs-from-scratch
总长度：7 章 + 5 附录，528 页（Manning, 2024）

章节清单：

1. Understanding Large Language Models（无 code）
2. Working with Text Data — tokenization、data loading、preprocessing
3. Coding Attention Mechanisms — single-head 和 multi-head attention
4. Implementing a GPT Model from Scratch
5. Pretraining on Unlabeled Data
6. Finetuning for Text Classification
7. Finetuning to Follow Instructions（instruction tuning, alignment）

附录：

A. Introduction to PyTorch（含分布式训练）
B. References
C. Exercise Solutions
D. Adding Bells and Whistles to the Training Loop
E. Parameter-efficient Finetuning with LoRA

Bonus 材料：tokenizer 实现变种、attention 优化、Llama/Qwen/Gemma 架构、KV caching、MoE、preference tuning、evaluation frameworks。

配套课程：Manning Live Video Course "Master and Build Large Language Models" + 免费 YouTube playlist。

Capstone：每章末尾习题；完整端到端 GPT 实现就是项目。

A13. 其他 Sebastian Raschka 课程 / 书籍#

Build a Reasoning Model (From Scratch)（in progress, 528 页）— reasoning LLMs、inference-time scaling、RL、distillation
Machine Learning Q and AI（264 页, No Starch, 2024）— Q&A 形式：multi-GPU training、fine-tuning transformers、vision transformers
Machine Learning with PyTorch and Scikit-Learn（770 页, Packt, 2022）— ML 基础到 deep learning

A14. Designing Machine Learning Systems — Chip Huyen#

URL：https://huyenchip.com/books / O'Reilly 出版
总长度：11 章 + Epilogue

章节清单：

1. Overview of Machine Learning Systems
2. Introduction to Machine Learning Systems Design — business/ML objectives, reliability, scalability, maintainability, adaptability, ML problem framing
3. Data Engineering Fundamentals — sources, formats, models, dataflow, batch vs stream
4. Training Data — sampling, labeling, class imbalance, augmentation
5. Feature Engineering — scaling, encoding, crossing, data leakage, generalization
6. Model Development and Offline Evaluation — ensembles, tracking, distributed training, AutoML, baselines
7. Model Deployment and Prediction Service — batch vs online, compression（factorization, distillation, pruning, quantization）, edge
8. Data Distribution Shifts and Monitoring — shift detection、observability
9. Continual Learning and Test in Production — stateless vs stateful、shadow、A/B、canary、interleaving、bandits
10. Infrastructure and Tooling for MLOps — compute、dev env、containers、orchestrators、ML platform、feature store
11. The Human Side of Machine Learning — UX、team structure、responsible AI

配套课程：Stanford CS329S（见 C 类）。

A15. AI Engineering — Chip Huyen#

URL：https://github.com/chiphuyen/aie-book
总长度：10 章（O'Reilly, 2025）

章节清单：

1. Introduction to Building AI Applications with Foundation Models — The Rise of AI Engineering, FM Use Cases, Planning AI Applications, The AI Engineering Stack
2. Understanding Foundation Models — Training Data, Modeling, Post-Training, Sampling
3. Evaluation Methodology — Challenges, Language Modeling Metrics, Exact Evaluation, AI as a Judge, Comparative Evaluation
4. Evaluate AI Systems — Evaluation Criteria, Model Selection, Design Your Evaluation Pipeline
5. Prompt Engineering — Introduction, Best Practices, Defensive Prompt Engineering
6. RAG and Agents — RAG, Agents, Memory
7. Finetuning — Overview, When to Finetune, Memory Bottlenecks, Techniques
8. Dataset Engineering — Data Curation, Augmentation/Synthesis, Processing
9. Inference Optimization — Understanding, Optimization
10. AI Engineering Architecture and User Feedback — Architecture, User Feedback

A16. Eugene Yan — 自学路径文章系列#

URL：https://eugeneyan.com/writing/

不是正式课程，但被广泛当成结构化自学材料，按主题：

LLMs：

Prompting Fundamentals and How to Apply them Effectively
Task-Specific LLM Evals that Do & Don't Work
Evaluating the Effectiveness of LLM-Evaluators
Patterns for Building LLM-based Systems & Products
What We've Learned From A Year of Building with LLMs
Evaluating Long-Context Question & Answer Systems

ML Systems：

A Practical Guide to Maintaining Machine Learning in Production
6 Little-Known Challenges After Deploying Machine Learning
Design Patterns in Machine Learning Code and Systems
More Design Patterns For Machine Learning Systems
How to Test Machine Learning Code and Systems
Writing Robust Tests for Data & Machine Learning Pipelines

Recommender Systems：

System Design for Recommendations and Search
Patterns for Personalization in Recommendations and Search
Real-time Machine Learning For Recommendations
Improving Recommendation Systems & Search in the Age of LLMs

Data Science：

How to get started in Data Science
What does a Data Scientist really do?
What I Do Before a Data Science Project to Ensure Success

A17. Mastering LLMs For Developers & Data Scientists — Hamel Husain + Dan Becker#

URL：https://hamel.dev/blog/posts/course/ + https://parlance-labs.com/education/
总长度：40+ 小时开源（原 Maven 4 工作坊）

5 大主题（共 23+ sessions）：

1. Evals（3 sessions）— LLM evaluation 框架与方法
2. RAG（3 talks）— RAG 基础到系统改进
3. Building Applications（4 resources）— 各种工具
4. Prompt Engineering（1 session）
5. Fine-Tuning（12+ sessions, 4 子分类）— Decision-making、Tools/Techniques（含 Wing Lian 讲 Axolotl、Zach Mueller 讲 HF Accelerate）、Deployment、Advanced

A18. AI Evals For Engineers & PMs — Hamel Husain + Shreya Shankar（Maven）#

URL：https://maven.com/parlance-labs/evals
总长度：4 周，3-5 小时/周，77 lessons，10+ office hours，150+ 页 course reader

周清单（部分公开）：

Week 1：Fundamentals & Lifecycle of Application-Centric Evals（Lesson 1）+ Systematic Error Analysis（Lesson 2-3）
Week 2：Automated Evaluators（Lesson 4-5）
Week 3-4：（完整 syllabus 课内）

Capstone：4 个 homework assignments + Recipe Bot Workflow 案例。

A19. The LLM Course — Maxime Labonne#

URL：https://github.com/mlabonne/llm-course
总长度：3 轨道，几十个章节 + 大量 hands-on notebooks

轨道 1：🧩 LLM Fundamentals

1. Mathematics for Machine Learning（线代、微积分、概率统计）
2. Python for Machine Learning（基础、NumPy/Pandas/Matplotlib/Seaborn、预处理、Scikit-learn）
3. Neural Networks（基础、训练优化、过拟合正则、MLP 实现）
4. NLP（预处理、特征提取、Word Embeddings、RNN/LSTM/GRU）

轨道 2：🧑‍🔬 The LLM Scientist

1. The LLM Architecture（encoder-decoder→decoder-only、tokenization、attention 变种、sampling）
2. Pre-Training Models（数据准备、distributed training: DP/PP/TP、训练优化、监控）
3. Post-Training Datasets（ShareGPT/ChatML/Alpaca、合成数据、增强、质量过滤）
4. Supervised Fine-Tuning（full FT、LoRA、QLoRA、DeepSpeed、FSDP）
5. Preference Alignment（rejection sampling、DPO、reward model、GRPO、PPO）
6. Evaluation（自动 benchmarks、人评、model-based、反馈信号）
7. Quantization（FP32/16/INT8、absmax、GGUF、llama.cpp、GPTQ、AWQ、SmoothQuant、ZeroQuant）
8. New Trends（SLERP/DARE/TIES merging、CLIP/SD/LLaVA 多模态、可解释性 SAE/abliteration、test-time compute）

轨道 3：👷 The LLM Engineer

1. Running LLMs（API: OpenAI/Google/Anthropic/OpenRouter/HF；开源: Ollama/LM Studio/llama.cpp；prompt engineering；结构化输出）
2. Building a Vector Storage（loading, splitting, embeddings, Chroma/Pinecone/Milvus/FAISS/Annoy）
3. RAG（LangChain/LlamaIndex/MCP、retriever 策略、记忆、Ragas/DeepEval）
4. Advanced RAG（SQL/Cypher、tools、re-ranking、RAG-fusion、DSPy）
5. Agents（基础、协议 MCP/A2A、OpenAI/Google ADK/Claude SDK、LangGraph/LlamaIndex/CrewAI/AutoGen）
6. Inference Optimization（Flash Attention、KV cache、MQA/GQA、EAGLE-3 speculative decoding）
7. Deploying LLMs（local、Gradio/Streamlit/HF Spaces、TGI/vLLM/SkyPilot、MLC LLM/mnn-llm）
8. Securing LLMs（prompt hacking、backdoor、red teaming、garak、langfuse）

Tools：LLM AutoEval, LazyMergekit, LazyAxolotl, AutoQuant, Model Family Tree, ZeroSpace, AutoAbliteration, AutoDedup

Notebooks：Fine-tune Llama 3.1 with Unsloth、Llama 3 with ORPO、Mistral-7b with DPO/QLoRA、CodeLlama with Axolotl、Llama 2 with QLoRA、4-bit GPTQ Quantization、GGUF 量化、ExLlamaV2、MergeKit MoE、abliteration、Knowledge Graphs、Decoding Strategies。

B 类 — 研究机构 / 公司课程#

B1. Hugging Face — 全部课程清单#

平台：https://huggingface.co/learn

课程	URL slug	简介
LLM Course（原 NLP Course）	/llm-course	用 HF 生态学 LLM/NLP
Context Course	/context-course	code agents 的 context engineering
Robotics Course	/robotics-course	LeRobot
smol course	/smol-course	post-training 极简
Agents Course	/agents-course	构建 agents
Deep RL Course	/deep-rl-course	深度强化学习
Computer Vision Course	/computer-vision-course	计算机视觉
Audio Course	/audio-course	transformer 处理音频
Open-Source AI Cookbook	/cookbook	开源 notebook 合集
ML for Games Course	/ml-games-course	游戏中的 AI
Diffusion Course	/diffusion-course	扩散模型
ML for 3D Course	/ml-for-3d-course	3D ML

B2. Hugging Face LLM Course（NLP Course → LLM Course）#

URL：https://huggingface.co/learn/llm-course
总长度：12 章，每章约 1 周 / 6-8 小时

章节清单：

Chapter 1：NLP and LLMs intro、Transformer 基础（pipeline 用法、架构原理、encoder/decoder/encoder-decoder 适用任务、scaling laws、attention 变种、pretraining + instruction tuning）
Chapter 2：Using Transformers — Tokenizers、Models、pipeline 内部、tokenization 算法
Chapter 3：Fine-tuning a pretrained model — Trainer API、数据处理、整合
Chapter 4：Sharing models and tokenizers — HF Hub
Chapter 5：The 🤗 Datasets library — big data 处理
Chapter 6：The 🤗 Tokenizers library — 从头训练 tokenizer
Chapter 7：Main NLP Tasks — token classification、masked LM、translation、summarization、causal LM from scratch、QA、Mastering LLMs
Chapter 8：How to ask for help
Chapter 9：Building and sharing demos — Gradio
Chapter 10：Curate high-quality datasets
Chapter 11：Fine-tune LLMs
Chapter 12：Build reasoning models

Capstone：每章末尾 quizzes；建议项目在 forum 上有清单。

B3. Hugging Face Agents Course#

URL：https://huggingface.co/learn/agents-course
总长度：5 chapters + 3 bonus units，每章约 1 周 / 3-4 小时

Chapter 清单：

Chapter	Topic
0	Onboarding
1	Agent Fundamentals — Tools/Thoughts/Actions/Observations、LLMs、messages、special tokens、chat templates、Python 函数作为 tools
2	Frameworks — smolagents、LangGraph、LlamaIndex
3	Use Cases — 真实场景实战
4	Final Assignment — 在 benchmark 上构建 agent 并在 leaderboard 上 PK

Bonus Units：

Bonus 1：Fine-tuning an LLM for Function-calling
Bonus 2：Agent Observability and Evaluation
Bonus 3：Agents in Games with Pokemon

Capstone：Final benchmark agent + student leaderboard；有 certificate of fundamentals + certificate of completion 两种。

B4. Hugging Face Deep RL Course#

URL：https://huggingface.co/learn/deep-rl-course
总长度：约 9 单元，每单元 3-4 小时/周

Unit 概要：

工具栈：Stable Baselines3、RL Baselines3 Zoo、Sample Factory、CleanRL
环境：SnowballFight、Huggy the Doggo、VizDoom、Space Invaders、PyBullet
包含 AI vs AI 挑战（Unit 7，目前停维护）
Certificate of Completion（80% 作业）/ Certificate of Honors（100%）

B5. Hugging Face Audio Course#

URL：https://huggingface.co/learn/audio-course
总长度：8 单元

Unit 清单：

Unit 1：音频数据基础、音频处理、数据准备
Unit 2：Audio pipelines for transformer 应用（分类、ASR）
Unit 3：Audio transformer 架构
Unit 4：Build your own music genre classifier
Unit 5：Speech recognition — 转录会议录音
Unit 6：Text-to-Speech 生成
Unit 7：构建真实音频应用
Unit 8：（高级 / 最终单元）

Capstone：每章有 hands-on 项目；80% / 100% 作业完成获 certificate。

B6. Hugging Face Diffusion Models Course#

URL：https://huggingface.co/learn/diffusion-course
总长度：4 units，每单元约 1 周 / 6-8 小时

Unit 清单：

Unit 1：Introduction to diffusion models — 🤗 Diffusers + 从零实现
Unit 2：Finetuning and guidance — 微调 + guidance
Unit 3：Stable Diffusion — text-conditioned latent diffusion
Unit 4：Doing more with diffusion — 高级技术

每单元含 theory section + 2 notebooks。

B7. Hugging Face Community Computer Vision Course#

URL：https://huggingface.co/learn/computer-vision-course
总长度：13 units

Unit 清单：

1. Fundamentals of Computer Vision — 图像基础、形成、预处理、特征提取
2. Convolutional Neural Networks (CNNs) — 架构、迁移学习
3. Vision Transformers — Swin、DETR、CVT、transfer learning
4. Multimodal Models — CLIP、GroupViT、BLIP、Owl-VIT
5. Generative Models — GANs、VAEs、Diffusion、text-to-image、inpainting
6. Basic Computer Vision Tasks — classification、object detection、segmentation、YOLO、SAM
7. Video and Video Processing — temporal continuity、motion estimation
8. 3D Vision, Scene Rendering, and Reconstruction — NeRF、GQN
9. Model Optimization — compression、distillation、pruning、TinyML
10. Synthetic Data Creation — point clouds、diffusion 合成数据
11. Zero Shot Computer Vision
12. Ethics and Biases in Computer Vision
13. Outlook and Emerging Trends — Retentive Network、Hiera、Hyena、I-JEPA

B8. Cohere LLM University#

URL：https://cohere.com/llmu
总长度：~7+ 模块，持续更新

Module 清单：

1. What are Large Language Models? — LLM 基础、embeddings、attention、transformer 架构、semantic search、实战
2. Text Representation with Cohere Endpoints — Classification / Embeddings / Semantic Search endpoints
3. Text Generation with Cohere Endpoints — generative learning、generated endpoint、prompt engineering
4. Deployment — AWS SageMaker、Streamlit、FastAPI
5. Semantic Search（2023 新增）
6. Prompt Engineering（2023 新增）
7. The Cohere Platform（2023 新增）

Tool Use 章节（2024-2025）：

From RAG to Tool Use（Part 1）— 构建查询销售/产品数据的 RAG assistant
Tool Use Anatomy（Part 2）— 关键组件 + 代码示例
Multi-Step Tool Use（Part 4）

AWS 集成模块：

Introduction to Cohere on Amazon Bedrock
Tool Use and Agents on Amazon Bedrock（Command R+）

单独 chapter：

Introduction to Text Generation（Command + chatbot 工作机制 + Chat endpoint）
Constructing Prompts
Validating Outputs

B9. Anthropic Academy — 全部课程清单#

URL：https://anthropic.skilljar.com/

完整课程列表：

1. Claude 101
2. Claude Code 101
3. Introduction to Claude Cowork
4. Claude Code in Action
5. AI Fluency: Framework & Foundations
6. Building with the Claude API
7. Introduction to Model Context Protocol
8. AI Fluency for Educators
9. AI Fluency for Students
10. Model Context Protocol: Advanced Topics
11. Claude with Amazon Bedrock
12. Claude with Google Cloud's Vertex AI
13. Teaching AI Fluency
14. AI Fluency for Nonprofits
15. Introduction to Agent Skills
16. Introduction to Subagents
17. AI Capabilities and Limitations
18. AI Fluency for Small Businesses

B10. Anthropic AI Fluency: Framework & Foundations（与 Joseph Feller / UCC 和 Rick Dakan / Ringling 合作）#

URL：https://anthropic.skilljar.com/ai-fluency-framework-foundations
AI Fluency Framework 站点：https://aifluencyframework.org/
核心 4D 框架：Delegation、Description、Discernment、Diligence

Module 清单：

1. Introduction to AI Fluency
2. The AI Fluency Framework — Why do we need AI Fluency? + The 4D Framework
3. Deep Dive 1: What is Generative AI? (Part 1) — fundamentals、capabilities & limitations
4. Delegation — A closer look + Project planning and Delegation
5. Description — A closer look + Deep Dive 2: Effective prompting techniques
6. Discernment — A closer look + Description-Discernment loop
7. Diligence — A closer look
8. Conclusion & Certificate — 总结 + 证书 + 额外活动

衍生课程：

AI Fluency for Students：5 lessons，约 3 小时（含 30 分钟视频）
AI Fluency for Educators：4 lessons，约 3 小时（含 35 分钟视频）

所有材料以 CC BY-NC-SA 许可发布，由爱尔兰 Higher Education Authority 通过 National Forum for Teaching and Learning 资助。

B11. OpenAI Academy#

URL：https://academy.openai.com/

Collections：

ChatGPT Fundamentals（resource）
Prompting（resource）
Codex 系列：Codex for Beginners（1h video, Aaron Wilkowitz）、Codex on Campus、Codex Fundamentals、Introduction to Codex、Codex for Admins and IT

Sector Collections：

K-12 Education
Higher Education（11 items）
ChatGPT at Work
Professors Teaching with OpenAI（83 items）
Small Business（10+ items）
Nonprofits（5 items）
Government（12 items）
Science（7 items）
Healthcare

Special Topics：

Prompt Packs、Custom GPTs、Workspace Agents、Advanced Features、Foundational AI Literacy

Stories & Case Studies：41 items

2025-2026 即将推出 AI fluency certifications（prompt engineering → AI-enabled work）。

B12. Google Machine Learning Crash Course#

URL：https://developers.google.com/machine-learning/crash-course

模块清单：

ML Models

1. Linear Regression — 线性模型、loss、SGD、超参
2. Logistic Regression
3. Classification — 二分类、thresholding、confusion matrix、accuracy/precision/recall/AUC

Data

1. Working with Numerical Data
2. Working with Categorical Data — one-hot、feature hashing、mean encoding、feature crosses
3. Datasets, Generalization, and Overfitting

Advanced ML Models

1. Neural Networks — perceptron、hidden layers、activation
2. Embeddings
3. Intro to Large Language Models — tokens、Transformer、训练

Real-World ML

1. Production ML Systems
2. AutoML
3. ML Fairness

B13. Google Generative AI Learning Path#

URL：https://www.skills.google/paths/118
5 activities

整体覆盖：generative AI 基础、LLMs、responsible AI。具体课程清单需登录 Google Skills 查看（搜索结果不公开详尽 list）。

B14. Microsoft AI for Beginners#

URL：https://microsoft.github.io/AI-For-Beginners/ + https://github.com/microsoft/AI-For-Beginners
总长度：12 周，24 lessons

Section 清单：

I. Introduction to AI

Lesson 1: Introduction and History of AI

II. Symbolic AI

Lesson 2: Knowledge Representation and Expert Systems

III. Introduction to Neural Networks

Lesson 3: Perceptron
Lesson 4: Multi-Layered Perceptron and Creating our own Framework
Lesson 5: Intro to Frameworks (PyTorch/TensorFlow) and Overfitting

IV. Computer Vision

Lesson 6: Intro to Computer Vision. OpenCV
Lesson 7: Convolutional Neural Networks & CNN Architectures
Lesson 8: Pre-trained Networks and Transfer Learning
Lesson 9: Autoencoders and VAEs
Lesson 10: Generative Adversarial Networks & Artistic Style Transfer
Lesson 11: Object Detection
Lesson 12: Semantic Segmentation. U-Net

V. Natural Language Processing

Lesson 13: Text Representation. Bow/TF-IDF
Lesson 14: Semantic word embeddings. Word2Vec and GloVe
Lesson 15: Language Modeling. Training your own embeddings
Lesson 16: Recurrent Neural Networks
Lesson 17: Generative Recurrent Networks
Lesson 18: Transformers. BERT
Lesson 19: Named Entity Recognition
Lesson 20: Large Language Models, Prompt Programming and Few-Shot Tasks

VI. Other AI Techniques

Lesson 21: Genetic Algorithms
Lesson 22: Deep Reinforcement Learning
Lesson 23: Multi-Agent Systems

VII. AI Ethics

Lesson 24: AI Ethics and Responsible AI

B15. Microsoft Generative AI for Beginners#

URL：https://microsoft.github.io/generative-ai-for-beginners/
总长度：22 lessons

Lesson 清单：

0. Course Setup
1. Introduction to Generative AI and LLMs
2. Exploring and Comparing Different LLMs
3. Using Generative AI Responsibly
4. Understanding Prompt Engineering Fundamentals
5. Creating Advanced Prompts
6. Building Text Generation Applications
7. Building Chat Applications
8. Building Search Apps with Vector Databases
9. Building Image Generation Applications
10. Building Low Code AI Applications
11. Integrating External Applications with Function Calling
12. Designing UX for AI Applications
13. Securing Your Generative AI Applications
14. The Generative AI Application Lifecycle（LLMOps）
15. Retrieval Augmented Generation and Vector Databases
16. Open Source Models and Hugging Face
17. AI Agents
18. Fine-Tuning LLMs
19. Building with SLMs
20. Building with Mistral Models
21. Building with Meta Models

B16. Microsoft Machine Learning for Beginners#

URL：https://github.com/microsoft/ML-For-Beginners
总长度：25 lessons + 2 postscript

Lesson 清单：

1. Introduction（4 lessons）

01: Introduction to machine learning
02: The History of machine learning
03: Fairness and machine learning
04: Techniques for machine learning

2. Regression（4 lessons）

05: Introduction to regression
06: North American pumpkin prices 🎃
07: North American pumpkin prices 🎃 (continued)
08: North American pumpkin prices 🎃 (logistic regression)

3. Web App

09: A Web App 🔌

4. Classification（4 lessons）

10: Introduction to classification
11: Delicious Asian and Indian cuisines 🍜
12: Delicious Asian and Indian cuisines 🍜 (continued)
13: Delicious Asian and Indian cuisines 🍜 (web app)

5. Clustering（2 lessons）

14: Introduction to clustering
15: Exploring Nigerian Musical Tastes 🎧

6. NLP（5 lessons）

16: Introduction to natural language processing
17: Common NLP Tasks
18: Translation and sentiment analysis
19: Romantic hotels of Europe
20: Romantic hotels of Europe (continued)

7. Time Series（3 lessons）

21: Introduction to time series forecasting
22: World Power Usage - ARIMA
23: World Power Usage - SVR

8. Reinforcement Learning（2 lessons）

24: Introduction to reinforcement learning
25: Help Peter avoid the wolf!

9. Real-World（2 postscript）

Real-World ML scenarios
Model Debugging using RAI dashboard

B17. Meta AI / Llama Cookbook（公开教程）#

URL：https://github.com/meta-llama/llama-recipes

不是正式课程，但是结构化教程集合：

Top-Level Categories：

1. Getting Started — Inference、Fine-tuning、RAG
2. End-to-End Use Cases — WhatsApp bot with Llama 4、Research paper analysis、Book character mind mapping
3. Third-Party Integrations
4. Latest Llama 4 Recipes — Llama API、Long context（5M tokens with Scout）、Maverick advanced analysis

主要为 Jupyter notebooks（93.7% 内容）。

与 DeepLearning.AI 合作课程：

Building with Llama 4（Amit Sangani / Meta，DeepLearning.AI 平台）

B18. NVIDIA Deep Learning Institute（DLI）#

URL：https://www.nvidia.com/en-us/training/self-paced-courses/
课程目录 PDF：https://cdn.dli.learn.nvidia.com/web-asset/nvidia-learning-training%20course-catalog.pdf

Tracks：

1. Accelerated Computing
2. Data Science
3. Deep Learning
4. Generative AI / LLM
5. Infrastructure
6. Simulation and Physical AI

课程形式：

2 小时短课（聚焦特定技术）
8 小时项目型课程（含 certificate）

已知具体课程：

Fundamentals of Deep Learning（8h）
Building Transformer-Based Natural Language Processing
Generative AI with Diffusion Models
Building RAG Agents with LLMs
Efficient LLM Customization
CUDA / Accelerated Computing 系列

（完整 catalog 需查 PDF 或 NVIDIA 在线目录）

C 类 — 大学开源 / 顶级开课#

C1. Stanford CS224N — Natural Language Processing with Deep Learning#

URL：https://web.stanford.edu/class/cs224n/
总长度：10 周 / 约 20 lectures

Winter 2026 Schedule：

Week 1: History of NLP / Word Vectors / Python Review
Week 2: Backpropagation and Neural Network Basics / Language Models and RNNs / PyTorch Tutorial
Week 3: Transformers / Final Projects: Custom and Default
Week 4: Pretraining (Scaling, Systems, Data) / Post-training (RLHF, SFT, DPO)
Week 5: Efficient Adaptation (Prompting + PEFT) / Agents, Tool Use, and RAG / Hugging Face Tutorial
Week 6: Benchmarking and Evaluation / Reasoning 1
Week 7: Reasoning 2 / Guest: Tokenization and Multilinguality (Julie Kallini)
Week 8: Guest: Interpretability (Been Kim) / Social and Broader Impacts of NLP
Week 9: Guest: Multimodality (Luke Zettlemoyer) / Guest: Tinker and LoRA Without Regret (John Schulman)
Week 10: Open Questions in NLP 2026

Assignments：

A1 (6%): Word Vectors
A2 (14%): Neural networks, derivatives, dependency parsing
A3 (14%): Self-attention and Transformers
A4 (14%): LLM benchmarking and evaluation
Final Project（Custom 或 Default）

C2. Stanford CS224U — Natural Language Understanding#

URL：https://web.stanford.edu/class/cs224u/
总长度：~10 周

Spring 2023 Schedule：

Apr 3: Course Introduction
Apr 5-12: Contextual Word Representations & Language Models（Transformers、GPT、BERT、RoBERTa、ELECTRA、T5、BART、DistilBERT）
Apr 17-24: Retrieval-Augmented In-Context Learning（neural IR、LLM prompting）
Apr 26-May 3: Advanced Behavioral Evaluation
May 8-15: Analysis Methods & NLP Evaluation（interpretability）
May 31-Jun 7: Research Presentation & Project Completion

Assignments：

A1: Domain adaptation sentiment classification
A2: Open-domain question answering
A3: ReCOGS compositional generalization
4 Quizzes（0-4）
Literature Review
Experimental Protocol
Final Paper

C3. Stanford CS224W — Machine Learning with Graphs#

URL：https://web.stanford.edu/class/cs224w/
总长度：19 lectures（Fall 2025）

Lecture 清单：

1. Introduction
2. Node Embeddings
3. Graph Neural Networks
4. A General Perspective on GNNs
5. GNN Augmentation and Training
6. Theory of GNNs
7. Designing Powerful Graph Encoders
8. Graph Transformers
9. Heterogeneous Graphs
10. Knowledge Graphs
11. GNNs for Recommender Systems
12. Relational Deep Learning
13. Advanced Architectures in RDL
14. Advanced Topics in GNNs
15. Towards Foundation Models for Knowledge Graphs
16. LLM + GNN
17. Agents + Graphs
18. Deep Generative Models for Graphs
19. Conclusion

C4. Stanford CS234 — Reinforcement Learning#

URL：https://web.stanford.edu/class/cs234/
总长度：11 周

Winter 2026 Schedule：

Week 1: Introduction to RL / Tabular MDP Planning（A1 released）
Week 2: Policy Evaluation / Q-learning and function approximation
Week 3: Policy Search
Week 4: Policy Search / Offline RL, Imitation Learning
Week 5: Offline RL, RLHF / Midterm Exam（A3 released, Project Proposal due）
Week 6: Offline RL / Bandits / Strategic data gathering / Exploration
Week 7: Exploration
Week 8: Exploration / RL and MCTS（Project Milestone due）
Week 9: Guest Lecture / In-Class Quiz
Week 10: Poster Session
Week 11: Final Project Writeup

Capstone：3 个 Assignments + Final Project（proposal、milestone、poster、writeup）。

C5. Stanford CS330 — Deep Multi-Task and Meta Learning#

URL：https://cs330.stanford.edu/
总长度：11 周（Fall 2023）

Week 清单：

Week 1: Course introduction & multi-task learning fundamentals + PyTorch tutorial
Week 2: Multi-task learning methods / Transfer learning & fine-tuning
Week 3: Black-box meta-learning & in-context learning / Optimization-based meta-learning + MAML review
Week 4: Few-shot learning via metric learning / Unsupervised pre-training contrastive
Week 5: Unsupervised pre-training generative / Advanced meta-learning task construction
Week 6: Variational inference / Bayesian meta-learning
Week 7: Large-scale meta-optimization / Learned optimizers (guest)
Week 8: Lifelong learning
Week 10: LM advances (guest) / Domain adaptation and generalization
Week 11: Frontiers / Project poster session

Grading：4 HW（HW0 5%, HW1-3 15% each）+ Project 50%。

C6. Stanford CS25 — Transformers United（seminar 系列）#

URL：https://web.stanford.edu/class/cs25/

所有 Season + 嘉宾：

V1（Fall 2021）：

1. Introduction to Transformers（instructors）
2. Mark Chen (OpenAI) — "Transformers in Language: GPT-3, Codex"
3. Lucas Beyer (Google Brain) — "Applications in Vision"
4. Aditya Grover (FAIR) — "Transformers in RL & Universal Compute Engines"
5. Barret Zoph + Irwan Bello + Liam Fedus (Google Brain) — "Scaling transformers"
6. Andrew Jaegle (DeepMind) — "Perceiver"
7. Aidan Gomez (Oxford) — "Self Attention & Non-Parametric Transformers"
8. Geoffrey Hinton (Toronto) — "GLOM"
9. Chris Olah (Anthropic) — "Interpretability with transformers"
10. Prateek Verma (Stanford) — "Transformers for Audio, Speech and Music"

V2（Winter 2023）：

Jan 10: Andrej Karpathy — Introduction to Transformers
Jan 17: Jan Leike (OpenAI) — Language and Human Alignment
Jan 24: Jason Wei (Google Brain) — Emergent Abilities and Scaling in LLMs
Jan 31: Noam Brown (FAIR) — Strategic Games
Feb 7: Ted Xiao (Google Brain) — Robotics and Imitation Learning
Feb 14: Yejin Choi (UW / AI2) — Common Sense Reasoning
Feb 21: Vivek Natarajan (Google Health AI) — Biomedical Transformers
Feb 28: Stephanie Chan + Antonia Creswell (DeepMind) — In-Context Learning & Faithful Reasoning
Mar 7: Trenton Bricken + Will Dorrell — Neuroscience-Inspired AI

V3（Fall 2023）：

Oct 3: Sharan Narang (Meta AI) — Llama 2
Oct 10: Fei Xia (Google DeepMind) — Low-level Embodied Intelligence with Foundation Models
Oct 24: Jim Fan (NVIDIA) — Generalist Agents in Open-Ended Worlds
Oct 31: Nazneen Rajani (HF) — Recipe for Training Helpful Chatbots
Nov 7: Ashish Vaswani — How I Learned to Stop Worrying and Love the Transformer
Nov 14: Angela Fan (Meta AI) — No Language Left Behind
Nov 28: Instructors — Going Beyond LLMs: Agents, Emergent Abilities, BabyLM
Dec 5: Douwe Kiela (Contextual AI) — Retrieval Augmented Language Models

V4（Spring 2024）：

1. Instructors — Overview of Transformers
2. Jason Wei (OpenAI) — Intuitions on Language Models
3. Hyung Won Chung (OpenAI) — Shaping the Future of AI from the History of Transformer
4. Nathan Lambert (AI2) — Aligning Open Language Models
5. Albert Jiang (Mistral / Cambridge) — Demystifying Mixtral of Experts
6. Jake Williams (Drexel) — Transformers that Transform Well Enough to Support Near-Shallow Architectures
7. Ming Ding (Zhipu AI) — From LLMs to Large Multimodal Models
8. Edward Hu (ex-OpenAI) — New training objective for LLMs
9. Loubna Ben Allal (HF) — Behind the Scenes of LLM Pre-training: StarCoder

V6（Spring 2026）：

1. Apr 9: Hazel Nam & Lucas Maes (Brown) — From Representation Learning to World Modeling through JEPAs
2. Apr 16: Albert Gu (CMU, Cartesia AI) — Tradeoffs of State Space Models and Transformers
3. Apr 23: Nouamane Tazi (HF) — Ultra-Scale Talk: Scaling Training to Thousands of GPUs
4. Apr 30: Shrimai Prabhumoye (Mistral AI) — From Next-Token Prediction to Next-Generation Intelligence
5. May 7: Andrew Lampinen (Anthropic) — Distinct Modes of Generalization from Parameters and Context
6. May 14: Vivek Natarajan (DeepMind) — Advancing Science and Medicine with Collaborative AI Agents
7. May 21: Victoria Lin (Thinking Machines) — From LMs to Native Multimodal Intelligence
8. May 28: Charles Frye (Modal) — Serving Transformers: Lessons from the Trenches of Production Inference

C7. Stanford CS329S — Machine Learning Systems Design#

URL：https://stanford-cs329s.github.io/
总长度：10 周（Winter 2022）

Schedule：

Week 1: Understanding ML production / ML and Data Systems Fundamentals
Week 2: Training Data / Feature engineering
Week 3: Model selection, development, and training
Week 4: Offline evaluation / Model evaluation tutorial (Goku Mohandas)
Week 5: Deployment / Deployment tutorials (MLOps tools & Ray Serve)
Week 6: Diagnosis of ML system failures, distribution shifts & monitoring / Monitoring & Continual Learning（streaming guest）
Week 7: Model Deployment case study & experiment tracking / Monitoring tutorials (WhyLogs, Evidently)
Week 8: Time series forecasting & fairness/security/governance (guests)
Week 9: ML Infrastructure and Platform / Final project workshop
Week 10: Integrating ML into business (guests) / Final project demo day

C8. Stanford CS336 — Language Modeling from Scratch（2025 新）#

URL：http://cs336.stanford.edu/spring2025/（讲师：Percy Liang + Tatsu Hashimoto）
总长度：19 lectures

Lecture 清单：

1. Overview, tokenization（Percy）
2. PyTorch, resource accounting（Percy）
3. Architectures, hyperparameters（Tatsu）
4. Mixture of experts（Tatsu）
5. GPUs（Tatsu）
6. Kernels, Triton（Tatsu）
7. Parallelism（Tatsu）
8. Parallelism（Percy）
9. Scaling laws（Tatsu）
10. Inference（Percy）
11. Scaling laws（Tatsu）
12. Evaluation（Percy）
13. Data（Percy）
14. Data（Percy）
15. Alignment - SFT/RLHF（Tatsu）
16. Alignment - RL（Tatsu）
17. Alignment - RL（Percy）
18. Guest: Junyang Lin
19. Guest: Mike Lewis

Assignments：

A1: Basics
A2: Systems
A3: Scaling
A4: Data
A5: Alignment and Reasoning RL

C9. MIT 6.S191 — Introduction to Deep Learning#

URL：http://introtodeeplearning.com/
总长度：9 lectures + 3 software labs + Final Project（2026 版）

Lecture 清单：

1. Intro to Deep Learning
2. Deep Sequence Modeling
3. Deep Computer Vision
4. Deep Generative Modeling
5. Deep Reinforcement Learning
6. New Frontiers
7. The Three Laws of AI
8. AI for Science
9. Secrets to Massively Parallel Training

Software Labs：

1. Deep Learning in Python; Music Generation
2. Facial Detection Systems
3. Fine-Tune an LLM, You Must!

Capstone：Final Project + Project Pitch + 颁奖。

C10. MIT 6.5940 — TinyML and Efficient Deep Learning Computing#

URL：https://hanlab.mit.edu/courses/2024-fall-65940
总长度：26 lectures + 6 labs + Final Project

Lecture 清单（Fall 2024）：

1. Introduction
2. Basics of Deep Learning
3. Pruning and Sparsity (Part I)
4. Pruning and Sparsity (Part II)
5. Quantization (Part I)
6. Quantization (Part II)
7. Neural Architecture Search (Part I)
8. Neural Architecture Search (Part II)
9. Knowledge Distillation
10. MCUNet: TinyML on Microcontrollers
11. TinyEngine and Parallel Processing
12. Transformer and LLM
13. Efficient LLM Deployment
14. LLM Post Training
15. Long Context LLM
16. Vision Transformer
17. GAN, Video, and Point Cloud
18. Diffusion Model
19. Distributed Training (Part I)
20. Distributed Training (Part II)
21. On-Device Training and Transfer Learning
22. Course Summary + Quantum ML I
23. Quantum ML II

24-26. Final Project Presentations

Labs：

Lab 0: Basics
Lab 1: Pruning
Lab 2: Quantization
Lab 3: Neural Architecture Search
Lab 4: LLM Compression
Lab 5: LLM Deployment

Capstone：Final Project（4-5 人 / proposal + final report）。

C11. CMU 10-714 — Deep Learning Systems#

URL：https://dlsyscourse.org/
总长度：每周 2 lectures（TR 11:00-12:20）

5 Homework Assignments：

HW0: Background review and softmax regression
HW1: Automatic differentiation framework
HW2: Neural network modules and optimizers
HW3: NDArray library with CPU/CUDA backends
HW4: Convolutions, RNNs, and advanced architectures

Capstone：Final Project（在 Needle 框架中实现新 feature + 模型）。

C12. CMU 11-711 — Advanced NLP（Graham Neubig，Spring 2025 / Fall 2024）#

URL：https://cmu-l3.github.io/anlp-spring2025/ + https://phontron.com/class/anlp-fall2024/
总长度：27 lectures

Spring 2025 Lecture 清单：

1. Introduction and Basics
2. Word Representation and Text Classification
3. Language Modeling Fundamentals
4. Recurrent Neural Networks
5. Attention and Transformers
6. Pretraining
7. Decoding and Generation（Guest: Amanda Bertsch）
8. Prompting
9. Fine-Tuning
10. Retrieval and RAG（Guest: Akari Asai）
11. Reinforcement Learning
12. Evaluating Language Generators（Guest: Seungone Kim）
13. Experimental Design
14. Agents
15. Quantization（Guest: Tim Dettmers）
16. Advanced Pretraining: Parallelism and Scaling
17. Project Discussion
18. Long Sequence Models
19. Advanced Inference Strategies
20. Efficient Inference
21. Advanced Post-Training
22. Multimodal Models I
23. AI for Mathematics
24. Multimodal Models II

25-26. Poster Presentations

Assignments：

A1: 个人（01/28-02/14）
A2: 团队（02/13-03/14）
A3: 团队 — Literature survey & baseline reproduction（02/20-04/07）
A4: 团队 — Final project（02/20-04/27）

Grading：Quizzes 20%（drop 3）/ A1 15% / A2 15% / A3 20% / A4 (Project) 30%。

C13. CMU 17-645 — Machine Learning in Production / AI Engineering（Christian Kästner）#

URL：https://mlip-cmu.github.io/s2025/ + https://github.com/mlip-cmu
配套教材：Machine Learning in Production: From Models to Products

Spring 2025 Schedule（14 周）：

Week 1: Introduction and Motivation / From Models to AI-Enabled Systems / Lab 1: Flask APIs
Week 2: Gathering Requirements / Lab 2: Kafka stream processing（I1 ML Product due）
Week 3: Planning for Mistakes / Model Quality / Lab 3: git collaboration（I2 Requirements due）
Week 4: Fostering Interdisciplinary Teams / Behavioral Model Testing / Lab 4: Model testing
Week 5: Toward Architecture and Design / Deploying a Model / Lab 5: Docker containers
Week 6: Testing and Experimenting in Production / Data Quality / Lab 6: Jenkins CI（M1 Modeling and First Deployment due）
Week 7: Automating and Testing ML Pipelines / Midterm 1
Week 8: Scaling the System / Planning for Operations / Lab 7: Prometheus + Grafana
Week 9: Versioning / Process & Technical Debt / Lab 8: MLflow（M2 Infrastructure Quality due; I3 MLOps Tools due）
Week 10: Intro to Ethics + Fairness / Measuring Fairness / Lab 9: Kubernetes
Week 11: Building Fairer Systems / Guest Lecture
Week 12: Explainability / Transparency & Accountability / Lab 10: Model Explainability（M3 Monitoring and CD due）
Week 13: Safety / Security and Privacy / Lab 11: LLM Jailbreaking（I4 Explainability due）
Week 14: Discussion / Summary / Review / Midterm 2（M4 Fairness, Security and Feedback Loops due）
Final: Final report and video（May 5）

Grading：Individual 35% / Group project 30% / Midterms 15% / Participation 5% / Labs 10% / Reading quizzes 5%。

C14. Berkeley CS285 — Deep Reinforcement Learning（Sergey Levine）#

URL：https://rail.eecs.berkeley.edu/deeprlcourse/
总长度：25 lectures

Lecture 模块：

Weeks 1-2: Introduction / Behavioral Cloning / RL Basics
Weeks 3-4: Policy Gradients / Actor Critic / Value-Based RL / Q-learning in Practice
Week 5: Advanced Policy Gradients（2 部分）
Weeks 6-7: Variational Inference / VI in RL / Control as Inference / LLM RL
Week 8: Model-Based RL（2 部分）
Week 9: Offline RL（2 部分）
Week 10: Exploration / RL Theory
Weeks 11-12: Midterm review / Advanced Exploration / Multi-task RL
Weeks 13-14: Challenges and Open Problems / Final guest

5 Homework Assignments：

1. Imitation Learning
2. Policy Gradients
3. Q-Learning and Actor Critic
4. LLM RL
5. Offline RL

Final Project Options：Offline-to-Online RL / LLM RL。

C15. Princeton COS 597G — Understanding Large Language Models（Danqi Chen，Fall 2022）#

URL：https://www.cs.princeton.edu/courses/archive/fall22/cos597G/
总长度：~23 lectures

Lecture 清单：

Date	Topic	Primary Papers
Sep 7	Introduction to LLMs	Human Language Understanding; Attention Is All You Need
Sep 12	BERT (encoder-only)	BERT
Sep 14	T5 (encoder-decoder)	T5
Sep 19	GPT-3 (decoder-only)	Language Models are Few-Shot Learners
Sep 21	Few-shot prompting	LM-BFF; How Many Data Points is a Prompt Worth
Sep 26	Parameter-efficient adaptation	Prefix-Tuning; Prompt Tuning
Sep 28	In-context learning	Rethinking Demonstrations; Implicit Bayesian Inference
Oct 3	Calibration	Calibrate Before Use; Surface Form Competition
Oct 5	Reasoning	Chain of Thought; Zero-Shot Reasoners
Oct 10	Knowledge	LM as Knowledge Bases; How Much Knowledge in Parameters
Oct 12	Training data	Documenting Webtext; The Pile
Oct 24	Scaling laws	Chinchilla; Scaling Laws for LMs
Oct 26	Privacy	Extracting Training Data; Quantifying Memorization
Oct 31	Bias & toxicity eval	RealToxicityPrompts; OPT Section 4
Nov 2	Bias & toxicity mitigation	Self-Diagnosis and Self-Debiasing
Nov 7	Sparse / mixture-of-experts	Switch Transformers
Nov 9	Retrieval-based models	RETRO
Nov 14	Human feedback training	InstructGPT
Nov 16	Code language models	Codex
Nov 21	Multimodal LMs	Flamingo
Nov 28	Guest: Alexander Rush — T0 / zero-shot generalization
Nov 30	AI alignment & open discussion
Dec 5	Final project presentations

Grading：Participation 25%（pre-lecture readings + 3 questions）/ Presentations 30%（60-min student-led）/ Lecture feedback 5% / Final project 40%。

Project Options：Fine-tune medium LMs（BERT/RoBERTa/T5/GPT-2 via HF）or 评估大模型（GPT-3, Codex）。

C16. Princeton COS 597R — Deep Dive into Large Language Models（Danqi Chen + Sanjeev Arora，Fall 2024）#

URL：https://princeton-cos597r.github.io/
总长度：~23 lectures

Lecture 清单：

Date	Instructor	Topic	Key Papers
Sep 4	Sanjeev	Introduction	—
Sep 9	Danqi	Pretraining 1	GPT-3, Transformers
Sep 11	Danqi	Pretraining 2	GPT-3 续 + Llama 3 Herd
Sep 16	Sanjeev	Scaling laws	Chinchilla, Data-Constrained Models
Sep 18	Sanjeev	Emergent behavior	Emergent Abilities; Theory for Emergence
Sep 23	Danqi	Data curation	Dolma, FineWeb, RefinedWeb
Sep 25	Danqi	Post-training: Instruction tuning	Scaling Instruction-Finetuned LMs
Sep 30	Danqi	Post-training: Preferences	RLHF, DPO
Oct 2	Sanjeev	Alignment	General Language Assistant
Oct 7	Sanjeev	Constitutional AI	Constitutional AI
Oct 9	Sanjeev	LLM Metacognition	Metacognitive Capabilities
Oct 21	Tianyu Gao	Long-context models	How to Train Long-Context LMs
Oct 23	Sanjeev	Advanced alignment	OpenAI o1 System Card; Weak-to-Strong
Oct 28	Danqi/Sanjeev	LLM Reasoning 1	Let's Verify Step by Step
Oct 30	Danqi	LLM Reasoning 2	Scaling LLM Test-Time Compute
Nov 4	Mengzhou Xia	Small models	Sheared LLaMA, Gemma 2
Nov 6	Danqi	Retrieval-augmented LMs	RETRO
Nov 11	Yu Su (OSU)	Language Agents	HippoRAG
Nov 13	Danqi	RAG	REALM, kNN-LM, FLARE, Self-RAG
Nov 18	Tri Dao	Hardware-aware algorithms	FlashAttention, Mamba
Nov 20	Saining Xie (NYU)	Multimodal LLMs	LLaVA, Cambrian-1, Molmo, MM1
Nov 25 / Dec 2 / Dec 4	Students	Project presentations	—

Grading：Participation 30% / Debate 15% / Lecture scribing 10% / Final project 35% / Project presentations 10%。

Debate Panels：Weeks 4-9 共 12 debate panels（每 5 个学生为一组，每 paper 有 presenter / critic / proponent）。

D 类 — 实操 / 工程社区课程#

D1. Full Stack Deep Learning（Sergey Karayev / Pieter Abbeel / Josh Tobin）#

URL：https://fullstackdeeplearning.com/course/

FSDL 2022 (Spring) — 9 Lectures + 6 Labs：

Lectures：

1. Course Vision and When to Use ML
2. Development Infrastructure & Tooling
3. Troubleshooting & Testing
4. Data Management
5. Deployment
6. Continual Learning
7. Foundation Models
8. ML Teams and Project Management
9. Ethics

Labs：

Lab 0: Overview
Lab 4: Experiment Management
Lab 5: Troubleshooting & Testing
Lab 6: Data Annotation
Lab 7: Web Deployment
Lab 8: Model Monitoring

外加 Project Showcase。

早期版本：FSDL 2021（Berkeley + online）/ 2020（UW）/ 2019 / 2018。

D2. Full Stack LLM Bootcamp（Spring 2023）#

URL：https://fullstackdeeplearning.com/llm-bootcamp/spring-2023/

8 个 Lectures：

1. Learn to Spell: Prompt Engineering
2. LLMOps
3. UX for Language User Interfaces
4. Augmented Language Models
5. Launch an LLM App in One Hour
6. LLM Foundations
7. Project Walkthrough: askFSDL
8. What's Next?

3 个 Invited Talks：

Reza Shabani — Training custom LLMs
Harrison Chase — Agents
Peter Welinder (OpenAI VP) — Fireside Chat

Capstone：项目 askFSDL（Q&A 系统，Python/Modal/Gantry）。

D3. MLOps Zoomcamp（DataTalks.Club）#

URL：https://github.com/DataTalksClub/mlops-zoomcamp

7 个 Module：

1. Introduction — MLOps 定义、maturity model、NY Taxi 数据集、环境搭建 + Homework
2. Experiment Tracking & Model Management — MLflow 基础、模型保存/加载/注册 + Homework
3. Orchestration & ML Pipelines + Homework
4. Model Deployment — online (web/streaming) vs offline (batch)、Flask、AWS Kinesis + Lambda、batch scoring + Homework
5. Model Monitoring — Prometheus + Evidently + Grafana、Prefect + MongoDB + Homework
6. Best Practices — 单元/集成测试、linting/pre-commit、GitHub Actions CI/CD、Terraform IaC + Homework
7. Final Project — 端到端整合所有概念

D4. LLM Zoomcamp（DataTalks.Club）#

URL：https://github.com/DataTalksClub/llm-zoomcamp

7 个 Module + Workshop：

1. Introduction to LLMs & RAG — 基础 RAG pipeline + text search
2. Vector Search — semantic embeddings 索引/检索
3. Agents — 自治 tool use 和 function calling
4. (Workshop) Data Ingestion — dlt 从外部源接入 RAG
5. Evaluation — 离线 + 在线 retrieval / answer 质量
6. Monitoring — 用户反馈 + live dashboards
7. Best Practices — LangChain、hybrid search（vector+keyword）、reranking
8. End-to-End Project — 健身助手 with LLMs

Capstone：完整 RAG 应用（知识库 + retrieval + LLM 集成 + evaluation + UI + 监控反馈）。Certification 需提交项目并 peer review 3 个其他学生项目。

D5. AI Engineer World's Fair 工作坊#

URL：https://www.ai.engineer/

每年 6 月在旧金山，workshop 由各家公司主讲（Anthropic、OpenAI、LangChain、LlamaIndex、Modal、Weaviate、Pinecone、Together AI 等）。典型 workshop 包括：

LLMs for the Working Programmer
2-hour workshop on Gemini 2.5
Hamel Husain — How to Construct Domain Specific LLM Evaluation Systems

完整 schedule 见每年事件页（2024 / 2025）。

D6. Modular Mojo AI Engineer 培训#

官方文档：https://mojolang.org/docs/manual/
MAX 平台：https://www.modular.com/max/mojo

不是正式课程，而是 docs + 工具链：

Mojo 语法、static class system（structs、fn）
Modular Engine（LLVM + MLIR）
Modular Playground（Jupyter-style IDE）

第三方 Udemy 课程：

Mojo Programming Training: Powering High-Performance AI
Mojo Programming: Basic to Expert with AI Use cases

D7. Replit Learn#

URL：https://learn.replit.com/

目前提供：

1. AI foundations（30 min, 7 lessons）— 用 AI 构建 app
2. Enterprise foundations（30 min, 6 lessons）
3. Intro to Replit（45 min, 5 lessons）— coming soon
4. Advanced vibes（30 min, 5 lessons）— coming soon

Replit 100 Days of Python（讲师：David Morgan）：

100 天 / 每天 15 分钟
覆盖：变量、math 函数、for/while/while-true 循环、libraries、subroutines、lists/2D lists、字典、文件读写、eval()、try/except、csv 操作、os library、recursion、debugging、数据库函数、HTML/CSS Crash Course、Flask
项目：tip calculator、grade calculator、multiplayer games、login systems、web scraping

D8. The AI Engineering Bootcamp（Greg Loughnane + Chris Alexiuk）#

URL：https://maven.com/aimakerspace/ai-eng-bootcamp
总长度：10 周

涵盖 AI 原则、Python、NLP、LLMs、Transformers、LangChain orchestration。Build, ship, and share。

附录 — 总览整理#

类别	课程数	共同特点（不做评价，只记观察）
A 神级个人	19	单人 IP 主导，重 "从零搓" 哲学
B 公司机构	18	教学 = 推广 SDK / 平台
C 大学开源	16	论文密度高，每周 paper 阅读 + project
D 工程社区	8	每年迭代，end-to-end capstone

所有课程汇总：~61 门具体课程 + N 个 specialization / track。

重复出现的结构模式（仅观察，不做评价）：

时长：单课程多在 3-12 周
Capstone：约 70% 课程有 final project 或 end-to-end capstone
Lecture 数：单课程多在 9-25 个 lecture
作业密度：高质量课程多有 3-5 个 assignment 横跨学期

主要信息源

karpathy.ai / cs231n.stanford.edu
course.fast.ai / course19.fast.ai
coursera.org/specializations/deep-learning
coursera.org/specializations/machine-learning-introduction
coursera.org/learn/generative-ai-with-llms
deeplearning.ai/courses
github.com/rasbt/LLMs-from-scratch
github.com/chiphuyen/aie-book
github.com/mlabonne/llm-course
eugeneyan.com
hamel.dev/talks.html
parlance-labs.com/education/
maven.com/parlance-labs/evals
huggingface.co/learn （NLP/LLM/Agents/RL/Audio/Diffusion/CV）
cohere.com/llmu
anthropic.skilljar.com / aifluencyframework.org
academy.openai.com
developers.google.com/machine-learning/crash-course
skills.google
github.com/microsoft/AI-For-Beginners
github.com/microsoft/generative-ai-for-beginners
github.com/microsoft/ML-For-Beginners
github.com/meta-llama/llama-recipes
nvidia.com/en-us/training
web.stanford.edu/class/cs224n + cs224u + cs224w + cs234 + cs330 + cs25 + cs329s
cs336.stanford.edu/spring2025
introtodeeplearning.com
hanlab.mit.edu/courses/2024-fall-65940
dlsyscourse.org
cmu-l3.github.io/anlp-spring2025
phontron.com/class/anlp-fall2024
mlip-cmu.github.io
rail.eecs.berkeley.edu/deeprlcourse
cs.princeton.edu/courses/archive/fall22/cos597G
princeton-cos597r.github.io
fullstackdeeplearning.com
github.com/DataTalksClub/mlops-zoomcamp + llm-zoomcamp
ai.engineer
modular.com/max/mojo
learn.replit.com