~37 min read

AI 时代专业课程设置参考资料

调研范围:全球独立大牛 / 个人 IP / 研究机构 / 公司 / 大学开源 / 工程社区开设的 AI 课程。

只列课表(每门课讲什么、有几节、时长、是否有 capstone),不写分析、评价、价格比较。

所有人名 / 课程名保留英文。


A 类 — 神级个人课程#


A1. Neural Networks: Zero to Hero — Andrej Karpathy#

完整视频清单:

  1. 1. The spelled-out intro to neural networks and backpropagation: building micrograd(145 分钟)
  2. 2. The spelled-out intro to language modeling: building makemore(117 分钟)— bigram 字符级语言模型,PyTorch tensors 入门
  3. 3. Building makemore Part 2: MLP(75 分钟)— 多层感知机、学习率、超参、过拟合
  4. 4. Building makemore Part 3: Activations & Gradients, BatchNorm(115 分钟)
  5. 5. Building makemore Part 4: Becoming a Backprop Ninja(56 分钟)— 不用 autograd 手动反传
  6. 6. Building makemore Part 5: Building a WaveNet(56 分钟)
  7. 7. Let's build GPT: from scratch, in code, spelled out(116 分钟)— 跟着 "Attention is All You Need" 和 GPT-2/GPT-3 paper
  8. 8. Let's build the GPT Tokenizer(133 分钟)— BPE 实现

Capstone / 作业:无正式作业,每个 video 末尾有 suggested exercises。


A2. Software Is Changing (Again) — Andrej Karpathy(Y Combinator 2025 keynote)#

章节:

  1. 1. Software Evolution: From 1.0 to 3.0
  2. 2. Programming in English: Rise of Software 3.0
  3. 3. LLMs as Infrastructure(LLM OS, fabs, utilities)
  4. 4. Psychology of LLMs("people spirits", jagged intelligence, hallucinations)
  5. 5. Partial Autonomy & Human-AI Collaboration
  6. 6. Tesla Autopilot & Autonomy Sliders
  7. 7. Vibe Coding(MenuGen demo)
  8. 8. Building for Agents
  9. 9. Future Outlook(mainframe era of LLMs)

A3. Stanford CS231n — Convolutional Neural Networks for Visual Recognition(Karpathy 早期版本,Winter 2016)#

Lecture 清单:

  1. 1. Intro to Computer Vision, historical context
  2. 2. Image Classification — k-NN, Linear Classification
  3. 3. Linear classification II, optimization, SGD
  4. 4. Backpropagation, Intro to Neural Networks
  5. 5. Training Neural Networks Part 1(activations, init, batch norm)
  6. 6. Training Neural Networks Part 2(parameter updates, dropout)+ CNN intro
  7. 7. CNN architectures, convolution/pooling, ImageNet case studies
  8. 8. Localization and Detection
  9. 9. Visualization, DeepDream, Style Transfer, Adversarial Examples
  10. 10. RNN, LSTM — Language modeling, Image captioning
  11. 11. Practical ConvNet training(augmentation, transfer learning, distributed)
  12. 12. Frameworks(Caffe, Torch, Theano, TensorFlow)
  13. 13. Segmentation, Soft Attention, Spatial Transformer Networks
  14. 14. Videos and Unsupervised Learning
  15. 15. Invited talks / Conclusions

Lecture Notes(学生自学路径)

Capstone:3 个 Assignment(kNN/SVM/Softmax/2-layer NN;CNN/Dropout/BN;RNN/LSTM/Captioning)+ Final Project。


A4. Stanford CS231n — Deep Learning for Computer Vision(最新 Spring 2026 版)#

Lecture 清单(按模块分):

Module 1 — Deep Learning Basics

  1. 1. Introduction
  2. 2. Image Classification with Linear Classifiers
  3. 3. Regularization and Optimization
  4. 4. Neural Networks and Backpropagation

Module 2 — Perceiving and Understanding the Visual World

  1. 5. Image Classification with CNNs
  2. 6. CNN Architectures
  3. 7. Recurrent Neural Networks
  4. 8. Attention and Transformers
  5. 9. Object Detection, Image Segmentation
  6. 10. Video Understanding
  7. 11. Large Scale Distributed Training

Module 3 — Generative and Interactive Visual Intelligence

  1. 12. Self-supervised Learning
  2. 13. Generative Models 1(VAEs, GANs, autoregressive)
  3. 14. Generative Models 2(Diffusion)
  4. 15. 3D Vision
  5. 16. Vision and Language
  6. 17. World Modeling
  7. 18. Human-Centered AI

Capstone:3 个 Assignments + Project Proposal + Final Project + Poster Session。


A5. Practical Deep Learning for Coders — Fast.ai Part 1(Jeremy Howard)#

Lesson 清单:

  1. 1. Getting started
  2. 2. Deployment
  3. 3. Neural net foundations
  4. 4. Natural Language (NLP)
  5. 5. From-scratch model
  6. 6. Random forests
  7. 7. Collaborative filtering
  8. 8. Convolutions (CNNs)
  9. 9. Bonus: Data ethics

Capstone:无正式 capstone,每 lesson 有 summary 文档 + Kaggle / 自建项目。

工具栈:PyTorch, fastai, Hugging Face Transformers, Gradio。


A6. Fast.ai Part 2 — Deep Learning from the Foundations / Stable Diffusion(Jeremy Howard)#

Lesson 清单(早期 "Deep Learning from the Foundations" 版本,lesson 8-14):

  1. 8. Matrix multiplication; forward and backward passes
  2. 9. Loss functions, optimizers, and the training loop
  3. 10. Looking inside the model(callbacks, hooks, batchnorm)
  4. 11. Data Block API, and generic optimizer
  5. 12. Advanced training techniques; ULMFiT from scratch
  6. 13. Basics of Swift for Deep Learning
  7. 14. C interop; Protocols; Putting it all together

最新版("Deep Learning Foundations to Stable Diffusion")扩展内容:matrix multiplication, clustering, backpropagation, autoencoders, attention, transformers, latent diffusion models。


A7. Deep Learning Specialization — Andrew Ng(DeepLearning.AI)#

课程清单:

  1. 1. Neural Networks and Deep Learning(25h)— 全连接网络、向量化、架构参数
  2. 2. Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization(24h)— Momentum, RMSprop, Adam, TensorFlow
  3. 3. Structuring Machine Learning Projects(7h)— 错误诊断、bias/variance、transfer / multi-task learning
  4. 4. Convolutional Neural Networks(36h)— CNN、neural style transfer、2D/3D 视觉
  5. 5. Sequence Models(37h)— RNN/LSTM/GRU、HuggingFace tokenizers、Transformer、NER、QA

Capstone:每门课有 programming assignments,无单独 capstone。


A8. Machine Learning Specialization — Andrew Ng(DeepLearning.AI / Stanford Online,2022 重制版)#

课程清单:

  1. 1. Supervised Machine Learning: Regression and Classification(33h)— 线性回归、逻辑回归、特征工程、训练优化
  2. 2. Advanced Learning Algorithms(34h)— Neural networks with TensorFlow、多分类、决策树、ensemble、transfer learning
  3. 3. Unsupervised Learning, Recommenders, Reinforcement Learning(28h)— 聚类、异常检测、降维、推荐系统、deep RL

A9. Machine Learning Engineering for Production (MLOps) Specialization — Andrew Ng / Robert Crowe(DeepLearning.AI)#

Machine Learning in Production 单课课表:

后续课程涵盖 ML Data Lifecycle, ML Modeling Pipelines, Deploying ML Models。


A10. Generative AI with Large Language Models — Andrew Ng / AWS(DeepLearning.AI)#

Week 清单:

Capstone:3 个 AWS labs(无独立项目)。


A11. DeepLearning.AI 其他 Specializations / Professional Certificates#

完整列表:

  1. 1. Machine Learning Specialization
  2. 2. Deep Learning Specialization
  3. 3. Natural Language Processing Specialization
  4. 4. TensorFlow Developer Professional Certificate
  5. 5. PyTorch for Deep Learning
  6. 6. Generative AI for Software Development
  7. 7. Data Analytics
  8. 8. Mathematics for Machine Learning and Data Science
  9. 9. AI for Good
  10. 10. AI for Medicine
  11. 11. Machine Learning Engineering for Production (MLOps)

典型 short courses(每个 1-2 小时):


A12. Build a Large Language Model From Scratch — Sebastian Raschka#

章节清单:

  1. 1. Understanding Large Language Models(无 code)
  2. 2. Working with Text Data — tokenization、data loading、preprocessing
  3. 3. Coding Attention Mechanisms — single-head 和 multi-head attention
  4. 4. Implementing a GPT Model from Scratch
  5. 5. Pretraining on Unlabeled Data
  6. 6. Finetuning for Text Classification
  7. 7. Finetuning to Follow Instructions(instruction tuning, alignment)

附录:

Bonus 材料:tokenizer 实现变种、attention 优化、Llama/Qwen/Gemma 架构、KV caching、MoE、preference tuning、evaluation frameworks。

配套课程:Manning Live Video Course "Master and Build Large Language Models" + 免费 YouTube playlist。

Capstone:每章末尾习题;完整端到端 GPT 实现就是项目。


A13. 其他 Sebastian Raschka 课程 / 书籍#


A14. Designing Machine Learning Systems — Chip Huyen#

章节清单:

  1. 1. Overview of Machine Learning Systems
  2. 2. Introduction to Machine Learning Systems Design — business/ML objectives, reliability, scalability, maintainability, adaptability, ML problem framing
  3. 3. Data Engineering Fundamentals — sources, formats, models, dataflow, batch vs stream
  4. 4. Training Data — sampling, labeling, class imbalance, augmentation
  5. 5. Feature Engineering — scaling, encoding, crossing, data leakage, generalization
  6. 6. Model Development and Offline Evaluation — ensembles, tracking, distributed training, AutoML, baselines
  7. 7. Model Deployment and Prediction Service — batch vs online, compression(factorization, distillation, pruning, quantization), edge
  8. 8. Data Distribution Shifts and Monitoring — shift detection、observability
  9. 9. Continual Learning and Test in Production — stateless vs stateful、shadow、A/B、canary、interleaving、bandits
  10. 10. Infrastructure and Tooling for MLOps — compute、dev env、containers、orchestrators、ML platform、feature store
  11. 11. The Human Side of Machine Learning — UX、team structure、responsible AI

配套课程:Stanford CS329S(见 C 类)。


A15. AI Engineering — Chip Huyen#

章节清单:

  1. 1. Introduction to Building AI Applications with Foundation Models — The Rise of AI Engineering, FM Use Cases, Planning AI Applications, The AI Engineering Stack
  2. 2. Understanding Foundation Models — Training Data, Modeling, Post-Training, Sampling
  3. 3. Evaluation Methodology — Challenges, Language Modeling Metrics, Exact Evaluation, AI as a Judge, Comparative Evaluation
  4. 4. Evaluate AI Systems — Evaluation Criteria, Model Selection, Design Your Evaluation Pipeline
  5. 5. Prompt Engineering — Introduction, Best Practices, Defensive Prompt Engineering
  6. 6. RAG and Agents — RAG, Agents, Memory
  7. 7. Finetuning — Overview, When to Finetune, Memory Bottlenecks, Techniques
  8. 8. Dataset Engineering — Data Curation, Augmentation/Synthesis, Processing
  9. 9. Inference Optimization — Understanding, Optimization
  10. 10. AI Engineering Architecture and User Feedback — Architecture, User Feedback

A16. Eugene Yan — 自学路径文章系列#

不是正式课程,但被广泛当成结构化自学材料,按主题:

LLMs:

ML Systems:

Recommender Systems:

Data Science:


A17. Mastering LLMs For Developers & Data Scientists — Hamel Husain + Dan Becker#

5 大主题(共 23+ sessions):

  1. 1. Evals(3 sessions)— LLM evaluation 框架与方法
  2. 2. RAG(3 talks)— RAG 基础到系统改进
  3. 3. Building Applications(4 resources)— 各种工具
  4. 4. Prompt Engineering(1 session)
  5. 5. Fine-Tuning(12+ sessions, 4 子分类)— Decision-making、Tools/Techniques(含 Wing Lian 讲 Axolotl、Zach Mueller 讲 HF Accelerate)、Deployment、Advanced

A18. AI Evals For Engineers & PMs — Hamel Husain + Shreya Shankar(Maven)#

周清单(部分公开):

Capstone:4 个 homework assignments + Recipe Bot Workflow 案例。


A19. The LLM Course — Maxime Labonne#

轨道 1:🧩 LLM Fundamentals

  1. 1. Mathematics for Machine Learning(线代、微积分、概率统计)
  2. 2. Python for Machine Learning(基础、NumPy/Pandas/Matplotlib/Seaborn、预处理、Scikit-learn)
  3. 3. Neural Networks(基础、训练优化、过拟合正则、MLP 实现)
  4. 4. NLP(预处理、特征提取、Word Embeddings、RNN/LSTM/GRU)

轨道 2:🧑‍🔬 The LLM Scientist

  1. 1. The LLM Architecture(encoder-decoder→decoder-only、tokenization、attention 变种、sampling)
  2. 2. Pre-Training Models(数据准备、distributed training: DP/PP/TP、训练优化、监控)
  3. 3. Post-Training Datasets(ShareGPT/ChatML/Alpaca、合成数据、增强、质量过滤)
  4. 4. Supervised Fine-Tuning(full FT、LoRA、QLoRA、DeepSpeed、FSDP)
  5. 5. Preference Alignment(rejection sampling、DPO、reward model、GRPO、PPO)
  6. 6. Evaluation(自动 benchmarks、人评、model-based、反馈信号)
  7. 7. Quantization(FP32/16/INT8、absmax、GGUF、llama.cpp、GPTQ、AWQ、SmoothQuant、ZeroQuant)
  8. 8. New Trends(SLERP/DARE/TIES merging、CLIP/SD/LLaVA 多模态、可解释性 SAE/abliteration、test-time compute)

轨道 3:👷 The LLM Engineer

  1. 1. Running LLMs(API: OpenAI/Google/Anthropic/OpenRouter/HF;开源: Ollama/LM Studio/llama.cpp;prompt engineering;结构化输出)
  2. 2. Building a Vector Storage(loading, splitting, embeddings, Chroma/Pinecone/Milvus/FAISS/Annoy)
  3. 3. RAG(LangChain/LlamaIndex/MCP、retriever 策略、记忆、Ragas/DeepEval)
  4. 4. Advanced RAG(SQL/Cypher、tools、re-ranking、RAG-fusion、DSPy)
  5. 5. Agents(基础、协议 MCP/A2A、OpenAI/Google ADK/Claude SDK、LangGraph/LlamaIndex/CrewAI/AutoGen)
  6. 6. Inference Optimization(Flash Attention、KV cache、MQA/GQA、EAGLE-3 speculative decoding)
  7. 7. Deploying LLMs(local、Gradio/Streamlit/HF Spaces、TGI/vLLM/SkyPilot、MLC LLM/mnn-llm)
  8. 8. Securing LLMs(prompt hacking、backdoor、red teaming、garak、langfuse)

Tools:LLM AutoEval, LazyMergekit, LazyAxolotl, AutoQuant, Model Family Tree, ZeroSpace, AutoAbliteration, AutoDedup

Notebooks:Fine-tune Llama 3.1 with Unsloth、Llama 3 with ORPO、Mistral-7b with DPO/QLoRA、CodeLlama with Axolotl、Llama 2 with QLoRA、4-bit GPTQ Quantization、GGUF 量化、ExLlamaV2、MergeKit MoE、abliteration、Knowledge Graphs、Decoding Strategies。


B 类 — 研究机构 / 公司课程#


B1. Hugging Face — 全部课程清单#

平台:https://huggingface.co/learn

课程URL slug简介
LLM Course(原 NLP Course)/llm-course用 HF 生态学 LLM/NLP
Context Course/context-coursecode agents 的 context engineering
Robotics Course/robotics-courseLeRobot
smol course/smol-coursepost-training 极简
Agents Course/agents-course构建 agents
Deep RL Course/deep-rl-course深度强化学习
Computer Vision Course/computer-vision-course计算机视觉
Audio Course/audio-coursetransformer 处理音频
Open-Source AI Cookbook/cookbook开源 notebook 合集
ML for Games Course/ml-games-course游戏中的 AI
Diffusion Course/diffusion-course扩散模型
ML for 3D Course/ml-for-3d-course3D ML

B2. Hugging Face LLM Course(NLP Course → LLM Course)#

章节清单:

Capstone:每章末尾 quizzes;建议项目在 forum 上有清单。


B3. Hugging Face Agents Course#

Chapter 清单:

ChapterTopic
0Onboarding
1Agent Fundamentals — Tools/Thoughts/Actions/Observations、LLMs、messages、special tokens、chat templates、Python 函数作为 tools
2Frameworks — smolagents、LangGraph、LlamaIndex
3Use Cases — 真实场景实战
4Final Assignment — 在 benchmark 上构建 agent 并在 leaderboard 上 PK

Bonus Units:

Capstone:Final benchmark agent + student leaderboard;有 certificate of fundamentals + certificate of completion 两种。


B4. Hugging Face Deep RL Course#

Unit 概要:


B5. Hugging Face Audio Course#

Unit 清单:

Capstone:每章有 hands-on 项目;80% / 100% 作业完成获 certificate。


B6. Hugging Face Diffusion Models Course#

Unit 清单:

每单元含 theory section + 2 notebooks。


B7. Hugging Face Community Computer Vision Course#

Unit 清单:

  1. 1. Fundamentals of Computer Vision — 图像基础、形成、预处理、特征提取
  2. 2. Convolutional Neural Networks (CNNs) — 架构、迁移学习
  3. 3. Vision Transformers — Swin、DETR、CVT、transfer learning
  4. 4. Multimodal Models — CLIP、GroupViT、BLIP、Owl-VIT
  5. 5. Generative Models — GANs、VAEs、Diffusion、text-to-image、inpainting
  6. 6. Basic Computer Vision Tasks — classification、object detection、segmentation、YOLO、SAM
  7. 7. Video and Video Processing — temporal continuity、motion estimation
  8. 8. 3D Vision, Scene Rendering, and Reconstruction — NeRF、GQN
  9. 9. Model Optimization — compression、distillation、pruning、TinyML
  10. 10. Synthetic Data Creation — point clouds、diffusion 合成数据
  11. 11. Zero Shot Computer Vision
  12. 12. Ethics and Biases in Computer Vision
  13. 13. Outlook and Emerging Trends — Retentive Network、Hiera、Hyena、I-JEPA

B8. Cohere LLM University#

Module 清单:

  1. 1. What are Large Language Models? — LLM 基础、embeddings、attention、transformer 架构、semantic search、实战
  2. 2. Text Representation with Cohere Endpoints — Classification / Embeddings / Semantic Search endpoints
  3. 3. Text Generation with Cohere Endpoints — generative learning、generated endpoint、prompt engineering
  4. 4. Deployment — AWS SageMaker、Streamlit、FastAPI
  5. 5. Semantic Search(2023 新增)
  6. 6. Prompt Engineering(2023 新增)
  7. 7. The Cohere Platform(2023 新增)

Tool Use 章节(2024-2025):

AWS 集成模块:

单独 chapter:


B9. Anthropic Academy — 全部课程清单#

完整课程列表:

  1. 1. Claude 101
  2. 2. Claude Code 101
  3. 3. Introduction to Claude Cowork
  4. 4. Claude Code in Action
  5. 5. AI Fluency: Framework & Foundations
  6. 6. Building with the Claude API
  7. 7. Introduction to Model Context Protocol
  8. 8. AI Fluency for Educators
  9. 9. AI Fluency for Students
  10. 10. Model Context Protocol: Advanced Topics
  11. 11. Claude with Amazon Bedrock
  12. 12. Claude with Google Cloud's Vertex AI
  13. 13. Teaching AI Fluency
  14. 14. AI Fluency for Nonprofits
  15. 15. Introduction to Agent Skills
  16. 16. Introduction to Subagents
  17. 17. AI Capabilities and Limitations
  18. 18. AI Fluency for Small Businesses

B10. Anthropic AI Fluency: Framework & Foundations(与 Joseph Feller / UCC 和 Rick Dakan / Ringling 合作)#

Module 清单:

  1. 1. Introduction to AI Fluency
  2. 2. The AI Fluency Framework — Why do we need AI Fluency? + The 4D Framework
  3. 3. Deep Dive 1: What is Generative AI? (Part 1) — fundamentals、capabilities & limitations
  4. 4. Delegation — A closer look + Project planning and Delegation
  5. 5. Description — A closer look + Deep Dive 2: Effective prompting techniques
  6. 6. Discernment — A closer look + Description-Discernment loop
  7. 7. Diligence — A closer look
  8. 8. Conclusion & Certificate — 总结 + 证书 + 额外活动

衍生课程:

所有材料以 CC BY-NC-SA 许可发布,由爱尔兰 Higher Education Authority 通过 National Forum for Teaching and Learning 资助。


B11. OpenAI Academy#

Collections:

Sector Collections:

Special Topics:

Stories & Case Studies:41 items

2025-2026 即将推出 AI fluency certifications(prompt engineering → AI-enabled work)。


B12. Google Machine Learning Crash Course#

模块清单:

ML Models

  1. 1. Linear Regression — 线性模型、loss、SGD、超参
  2. 2. Logistic Regression
  3. 3. Classification — 二分类、thresholding、confusion matrix、accuracy/precision/recall/AUC

Data

  1. 1. Working with Numerical Data
  2. 2. Working with Categorical Data — one-hot、feature hashing、mean encoding、feature crosses
  3. 3. Datasets, Generalization, and Overfitting

Advanced ML Models

  1. 1. Neural Networks — perceptron、hidden layers、activation
  2. 2. Embeddings
  3. 3. Intro to Large Language Models — tokens、Transformer、训练

Real-World ML

  1. 1. Production ML Systems
  2. 2. AutoML
  3. 3. ML Fairness

B13. Google Generative AI Learning Path#

整体覆盖:generative AI 基础、LLMs、responsible AI。具体课程清单需登录 Google Skills 查看(搜索结果不公开详尽 list)。


B14. Microsoft AI for Beginners#

Section 清单:

I. Introduction to AI

II. Symbolic AI

III. Introduction to Neural Networks

IV. Computer Vision

V. Natural Language Processing

VI. Other AI Techniques

VII. AI Ethics


B15. Microsoft Generative AI for Beginners#

Lesson 清单:

  1. 0. Course Setup
  2. 1. Introduction to Generative AI and LLMs
  3. 2. Exploring and Comparing Different LLMs
  4. 3. Using Generative AI Responsibly
  5. 4. Understanding Prompt Engineering Fundamentals
  6. 5. Creating Advanced Prompts
  7. 6. Building Text Generation Applications
  8. 7. Building Chat Applications
  9. 8. Building Search Apps with Vector Databases
  10. 9. Building Image Generation Applications
  11. 10. Building Low Code AI Applications
  12. 11. Integrating External Applications with Function Calling
  13. 12. Designing UX for AI Applications
  14. 13. Securing Your Generative AI Applications
  15. 14. The Generative AI Application Lifecycle(LLMOps)
  16. 15. Retrieval Augmented Generation and Vector Databases
  17. 16. Open Source Models and Hugging Face
  18. 17. AI Agents
  19. 18. Fine-Tuning LLMs
  20. 19. Building with SLMs
  21. 20. Building with Mistral Models
  22. 21. Building with Meta Models

B16. Microsoft Machine Learning for Beginners#

Lesson 清单:

1. Introduction(4 lessons)

2. Regression(4 lessons)

3. Web App

4. Classification(4 lessons)

5. Clustering(2 lessons)

6. NLP(5 lessons)

7. Time Series(3 lessons)

8. Reinforcement Learning(2 lessons)

9. Real-World(2 postscript)


B17. Meta AI / Llama Cookbook(公开教程)#

不是正式课程,但是结构化教程集合:

Top-Level Categories:

  1. 1. Getting Started — Inference、Fine-tuning、RAG
  2. 2. End-to-End Use Cases — WhatsApp bot with Llama 4、Research paper analysis、Book character mind mapping
  3. 3. Third-Party Integrations
  4. 4. Latest Llama 4 Recipes — Llama API、Long context(5M tokens with Scout)、Maverick advanced analysis

主要为 Jupyter notebooks(93.7% 内容)。

与 DeepLearning.AI 合作课程:


B18. NVIDIA Deep Learning Institute(DLI)#

Tracks:

  1. 1. Accelerated Computing
  2. 2. Data Science
  3. 3. Deep Learning
  4. 4. Generative AI / LLM
  5. 5. Infrastructure
  6. 6. Simulation and Physical AI

课程形式:

已知具体课程:

(完整 catalog 需查 PDF 或 NVIDIA 在线目录)


C 类 — 大学开源 / 顶级开课#


C1. Stanford CS224N — Natural Language Processing with Deep Learning#

Winter 2026 Schedule:

Assignments:


C2. Stanford CS224U — Natural Language Understanding#

Spring 2023 Schedule:

Assignments:


C3. Stanford CS224W — Machine Learning with Graphs#

Lecture 清单:

  1. 1. Introduction
  2. 2. Node Embeddings
  3. 3. Graph Neural Networks
  4. 4. A General Perspective on GNNs
  5. 5. GNN Augmentation and Training
  6. 6. Theory of GNNs
  7. 7. Designing Powerful Graph Encoders
  8. 8. Graph Transformers
  9. 9. Heterogeneous Graphs
  10. 10. Knowledge Graphs
  11. 11. GNNs for Recommender Systems
  12. 12. Relational Deep Learning
  13. 13. Advanced Architectures in RDL
  14. 14. Advanced Topics in GNNs
  15. 15. Towards Foundation Models for Knowledge Graphs
  16. 16. LLM + GNN
  17. 17. Agents + Graphs
  18. 18. Deep Generative Models for Graphs
  19. 19. Conclusion

C4. Stanford CS234 — Reinforcement Learning#

Winter 2026 Schedule:

Capstone:3 个 Assignments + Final Project(proposal、milestone、poster、writeup)。


C5. Stanford CS330 — Deep Multi-Task and Meta Learning#

Week 清单:

Grading:4 HW(HW0 5%, HW1-3 15% each)+ Project 50%。


C6. Stanford CS25 — Transformers United(seminar 系列)#

所有 Season + 嘉宾:

V1(Fall 2021):

  1. 1. Introduction to Transformers(instructors)
  2. 2. Mark Chen (OpenAI) — "Transformers in Language: GPT-3, Codex"
  3. 3. Lucas Beyer (Google Brain) — "Applications in Vision"
  4. 4. Aditya Grover (FAIR) — "Transformers in RL & Universal Compute Engines"
  5. 5. Barret Zoph + Irwan Bello + Liam Fedus (Google Brain) — "Scaling transformers"
  6. 6. Andrew Jaegle (DeepMind) — "Perceiver"
  7. 7. Aidan Gomez (Oxford) — "Self Attention & Non-Parametric Transformers"
  8. 8. Geoffrey Hinton (Toronto) — "GLOM"
  9. 9. Chris Olah (Anthropic) — "Interpretability with transformers"
  10. 10. Prateek Verma (Stanford) — "Transformers for Audio, Speech and Music"

V2(Winter 2023):

V3(Fall 2023):

V4(Spring 2024):

  1. 1. Instructors — Overview of Transformers
  2. 2. Jason Wei (OpenAI) — Intuitions on Language Models
  3. 3. Hyung Won Chung (OpenAI) — Shaping the Future of AI from the History of Transformer
  4. 4. Nathan Lambert (AI2) — Aligning Open Language Models
  5. 5. Albert Jiang (Mistral / Cambridge) — Demystifying Mixtral of Experts
  6. 6. Jake Williams (Drexel) — Transformers that Transform Well Enough to Support Near-Shallow Architectures
  7. 7. Ming Ding (Zhipu AI) — From LLMs to Large Multimodal Models
  8. 8. Edward Hu (ex-OpenAI) — New training objective for LLMs
  9. 9. Loubna Ben Allal (HF) — Behind the Scenes of LLM Pre-training: StarCoder

V6(Spring 2026):

  1. 1. Apr 9: Hazel Nam & Lucas Maes (Brown) — From Representation Learning to World Modeling through JEPAs
  2. 2. Apr 16: Albert Gu (CMU, Cartesia AI) — Tradeoffs of State Space Models and Transformers
  3. 3. Apr 23: Nouamane Tazi (HF) — Ultra-Scale Talk: Scaling Training to Thousands of GPUs
  4. 4. Apr 30: Shrimai Prabhumoye (Mistral AI) — From Next-Token Prediction to Next-Generation Intelligence
  5. 5. May 7: Andrew Lampinen (Anthropic) — Distinct Modes of Generalization from Parameters and Context
  6. 6. May 14: Vivek Natarajan (DeepMind) — Advancing Science and Medicine with Collaborative AI Agents
  7. 7. May 21: Victoria Lin (Thinking Machines) — From LMs to Native Multimodal Intelligence
  8. 8. May 28: Charles Frye (Modal) — Serving Transformers: Lessons from the Trenches of Production Inference

C7. Stanford CS329S — Machine Learning Systems Design#

Schedule:


C8. Stanford CS336 — Language Modeling from Scratch(2025 新)#

Lecture 清单:

  1. 1. Overview, tokenization(Percy)
  2. 2. PyTorch, resource accounting(Percy)
  3. 3. Architectures, hyperparameters(Tatsu)
  4. 4. Mixture of experts(Tatsu)
  5. 5. GPUs(Tatsu)
  6. 6. Kernels, Triton(Tatsu)
  7. 7. Parallelism(Tatsu)
  8. 8. Parallelism(Percy)
  9. 9. Scaling laws(Tatsu)
  10. 10. Inference(Percy)
  11. 11. Scaling laws(Tatsu)
  12. 12. Evaluation(Percy)
  13. 13. Data(Percy)
  14. 14. Data(Percy)
  15. 15. Alignment - SFT/RLHF(Tatsu)
  16. 16. Alignment - RL(Tatsu)
  17. 17. Alignment - RL(Percy)
  18. 18. Guest: Junyang Lin
  19. 19. Guest: Mike Lewis

Assignments:


C9. MIT 6.S191 — Introduction to Deep Learning#

Lecture 清单:

  1. 1. Intro to Deep Learning
  2. 2. Deep Sequence Modeling
  3. 3. Deep Computer Vision
  4. 4. Deep Generative Modeling
  5. 5. Deep Reinforcement Learning
  6. 6. New Frontiers
  7. 7. The Three Laws of AI
  8. 8. AI for Science
  9. 9. Secrets to Massively Parallel Training

Software Labs:

  1. 1. Deep Learning in Python; Music Generation
  2. 2. Facial Detection Systems
  3. 3. Fine-Tune an LLM, You Must!

Capstone:Final Project + Project Pitch + 颁奖。


C10. MIT 6.5940 — TinyML and Efficient Deep Learning Computing#

Lecture 清单(Fall 2024):

  1. 1. Introduction
  2. 2. Basics of Deep Learning
  3. 3. Pruning and Sparsity (Part I)
  4. 4. Pruning and Sparsity (Part II)
  5. 5. Quantization (Part I)
  6. 6. Quantization (Part II)
  7. 7. Neural Architecture Search (Part I)
  8. 8. Neural Architecture Search (Part II)
  9. 9. Knowledge Distillation
  10. 10. MCUNet: TinyML on Microcontrollers
  11. 11. TinyEngine and Parallel Processing
  12. 12. Transformer and LLM
  13. 13. Efficient LLM Deployment
  14. 14. LLM Post Training
  15. 15. Long Context LLM
  16. 16. Vision Transformer
  17. 17. GAN, Video, and Point Cloud
  18. 18. Diffusion Model
  19. 19. Distributed Training (Part I)
  20. 20. Distributed Training (Part II)
  21. 21. On-Device Training and Transfer Learning
  22. 22. Course Summary + Quantum ML I
  23. 23. Quantum ML II

24-26. Final Project Presentations

Labs:

Capstone:Final Project(4-5 人 / proposal + final report)。


C11. CMU 10-714 — Deep Learning Systems#

5 Homework Assignments:

Capstone:Final Project(在 Needle 框架中实现新 feature + 模型)。


C12. CMU 11-711 — Advanced NLP(Graham Neubig,Spring 2025 / Fall 2024)#

Spring 2025 Lecture 清单:

  1. 1. Introduction and Basics
  2. 2. Word Representation and Text Classification
  3. 3. Language Modeling Fundamentals
  4. 4. Recurrent Neural Networks
  5. 5. Attention and Transformers
  6. 6. Pretraining
  7. 7. Decoding and Generation(Guest: Amanda Bertsch)
  8. 8. Prompting
  9. 9. Fine-Tuning
  10. 10. Retrieval and RAG(Guest: Akari Asai)
  11. 11. Reinforcement Learning
  12. 12. Evaluating Language Generators(Guest: Seungone Kim)
  13. 13. Experimental Design
  14. 14. Agents
  15. 15. Quantization(Guest: Tim Dettmers)
  16. 16. Advanced Pretraining: Parallelism and Scaling
  17. 17. Project Discussion
  18. 18. Long Sequence Models
  19. 19. Advanced Inference Strategies
  20. 20. Efficient Inference
  21. 21. Advanced Post-Training
  22. 22. Multimodal Models I
  23. 23. AI for Mathematics
  24. 24. Multimodal Models II

25-26. Poster Presentations

Assignments:

Grading:Quizzes 20%(drop 3)/ A1 15% / A2 15% / A3 20% / A4 (Project) 30%。


C13. CMU 17-645 — Machine Learning in Production / AI Engineering(Christian Kästner)#

Spring 2025 Schedule(14 周):

Grading:Individual 35% / Group project 30% / Midterms 15% / Participation 5% / Labs 10% / Reading quizzes 5%。


C14. Berkeley CS285 — Deep Reinforcement Learning(Sergey Levine)#

Lecture 模块:

5 Homework Assignments:

  1. 1. Imitation Learning
  2. 2. Policy Gradients
  3. 3. Q-Learning and Actor Critic
  4. 4. LLM RL
  5. 5. Offline RL

Final Project Options:Offline-to-Online RL / LLM RL。


C15. Princeton COS 597G — Understanding Large Language Models(Danqi Chen,Fall 2022)#

Lecture 清单:

DateTopicPrimary Papers
Sep 7Introduction to LLMsHuman Language Understanding; Attention Is All You Need
Sep 12BERT (encoder-only)BERT
Sep 14T5 (encoder-decoder)T5
Sep 19GPT-3 (decoder-only)Language Models are Few-Shot Learners
Sep 21Few-shot promptingLM-BFF; How Many Data Points is a Prompt Worth
Sep 26Parameter-efficient adaptationPrefix-Tuning; Prompt Tuning
Sep 28In-context learningRethinking Demonstrations; Implicit Bayesian Inference
Oct 3CalibrationCalibrate Before Use; Surface Form Competition
Oct 5ReasoningChain of Thought; Zero-Shot Reasoners
Oct 10KnowledgeLM as Knowledge Bases; How Much Knowledge in Parameters
Oct 12Training dataDocumenting Webtext; The Pile
Oct 24Scaling lawsChinchilla; Scaling Laws for LMs
Oct 26PrivacyExtracting Training Data; Quantifying Memorization
Oct 31Bias & toxicity evalRealToxicityPrompts; OPT Section 4
Nov 2Bias & toxicity mitigationSelf-Diagnosis and Self-Debiasing
Nov 7Sparse / mixture-of-expertsSwitch Transformers
Nov 9Retrieval-based modelsRETRO
Nov 14Human feedback trainingInstructGPT
Nov 16Code language modelsCodex
Nov 21Multimodal LMsFlamingo
Nov 28Guest: Alexander Rush — T0 / zero-shot generalization
Nov 30AI alignment & open discussion
Dec 5Final project presentations

Grading:Participation 25%(pre-lecture readings + 3 questions)/ Presentations 30%(60-min student-led)/ Lecture feedback 5% / Final project 40%。

Project Options:Fine-tune medium LMs(BERT/RoBERTa/T5/GPT-2 via HF)or 评估大模型(GPT-3, Codex)。


C16. Princeton COS 597R — Deep Dive into Large Language Models(Danqi Chen + Sanjeev Arora,Fall 2024)#

Lecture 清单:

DateInstructorTopicKey Papers
Sep 4SanjeevIntroduction
Sep 9DanqiPretraining 1GPT-3, Transformers
Sep 11DanqiPretraining 2GPT-3 续 + Llama 3 Herd
Sep 16SanjeevScaling lawsChinchilla, Data-Constrained Models
Sep 18SanjeevEmergent behaviorEmergent Abilities; Theory for Emergence
Sep 23DanqiData curationDolma, FineWeb, RefinedWeb
Sep 25DanqiPost-training: Instruction tuningScaling Instruction-Finetuned LMs
Sep 30DanqiPost-training: PreferencesRLHF, DPO
Oct 2SanjeevAlignmentGeneral Language Assistant
Oct 7SanjeevConstitutional AIConstitutional AI
Oct 9SanjeevLLM MetacognitionMetacognitive Capabilities
Oct 21Tianyu GaoLong-context modelsHow to Train Long-Context LMs
Oct 23SanjeevAdvanced alignmentOpenAI o1 System Card; Weak-to-Strong
Oct 28Danqi/SanjeevLLM Reasoning 1Let's Verify Step by Step
Oct 30DanqiLLM Reasoning 2Scaling LLM Test-Time Compute
Nov 4Mengzhou XiaSmall modelsSheared LLaMA, Gemma 2
Nov 6DanqiRetrieval-augmented LMsRETRO
Nov 11Yu Su (OSU)Language AgentsHippoRAG
Nov 13DanqiRAGREALM, kNN-LM, FLARE, Self-RAG
Nov 18Tri DaoHardware-aware algorithmsFlashAttention, Mamba
Nov 20Saining Xie (NYU)Multimodal LLMsLLaVA, Cambrian-1, Molmo, MM1
Nov 25 / Dec 2 / Dec 4StudentsProject presentations

Grading:Participation 30% / Debate 15% / Lecture scribing 10% / Final project 35% / Project presentations 10%。

Debate Panels:Weeks 4-9 共 12 debate panels(每 5 个学生为一组,每 paper 有 presenter / critic / proponent)。


D 类 — 实操 / 工程社区课程#


D1. Full Stack Deep Learning(Sergey Karayev / Pieter Abbeel / Josh Tobin)#

FSDL 2022 (Spring) — 9 Lectures + 6 Labs:

Lectures:

  1. 1. Course Vision and When to Use ML
  2. 2. Development Infrastructure & Tooling
  3. 3. Troubleshooting & Testing
  4. 4. Data Management
  5. 5. Deployment
  6. 6. Continual Learning
  7. 7. Foundation Models
  8. 8. ML Teams and Project Management
  9. 9. Ethics

Labs:

外加 Project Showcase。

早期版本:FSDL 2021(Berkeley + online)/ 2020(UW)/ 2019 / 2018。


D2. Full Stack LLM Bootcamp(Spring 2023)#

8 个 Lectures:

  1. 1. Learn to Spell: Prompt Engineering
  2. 2. LLMOps
  3. 3. UX for Language User Interfaces
  4. 4. Augmented Language Models
  5. 5. Launch an LLM App in One Hour
  6. 6. LLM Foundations
  7. 7. Project Walkthrough: askFSDL
  8. 8. What's Next?

3 个 Invited Talks:

Capstone:项目 askFSDL(Q&A 系统,Python/Modal/Gantry)。


D3. MLOps Zoomcamp(DataTalks.Club)#

7 个 Module:

  1. 1. Introduction — MLOps 定义、maturity model、NY Taxi 数据集、环境搭建 + Homework
  2. 2. Experiment Tracking & Model Management — MLflow 基础、模型保存/加载/注册 + Homework
  3. 3. Orchestration & ML Pipelines + Homework
  4. 4. Model Deployment — online (web/streaming) vs offline (batch)、Flask、AWS Kinesis + Lambda、batch scoring + Homework
  5. 5. Model Monitoring — Prometheus + Evidently + Grafana、Prefect + MongoDB + Homework
  6. 6. Best Practices — 单元/集成测试、linting/pre-commit、GitHub Actions CI/CD、Terraform IaC + Homework
  7. 7. Final Project — 端到端整合所有概念

D4. LLM Zoomcamp(DataTalks.Club)#

7 个 Module + Workshop:

  1. 1. Introduction to LLMs & RAG — 基础 RAG pipeline + text search
  2. 2. Vector Search — semantic embeddings 索引/检索
  3. 3. Agents — 自治 tool use 和 function calling
  4. 4. (Workshop) Data Ingestion — dlt 从外部源接入 RAG
  5. 5. Evaluation — 离线 + 在线 retrieval / answer 质量
  6. 6. Monitoring — 用户反馈 + live dashboards
  7. 7. Best Practices — LangChain、hybrid search(vector+keyword)、reranking
  8. 8. End-to-End Project — 健身助手 with LLMs

Capstone:完整 RAG 应用(知识库 + retrieval + LLM 集成 + evaluation + UI + 监控反馈)。Certification 需提交项目并 peer review 3 个其他学生项目。


D5. AI Engineer World's Fair 工作坊#

每年 6 月在旧金山,workshop 由各家公司主讲(Anthropic、OpenAI、LangChain、LlamaIndex、Modal、Weaviate、Pinecone、Together AI 等)。典型 workshop 包括:

完整 schedule 见每年事件页(2024 / 2025)。


D6. Modular Mojo AI Engineer 培训#

不是正式课程,而是 docs + 工具链:

第三方 Udemy 课程:


D7. Replit Learn#

目前提供:

  1. 1. AI foundations(30 min, 7 lessons)— 用 AI 构建 app
  2. 2. Enterprise foundations(30 min, 6 lessons)
  3. 3. Intro to Replit(45 min, 5 lessons)— coming soon
  4. 4. Advanced vibes(30 min, 5 lessons)— coming soon

Replit 100 Days of Python(讲师:David Morgan):


D8. The AI Engineering Bootcamp(Greg Loughnane + Chris Alexiuk)#

涵盖 AI 原则、Python、NLP、LLMs、Transformers、LangChain orchestration。Build, ship, and share。


附录 — 总览整理#

类别课程数共同特点(不做评价,只记观察)
A 神级个人19单人 IP 主导,重 "从零搓" 哲学
B 公司机构18教学 = 推广 SDK / 平台
C 大学开源16论文密度高,每周 paper 阅读 + project
D 工程社区8每年迭代,end-to-end capstone

所有课程汇总:~61 门具体课程 + N 个 specialization / track。

重复出现的结构模式(仅观察,不做评价):


主要信息源