计算图、反向传播与梯度下降：深度学习核心数学基础

计算图、反向传播与梯度下降：深度学习核心数学基础#

深度学习的核心问题是什么？让机器从数据中学习规律。

但如何做到呢？本章将揭示答案背后的数学机制：

我们将用 MNIST 手写数字识别作为贯穿例子，从概念到代码，一步步拆解这些技术如何让神经网络"学会"知识。

学习目标

完成本章后，你将能够：

本章是整个系列的理论基础层：

核心认知：我们不涉及复杂的网络架构，而是聚焦于让神经网络"工作"起来的核心数学机制——每个概念都有对应的代码实现。

本章聚焦于让神经网络"工作"起来的五大核心机制：

学习路径：建立直觉 → 理解原理 → 动手实践 → 为后续章节铺垫

学习本章前，请确保你已经掌握

本章是深度学习系列的理论基础，不需要深度学习前置知识，但需要：

环境准备

如果你还没有配置 Python 环境，可以参考环境配置番外篇中的安装指南。

本章为整个系列奠定基础：

下一章神经网络基础：从理论到架构将基于这些理论，用 PyTorch 搭建和训练实际的神经网络。

贡献者与修订历史

查看详细修订记录

bba351e 2026-04-29 - Heyan Zhu: docs: update chapter summaries and learning paths for consistency
0cdb1e4 2026-04-29 - Heyan Zhu: feat: add model-serving chapter and update related content
59126f4 2026-04-26 - Heyan Zhu: docs(math-fundamentals): update content structure and add citations
ae2053f 2026-04-26 - Heyan Zhu: docs(math-fundamentals): add task-formulations and update related content
756a793 2026-04-25 - Heyan Zhu: docs(math-fundamentals): update content structure and improve explanations
0c291d7 2025-12-10 - Heyan Zhu: docs: restructure course materials and add new content