强化学习的数学原理 - 西湖大学赵世钰老师

方浩坤Harriet

于 2024-09-26 19:14:08 发布

阅读量557

点赞数 9

本文链接：https://blog.csdn.net/gitblog_06610/article/details/142570750

版权

本仓库提供了一个资源文件的下载，该资源文件为西湖大学赵世钰老师的《强化学习的数学原理》。这本书从零开始，通过数学角度，结合大量例子，循序渐进地揭示了强化学习的本质原理。该书为纯英文书籍，适合对强化学习有深入学习需求的研究者和学生。

该书包含以下章节：

Chapter 1 Basic Concepts - 基本概念
Chapter 2 State Values and Bellman Equation - 状态值与贝尔曼方程
Chapter 3 Optimal State Values and Bellman Optimality Equation - 最优状态值与贝尔曼最优方程
Chapter 4 Value Iteration and Policy Iteration - 值迭代与策略迭代
Chapter 5 Monte Carlo Methods - 蒙特卡洛方法
Chapter 6 Stochastic Approximation - 随机逼近
Chapter 7 Temporal-Difference Methods - 时序差分方法
Chapter 8 Value Function Approximation - 值函数逼近
Chapter 9 Policy Gradient Methods - 策略梯度方法
Chapter 10 Actor-Critic Methods - 演员-评论家方法
Appendix C Convergence of Sequences - 序列的收敛性