好长时间没跟新了,这期间有好多事情(华为、微博、算法课),现在把最后几节课拾起来。
上节课内容和本节课内容
1)Model-Free和Model-Based的区别:
Model-Free RL:
No model
Learn value function (and/or policy) from experience
Model-Based RL:
Learn a model from experience
Plan value function (and/or policy) from model
Model-Based Reinforcement Learning
1)整体流程如下,包括两大部分:model learning、知道model后如何做planning!
Advantages:
Can efficiently learn model by supervised learning met