What is the difference between model-based and model-free reinforcement learning?
To answer this question, lets revisit the components of an MDP, the most typical decision making framework for RL.
An MDP is typically defined by a 4-tuple (S,A,R,T)(S,A,R,T) where
SS is the state/o...
原创
2018-12-21 19:50:38 ·
424 阅读 ·
1 评论