RL | DQN
CatalogueDQN FrameworkApplication1.1 Cartpole Introduction1.2 CodeReference
DQN Framework
The agent interacts with the environment to generate next state, reward and termination information, which will be stored in a replay buffer.
Agent与环境交互,产生下一个状态、奖
原创
2020-09-04 22:27:10 ·
289 阅读 ·
0 评论