强化学习
JSerenity
这个作者很懒,什么都没留下…
展开
-
Key Concepts in RL
Part 1: Key Concepts in RL for review. need to update. In a nutshell, RL is the study of agents and how they learn by trial and error. It formalizes the idea that rewarding or punishing an agent for i...原创 2019-05-19 00:11:41 · 162 阅读 · 0 评论 -
Spinning Up USER DOCUMENTATION
Spinning Up USER DOCUMENTATION (For review) Environments Spinning Up requires Python3, OpenAI Gym, and OpenMPI. MuJoCo(Optional but Preferred) Algorithms VPG, TRPO, PPO, DDPG, TD3, SAC. The On-Policy ...原创 2019-05-17 23:37:57 · 239 阅读 · 0 评论 -
Learn to play Pong with PG from scratch and pixels
Learn to play Pong with PG from scratch and pixels http://karpathy.github.io/2016/05/31/rl/ Policy Gradients(PG) is default choice for attacking RL problems. DQN changed Q-Learning. PG is preferred be...原创 2019-06-08 15:15:42 · 301 阅读 · 0 评论