强化学习-经典论文

*Major*

已于 2023-10-09 08:39:40 修改

阅读量309

点赞数

文章标签：开发语言

于 2020-07-25 18:51:47 首次发布

本文链接：https://blog.csdn.net/qq_41375318/article/details/107583096

版权

在这里插入图片描述

Q Learning
Sarsa
DQN
Policy Gradients
Actor Critic
Playing Atari with Deep Reinforcement Learning
Deep Reinforcement Learning with Double Q-learning
Continuous control with deep reinforcement learning
Asynchronous Methods for Deep Reinforcement Learning
Proximal Policy Optimization Algorithms
Hindsight Experience Replay
Emergence of Locomotion Behaviours in Rich Environments
ImplicitQuantile Networks for Distributional Reinforcement Learning
Imagination-Augmented Agents for Deep Reinforcement Learning
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Model-based value estimation for efficient model-free reinforcement learning
Model-ensemble trust-region policy optimization
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning