强化学习 − 经典论文 强化学习-经典论文 强化学习−经典论文
- 通过价值选行为:Q Learning 、 Sarsa 、 DQN
- 直接选行为:Policy Gradients
- 想象环境并从中学习:Model based RL
- Q Learning
- Sarsa
- DQN
- Policy Gradients
- Actor Critic
- Playing Atari with Deep Reinforcement Learning
- Deep Reinforcement Learning with Double Q-learning
- Continuous control with deep reinforcement learning
- Asynchronous Methods for Deep Reinforcement Learning
- Proximal Policy Optimization Algorithms
- Hindsight Experience Replay
- Emergence of Locomotion Behaviours in Rich Environments
- ImplicitQuantile Networks for Distributional Reinforcement Learning
- Imagination-Augmented Agents for Deep Reinforcement Learning
- Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
- Model-based value estimation for efficient model-free reinforcement learning
- Model-ensemble trust-region policy optimization
- Dynamic Horizon Value Estimation for Model-based Reinforcement Learning