Neural Network Dynamics for Model-Based Deep Reinforcement Learniing with Model-Free Fine-Tuning

Goal

  1. 怎样在model-based reinforcement learning中使用neural-network创建system dynamics
  2. 怎样使用model-based reinforcement learning来加速model-free reinforcement learning

Related Work

The most efficient model-based algorithms have used relatively simple function approximators, such as Gaussian processes, time-varying linear models, and mixtures of Gaussians.

Contribution

  1. demonstrate effective model-based reinforcement learning with neural network models for several contact-rich simulated locomotion tasks from standard deep reinforcement learning benchmarks
  2. evaluate a number of design decisions for neural network dynamics model learning
  3. show how a model-based learner can be used to initialized a model-free learner to achieve high rewards while drastically reducing sample complexity

The learned model-based controller provides good rollouts, which enable supervised initialization of a policy that can then be fine-tuned with model-free algorithms, such as policy gradients.

Code(这篇文章的github repository的结构还可以)

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值