转移价值？还是策略？一个可转移的连续强化学习的中心框架

最新推荐文章于 2024-04-25 09:38:32 发布

Adam婷

最新推荐文章于 2024-04-25 09:38:32 发布

阅读量3.8k

点赞数

分类专栏：深度学习机器学习算法深度强化学习强化学习论文研读 ICLR

本文链接：https://blog.csdn.net/weixin_41697507/article/details/94294350

版权

机器学习同时被 3 个专栏收录

161 篇文章 8 订阅 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

强化学习

26 篇文章 1 订阅 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

算法

161 篇文章 4 订阅

订阅专栏

TRANSFER VALUE OR POLICY? A AVALUE-CENTRIC FRAMEWORK TOWARDS TRANSFERRABLE CONTINUOUS REINFORCEMENT LEARNING

ABSTRACT

Transferring learned knowledge from one environment to another is an important step towards practical reinforcement learning (RL). In this paper, we investigate the problem of transfer learning across environments with different dynamics while accomplishing the same task in the continuous control domain. We start by illustrating the limitations of policy-centric methods (policy gradient, actor-critic, etc.) when transferring knowledge across environments. We then propose a general model-based value-centric (MVC) framework for continuous RL. MVC learns a dynamics approximator and a value approximator simultaneously in the source domain, and makes decision based on both of them. We evaluate MVC against po

了解本专栏

超级会员免费看

Adam婷

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
打赏
0
评论
转移价值？还是策略？一个可转移的连续强化学习的中心框架

TRANSFER VALUE OR POLICY? A AVALUE-CENTRIC FRAMEWORK TOWARDS TRANSFERRABLE CONTINUOUS REINFORCEMENT LEARNINGABSTRACTTransferring learned knowledge from one environment to another is an important ...
复制链接

扫一扫