Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

最新推荐文章于 2024-07-01 15:46:16 发布

Adam婷

最新推荐文章于 2024-07-01 15:46:16 发布

阅读量1.6k

点赞数 1

分类专栏：算法机器学习强化学习深度强化学习论文研读 ICLR

本文链接：https://blog.csdn.net/weixin_41697507/article/details/93900110

版权

机器学习同时被 3 个专栏收录

161 篇文章 8 订阅 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

强化学习

26 篇文章 1 订阅 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

算法

161 篇文章 4 订阅

订阅专栏

利用信息约束基元的竞争集合强化学习

Anirudh Goyal1, Shagun Sodhani1, Jonathan Binas1, Xue Bin Peng2
Sergey Levine2, Yoshua Bengio1y
1Mila, Université de Montréal
2University of California, Berkeley yCIFAR Senior Fellow.

Abstract

Reinforcement learning agents that operate in diverse and complex environments can benefit from the structured decomposition of their behavior. Often, this is addressed in the context of hierarchical reinforcement learning, where the aim is to decompose a policy into lower-level primitives or options, and a higher-level meta-policy that triggers the appropriate behaviors for a given situation. However, the meta-policy must still produce appropriate decisions in all states. In this work, we propose a policy design that decomposes into primitives, similarly to hierarchical reinforcement learning, but without a high-level meta-policy. Instead, each primitive can decide for themse

了解本专栏

超级会员免费看

Adam婷

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
0
评论
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

利用信息约束基元的竞争集合强化学习Anirudh Goyal1, Shagun Sodhani1, Jonathan Binas1, Xue Bin Peng2Sergey Levine2, Yoshua Bengio1y1Mila, Université de Montréal2University of California, Berkeley yCIFAR Senior Fello...
复制链接

扫一扫