[NOTE] Advice and Perspectives on RL Research Frontiers - Rich Sutton in DLRLSS 2019

最新推荐文章于 2020-11-09 10:49:50 发布

tangwing

最新推荐文章于 2020-11-09 10:49:50 发布

阅读量150

点赞数

分类专栏： # Data-Driven Decision Making # Paper notes 算法

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/tangwing/article/details/107483590

版权

算法同时被 3 个专栏收录

29 篇文章 0 订阅

订阅专栏

Data-Driven Decision Making

22 篇文章 0 订阅

订阅专栏

4 篇文章 0 订阅

订阅专栏

根据我的习惯，当然先放ressources：slides，video. 这是Sutton在DLRLSS 2019 summer school上的一个lecture，从他自己的角度分享了对RL领域的一些理解，他目前的研究方向及前沿等。一些思考还是很有启发的。个别要点摘录于此，细节可以自行阅读、观看。

Developing your own research thoughts

There are no authorities in science. Be ambitious but also humble. Your own thought is of great value.
One best way of training is to write for yourself and discuss with others.
When thinking on big questions, it's easy to get stuck:
1. Define your own terms
2. Go multiple: think about alternatives
3. Go meta: what are the properties that the solution should have
4. Retreat to clearer question
The most important insight you will ever contribute is too obvious to see.(The discovery of gravity)

“Completing the square” for doing RL research

Research that Sutton is doing

有必要更加深入地理解Prediction和Control的联系与区别。

下文简而言之，Sutton is working on subprolems. The world env is often too complex to learn as a whole. It's natural to have multiple components like different parts of the body. I think it's a bit like the multi-agent concept, whose goals may not directly relate to the global reward.

关于Permanent memory的部分其实非常有想象空间。

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。