【HRL】option-critic & FeUdal

好大米

已于 2022-03-21 20:31:36 修改

阅读量212

点赞数

文章标签：深度学习

于 2022-03-21 20:21:07 首次发布

本文链接：https://blog.csdn.net/weixin_39632924/article/details/123642833

版权

OverviewThis is the first day when I start to scribe some ideas when I read papers. Why do so? This marks the start of my making a change to the target for my career. So in this pages, I will think about two papers which fall into the field I am recently

摘要由CSDN通过智能技术生成

Overview

This is the first day when I start to scribe some ideas when I read papers. Why do so? This marks the start of my making a change to the target for my career. So in this pages, I will think about two papers which fall into the field I am recently focuing on. Let’s start it.

The Optic-Critic Architecture

This paper provides the fundation of how to derive the hierarchical structure of the high-level option and termination function.

Abstract

Temporal abstraction is key to scaling up learning and planning in reinforcement learning.
Directly introduce the RL.
While planning with temporally extended actions is well understood, creating such abstractions autonomously from data has remained challenging.
Propose the problem
We tackle this problem in the framework of opti

最低0.47元/天解锁文章

好大米

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
【HRL】option-critic & FeUdal

OverviewThis is the first day when I start to scribe some ideas when I read papers. Why do so? This marks the start of my making a change to the target for my career. So in this pages, I will think about two papers which fall into the field I am recently
复制链接

扫一扫