入门强化学习

1、基础理论知识

书籍:《Reinforcement Learning:An Introduction》、《深入浅出强化学习》

视频课程:https://edu.csdn.net/course/detail/4916

2、小实验

http://gym.openai.com/envs/#algorithmic

https://github.com/xiaoqian19940510?tab=repositories(我的github,暂时还没上传我做的一些小实验,这几天会上传)

3、经典论文和最新论文

经典论文:围棋三算法(alphago,alphazero,alphago zero),后面可以想象在我的GitHub上详细解析每篇论文及代码(https://github.com/xiaoqian19940510?tab=repositories

      CVPR 2017 papers

1Deep Reinforcement Learning-Based Image Captioning With Embedding Reward

Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li

2Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning

Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, Jin Young Choi

3Attention-Aware Face Hallucination via Deep Reinforcement Learning

Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang, Guanbin Li

4PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother

5A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

6Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection

Xiaodan Liang, Lisa Lee, Eric P. Xing

7A Reinforcement Learning Approach to the View Planning Problem

Mustafa Devrim Kaba, Mustafa Gokhan Uzunbas, Ser Nam Lim

8Collaborative Deep Reinforcement Learning for Joint Object Search

Xiangyu Kong, Bo Xin, Yizhou Wang, Gang Hua

      ICCV 2017 papers

1Tracking as Online Decision-Making: Learning a Policy From Streaming Videos With Reinforcement Learning

James Supančič, III, Deva Ramanan

2Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning

Abhishek Das, Satwik Kottur, José, M. F. Moura, Stefan Lee, Dhruv Batra

3、First-Person Activity Forecasting With Online Inverse Reinforcement Learning

Nicholas RhinehartKris M. Kitani

4、Attention-Aware Deep Reinforcement Learning for Video Face Recognition

Yongming Rao, Jiwen Lu, Jie Zhou

5、3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds

Fangyu Liu, Shuaipeng Li, Liqiang Zhang, Chenghu Zhou, Rongtian Ye, Yuebin Wang, Jiwen Lu

      Nature

1Vector-based navigation using grid-like representations in artificial agents

2、Reinforcement determines the timing dependence of corticostriatal synaptic plasticity in vivo

3Drive and Reinforcement Circuitry in the Brain: Origins, Neurotransmitters, and Projection Fields

4Mastering the game of Go without human knowledge

5A hippocampo-cerebellar centred network for the learning and execution of sequence-based navigation

6Reinforcement learning improves behaviour from evaluative feedback

7Human-level control through deep reinforcement learning

8Adaptation to criticality through organizational invariance in embodied agents

9Human-level control through deep reinforcement learning(凭借深度强化学习达到人类水平的操控,深度Q网络,将近60%游戏超过人类选手)

10Deep learning (深度学习)

11Mastering the game of Go with deep neural networks and tree search(利用深度神经网络和树搜索征服围棋游戏)

     Science

1Soft humanoid motor learning

2Scientists imbue robots with curiosity

3Artificial intelligence bests humans at classic arcade games

4Solving the quantum many-body problem with artificial neural networks

5、A Global Geometric Framework for Nonlinear Dimensionality Reduction(一种用于非线性降维的全局几何框架)

6Nonlinear Dimensionality Reduction by Locally Linear Embedding(通过局部线性嵌入进行非线性降维)

7Reducing the Dimensionality of Data with Neural Networks(利用神经元网络降低数据的维度)

8Machine learning. Clustering by fast search and find of density peaks.(通过快速查找和发现密度峰值进行聚类)

9Human-level concept learning through probabilistic program induction(凭借概率规划归纳法进行人类层级的概念学习)

 

 

 

 

 

 

 

 

 

 

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值