1、基础理论知识
书籍:《Reinforcement Learning:An Introduction》、《深入浅出强化学习》
视频课程:https://edu.csdn.net/course/detail/4916
2、小实验
http://gym.openai.com/envs/#algorithmic
https://github.com/xiaoqian19940510?tab=repositories(我的github,暂时还没上传我做的一些小实验,这几天会上传)
3、经典论文和最新论文
经典论文:围棋三算法(alphago,alphazero,alphago zero),后面可以想象在我的GitHub上详细解析每篇论文及代码(https://github.com/xiaoqian19940510?tab=repositories)
CVPR 2017 papers
1、Deep Reinforcement Learning-Based Image Captioning With Embedding Reward
Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li
2、Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning
Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, Jin Young Choi
3、Attention-Aware Face Hallucination via Deep Reinforcement Learning
Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang, Guanbin Li
4、PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning
Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother
5、A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg
6、Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection
Xiaodan Liang, Lisa Lee, Eric P. Xing
7、A Reinforcement Learning Approach to the View Planning Problem
Mustafa Devrim Kaba, Mustafa Gokhan Uzunbas, Ser Nam Lim
8、Collaborative Deep Reinforcement Learning for Joint Object Search
Xiangyu Kong, Bo Xin, Yizhou Wang, Gang Hua
ICCV 2017 papers
1、Tracking as Online Decision-Making: Learning a Policy From Streaming Videos With Reinforcement Learning
James Supančič, III, Deva Ramanan
2、Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning
Abhishek Das, Satwik Kottur, José, M. F. Moura, Stefan Lee, Dhruv Batra
3、First-Person Activity Forecasting With Online Inverse Reinforcement Learning
Nicholas Rhinehart, Kris M. Kitani
4、Attention-Aware Deep Reinforcement Learning for Video Face Recognition
Yongming Rao, Jiwen Lu, Jie Zhou
5、3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds
Fangyu Liu, Shuaipeng Li, Liqiang Zhang, Chenghu Zhou, Rongtian Ye, Yuebin Wang, Jiwen Lu
Nature
1、Vector-based navigation using grid-like representations in artificial agents
2、Reinforcement determines the timing dependence of corticostriatal synaptic plasticity in vivo
3、Drive and Reinforcement Circuitry in the Brain: Origins, Neurotransmitters, and Projection Fields
4、Mastering the game of Go without human knowledge
5、A hippocampo-cerebellar centred network for the learning and execution of sequence-based navigation
6、Reinforcement learning improves behaviour from evaluative feedback
7、Human-level control through deep reinforcement learning
8、Adaptation to criticality through organizational invariance in embodied agents
9、Human-level control through deep reinforcement learning(凭借深度强化学习达到人类水平的操控,深度Q网络,将近60%游戏超过人类选手)
10、Deep learning (深度学习)
11、Mastering the game of Go with deep neural networks and tree search(利用深度神经网络和树搜索征服围棋游戏)
Science
1、Soft humanoid motor learning
2、Scientists imbue robots with curiosity
3、Artificial intelligence bests humans at classic arcade games
4、Solving the quantum many-body problem with artificial neural networks
5、A Global Geometric Framework for Nonlinear Dimensionality Reduction(一种用于非线性降维的全局几何框架)
6、Nonlinear Dimensionality Reduction by Locally Linear Embedding(通过局部线性嵌入进行非线性降维)
7、Reducing the Dimensionality of Data with Neural Networks(利用神经元网络降低数据的维度)
8、Machine learning. Clustering by fast search and find of density peaks.(通过快速查找和发现密度峰值进行聚类)
9、Human-level concept learning through probabilistic program induction(凭借概率规划归纳法进行人类层级的概念学习)