ICLR
Adam婷
笔者在人工智能/机器学习领域中默默探索,时而迷惘,时而欣喜。
展开
-
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
Averaged-DQN:深度强化学习的方差减少和稳定性AbstractInstability and variability of Deep Reinforcement Learning (DRL) algorithms tend to adversely af-fect their performance. Averaged-DQN is a sim-ple extension to th...原创 2019-07-01 19:40:39 · 1986 阅读 · 0 评论 -
POLICY GENERALIZATION IN CAPACITY-LIMITED REINFORCEMENT LEARNING
能力有限的加强学习中的政策一般化ABSTRACTMotivated by the study of generalization in biological intelligence, we examine reinforcement learning (RL) in settings where there are information-theoretic constraints plac...原创 2019-06-30 12:53:32 · 525 阅读 · 0 评论 -
转移价值?还是 策略? 一个可转移的连续强化学习的中心框架
TRANSFER VALUE OR POLICY? A AVALUE-CENTRIC FRAMEWORK TOWARDS TRANSFERRABLE CONTINUOUS REINFORCEMENT LEARNINGABSTRACTTransferring learned knowledge from one environment to another is an important ...原创 2019-06-30 11:14:59 · 3928 阅读 · 0 评论 -
学习控制深度加固学习中结构探索的视觉抽象
LEARNING TO CONTROL VISUAL ABSTRACTIONS FOR STRUCTURED EXPLORATION IN DEEP REINFORCEMENT LEARNINGABSTRACTExploration in environments with sparse rewards is a key challenge for reinforcement learn...原创 2019-06-30 09:59:03 · 1363 阅读 · 0 评论 -
LEARNING GOAL-CONDITIONED VALUE FUNCTIONS WITH ONE-STEP PATH REWARDS RATHER THAN GOAL- REWARDS
ABSTRACTMulti-goal reinforcement learning (MGRL) addresses tasks where the desired goal state can change for every trial. State-of-the-art algorithms model these problems such that the reward formula...原创 2019-06-30 08:22:47 · 808 阅读 · 0 评论 -
TRAJECTORY VAE FOR MULTI-MODAL IMITATION(用于多模态模拟的轨迹VAE)
ABSTRACTWe address the problem of imitating multi-modal expert demonstrations in sequential decision making problems. In many practical applications, for example video games, behavioural demonstratio...原创 2019-06-30 00:08:04 · 1032 阅读 · 0 评论 -
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
利用信息约束基元的竞争集合强化学习Anirudh Goyal1, Shagun Sodhani1, Jonathan Binas1, Xue Bin Peng2Sergey Levine2, Yoshua Bengio1y1Mila, Université de Montréal2University of California, Berkeley yCIFAR Senior Fello...原创 2019-06-27 19:50:29 · 1652 阅读 · 0 评论 -
THE WISDOM OF THE CROWD: RELIABLE DEEP REINFORCEMENT LEARNING THROUGH ENSEMBLES OF Q--FUNCTIONS
ABSTRACTReinforcement learning agents learn by exploring the environment and then ex-ploiting what they have learned. This frees the human trainers from having to know the preferred action or intrins...原创 2019-06-27 11:20:52 · 1150 阅读 · 0 评论 -
TARMAC: TARGETED MULTI-AGENT COMMUNICATION(TARMAC:目标多代理通信)
ABSTRACTWe explore a collaborative multi-agent reinforcement learning setting where a team of agents attempts to solve cooperative tasks in partially-observable environ-ments. In this scenario, learn...原创 2019-06-27 10:16:27 · 1971 阅读 · 1 评论 -
UNIVERSAL SUCCESSOR FEATURES FOR TRANSFER REINFORCEMENT LEARNING(转移强化学习的通用后继特征)
ABSTRACTTransfer in Reinforcement Learning (RL) refers to the idea of applying knowledge gained from previous tasks to solving related tasks. Learning a universal value function (Schaul et al., 2015)...原创 2019-06-27 08:23:17 · 1683 阅读 · 0 评论 -
THE BODY IS NOT A GIVEN: JOINT AGENT POLICY LEARNING AND MORPHOLOGY EVOLUTION
ABSTRACTReinforcement learning (RL) has proven to be a powerful paradigm for deriving complex behaviors from simple reward signals in a wide range of environments. When applying RL to continuous cont...原创 2019-06-30 17:16:12 · 1193 阅读 · 0 评论