![](https://img-blog.csdnimg.cn/20201014180756927.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
论文研读
Adam婷
笔者在人工智能/机器学习领域中默默探索,时而迷惘,时而欣喜。
展开
-
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments基于混合合作竞争环境的多代理演员评论家算法
AbstractWe explore deep reinforcement learning methods for multi-agent domains. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case:Q-learning is challenged by an inherent non-stationarity of the environment, while polic原创 2020-09-28 16:18:27 · 2674 阅读 · 0 评论 -
Large Scale Evolving Graphs with Burst Detection
Large Scale Evolving Graphs with Burst DetectionAbstractAnalyzing large-scale evolving graphs are crucialfor understanding the dynamic and evolutionary nature of social networks. Most existing works focus on discovering repeated and consistent temporal原创 2020-07-23 22:39:30 · 746 阅读 · 0 评论 -
Controllable Multi-Interest Framework for Recommendation
Controllable Multi-Interest Framework for RecommendationABSTRACTRecently, neural networks have been widely used in e-commerce recommender systems, owing to the rapid development of deep learning. We formalize the recommender system as a sequential recom原创 2020-07-23 21:54:26 · 3384 阅读 · 0 评论 -
ArnetMiner: Extraction and Mining of Academic Social Networks
ArnetMiner: Extraction and Mining of Academic Social NetworksABSTRACTThis paper addresses several key issues in the ArnetMiner system, which aims at extracting and mining academic social networks. Specifically, the system focuses on: 1) Extracting resear原创 2020-07-23 20:23:52 · 2957 阅读 · 0 评论 -
Distributed Stochastic Gradient Method for Non-Convex Problems with Applications in Supervised Learn
Distributed Stochastic Gradient Method for Non-Convex Problems with Applications in Supervised LearningAbstractWe develop a distributed stochastic gradient descent algorithm for solving non-convex optimization problems under the assumption that the local原创 2020-07-22 16:35:00 · 715 阅读 · 0 评论 -
Distributed Stochastic Gradient Descent with Event-Triggered Communication
Distributed Stochastic Gradient Descent with Event-Triggered CommunicationAbstractWe develop a Distributed Event-Triggered Stochastic GRAdient Descent (DETSGRAD) algorithm for solving non-convex optimization problems typically encountered in distributed原创 2020-07-29 22:06:42 · 1284 阅读 · 1 评论 -
Deep Learning Meets SAR
Deep Learning Meets SARAbstractDeep learning in remote sensing has become an international hype, but it is mostly limited to the evaluation of optical data. Although deep learning has been introduced in SAR data processing, despite successful first attemp原创 2020-07-16 22:08:17 · 7301 阅读 · 2 评论 -
Semantic Flow for Fast and Accurate Scene Parsing
Semantic Flow for Fast and Accurate Scene ParsingAbstract1. Introduction2. Related Work3. Method3.1. Preliminary3.2. Flow Alignment Module3.3. Network Architectures4. Experiment4.1. Experiments on Cit...原创 2020-03-29 17:35:03 · 5801 阅读 · 0 评论 -
Circle Loss: A Unified Perspective of Pair Similarity Optimization
Circle Loss: A Unified Perspective of Pair Similarity OptimizationAbstract1. Introduction2. A Unified Perspective3. A New Loss Function3.1. Self-paced Weighting3.2. Within-class and Between-class Marg...原创 2020-03-29 16:44:20 · 2129 阅读 · 0 评论 -
ReZero is All You Need: Fast Convergence at Large Depth
ReZero is All You Need: Fast Convergence at Large DepthAbstractDeep networks have enabled significant performance gains across domains, but they often suffer from vanishing/exploding gradients. Thi...原创 2020-03-29 10:17:40 · 1539 阅读 · 1 评论 -
Deep Snake for Real-Time Instance Segmentation
Deep Snake for Real-Time Instance SegmentationAbstract1. IntroductionFigure 1. The basic idea of deep snake. Given an initial contour, image features are extracted at each vertex (a). Since the contou...原创 2020-03-29 09:04:34 · 2107 阅读 · 0 评论 -
Graph Convolutional Neural Networks for Web-Scale Recommender Systems(用于Web级推荐系统的图形卷积神经网络)
Graph Convolutional Neural Networks for Web-Scale Recommender Systems用于Web级推荐系统的图形卷积神经网络ABSTRACTRecent advancements in deep neural networks for graph-structured data have led to state-of-the-art pe...原创 2019-07-07 09:52:26 · 4095 阅读 · 0 评论 -
XGBoost: Scalable GPU Accelerated Learning (XGBoost:可扩展的GPU加速学习)
XGBoost: Scalable GPU Accelerated LearningAbstractWe describe the multi-GPU gradient boosting algorithm implemented in the XGBoost library1. Our algorithm allows fast, scalable training on multi-GPU...原创 2019-07-16 10:11:00 · 728 阅读 · 0 评论 -
Graph Neural Networks: A Review of Methods and Applications(图神经网络:方法与应用综述)
Graph Neural Networks: A Review of Methods and ApplicationsJie Zhou , Ganqu Cui , Zhengyan Zhang , Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, Maosong SunAbstract—Lots of learning tasks req...原创 2019-07-06 21:07:17 · 5260 阅读 · 0 评论 -
A Comprehensive Survey on Graph Neural Networks(图神经网络综合研究)
A Comprehensive Survey on Graph Neural NetworksZonghan Wu, Shirui Pan, Member, IEEE, Fengwen Chen, Guodong Long,Chengqi Zhang, Senior Member, IEEE, Philip S. Yu, Fellow, IEEEAbstract—Deep learning ...原创 2019-07-06 18:54:16 · 7301 阅读 · 0 评论 -
XGBoost: A Scalable Tree Boosting System(XGBoost:一个可扩展的树提升系统)
XGBoost: A Scalable Tree Boosting SystemABSTRACTTree boosting is a highly e ective and widely used machine learning method. In this paper, we describe a scalable end-to-end tree boosting system call...原创 2019-07-15 20:42:38 · 2355 阅读 · 0 评论 -
Hybrid Reward Architecture for Reinforcement Learning
Hybrid Reward Architecture for Reinforcement Learning31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA.AbstractOne of the main challenges in reinforcemen...原创 2019-07-01 14:21:27 · 1281 阅读 · 0 评论 -
Relational inductive biases, deep learning, and graph networks(关系归纳偏差、深度学习和图形网络)
Relational inductive biases, deep learning, and graph networksPeter W. Battaglia1, Jessica B. Hamrick1, Victor Bapst1, Alvaro Sanchez-Gonzalez1, Vinicius Zambaldi1, Mateusz Malinowski1, Andrea Tacche...原创 2019-07-15 12:29:32 · 4912 阅读 · 0 评论 -
THE BODY IS NOT A GIVEN: JOINT AGENT POLICY LEARNING AND MORPHOLOGY EVOLUTION
ABSTRACTReinforcement learning (RL) has proven to be a powerful paradigm for deriving complex behaviors from simple reward signals in a wide range of environments. When applying RL to continuous cont...原创 2019-06-30 17:16:12 · 1145 阅读 · 0 评论 -
POLICY GENERALIZATION IN CAPACITY-LIMITED REINFORCEMENT LEARNING
能力有限的加强学习中的政策一般化ABSTRACTMotivated by the study of generalization in biological intelligence, we examine reinforcement learning (RL) in settings where there are information-theoretic constraints plac...原创 2019-06-30 12:53:32 · 490 阅读 · 0 评论 -
转移价值?还是 策略? 一个可转移的连续强化学习的中心框架
TRANSFER VALUE OR POLICY? A AVALUE-CENTRIC FRAMEWORK TOWARDS TRANSFERRABLE CONTINUOUS REINFORCEMENT LEARNINGABSTRACTTransferring learned knowledge from one environment to another is an important ...原创 2019-06-30 11:14:59 · 3872 阅读 · 0 评论 -
AdamTechLouis's talk:Decoding the Best Papers from ICLR 2019 – Neural Networks are Here to Rule
IntroductionI love reading and decoding machine learning research papers. There is so much incredible information to parse through – a goldmine for us data scientists! I was thrilled when the best pa...原创 2019-06-03 14:47:23 · 389 阅读 · 0 评论 -
AdamTechLouis's talk:Deep Learning approaches to understand Human Reasoning
For a doctor who is using Deep Learning to find whether the patient has multiple sclerosis, it is not at all good to get a yes or no answer from the model. For a safety critical application such as a...原创 2019-06-03 15:03:04 · 427 阅读 · 0 评论 -
博弈论与多智能体强化学习
Ann Nowe´, Peter Vrancx, and Yann-Michae¨l De HauwereAbstract. Reinforcement Learning was originally developed for Markov Decision Processes (MDPs). It allows a single agent to learn a policy that ma...原创 2019-06-22 11:22:50 · 12043 阅读 · 2 评论 -
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
Averaged-DQN:深度强化学习的方差减少和稳定性AbstractInstability and variability of Deep Reinforcement Learning (DRL) algorithms tend to adversely af-fect their performance. Averaged-DQN is a sim-ple extension to th...原创 2019-07-01 19:40:39 · 1963 阅读 · 0 评论 -
REINFORCEMENT LEARNING USING QUANTUM BOLTZMANN MACHINES利用量子波兹曼机进行强化学习
REINFORCEMENT LEARNING USING QUANTUM BOLTZMANN MACHINESAbstract. We investigate whether quantum annealers with select chip layouts can outperform classical computers in reinforcement learning tasks. ...原创 2019-07-07 20:43:04 · 1792 阅读 · 0 评论 -
Finding Options that Minimize Planning Time
Yuu Jinnai 1 David Abel 1 D Ellis Hershkowitz 2 Michael L. Littman 1 George Konidaris 1AbstractWe formalize the problem of selecting the optimal set of options for planning as that of computing the ...原创 2019-06-26 21:01:12 · 148 阅读 · 0 评论 -
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
AbstractWe consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocol...原创 2019-06-23 23:59:28 · 2217 阅读 · 0 评论 -
UNIVERSAL SUCCESSOR FEATURES FOR TRANSFER REINFORCEMENT LEARNING(转移强化学习的通用后继特征)
ABSTRACTTransfer in Reinforcement Learning (RL) refers to the idea of applying knowledge gained from previous tasks to solving related tasks. Learning a universal value function (Schaul et al., 2015)...原创 2019-06-27 08:23:17 · 1642 阅读 · 0 评论 -
LEARNING TO SCHEDULE COMMUNICATION IN MULTI-AGENT REINFORCEMENT LEARNING
ABSTRACTMany real-world reinforcement learning tasks require multiple agents to make se- quential decisions under the agents’ interaction, where well-coordinated actions among the agents are crucial ...原创 2019-06-24 10:56:55 · 4002 阅读 · 0 评论 -
TARMAC: TARGETED MULTI-AGENT COMMUNICATION(TARMAC:目标多代理通信)
ABSTRACTWe explore a collaborative multi-agent reinforcement learning setting where a team of agents attempts to solve cooperative tasks in partially-observable environ-ments. In this scenario, learn...原创 2019-06-27 10:16:27 · 1914 阅读 · 1 评论 -
THE WISDOM OF THE CROWD: RELIABLE DEEP REINFORCEMENT LEARNING THROUGH ENSEMBLES OF Q--FUNCTIONS
ABSTRACTReinforcement learning agents learn by exploring the environment and then ex-ploiting what they have learned. This frees the human trainers from having to know the preferred action or intrins...原创 2019-06-27 11:20:52 · 1101 阅读 · 0 评论 -
When Does Label Smoothing Help?
When Does Label Smoothing Help?Rafael Müller, Simon Kornblith, Geoffrey HintonGoogle BrainTorontorafaelmuller@google.comAbstractThe generalization and learning speed of a multi-class neural netw...原创 2019-07-08 20:45:31 · 3211 阅读 · 0 评论 -
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
利用信息约束基元的竞争集合强化学习Anirudh Goyal1, Shagun Sodhani1, Jonathan Binas1, Xue Bin Peng2Sergey Levine2, Yoshua Bengio1y1Mila, Université de Montréal2University of California, Berkeley yCIFAR Senior Fello...原创 2019-06-27 19:50:29 · 1598 阅读 · 0 评论 -
TRAJECTORY VAE FOR MULTI-MODAL IMITATION(用于多模态模拟的轨迹VAE)
ABSTRACTWe address the problem of imitating multi-modal expert demonstrations in sequential decision making problems. In many practical applications, for example video games, behavioural demonstratio...原创 2019-06-30 00:08:04 · 997 阅读 · 0 评论 -
LEARNING GOAL-CONDITIONED VALUE FUNCTIONS WITH ONE-STEP PATH REWARDS RATHER THAN GOAL- REWARDS
ABSTRACTMulti-goal reinforcement learning (MGRL) addresses tasks where the desired goal state can change for every trial. State-of-the-art algorithms model these problems such that the reward formula...原创 2019-06-30 08:22:47 · 766 阅读 · 0 评论 -
学习控制深度加固学习中结构探索的视觉抽象
LEARNING TO CONTROL VISUAL ABSTRACTIONS FOR STRUCTURED EXPLORATION IN DEEP REINFORCEMENT LEARNINGABSTRACTExploration in environments with sparse rewards is a key challenge for reinforcement learn...原创 2019-06-30 09:59:03 · 1327 阅读 · 0 评论 -
用于深度学习的演化神经AutoML
ABSTRACT深度神经网络(DNN)已经在许多基准测试和问题领域中产生了最先进的结果。然而,DNN的成功取决于其ar-chitecture和超参数的正确配置。这种配置很困难,因此,DNN通常不会充分发挥其潜力。此外,商业应用中的DNN通常需要满足现实世界的设计约束,例如参数的大小或数量。为了简化配置,我们开发了用于深度学习的自动机器学习(AutoML)系统,主要侧重于超参数的优化。本文将Au...原创 2019-05-03 22:58:33 · 1472 阅读 · 0 评论