【论文理解】Learning Multiagent Communication with Backpropagation

Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar, Arthur Szlam, Rob Fergus 


Many tasks in AI require the collaboration of multiple agents. Typically, the communication protocol between agents is manually specified and not altered during training. In this paper we explore a simple neural model, called CommNet, that uses continuous communication for fully cooperative tasks. The model consists of multiple agents and the communication between them is learned alongside their policy. We apply this model to a diverse set of tasks, demonstrating the ability of the agents to learn to communicate amongst themselves, yielding improved performance over non-communicative agents and baselines. In some cases, it is possible to interpret the language devised by the agents, revealing simple but effective strategies for solving the task at hand.

 

人机对抗中多agent通信的模型,可以做baseline。核心就是上图,右边是整个框架,s1...代表是状态,a1...代表是输出动作,灰色的表示agent,每一层之间有通信;中间是层与层之间的通信,每个agent获得的reward和其它agent不相关,fi指的是通道channel;左边是单个通道对应单个agent的具体输入输出示意图,C指的是concatenation,连接即指的是通信,H指的是hidden state,即状态,输出的是下一层的状态。

转载于:https://www.cnblogs.com/WegZumHimmel/p/7453084.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值