很认真的中了两篇AAAI2020的文章:NCC-MARL: Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning.

本文介绍了两篇AAAI2020的文章,第一篇提出NCC-MARL框架,通过邻域认知一致性解决大规模多智能体合作问题,利用VAE和GNN实现;第二篇提出Gated-ACML,采用门控机制动态修剪通信消息,以适应有限带宽下的多智能体协作,侧重于Q-value的优化策略。
摘要由CSDN通过智能技术生成

第一篇:NCC-MARL: Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning. 

  1. NCC-MARL is a general RL framework to handle large-scale multi-agent cooperative problems.
  2. We notice that agents maintain consistent cognitions about their environments are crucial for achieving effective system-level cooperation. In contrast, it is hard to imagine that the agents without consensuses on their situated environments can cooperate well.
  3. NCC-MARL decomposes all agents into much smaller neighborhoods. Furthermore, we assume that each neighborhood has a true hidden cognitive variable, then all neighboring agents learn to align their learned neighborhood-specific cognitive representations with this true hidden cognitive variable by variational inference. As a result, all neighboring agents will eventually form consistent neighborhood cognitions, and thus achieve effective cooperations.
  4. NCC-MARL achieves much better perfor
评论 5
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值