第一篇:NCC-MARL: Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning.
- NCC-MARL is a general RL framework to handle large-scale multi-agent cooperative problems.
- We notice that agents maintain consistent cognitions about their environments are crucial for achieving effective system-level cooperation. In contrast, it is hard to imagine that the agents without consensuses on their situated environments can cooperate well.
- NCC-MARL decomposes all agents into much smaller neighborhoods. Furthermore, we assume that each neighborhood has a true hidden cognitive variable, then all neighboring agents learn to align their learned neighborhood-specific cognitive representations with this true hidden cognitive variable by variational inference. As a result, all neighboring agents will eventually form consistent neighborhood cognitions, and thus achieve effective cooperations.
- NCC-MARL achieves much better perfor