NFDI4DS | UHH-SEMS - Publication Details

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

FOS: Computer and information sciences 0202 electrical engineering, electronic engineering, information engineering Computer Science - Multiagent Systems 02 engineering and technology Multiagent Systems (cs.MA)

DOI: 10.48550/arxiv.2002.03939 Publication Date: 2020-01-01

Abstract Supplemental Material References Cited by

AUTHORS (7)

Yang, Yaodong

Hao, Jianye

Liao, Ben

Shao, Kun

Chen, Guangyong

Liu, Wulong

Tang, Hongyao

ABSTRACT

In many real-world tasks, multiple agents must learn to coordinate with each other given their private observations and limited communication ability. Deep multiagent reinforcement learning (Deep-MARL) algorithms have shown superior performance in such challenging settings. One representative class of work is multiagent value decomposition, which decomposes the global shared multiagent Q-value $Q_{tot}$ into individual Q-values $Q^{i}$ to guide individuals' behaviors, i.e. VDN imposing an additive formation and QMIX adopting a monotonic assumption using an implicit mixing method. However, most of the previous efforts impose certain assumptions between $Q_{tot}$ and $Q^{i}$ and lack theoretical groundings. Besides, they do not explicitly consider the agent-level impact of individuals to the whole system when transforming individual $Q^{i}$s into $Q_{tot}$. In this paper, we theoretically derive a general formula of $Q_{tot}$ in terms of $Q^{i}$, based on which we can naturally implement a multi-head attention formation to approximate $Q_{tot}$, resulting in not only a refined representation of $Q_{tot}$ with an agent-level attention mechanism, but also a tractable maximization algorithm of decentralized policies. Extensive experiments demonstrate that our method outperforms state-of-the-art MARL methods on the widely adopted StarCraft benchmark across different scenarios, and attention analysis is further conducted with valuable insights.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....