Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning

Mahjoub, Omayma; Abramowitz, Sasha; de Kock, Ruan; Khlifi, Wiem; Toit, Simon du; Daniel, Jemma; Nessir, Louay Ben; Beyers, Louise; Formanek, Claude; Clark, Liam; Pretorius, Arnu

Computer Science > Machine Learning

arXiv:2410.01706 (cs)

[Submitted on 2 Oct 2024]

Title:Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning

Authors:Omayma Mahjoub, Sasha Abramowitz, Ruan de Kock, Wiem Khlifi, Simon du Toit, Jemma Daniel, Louay Ben Nessir, Louise Beyers, Claude Formanek, Liam Clark, Arnu Pretorius

View PDF HTML (experimental)

Abstract:As the field of multi-agent reinforcement learning (MARL) progresses towards larger and more complex environments, achieving strong performance while maintaining memory efficiency and scalability to many agents becomes increasingly important. Although recent research has led to several advanced algorithms, to date, none fully address all of these key properties simultaneously. In this work, we introduce Sable, a novel and theoretically sound algorithm that adapts the retention mechanism from Retentive Networks to MARL. Sable's retention-based sequence modelling architecture allows for computationally efficient scaling to a large number of agents, as well as maintaining a long temporal context, making it well-suited for large-scale partially observable environments. Through extensive evaluations across six diverse environments, we demonstrate how Sable is able to significantly outperform existing state-of-the-art methods in the majority of tasks (34 out of 45, roughly 75\%). Furthermore, Sable demonstrates stable performance as we scale the number of agents, handling environments with more than a thousand agents while exhibiting a linear increase in memory usage. Finally, we conduct ablation studies to isolate the source of Sable's performance gains and confirm its efficient computational memory usage. Our results highlight Sable's performance and efficiency, positioning it as a leading approach to MARL at scale.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
Cite as:	arXiv:2410.01706 [cs.LG]
	(or arXiv:2410.01706v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.01706

Submission history

From: Ruan De Kock [view email]
[v1] Wed, 2 Oct 2024 16:15:26 UTC (1,715 KB)

Computer Science > Machine Learning

Title:Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators