Efficient and scalable reinforcement learning for large-scale network control

Chengdong Ma, Aming Li, Yali Du, Hao Dong, Yaodong Yang

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)
47 Downloads (Pure)

Abstract

The primary challenge in the development of large-scale artificial intelligence (AI) systems lies in achieving scalable decision-making—extending the AI models while maintaining sufficient performance. Existing research indicates that distributed AI can improve scalability by decomposing complex tasks and distributing them across collaborative nodes. However, previous technologies suffered from compromised real-world applicability and scalability due to the massive requirement of communication and sampled data. Here we develop a model-based decentralized policy optimization framework, which can be efficiently deployed in multi-agent systems. By leveraging local observation through the agent-level topological decoupling of global dynamics, we prove that this decentralized mechanism achieves accurate estimations of global information. Importantly, we further introduce model learning to reinforce the optimal policy for monotonic improvement with a limited amount of sampled data. Empirical results on diverse scenarios show the superior scalability of our approach, particularly in real-world systems with hundreds of agents, thereby paving the way for scaling up AI systems.

Original languageEnglish
Pages (from-to)1006-1020
Number of pages15
JournalNature Machine Intelligence
Volume6
Issue number9
DOIs
Publication statusPublished - 3 Sept 2024

Fingerprint

Dive into the research topics of 'Efficient and scalable reinforcement learning for large-scale network control'. Together they form a unique fingerprint.

Cite this