Efficient and scalable reinforcement learning for large-scale network control

Chengdong Ma; Aming  Li; Yali Du; Hao Dong; Yaodong Yang

doi:10.1038/s42256-024-00879-7

Efficient and scalable reinforcement learning for large-scale network control

Chengdong Ma, Aming Li, Yali Du, Hao Dong, Yaodong Yang

Informatics

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

47 Downloads (Pure)

Abstract

The primary challenge in the development of large-scale artificial intelligence (AI) systems lies in achieving scalable decision-making—extending the AI models while maintaining sufficient performance. Existing research indicates that distributed AI can improve scalability by decomposing complex tasks and distributing them across collaborative nodes. However, previous technologies suffered from compromised real-world applicability and scalability due to the massive requirement of communication and sampled data. Here we develop a model-based decentralized policy optimization framework, which can be efficiently deployed in multi-agent systems. By leveraging local observation through the agent-level topological decoupling of global dynamics, we prove that this decentralized mechanism achieves accurate estimations of global information. Importantly, we further introduce model learning to reinforce the optimal policy for monotonic improvement with a limited amount of sampled data. Empirical results on diverse scenarios show the superior scalability of our approach, particularly in real-world systems with hundreds of agents, thereby paving the way for scaling up AI systems.

Original language	English
Pages (from-to)	1006-1020
Number of pages	15
Journal	Nature Machine Intelligence
Volume	6
Issue number	9
DOIs	https://doi.org/10.1038/s42256-024-00879-7
Publication status	Published - 3 Sept 2024

Access to Document

10.1038/s42256-024-00879-7

NMI_MARL_final_submission

Cite this

@article{57d250acea2f45ef9266e72694d3c66c,

title = "Efficient and scalable reinforcement learning for large-scale network control",

abstract = "The primary challenge in the development of large-scale artificial intelligence (AI) systems lies in achieving scalable decision-making—extending the AI models while maintaining sufficient performance. Existing research indicates that distributed AI can improve scalability by decomposing complex tasks and distributing them across collaborative nodes. However, previous technologies suffered from compromised real-world applicability and scalability due to the massive requirement of communication and sampled data. Here we develop a model-based decentralized policy optimization framework, which can be efficiently deployed in multi-agent systems. By leveraging local observation through the agent-level topological decoupling of global dynamics, we prove that this decentralized mechanism achieves accurate estimations of global information. Importantly, we further introduce model learning to reinforce the optimal policy for monotonic improvement with a limited amount of sampled data. Empirical results on diverse scenarios show the superior scalability of our approach, particularly in real-world systems with hundreds of agents, thereby paving the way for scaling up AI systems.",

author = "Chengdong Ma and Aming Li and Yali Du and Hao Dong and Yaodong Yang",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",

year = "2024",

month = sep,

day = "3",

doi = "10.1038/s42256-024-00879-7",

language = "English",

volume = "6",

pages = "1006--1020",

journal = "Nature Machine Intelligence",

issn = "2522-5839",

publisher = "Springer Nature Switzerland AG",

number = "9",

}

TY - JOUR

T1 - Efficient and scalable reinforcement learning for large-scale network control

AU - Ma, Chengdong

AU - Li, Aming

AU - Du, Yali

AU - Dong, Hao

AU - Yang, Yaodong

PY - 2024/9/3

Y1 - 2024/9/3

N2 - The primary challenge in the development of large-scale artificial intelligence (AI) systems lies in achieving scalable decision-making—extending the AI models while maintaining sufficient performance. Existing research indicates that distributed AI can improve scalability by decomposing complex tasks and distributing them across collaborative nodes. However, previous technologies suffered from compromised real-world applicability and scalability due to the massive requirement of communication and sampled data. Here we develop a model-based decentralized policy optimization framework, which can be efficiently deployed in multi-agent systems. By leveraging local observation through the agent-level topological decoupling of global dynamics, we prove that this decentralized mechanism achieves accurate estimations of global information. Importantly, we further introduce model learning to reinforce the optimal policy for monotonic improvement with a limited amount of sampled data. Empirical results on diverse scenarios show the superior scalability of our approach, particularly in real-world systems with hundreds of agents, thereby paving the way for scaling up AI systems.

AB - The primary challenge in the development of large-scale artificial intelligence (AI) systems lies in achieving scalable decision-making—extending the AI models while maintaining sufficient performance. Existing research indicates that distributed AI can improve scalability by decomposing complex tasks and distributing them across collaborative nodes. However, previous technologies suffered from compromised real-world applicability and scalability due to the massive requirement of communication and sampled data. Here we develop a model-based decentralized policy optimization framework, which can be efficiently deployed in multi-agent systems. By leveraging local observation through the agent-level topological decoupling of global dynamics, we prove that this decentralized mechanism achieves accurate estimations of global information. Importantly, we further introduce model learning to reinforce the optimal policy for monotonic improvement with a limited amount of sampled data. Empirical results on diverse scenarios show the superior scalability of our approach, particularly in real-world systems with hundreds of agents, thereby paving the way for scaling up AI systems.

UR - http://www.scopus.com/inward/record.url?scp=85203020555&partnerID=8YFLogxK

U2 - 10.1038/s42256-024-00879-7

DO - 10.1038/s42256-024-00879-7

M3 - Article

SN - 2522-5839

VL - 6

SP - 1006

EP - 1020

JO - Nature Machine Intelligence

JF - Nature Machine Intelligence

IS - 9

ER -

Efficient and scalable reinforcement learning for large-scale network control

Abstract

Access to Document

Other files and links

Fingerprint

Cite this