Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization

Nan Jiang; Yansha Deng; Osvaldo Simeone; Arumugam Nallanathan

doi:10.1109/ICASSP.2019.8682697

Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization

Nan Jiang, Yansha Deng, Osvaldo Simeone, Arumugam Nallanathan

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

18 Citations (Scopus)

Abstract

NarrowBand-Internet of Things (NB-IoT) is an emerging cellular-based technology that offers a range of flexible configurations for massive IoT radio access from groups of devices with heterogeneous requirements. A configuration specifies the amount of radio resources allocated to each group of devices for random access and for data transmission. Assuming no knowledge of the traffic statistics, the problem is to determine, in an online fashion at each Transmission Time Interval (TTI), the configurations that maximizes the long-term average number of IoT devices that are able to both access and deliver data. Given the complexity of optimal algorithms, a Cooperative Multi-Agent Deep Neural Network based Q-learning (CMA-DQN) approach is developed, whereby each DQN agent independently control a configuration variable for each group. The DQN agents are cooperatively trained in the same environment based on feedback regarding transmission outcomes. CMA-DQN is seen to considerably outperform conventional heuristic approaches based on load estimation.

Original language	English
Title of host publication	2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	8424-8428
Number of pages	5
ISBN (Electronic)	9781479981311
DOIs	https://doi.org/10.1109/ICASSP.2019.8682697
Publication status	Published - 2019
Event	44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Brighton, United Kingdom Duration: 12 May 2019 → 17 May 2019

Publication series

Name	2010 Ieee International Conference On Acoustics, Speech, And Signal Processing
ISSN (Print)	1520-6149

Conference

Conference	44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019
Country/Territory	United Kingdom
City	Brighton
Period	12/05/2019 → 17/05/2019

Keywords

Deep Reinforcement Learning
Multi-Agent
NB-IoT
Random Access
Resource Configuration

Access to Document

10.1109/ICASSP.2019.8682697

Cite this

Jiang, N., Deng, Y., Simeone, O., & Nallanathan, A. (2019). Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization. In 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings (pp. 8424-8428). Article 8682697 (2010 Ieee International Conference On Acoustics, Speech, And Signal Processing). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2019.8682697

Jiang, Nan ; Deng, Yansha ; Simeone, Osvaldo et al. / Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization. 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 8424-8428 (2010 Ieee International Conference On Acoustics, Speech, And Signal Processing).

@inbook{7e22a1c35f124708ad77c21696981908,

title = "Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization",

abstract = "NarrowBand-Internet of Things (NB-IoT) is an emerging cellular-based technology that offers a range of flexible configurations for massive IoT radio access from groups of devices with heterogeneous requirements. A configuration specifies the amount of radio resources allocated to each group of devices for random access and for data transmission. Assuming no knowledge of the traffic statistics, the problem is to determine, in an online fashion at each Transmission Time Interval (TTI), the configurations that maximizes the long-term average number of IoT devices that are able to both access and deliver data. Given the complexity of optimal algorithms, a Cooperative Multi-Agent Deep Neural Network based Q-learning (CMA-DQN) approach is developed, whereby each DQN agent independently control a configuration variable for each group. The DQN agents are cooperatively trained in the same environment based on feedback regarding transmission outcomes. CMA-DQN is seen to considerably outperform conventional heuristic approaches based on load estimation.",

keywords = "Deep Reinforcement Learning, Multi-Agent, NB-IoT, Random Access, Resource Configuration",

author = "Nan Jiang and Yansha Deng and Osvaldo Simeone and Arumugam Nallanathan",

year = "2019",

doi = "10.1109/ICASSP.2019.8682697",

language = "English",

series = "2010 Ieee International Conference On Acoustics, Speech, And Signal Processing",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "8424--8428",

booktitle = "2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings",

address = "United States",

note = "44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 ; Conference date: 12-05-2019 Through 17-05-2019",

}

Jiang, N, Deng, Y , Simeone, O & Nallanathan, A 2019, Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization. in 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings., 8682697, 2010 Ieee International Conference On Acoustics, Speech, And Signal Processing, Institute of Electrical and Electronics Engineers Inc., pp. 8424-8428, 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019, Brighton, United Kingdom, 12/05/2019. https://doi.org/10.1109/ICASSP.2019.8682697

Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization. / Jiang, Nan; Deng, Yansha ; Simeone, Osvaldo et al.
2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. p. 8424-8428 8682697 (2010 Ieee International Conference On Acoustics, Speech, And Signal Processing).

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

TY - CHAP

T1 - Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization

AU - Jiang, Nan

AU - Deng, Yansha

AU - Simeone, Osvaldo

AU - Nallanathan, Arumugam

PY - 2019

Y1 - 2019

N2 - NarrowBand-Internet of Things (NB-IoT) is an emerging cellular-based technology that offers a range of flexible configurations for massive IoT radio access from groups of devices with heterogeneous requirements. A configuration specifies the amount of radio resources allocated to each group of devices for random access and for data transmission. Assuming no knowledge of the traffic statistics, the problem is to determine, in an online fashion at each Transmission Time Interval (TTI), the configurations that maximizes the long-term average number of IoT devices that are able to both access and deliver data. Given the complexity of optimal algorithms, a Cooperative Multi-Agent Deep Neural Network based Q-learning (CMA-DQN) approach is developed, whereby each DQN agent independently control a configuration variable for each group. The DQN agents are cooperatively trained in the same environment based on feedback regarding transmission outcomes. CMA-DQN is seen to considerably outperform conventional heuristic approaches based on load estimation.

AB - NarrowBand-Internet of Things (NB-IoT) is an emerging cellular-based technology that offers a range of flexible configurations for massive IoT radio access from groups of devices with heterogeneous requirements. A configuration specifies the amount of radio resources allocated to each group of devices for random access and for data transmission. Assuming no knowledge of the traffic statistics, the problem is to determine, in an online fashion at each Transmission Time Interval (TTI), the configurations that maximizes the long-term average number of IoT devices that are able to both access and deliver data. Given the complexity of optimal algorithms, a Cooperative Multi-Agent Deep Neural Network based Q-learning (CMA-DQN) approach is developed, whereby each DQN agent independently control a configuration variable for each group. The DQN agents are cooperatively trained in the same environment based on feedback regarding transmission outcomes. CMA-DQN is seen to considerably outperform conventional heuristic approaches based on load estimation.

KW - Deep Reinforcement Learning

KW - Multi-Agent

KW - NB-IoT

KW - Random Access

KW - Resource Configuration

UR - http://www.scopus.com/inward/record.url?scp=85068956808&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2019.8682697

DO - 10.1109/ICASSP.2019.8682697

M3 - Conference paper

AN - SCOPUS:85068956808

T3 - 2010 Ieee International Conference On Acoustics, Speech, And Signal Processing

SP - 8424

EP - 8428

BT - 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019

Y2 - 12 May 2019 through 17 May 2019

ER -

Jiang N, Deng Y , Simeone O , Nallanathan A. Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization. In 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2019. p. 8424-8428. 8682697. (2010 Ieee International Conference On Acoustics, Speech, And Signal Processing). Epub 2019 Apr 16. doi: 10.1109/ICASSP.2019.8682697

Cooperative Deep Reinforcement Learning for Multiple-group NB-IoT Networks Optimization

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this