Deep Reinforcement Learning for Discrete and Continuous Massive Access Control optimization

Nan Jiang; Yansha Deng; Arumugam Nallanathan

doi:10.1109/ICC40277.2020.9149055

Deep Reinforcement Learning for Discrete and Continuous Massive Access Control optimization

Nan Jiang, Yansha Deng^*, Arumugam Nallanathan

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

13 Citations (Scopus)

Abstract

Cellular-based networks are expected to offer connectivity for massive Internet of Things (mIoT) systems, however, their Random Access CHannel (RACH) procedure suffers from unreliability, due to the collision during the simultaneous massive. Despite that this collision problem has been treated in existing RACH schemes by organizing IoT devices' transmission and retransmission via the central control at the Base Station (BS), these existing RACH schemes are usually fixed over time, thus can hardly adapt to time-varying traffic patterns. In order to optimize the long-term objective in the number of success devices, this paper aims to design Deep Reinforcement Learning (DRL)-based optimizers with Deep Q-Network (DQN) and Deep Deterministic Policy Gradients (DDPG) for optimizing RACH schemes, including Access Class Barring (ACB), Back-Off (BO), and Distributed Queuing (DQ). Specifically, we apply DQN to handle discrete action selection for the BO as well as the DQ schemes, and DDPG to handle continuous action selection for the ACB scheme. Both agents are integrated with Gated recurrent unit Gated Recurrent Unit (GRU) network to approximate their value function/policy, which can improve the optimization performance by capturing temporal traffic correlations. Numerical results showcase that our proposed DRL-based optimizers considerably outperform conventional heuristic solutions in terms of the number of success access devices.

Original language	English
Title of host publication	2020 IEEE International Conference on Communications, ICC 2020 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781728150895
DOIs	https://doi.org/10.1109/ICC40277.2020.9149055
Publication status	Published - Jun 2020
Event	2020 IEEE International Conference on Communications, ICC 2020 - Dublin, Ireland Duration: 7 Jun 2020 → 11 Jun 2020

Publication series

Name	IEEE International Conference on Communications
Volume	2020-June
ISSN (Print)	1550-3607

Conference

Conference	2020 IEEE International Conference on Communications, ICC 2020
Country/Territory	Ireland
City	Dublin
Period	7/06/2020 → 11/06/2020

Keywords

access control
Deep reinforcement learning
dynamic optimization
random access

Access to Document

10.1109/ICC40277.2020.9149055

Cite this

Jiang, N., Deng, Y., & Nallanathan, A. (2020). Deep Reinforcement Learning for Discrete and Continuous Massive Access Control optimization. In 2020 IEEE International Conference on Communications, ICC 2020 - Proceedings Article 9149055 (IEEE International Conference on Communications; Vol. 2020-June). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICC40277.2020.9149055

@inbook{ebddd51210ed4804bd246903a26c9705,

title = "Deep Reinforcement Learning for Discrete and Continuous Massive Access Control optimization",

abstract = "Cellular-based networks are expected to offer connectivity for massive Internet of Things (mIoT) systems, however, their Random Access CHannel (RACH) procedure suffers from unreliability, due to the collision during the simultaneous massive. Despite that this collision problem has been treated in existing RACH schemes by organizing IoT devices' transmission and retransmission via the central control at the Base Station (BS), these existing RACH schemes are usually fixed over time, thus can hardly adapt to time-varying traffic patterns. In order to optimize the long-term objective in the number of success devices, this paper aims to design Deep Reinforcement Learning (DRL)-based optimizers with Deep Q-Network (DQN) and Deep Deterministic Policy Gradients (DDPG) for optimizing RACH schemes, including Access Class Barring (ACB), Back-Off (BO), and Distributed Queuing (DQ). Specifically, we apply DQN to handle discrete action selection for the BO as well as the DQ schemes, and DDPG to handle continuous action selection for the ACB scheme. Both agents are integrated with Gated recurrent unit Gated Recurrent Unit (GRU) network to approximate their value function/policy, which can improve the optimization performance by capturing temporal traffic correlations. Numerical results showcase that our proposed DRL-based optimizers considerably outperform conventional heuristic solutions in terms of the number of success access devices.",

keywords = "access control, Deep reinforcement learning, dynamic optimization, random access",

author = "Nan Jiang and Yansha Deng and Arumugam Nallanathan",

year = "2020",

month = jun,

doi = "10.1109/ICC40277.2020.9149055",

language = "English",

series = "IEEE International Conference on Communications",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2020 IEEE International Conference on Communications, ICC 2020 - Proceedings",

address = "United States",

note = "2020 IEEE International Conference on Communications, ICC 2020 ; Conference date: 07-06-2020 Through 11-06-2020",

}

Jiang, N, Deng, Y & Nallanathan, A 2020, Deep Reinforcement Learning for Discrete and Continuous Massive Access Control optimization. in 2020 IEEE International Conference on Communications, ICC 2020 - Proceedings., 9149055, IEEE International Conference on Communications, vol. 2020-June, Institute of Electrical and Electronics Engineers Inc., 2020 IEEE International Conference on Communications, ICC 2020, Dublin, Ireland, 7/06/2020. https://doi.org/10.1109/ICC40277.2020.9149055

Deep Reinforcement Learning for Discrete and Continuous Massive Access Control optimization. / Jiang, Nan; Deng, Yansha ; Nallanathan, Arumugam.
2020 IEEE International Conference on Communications, ICC 2020 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2020. 9149055 (IEEE International Conference on Communications; Vol. 2020-June).

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

TY - CHAP

T1 - Deep Reinforcement Learning for Discrete and Continuous Massive Access Control optimization

AU - Jiang, Nan

AU - Deng, Yansha

AU - Nallanathan, Arumugam

PY - 2020/6

Y1 - 2020/6

N2 - Cellular-based networks are expected to offer connectivity for massive Internet of Things (mIoT) systems, however, their Random Access CHannel (RACH) procedure suffers from unreliability, due to the collision during the simultaneous massive. Despite that this collision problem has been treated in existing RACH schemes by organizing IoT devices' transmission and retransmission via the central control at the Base Station (BS), these existing RACH schemes are usually fixed over time, thus can hardly adapt to time-varying traffic patterns. In order to optimize the long-term objective in the number of success devices, this paper aims to design Deep Reinforcement Learning (DRL)-based optimizers with Deep Q-Network (DQN) and Deep Deterministic Policy Gradients (DDPG) for optimizing RACH schemes, including Access Class Barring (ACB), Back-Off (BO), and Distributed Queuing (DQ). Specifically, we apply DQN to handle discrete action selection for the BO as well as the DQ schemes, and DDPG to handle continuous action selection for the ACB scheme. Both agents are integrated with Gated recurrent unit Gated Recurrent Unit (GRU) network to approximate their value function/policy, which can improve the optimization performance by capturing temporal traffic correlations. Numerical results showcase that our proposed DRL-based optimizers considerably outperform conventional heuristic solutions in terms of the number of success access devices.

AB - Cellular-based networks are expected to offer connectivity for massive Internet of Things (mIoT) systems, however, their Random Access CHannel (RACH) procedure suffers from unreliability, due to the collision during the simultaneous massive. Despite that this collision problem has been treated in existing RACH schemes by organizing IoT devices' transmission and retransmission via the central control at the Base Station (BS), these existing RACH schemes are usually fixed over time, thus can hardly adapt to time-varying traffic patterns. In order to optimize the long-term objective in the number of success devices, this paper aims to design Deep Reinforcement Learning (DRL)-based optimizers with Deep Q-Network (DQN) and Deep Deterministic Policy Gradients (DDPG) for optimizing RACH schemes, including Access Class Barring (ACB), Back-Off (BO), and Distributed Queuing (DQ). Specifically, we apply DQN to handle discrete action selection for the BO as well as the DQ schemes, and DDPG to handle continuous action selection for the ACB scheme. Both agents are integrated with Gated recurrent unit Gated Recurrent Unit (GRU) network to approximate their value function/policy, which can improve the optimization performance by capturing temporal traffic correlations. Numerical results showcase that our proposed DRL-based optimizers considerably outperform conventional heuristic solutions in terms of the number of success access devices.

KW - access control

KW - Deep reinforcement learning

KW - dynamic optimization

KW - random access

UR - http://www.scopus.com/inward/record.url?scp=85089477390&partnerID=8YFLogxK

U2 - 10.1109/ICC40277.2020.9149055

DO - 10.1109/ICC40277.2020.9149055

M3 - Conference paper

AN - SCOPUS:85089477390

T3 - IEEE International Conference on Communications

BT - 2020 IEEE International Conference on Communications, ICC 2020 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2020 IEEE International Conference on Communications, ICC 2020

Y2 - 7 June 2020 through 11 June 2020

ER -

Deep Reinforcement Learning for Discrete and Continuous Massive Access Control optimization

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this