TY - JOUR
T1 - Intelligent Trajectory Planning in UAV-mounted Wireless Networks: A Quantum-Inspired Reinforcement Learning Perspective
AU - Li, Yuanjian
AU - Aghvami, Abdol-Hamid
AU - Dong, Daoyi
N1 - Funding Information:
Manuscript received May 12, 2021; accepted June 8, 2021. Date of publication June 16, 2021; date of current version September 9, 2021. This work was supported by the K-CSC scholarship funded jointly by China Scholarship Council and King’s College London, under Grant CSC201908350102. The associate editor coordinating the review of this article and approving it for publication was Z. Chang. (Corresponding author: Yuanjian Li.) Yuanjian Li and A. Hamid Aghvami are with the Centre for Telecommunications Research, King’s College London, London WC2R 2LS, U.K. (e-mail: [email protected]; [email protected]).
Publisher Copyright:
© 2012 IEEE.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2021/9
Y1 - 2021/9
N2 - In this letter, we consider a wireless uplink transmission scenario in which an unmanned aerial vehicle (UAV) serves as an aerial base station collecting data from ground users. To optimize the expected sum uplink transmit rate without any prior knowledge of ground users (e.g., locations, channel state information and transmit power), the trajectory planning problem is optimized via the quantum-inspired reinforcement learning (QiRL) approach. Specifically, the QiRL method adopts novel probabilistic action selection policy and new reinforcement strategy, which are inspired by the collapse phenomenon and amplitude amplification in quantum computation theory, respectively. Numerical results demonstrate that the proposed QiRL solution can offer natural balancing between exploration and exploitation via ranking collapse probabilities of possible actions, compared to the traditional reinforcement learning approaches that are highly dependent on tuned exploration parameters.
AB - In this letter, we consider a wireless uplink transmission scenario in which an unmanned aerial vehicle (UAV) serves as an aerial base station collecting data from ground users. To optimize the expected sum uplink transmit rate without any prior knowledge of ground users (e.g., locations, channel state information and transmit power), the trajectory planning problem is optimized via the quantum-inspired reinforcement learning (QiRL) approach. Specifically, the QiRL method adopts novel probabilistic action selection policy and new reinforcement strategy, which are inspired by the collapse phenomenon and amplitude amplification in quantum computation theory, respectively. Numerical results demonstrate that the proposed QiRL solution can offer natural balancing between exploration and exploitation via ranking collapse probabilities of possible actions, compared to the traditional reinforcement learning approaches that are highly dependent on tuned exploration parameters.
UR - http://www.scopus.com/inward/record.url?scp=85112143901&partnerID=8YFLogxK
U2 - 10.1109/LWC.2021.3089876
DO - 10.1109/LWC.2021.3089876
M3 - Article
SN - 2162-2337
VL - 10
SP - 1994
EP - 1998
JO - IEEE Wireless Communications Letters
JF - IEEE Wireless Communications Letters
IS - 9
M1 - 9456900
ER -