Covertness-Aware Trajectory Design for UAV: A Multi-Step TD3-PER Solution

Yuanjian Li; Abdol-Hamid Aghvami

Covertness-Aware Trajectory Design for UAV: A Multi-Step TD3-PER Solution

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

In the presence of Warden's detection, a maximization problem on transmission throughput from unmanned aerial vehicle (UAV) to legitimate nodes is considered and solved via UAV trajectory design, subject to covert, velocity and mobility constraints. With the building-distribution-based pathloss model and the Warden's uncertain location model, the formulated optimization problem is challenging to be tackled through standard offline optimization methods. Alternatively, a twin delayed deep deterministic policy gradient approach enhanced by multi-step learning and prioritized experience replay techniques, termed as multi-step TD3-PER, is proposed to help the UAV adaptively select velocity from continuous action space. Numerical results demonstrate the effectiveness of the proposed multi-step TD3-PER solution and showcase the corresponding superiorities against provided baselines.

Original language	English
Title of host publication	IEEE International Conference on Communications (ICC 2022)
Publication status	Accepted/In press - 18 Jan 2022

Cite this

@inbook{bfba13aad3d743f1b33d72b44be8954e,

title = "Covertness-Aware Trajectory Design for UAV: A Multi-Step TD3-PER Solution",

abstract = "In the presence of Warden's detection, a maximization problem on transmission throughput from unmanned aerial vehicle (UAV) to legitimate nodes is considered and solved via UAV trajectory design, subject to covert, velocity and mobility constraints. With the building-distribution-based pathloss model and the Warden's uncertain location model, the formulated optimization problem is challenging to be tackled through standard offline optimization methods. Alternatively, a twin delayed deep deterministic policy gradient approach enhanced by multi-step learning and prioritized experience replay techniques, termed as multi-step TD3-PER, is proposed to help the UAV adaptively select velocity from continuous action space. Numerical results demonstrate the effectiveness of the proposed multi-step TD3-PER solution and showcase the corresponding superiorities against provided baselines.",

author = "Yuanjian Li and Abdol-Hamid Aghvami",

year = "2022",

month = jan,

day = "18",

language = "English",

booktitle = "IEEE International Conference on Communications (ICC 2022)",

}

TY - CHAP

T1 - Covertness-Aware Trajectory Design for UAV: A Multi-Step TD3-PER Solution

AU - Li, Yuanjian

AU - Aghvami, Abdol-Hamid

PY - 2022/1/18

Y1 - 2022/1/18

N2 - In the presence of Warden's detection, a maximization problem on transmission throughput from unmanned aerial vehicle (UAV) to legitimate nodes is considered and solved via UAV trajectory design, subject to covert, velocity and mobility constraints. With the building-distribution-based pathloss model and the Warden's uncertain location model, the formulated optimization problem is challenging to be tackled through standard offline optimization methods. Alternatively, a twin delayed deep deterministic policy gradient approach enhanced by multi-step learning and prioritized experience replay techniques, termed as multi-step TD3-PER, is proposed to help the UAV adaptively select velocity from continuous action space. Numerical results demonstrate the effectiveness of the proposed multi-step TD3-PER solution and showcase the corresponding superiorities against provided baselines.

AB - In the presence of Warden's detection, a maximization problem on transmission throughput from unmanned aerial vehicle (UAV) to legitimate nodes is considered and solved via UAV trajectory design, subject to covert, velocity and mobility constraints. With the building-distribution-based pathloss model and the Warden's uncertain location model, the formulated optimization problem is challenging to be tackled through standard offline optimization methods. Alternatively, a twin delayed deep deterministic policy gradient approach enhanced by multi-step learning and prioritized experience replay techniques, termed as multi-step TD3-PER, is proposed to help the UAV adaptively select velocity from continuous action space. Numerical results demonstrate the effectiveness of the proposed multi-step TD3-PER solution and showcase the corresponding superiorities against provided baselines.

M3 - Conference paper

BT - IEEE International Conference on Communications (ICC 2022)

ER -

Covertness-Aware Trajectory Design for UAV: A Multi-Step TD3-PER Solution

Abstract

Fingerprint

Cite this