TY - JOUR
T1 - Average AoI-Minimal Trajectory Design for UAV-Assisted IoT Data Collection System
T2 - A Safe-TD3 Approach
AU - Sun, Hongguang
AU - Zhou, Yi
AU - Tang, Jinchen
AU - Kang, Zhangsai
AU - Wang, Xijun
AU - Quek, Tony Q.S.
N1 - Publisher Copyright:
© 2012 IEEE.
PY - 2024/2/1
Y1 - 2024/2/1
N2 - This letter investigates an unmanned aerial vehicle (UAV)-assisted data collection strategy where the UAV trajectory is optimally designed to collect status update from several Internet of Things (IoT) nodes, so as to minimize the average Age of Information (AoI). We consider a practical three-dimensional (3D) urban environment, and design the UAV's trajectory by considering the data collection, flight, and energy constraints. Motivated by the critical safety requirements for the UAV, i.e., the energy constraint during the data collection, we exploit the twin delayed deep deterministic policy gradient (TD3) approach by enforcing the safety constraint throughout the training, and propose a Safe-TD3 based trajectory design for average AoI minimization. By evaluating the long-term safety constraint via the integrated cost network, we illustrate the superiority of the proposed Safe-TD3 based trajectory design algorithm over the benchmarks in reducing the safety constraint violations during the training process while achieving a lower average AoI.
AB - This letter investigates an unmanned aerial vehicle (UAV)-assisted data collection strategy where the UAV trajectory is optimally designed to collect status update from several Internet of Things (IoT) nodes, so as to minimize the average Age of Information (AoI). We consider a practical three-dimensional (3D) urban environment, and design the UAV's trajectory by considering the data collection, flight, and energy constraints. Motivated by the critical safety requirements for the UAV, i.e., the energy constraint during the data collection, we exploit the twin delayed deep deterministic policy gradient (TD3) approach by enforcing the safety constraint throughout the training, and propose a Safe-TD3 based trajectory design for average AoI minimization. By evaluating the long-term safety constraint via the integrated cost network, we illustrate the superiority of the proposed Safe-TD3 based trajectory design algorithm over the benchmarks in reducing the safety constraint violations during the training process while achieving a lower average AoI.
UR - http://www.scopus.com/inward/record.url?scp=85179032408&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85179032408&partnerID=8YFLogxK
U2 - 10.1109/LWC.2023.3335037
DO - 10.1109/LWC.2023.3335037
M3 - Article
AN - SCOPUS:85179032408
SN - 2162-2337
VL - 13
SP - 530
EP - 534
JO - IEEE Wireless Communications Letters
JF - IEEE Wireless Communications Letters
IS - 2
ER -