Cooperative Internet of UAVs: Distributed Trajectory Design by Multi-Agent Deep Reinforcement Learning

Hu J, Zhang H, Song L, Schober R, Poor HV (2020)

Publication Type: Journal article

Publication year: 2020

Journal

IEEE Transactions on Communications Institute of Electrical and Electronics Engineers (IEEE)

Book Volume: 68

Pages Range: 6807-6821

Article Number: 9154432

Journal Issue: 11

DOI: 10.1109/TCOMM.2020.3013599

Abstract

Due to the advantages of flexible deployment and extensive coverage, unmanned aerial vehicles (UAVs) have significant potential for sensing applications in the next generation of cellular networks, which will give rise to a cellular Internet of UAVs. In this article, we consider a cellular Internet of UAVs, where the UAVs execute sensing tasks through cooperative sensing and transmission to minimize the age of information (AoI). However, the cooperative sensing and transmission is tightly coupled with the UAVs' trajectories, which makes the trajectory design challenging. To tackle this challenge, we propose a distributed sense-and-send protocol, where the UAVs determine the trajectories by selecting from a discrete set of tasks and a continuous set of locations for sensing and transmission. Based on this protocol, we formulate the trajectory design problem for AoI minimization and propose a compound-action actor-critic (CA2C) algorithm to solve it based on deep reinforcement learning. The CA2C algorithm can learn the optimal policies for actions involving both continuous and discrete variables and is suited for the trajectory design. Our simulation results show that the CA2C algorithm outperforms four baseline algorithms. Also, we show that by dividing the tasks, cooperative UAVs can achieve a lower AoI compared to non-cooperative UAVs.

Authors with CRIS profile

Robert Schober Lehrstuhl für Digitale Übertragung (IDC)

Involved external institutions

Peking University (PKU) / 北京大学

China (CN) Princeton University

United States (USA) (US)

How to cite

APA:

Hu, J., Zhang, H., Song, L., Schober, R., & Poor, H.V. (2020). Cooperative Internet of UAVs: Distributed Trajectory Design by Multi-Agent Deep Reinforcement Learning. IEEE Transactions on Communications, 68(11), 6807-6821. https://doi.org/10.1109/TCOMM.2020.3013599

MLA:

Hu, Jingzhi, et al. "Cooperative Internet of UAVs: Distributed Trajectory Design by Multi-Agent Deep Reinforcement Learning." IEEE Transactions on Communications 68.11 (2020): 6807-6821.

BibTeX: Download