☆ 4.6 Article Proceedings Paper

Adaptive UAV-Trajectory Optimization Under Quality of Service Constraints: A Model-Free Solution

IEEE ACCESS (2020)

Journal

IEEE ACCESS

Volume 8, Issue -, Pages 112253-112265

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/ACCESS.2020.3001752

Keywords

Sensors; Trajectory optimization; Unmanned aerial vehicles; Reinforcement learning; Wireless networks; Reinforcement learning; sensor data collection; trajectory optimization; UAV communications

Funding

Engineering and Physical Sciences Research Council [EP/Noo4558/1, EP/PO34284/1]
COALESCE
Royal Society's Global Challenges Research Fund
European Research Council's Advanced Fellowship under Grant QuantCom
U.K. Engineering and Physical Sciences Research Council (EPSRC) [EP/N029720/2]
EPSRC [EP/N029720/2] Funding Source: UKRI

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Unmanned aerial vehicles (UAVs) with the potential of providing reliable high-rate connectivity, are becoming a promising component of future wireless networks. A UAV collects data from a set of randomly distributed sensors, where both the locations of these sensors and their data volume to be transmitted are unknown to the UAV. In order to assist the UAV in finding the optimal motion trajectory in the face of the uncertainty without the above knowledge whilst aiming for maximizing the cumulative collected data, we formulate a reinforcement learning problem by modelling the motion-trajectory as a Markov decision process with the UAV acting as the learning agent. Then, we propose a pair of novel trajectory optimization algorithms based on stochastic modelling and reinforcement learning, which allows the UAV to optimize its flight trajectory without the need for system identification. More specifically, by dividing the considered region into small tiles, we conceive state-action-reward-state-action (Sarsa) and Q-learning based UAV-trajectory optimization algorithms (i.e., SUTOA and QUTOA) aiming to maximize the cumulative data collected during the finite flight-time. Our simulation results demonstrate that both of the proposed approaches are capable of finding an optimal trajectory under the fight-time constraint. The preference for QUTOA vs. SUTOA depends on the relative position of the start and the end points of the UAVs.

Adaptive UAV-Trajectory Optimization Under Quality of Service Constraints: A Model-Free Solution

Journal

IEEE ACCESS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Adaptive UAV-Trajectory Optimization Under Quality of Service Constraints: A Model-Free Solution

Journal

IEEE ACCESS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper