4.8 Article

A Transfer Learning Approach for UAV Path Design With Connectivity Outage Constraint

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 10, Issue 6, Pages 4998-5012

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2022.3220981

Keywords

Cellular networks; deep reinforcement learning (DRL); millimeter-wave (mmWave); path design; transfer learning (TL); unmanned aerial vehicle (UAV)

Ask authors/readers for more resources

In this article, a transfer learning approach is proposed to enhance the path learning of autonomous unmanned aerial vehicles (UAVs) in a new domain, using a teacher policy trained in an old domain. The approach is evaluated in different urban environment scenarios at sub-6 GHz and millimeter-wave (mmWave). Results show that this approach considerably reduces the training time at mmWave.
The connectivity-aware path design is crucial in the effective deployment of autonomous unmanned aerial vehicles (UAVs). Recently, reinforcement learning (RL) algorithms have become the popular approach to solving this type of complex problem, but RL algorithms suffer slow convergence. In this article, we propose a transfer learning (TL) approach, where we use a teacher policy previously trained in an old domain to boost the path learning of the agent in the new domain. As the exploration processes and the training continue, the agent refines the path design in the new domain based on the subsequent interactions with the environment. We evaluate our approach considering an old domain at sub-6 GHz and a new domain at millimeter-wave (mmWave). The teacher path policy, previously trained at the sub-6 GHz path, is the solution to a connectivity-aware path problem that we formulate as a constrained Markov decision process ( CMDP). We employ a Lyapunov-based model-free deep Q-network (DQN) to solve the path design at sub-6 GHz that guarantees connectivity constraint satisfaction. We empirically demonstrate the effectiveness of our approach for different urban environment scenarios. The results demonstrate that our proposed approach is capable of reducing the training time considerably at mmWave.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available