4.7 Article

Improving Energy Efficiency and QoS of LPWANs for IoT Using Q-Learning Based Data Routing

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCCN.2021.3114147

Keywords

Low-power wide-area networks (LPWANs); Internet of Things (IoT); energy efficiency; quality-of-service (QoS); reinforcement learning; Q-learning

Ask authors/readers for more resources

The rapid development of the Internet of Things (IoT) has created the need for large-scale connectivity among smart IoT devices over a vast geographical area. This has led to the emergence of Low-Power Wide-Area Networks (LPWANs) which provide long-range communication capability with low-power consumption. However, the increasing volume of data generated by IoT devices makes direct data transmission inefficient, hence the need for a multi-hop data routing method. This study proposes a reinforcement learning approach to address the challenges of multi-hop data transmission and demonstrates its effectiveness through simulations and real field data.
Recent proliferation of Internet of Things (IoT) demands large scale connectivity among smart IoT devices over a vast geographical area. However, limited radio range and lack of scalability of conventional wireless sensor networks do not allow a wide area connectivity among IoT devices. To address these challenges, Low-Power Wide-Area Networks (LPWANs) are emerging to provide long-range communication capability with low-power consumption of the end devices. Nevertheless, given the demand in delivering an increasingly large volume of data generated by IoT devices, the direct data transmission model is not suitable due to its poor network lifetime. Therefore, in this work, a multi-hop data routing method is proposed for LPWANs. Since multi-hop data transmission faces several challenges such as increased data latency, higher interference, and reduced data throughput (i.e., poor bandwidth utilization), we propose a reinforcement learning method to address those challenges. The proposed method updates the Q-matrix of the network at varying discrete time instants and selects relay devices in such a way that maximizes the cumulative reward value between selected device-gateway pairs. The applicability and effectiveness of the proposed method are illustrated over both simulated LPWAN testbed and real field data sets. The obtained results clearly demonstrate the improved network performance in terms of energy efficiency and QoS of the proposed method as compared to various existing methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available