4.7 Article

Multi-Hop Deflection Routing Algorithm Based on Reinforcement Learning for Energy-Harvesting Nanonetworks

Journal

IEEE TRANSACTIONS ON MOBILE COMPUTING
Volume 21, Issue 1, Pages 211-225

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TMC.2020.3006535

Keywords

Nanonetworks; deflection routing; reinforcement learning; energy harvesting; THz communications

Funding

  1. National Natural Science Foundation of China (NSFC) [61772471, 61873240]

Ask authors/readers for more resources

This paper proposes a multi-hop deflection routing algorithm based on reinforcement learning (MDR-RL) for the design of routing protocols in nanonetworks. By implementing new routing and deflection tables in nano-nodes and designing different updating schemes, dynamic and efficient routing path exploration is achieved. Simulation results show that MDR-RL can significantly increase the packet delivery ratio and number of delivered packets while reducing the packet average hop count.
Nanonetworks are composed of interacting nano-nodes, whose size ranges from several hundred cubic nanometers to several cubic micrometers. The extremely constrained computational resources of nano-nodes, the fluctuations in their energy caused by energy harvesting processes, and their very limited transmission range at Terahertz (THz)-band frequencies (0.1-10 THz), make the design of routing protocols in nanonetworks very challenging. A multi-hop deflection routing algorithm based on reinforcement learning (MDR-RL) is proposed in this paper to dynamically and efficiently explore the routing paths during packet transmissions. First, new routing and deflection tables are implemented in nano-nodes, so that nano-nodes can deflect packets to other neighbors when route entries in the routing table are invalid. Second, one forward updating scheme and two feedback updating schemes based on reinforcement learning are designed to update the tables, namely, on-policy and off-policy updating schemes. Finally, extensive simulations in networks simulator-3 are conducted to analyze the performance of MDR-RL using different updating policies, as well as to compare the performance with other machine learning routing algorithms based on Neural Networks and Decision Tree. The results show that the MDR-RL can increase the packet delivery ratio and number of delivered packets, and can decrease the packet average hop count.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available