4.7 Article

RLID-V: Reinforcement Learning-Based Information Dissemination Policy Generation in VANETs

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TITS.2023.3300948

Keywords

~VANETs; Information Dissemination; Policy Combination; Reinforcement Learning

Ask authors/readers for more resources

In this paper, a reinforcement learning-based information dissemination policy generation scheme in VANETs, named RLID-V, is proposed. It combines multiple attribute-based access control policies to resolve conflicts and uses a decision tree to construct a manual feedback policy. Reinforcement learning is employed to dynamically update confidence weights of different policy sources. Experimental results show that RLID-V outperforms three existing schemes in accuracy and effectiveness of information dissemination in traffic guidance and accident warning scenarios. Additionally, RLID-V exhibits robustness, with a negligible cost of less than 1% of overall delay overhead for policy generation.
Ciphertext policy attribute-based encryption (CP-ABE) is popularly used to implement secure and accurate access control of disseminated information in vehicular ad hoc networks (VANETs). Nevertheless, how to improve the policy generation of CP-ABE for accurate information dissemination in the dynamic VANETs remains a challenge, as there are several access control policies rising from moving vehicles and road side units (RSUs) with different sensing boarder regarding to a specific event, such as moving vehicles and road side units (RSUs). To solve this problem, this paper proposes a reinforcement learning-based information dissemination policy generation scheme in VANETs, named RLID-V. The scheme firstly combines multiple attribute-based access control policies and resolves policy conflicts between vehicles and RSUs. Then, a manual feedback policy construction method is designed by applying decision tree to the collected feedback from all receivers. Finally, we employ reinforcement learning to dynamically update the confidence weights of different policy sources. The experiments are conducted in two classic VANETs scenarios, traffic guidance and accident warning, demonstrating that RLID-V achieves better performance in the accuracy and effectiveness of information dissemination compared with three existing schemes. Otherwise, RLID-V outperforms the compared schemes in robustness with 20% error feedback and takes a negligible cost of less than 1% of the overall delay overhead for policy generation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available