4.7 Article

Dumodds: Dual modeling approach for drowsiness detection based on spatial and spatio-temporal features

Journal

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.engappai.2022.105759

Keywords

Spatio-temporal feature; TransGAN; YOLOv3; Temporal feature; LSTM

Ask authors/readers for more resources

Road accidents have become a significant problem due to driver's drowsy behavior. To tackle this issue, a reliable system is necessary. In this study, a large drowsiness video dataset from the University of Texas was analyzed. Two models, Model-A for temporal features and Model-B for spatiotemporal characteristics, were created. Despite having lower accuracy, Model-A showed superiority in terms of training period compared to Model-B.
Road accidents have been a significant problem in recent years. As per statistics, this is primarily due to the driver's drowsy behavior. As an impact, many valuable lives have been lost in road accidents. So, a reliable system is required to overcome this issue. As part of this meticulous analysis, we have chosen a sizable realistic drowsiness video dataset created by the University of Texas. After that, we picked just the extreme classes of videos, such as alert and drowsy, from this dataset. Then, we created two distinct models, namely Model -A for temporal features and Model-B for spatiotemporal characteristics. In the first model, computer vision techniques, i.e., YOLOv3, are used to retrieve temporal characteristics, then processed using long short-term memory (LSTM). Here, we suited the occlusion issue by imposing a condition on each frame. The overfitting problem arises when occluded frames are discarded during this procedure. This issue is handled with the help of TransGAN's augmentation approach. The second model, on the other hand, extracts spatial information using a convolution neural network (CNN) called InceptionV3, which is subsequently processed using LSTM. Even though Model-A is more complicated and has lower accuracy, i.e., 86%, than Model-B, with an accuracy of 97.5%, the investigation reveals that Model-A seems much superior to Model-B regarding the training period. These differences are emphasized through the AUC-ROC score and confusion metrics.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available