☆ 4.7 Article

Dumodds: Dual modeling approach for drowsiness detection based on spatial and spatio-temporal features

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2023)

Journal

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Volume 119, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.engappai.2022.105759

Keywords

Spatio-temporal feature; TransGAN; YOLOv3; Temporal feature; LSTM

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Road accidents have become a significant problem due to driver's drowsy behavior. To tackle this issue, a reliable system is necessary. In this study, a large drowsiness video dataset from the University of Texas was analyzed. Two models, Model-A for temporal features and Model-B for spatiotemporal characteristics, were created. Despite having lower accuracy, Model-A showed superiority in terms of training period compared to Model-B.

Road accidents have been a significant problem in recent years. As per statistics, this is primarily due to the driver's drowsy behavior. As an impact, many valuable lives have been lost in road accidents. So, a reliable system is required to overcome this issue. As part of this meticulous analysis, we have chosen a sizable realistic drowsiness video dataset created by the University of Texas. After that, we picked just the extreme classes of videos, such as alert and drowsy, from this dataset. Then, we created two distinct models, namely Model -A for temporal features and Model-B for spatiotemporal characteristics. In the first model, computer vision techniques, i.e., YOLOv3, are used to retrieve temporal characteristics, then processed using long short-term memory (LSTM). Here, we suited the occlusion issue by imposing a condition on each frame. The overfitting problem arises when occluded frames are discarded during this procedure. This issue is handled with the help of TransGAN's augmentation approach. The second model, on the other hand, extracts spatial information using a convolution neural network (CNN) called InceptionV3, which is subsequently processed using LSTM. Even though Model-A is more complicated and has lower accuracy, i.e., 86%, than Model-B, with an accuracy of 97.5%, the investigation reveals that Model-A seems much superior to Model-B regarding the training period. These differences are emphasized through the AUC-ROC score and confusion metrics.

Dumodds: Dual modeling approach for drowsiness detection based on spatial and spatio-temporal features

Journal

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Dumodds: Dual modeling approach for drowsiness detection based on spatial and spatio-temporal features

Journal

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper