4.6 Article

Automatic recognition of schizophrenia from facial videos using 3D convolutional neural network

Journal

ASIAN JOURNAL OF PSYCHIATRY
Volume 77, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.ajp.2022.103263

Keywords

Schizophrenia; Facial video; Convolutional neural network; Deep learning

Categories

Funding

  1. Beijing Natural Science Foundation
  2. Beijing Municipal Science & Technology Commission
  3. Ascent Plan
  4. [7202072]
  5. [Z191100006619104]
  6. [D171100007017002]
  7. [DFL20192001]

Ask authors/readers for more resources

This study proposes a rapid detection method for schizophrenia based on deep learning and facial videos. The results show that the method can differentiate between healthy controls and schizophrenic patients by analyzing changes in facial area, providing assistance for diagnosis in a clinical setting.
Schizophrenia affects patients and their families and society because of chronic impairments in cognition, behavior, and emotion. However, its clinical diagnosis mainly depends on the clinicians' knowledge of the pa-tients' symptoms. Other auxiliary diagnostic methods such as MRI and EEG are cumbersome and time-consuming. Recently, the convolutional neural network (CNN) has been applied to the auxiliary diagnosis of psychiatry. Hence, in this study, a method based on deep learning and facial videos is proposed for the rapid detection of schizophrenia. Herein, 125 videos from 125 schizophrenic patients and 75 videos from 75 healthy controls based on emotional stimulation tasks were obtained. The video preprocessing included the experiment clips extraction, face detection, facial region cropping, resizing to 500 x 500 pixel size, and uniform sampling of 100 frames. The preprocessed facial videos were used to train the Resnet18_3D. We utilized ten-fold cross -validation, and held-out testing set to evaluate the model with the accuracy, the precision, the sensitivity, the specificity, the balanced accuracy, and the AUC. The Resnet18_3D trained on Film_order achieved the best performance with accuracy, sensitivity, specificity, balanced accuracy, and AUC of 89.00%, 96.80%, 76.00%, 86.40% and 0.9397. The neural network model indeed recognizes healthy controls and schizophrenic patients through the changes in the area of the face. The results show that facial video under emotional stimulation can be used to classify schizophrenic patients and help clinicians with diagnosis in the clinical environment. Among the different types of stimuli, the video stimuli with fixed emotional order showed the best classification performance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available