4.7 Article

Depression Level Prediction Using Deep Spatiotemporal Features and Multilayer Bi-LTSM

Journal

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING
Volume 13, Issue 2, Pages 864-870

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TAFFC.2020.2970418

Keywords

Depression; Feature extraction; Dynamics; Spatiotemporal phenomena; Histograms; Nonhomogeneous media; Optical imaging; Depression; deep spatiotemporal network; Inception-ResNet-v2 network; volume local directional number; bidirectional long short-term memory; temporal median pooling

Funding

  1. Institute for Information and communications Technology Promotion (IITP) - Korea government (MSIT) [2016-0-00406]

Ask authors/readers for more resources

This article proposes a framework for estimating depression levels from video data by using a two-stream deep spatiotemporal network. The approach extracts spatial and temporal information and shows promising performance compared to existing methods.
Depression is a serious psychiatric disorder that restricts an individuals ability to work properly in both their daily and professional lives. Usually, the diagnosis of depression often needs a thorough assessment by an expert. Recently, significant consideration has been given to automatic depression prediction for more reliable and efficient depression investigation. In this article, we propose a novel framework to estimate the depression level from video data by employing a two-stream deep spatiotemporal network. Our approach extracts spatial information using the Inception-ResNet-v2 network. In contrast, we introduce a volume local directional number (VLDN) based dynamic feature descriptor to capture facial motions. Then, the feature map obtained from the VLDN is fed into a convolutional neural network (CNN) to obtain more discriminative features. Additionally, we designed a multilayer bidirectional long short-term memory (Bi-LSTM) model to obtain temporal information by integrating the temporal median pooling (TMP) approach into the model. The TMP approach is employed on the temporal fragments of spatial and temporal features. Finally, extensive experimental analysis of two challenging datasets, AVEC2013 and AVEC2014, demonstrates that the proposed approach shows promising performance compared to the existing approaches for depression level prediction.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available