☆ 4.5 Article

FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients

BRAIN SCIENCES (2022)

Journal

BRAIN SCIENCES

Volume 12, Issue 12, Pages -

Publisher

MDPI

DOI: 10.3390/brainsci12121626

Keywords

facial expression recognition (FER); vision transformer (ViT); convolutional neural networks (CNNs); stroke; rehabilitation

Funding

Wuxi Municipal Health Commission Translational Medicine Research Project [ZH202102]
Scientific and Technological Innovation Plan of Shanghai STC (Shanghai Science & Technology Commission) [21511102605]
China National Nature Science Young Foundation (National Natural Science Foundation of China) [82102665]
Shanghai Sailing Program (Shanghai Science & Technology Commission) [21YF1404600]
National Key R&D Program of China (Ministry of Science and Technology of the People's Republic of China) [2018YFC2002301]
Key Subjects Construction Program of the Health System in Jing'an District (Shanghai Municipal Health Commission) [2021PY04]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

In this study, a lightweight facial expression recognition algorithm is proposed to automatically diagnose stroke patients' training motivations. The experiments show that the algorithm achieves high accuracy while keeping the model complexity low.

Early rehabilitation with the right intensity contributes to the physical recovery of stroke survivors. In clinical practice, physicians determine whether the training intensity is suitable for rehabilitation based on patients' narratives, training scores, and evaluation scales, which puts tremendous pressure on medical resources. In this study, a lightweight facial expression recognition algorithm is proposed to diagnose stroke patients' training motivations automatically. First, the properties of convolution are introduced into the Vision Transformer's structure, allowing the model to extract both local and global features of facial expressions. Second, the pyramid-shaped feature output mode in Convolutional Neural Networks is also introduced to reduce the model's parameters and calculation costs significantly. Moreover, a classifier that can better classify facial expressions of stroke patients is designed to improve performance further. We verified the proposed algorithm on the Real-world Affective Faces Database (RAF-DB), the Face Expression Recognition Plus Dataset (FER+), and a private dataset for stroke patients. Experiments show that the backbone network of the proposed algorithm achieves better performance than Pyramid Vision Transformer (PvT) and Convolutional Vision Transformer (CvT) with fewer parameters and Floating-point Operations Per Second (FLOPs). In addition, the algorithm reaches an 89.44% accuracy on the RAF-DB dataset, which is higher than other recent studies. In particular, it obtains an accuracy of 99.81% on the private dataset, with only 4.10M parameters.

FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients

Journal

BRAIN SCIENCES

Publisher

MDPI

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients

Journal

BRAIN SCIENCES

Publisher

MDPI

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper