4.7 Article

Hierarchical attention network with progressive feature fusion for facial expression recognition

Related references

Note: Only part of the references are listed.
Article Computer Science, Information Systems

Patch attention convolutional vision transformer for facial expression recognition with occlusion

Chang Liu et al.

Summary: A Patch Attention Convolutional Vision Transformer (PACVT) is proposed to tackle the occlusion problem in Facial Expression Recognition (FER). It extracts local and global features from facial patches and uses self-attention to focus on salient patches with discriminative features. Experimental results demonstrate the superiority of PACVT in occlusion FER.

INFORMATION SCIENCES (2023)

Article Computer Science, Artificial Intelligence

SSA-ICL: Multi-domain adaptive attention with intra-dataset continual learning for Facial expression recognition

Hongxiang Gao et al.

Summary: Facial expression recognition (FER) is a method of identifying emotions in facial photographs. Despite progress, three major challenges remain unaddressed: the interaction between spatial action units, inadequate semantic information about spectral expressions, and unbalanced data distribution. In this work, we propose SSA-ICL, a novel approach that solves these challenges by integrating spectral semantics with spatial locations and using an Intra-dataset Continual Learning (ICL) module to address data distribution.

NEURAL NETWORKS (2023)

Article Engineering, Electrical & Electronic

Toward Facial Expression Recognition in the Wild via Noise-Tolerant Network

Yu Gu et al.

Summary: This paper proposes a simple but effective Facial Expression Noise-tolerant Network (FENN) that explores inter-class correlations to mitigate the ambiguity of facial expressions. FENN leverages a multivariate normal distribution to model such correlations and suppresses the heteroscedastic uncertainty caused by inter-class label noise. Additionally, FENN utilizes a feature-noise mitigation module to extract compact intra-class feature representations and preserve the intrinsic inter-class relationships.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Computer Science, Information Systems

Enhanced discriminative global-local feature learning with priority for facial expression recognition

Ziyang Zhang et al.

Summary: This study proposes enhanced discriminative global-local feature learning with priority (EDGL-FLP), which focuses on feature extraction without auxiliary information and feature fusion based on priority. EDGL-FLP achieves state-of-the-art performance on the FER benchmarks RAF-DB, SFEW, AffectNet, FED-RO, and MMI with accuracies of 89.63%, 62.31%, 61.09%, 71.42%, and 86.47%, respectively. Thus, EDGL-FLP is robust for both in-the-wild and in-the-lab FER datasets.

INFORMATION SCIENCES (2023)

Article Computer Science, Artificial Intelligence

Learning discriminative feature representation with pixel-level supervision for forest smoke recognition

Huanjie Tao et al.

Summary: This paper proposes a pixel-level supervision neural network (PSNet) for forest smoke recognition, which effectively separates and accurately recognizes smoke and background through pixel-level supervision information and detail-difference-aware module. Experimental results show that our method achieves better performance in smoke recognition and smoke component separation.

PATTERN RECOGNITION (2023)

Article Computer Science, Artificial Intelligence

Probabilistic Attribute Tree Structured Convolutional Neural Networks for Facial Expression Recognition in the Wild

Jie Cai et al.

Summary: This paper proposes a novel Probabilistic Attribute Tree Convolutional Neural Network (PAT-CNN) for facial expression recognition. By explicitly modeling identity-related attributes, PAT-CNN effectively deals with large intra-class variations and achieves the best performance among state-of-the-art methods on multiple datasets. Impressively, a single model of PAT-CNN outperforms ensemble models of hundreds of CNNs.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Article Engineering, Electrical & Electronic

Triplet Loss With Multistage Outlier Suppression and Class-Pair Margins for Facial Expression Recognition

Weicheng Xie et al.

Summary: This paper proposes a new triplet loss method based on class-pair margins and multistage outlier suppression for facial expression recognition. This method can adapt to variations among different expression pairs and exclude difficult samples during triplet selection, improving the generalization performance of network features.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Computer Science, Artificial Intelligence

Dynamic Facial Expression Recognition Under Partial Occlusion With Optical Flow Reconstruction

Delphine Poux et al.

Summary: Video facial expression recognition is a useful tool, but recognizing facial expressions in the presence of partial occlusion remains challenging. This paper proposes a new method based on an auto-encoder to reconstruct the occluded part of the face in the optical flow domain. Experimental results show that this method reduces the accuracy gap between occluded and unoccluded situations.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Artificial Intelligence

High-Resolution Depth Maps Imaging via Attention-Based Hierarchical Multi-Modal Fusion

Zhiwei Zhong et al.

Summary: In this paper, a novel attention-based hierarchical multi-modal fusion (AHMF) network for guided depth map super-resolution (DSR) is proposed. The network effectively extracts and combines relevant information from low-resolution depth and high-resolution guidance, and leverages low-level spatial information and high-level structure information. Experimental results show that the approach outperforms state-of-the-art methods in terms of accuracy, speed, and memory efficiency.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Artificial Intelligence

Adaptive Deep Disturbance-Disentangled Learning for Facial Expression Recognition

Delian Ruan et al.

Summary: This paper proposes a novel adaptive deep disturbance-disentangled learning (ADDL) method for effective facial expression recognition (FER). The method involves a two-stage learning process that identifies and separates disturbing factors from facial expression images. Experimental results show that the proposed method outperforms several state-of-the-art FER methods on multiple databases.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2022)

Article Computer Science, Information Systems

HistNet: Histogram-based convolutional neural network with Chi-squared deep metric learning for facial expression recognition

Hamid Sadeghi et al.

Summary: This paper proposes a deep histogram metric learning approach based on a Convolutional Neural Network (CNN) for facial expression recognition. By introducing a histogram calculation layer and a learnable matrix, the accuracy is improved and the proposed CNN outperforms state-of-the-art methods on multiple databases.

INFORMATION SCIENCES (2022)

Article Automation & Control Systems

Point Adversarial Self-Mining: A Simple Method for Facial Expression Recognition

Ping Liu et al.

Summary: This article presents a novel method, PASM, which enhances the accuracy of facial expression recognition by simulating human learning processes and leveraging stronger teachers and generating harder learning samples. This approach iteratively improves network capability through adaptive learning materials generation and teacher/student update.

IEEE TRANSACTIONS ON CYBERNETICS (2022)

Article Engineering, Electrical & Electronic

Adaptive Multilayer Perceptual Attention Network for Facial Expression Recognition

Hanwei Liu et al.

Summary: This paper proposes an adaptive multilayer perceptual attention network (AMP-Net) for facial expression recognition in complex real-world situations. AMP-Net extracts facial emotional features with different fine-grained features and improves robustness and generalizability.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Computer Science, Artificial Intelligence

Improving the Facial Expression Recognition and Its Interpretability via Generating Expression Pattern-map

Jing Zhang et al.

Summary: This paper proposes a novel method for facial expression modeling, which focuses on accurately extracting expression-related regions and enhancing expression features. The proposed model consists of three submodules and achieves competitive performance compared with the state-of-the-art on four public datasets.

PATTERN RECOGNITION (2022)

Article Computer Science, Artificial Intelligence

Relation-Aware Facial Expression Recognition

Yifan Xia et al.

Summary: Research on facial expression recognition in the wild is challenging due to pose, occlusion, illumination, and intensity variations. This article proposes a relation-aware facial expression recognition method that captures the relationship between crucial regions and facial expressions, leading to improved recognition accuracy.

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

FLEPNet: Feature Level Ensemble Parallel Network for Facial Expression Recognition

Mohan Karnati et al.

Summary: With the development of deep learning, the research on facial expression recognition (FER) has gained significant attention. This study proposes a texture-based feature-level ensemble parallel network (FLEPNet) to address the challenges in FER, and experimental results demonstrate its effectiveness and reliability.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2022)

Article Computer Science, Artificial Intelligence

OAENet: Oriented attention ensemble for accurate facial expression recognition

Zhengning Wang et al.

Summary: This research proposes an oriented attention pseudo-siamese network for accurate facial expression recognition, which utilizes both global and local facial information. A direction-dependent attention mechanism is established to address the limitation of insufficient utilization of local information.

PATTERN RECOGNITION (2021)

Article Computer Science, Artificial Intelligence

Res2Net: A New Multi-Scale Backbone Architecture

Shang-Hua Gao et al.

Summary: This paper introduces a novel building block for CNNs, Res2Net, which represents multiscale features within one single residual block by constructing hierarchical residual-like connections. The Res2Net enhances the representation of multiscale features in various vision tasks and consistently outperforms baseline models in performance gains.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)

Article Computer Science, Information Systems

Facial expression recognition with grid-wise attention and visual transformer

Qionghao Huang et al.

Summary: This paper introduces a novel FER framework with two attention mechanisms for CNN-based models, showing superior performance in experiments, especially achieving 100% accuracy on the CK+ dataset.

INFORMATION SCIENCES (2021)

Article Computer Science, Artificial Intelligence

A nested U-shape network with multi-scale upsample attention for robust retinal vascular segmentation

Ruohan Zhao et al.

Summary: This paper introduces a new nested U-shaped attention network, NUA-Net, to enhance the robustness of vascular segmentation in retinal imaging by extending the U-shape segmentation network with a multi-scale upsample attention module. Experimental results demonstrate the network achieving high levels of accuracy, sensitivity, and specificity on multiple publicly available databases.

PATTERN RECOGNITION (2021)

Article Computer Science, Artificial Intelligence

Residual multi-task learning for facial landmark localization and expression recognition

Boyu Chen et al.

Summary: The paper introduces a residual multi-task learning framework for predicting facial landmark localization and expression recognition simultaneously, utilizing a residual learning module to strengthen the connection between the two tasks and addressing the lack of training data with multi-task labels in multi-task learning.

PATTERN RECOGNITION (2021)

Article Computer Science, Artificial Intelligence

Learning Deep Global Multi-Scale and Local Attention Features for Facial Expression Recognition in the Wild

Zengqun Zhao et al.

Summary: This paper proposed a global multi-scale and local attention network for facial expression recognition in the wild, achieving state-of-the-art results on several benchmarks.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2021)

Article Computer Science, Information Systems

Salient Object Detection by Fusing Local and Global Contexts

Qinghua Ren et al.

Summary: A novel salient object detection method is proposed in this paper, which utilizes contextual attention module to exploit pixel-level correlation information and achieves superior performance. Experimental results demonstrate that the proposed model outperforms most existing models on six public SOD datasets.

IEEE TRANSACTIONS ON MULTIMEDIA (2021)

Article Computer Science, Artificial Intelligence

Facial Expression Recognition Using Frequency Neural Network

Yan Tang et al.

Summary: This paper presents a deep learning approach for facial expression recognition called frequency neural network (FreNet), which learns features in the frequency domain with efficient computation and spatial redundancy elimination. By utilizing learnable multiplication kernels, weight-shared multiplication kernels, and block sub-sampling, the proposed Block-FreNet achieves superior performance and reduces computational costs in facial expression recognition. The method fills a gap in frequency-based deep learning models for this task.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2021)

Article Computer Science, Artificial Intelligence

Deep Neural Network Augmentation: Generating Faces for Affect Analysis

Dimitrios Kollias et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2020)

Article Computer Science, Artificial Intelligence

Squeeze-and-Excitation Networks

Jie Hu et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2020)

Article Computer Science, Artificial Intelligence

Advances in multimodal data fusion in neuroimaging: Overview, challenges, and novel orientation

Yu-Dong Zhang et al.

INFORMATION FUSION (2020)

Article Computer Science, Artificial Intelligence

Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition

Kai Wang et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)

Article Computer Science, Artificial Intelligence

Semantic Neighborhood-Aware Deep Facial Expression Recognition

Yongjian Fu et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)

Article Computer Science, Artificial Intelligence

AffectNet: A Database for Facial Expression, Valence, and Arousal Computing in the Wild

Ali Mollahosseini et al.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2019)

Article Computer Science, Artificial Intelligence

Facial Landmark Detection: A Literature Survey

Yue Wu et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2019)

Article Computer Science, Artificial Intelligence

Deep multi-path convolutional neural network joint with salient region attention for facial expression recognition

Siyue Xie et al.

PATTERN RECOGNITION (2019)

Article Computer Science, Artificial Intelligence

Occlusion Aware Facial Expression Recognition Using CNN With Attention Mechanism

Yong Li et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2019)

Article Computer Science, Artificial Intelligence

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Unconstrained Facial Expression Recognition

Shan Li et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2019)