4.6 Article

Adapting Multiple Distributions for Bridging Emotions from Different Speech Corpora

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

Cross-Database Micro-Expression Recognition: A Benchmark

Tong Zhang et al.

Summary: This paper discusses the challenges and importance of cross-database micro-expression recognition (CDMER) and contributes to this field by establishing an evaluation protocol, conducting benchmark experiments, and proposing a novel DA method.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2022)

Article Computer Science, Artificial Intelligence

Spontaneous Speech Emotion Recognition Using Multiscale Deep Convolutional LSTM

Shiqing Zhang et al.

Summary: This paper proposes a multiscale deep convolutional long short-term memory (LSTM) framework for spontaneous speech emotion recognition. By combining convolutional neural network (CNN) with LSTM on different lengths of spectrograms, the proposed method achieves efficient emotion recognition. Experimental results demonstrate the superior performance of this method on two challenging emotional datasets.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2022)

Article Acoustics

Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition

Cheng Lu et al.

Summary: In this paper, a novel domain invariant feature learning method is proposed for speaker-independent speech emotion recognition. The proposed method eliminates domain shifts caused by different speakers and learns speaker-invariant emotion features. Experimental results demonstrate the superiority of the proposed method in SER performance.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2022)

Article Computer Science, Information Systems

Exploiting the potentialities of features for speech emotion recognition

Dongdong Li et al.

Summary: Recent studies on speech signals have focused on emotional information and the importance of feature representation in speech emotion recognition (SER). Different combinations of features and models have a significant impact on SER performance, with the proposed ECFW method showing promising results in improving performance across different databases.

INFORMATION SCIENCES (2021)

Article Engineering, Electrical & Electronic

A Comprehensive Survey on Transfer Learning

Fuzhen Zhuang et al.

Summary: Transfer learning aims to improve the performance of target learners by transferring knowledge from related source domains, reducing the reliance on target-domain data. This survey aims to systematize and summarize existing research studies in order to help readers understand the current status and ideas in the area of transfer learning.

PROCEEDINGS OF THE IEEE (2021)

Article Acoustics

Learning deep multimodal affective features for spontaneous speech emotion recognition

Shiqing Zhang et al.

Summary: This paper proposes a new method for spontaneous speech emotion recognition using deep multimodal audio feature learning based on multiple deep convolutional neural networks. By combining three different types of audio inputs and a multi-CNN fusion network, significant improvement in emotion classification performance is achieved.

SPEECH COMMUNICATION (2021)

Article Computer Science, Artificial Intelligence

Improving Cross-Corpus Speech Emotion Recognition with Adversarial Discriminative Domain Generalization (ADDoG)

John Gideon et al.

Summary: Automatic speech emotion recognition provides computers with important context for user understanding. While current methods often fail when applied to unseen datasets, recent research has focused on adversarial methods to create more generalized representations of emotional speech. The introduced Adversarial Discriminative Domain Generalization (ADDoG) method improves cross-dataset generalization by iteratively moving representations learned for each dataset closer to one another.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2021)

Article Computer Science, Artificial Intelligence

Deep Subdomain Adaptation Network for Image Classification

Yongchun Zhu et al.

Summary: This study introduces a deep subdomain adaptation network (DSAN) that aligns relevant subdomain distributions across different domains based on the local maximum mean discrepancy (LMMD). DSAN is simple but effective, does not require adversarial training, and converges quickly. It can be easily integrated into feedforward network models to achieve efficient adaptation via backpropagation.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2021)

Proceedings Paper Acoustics

CROSS-CORPUS SPEECH EMOTION RECOGNITION USING JOINT DISTRIBUTION ADAPTIVE REGRESSION

Jiacheng Zhang et al.

Summary: This paper focuses on cross-corpus speech emotion recognition (SER) research, proposing a novel domain adaptation method called JDAR to alleviate the feature distribution difference between training and testing speech signals. Experimental results demonstrate that JDAR achieves satisfactory performance and outperforms most state-of-the-art subspace learning based DA methods on EmoDB, eNTERFACE, and CASIA speech databases.

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) (2021)

Article Computer Science, Artificial Intelligence

Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition

Peng Song

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2019)

Review Computer Science, Hardware & Architecture

Speech Emotion Recognition Two Decades in a Nutshell, Benchmarks, and Ongoing Trends

Bjoern W. Schuller

COMMUNICATIONS OF THE ACM (2018)

Article Computer Science, Artificial Intelligence

Deep visual domain adaptation: A survey

Mei Wang et al.

NEUROCOMPUTING (2018)

Article Engineering, Electrical & Electronic

3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition

Mingyi Chen et al.

IEEE SIGNAL PROCESSING LETTERS (2018)

Article Engineering, Electrical & Electronic

Universum Autoencoder-Based Domain Adaptation for Speech Emotion Recognition

Jun Deng et al.

IEEE SIGNAL PROCESSING LETTERS (2017)

Article Engineering, Electrical & Electronic

Double sparse learning model for speech emotion recognition

Yuan Zong et al.

ELECTRONICS LETTERS (2016)

Article Engineering, Electrical & Electronic

Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition

Jun Deng et al.

IEEE SIGNAL PROCESSING LETTERS (2014)

Article Acoustics

On Acoustic Emotion Recognition: Compensating for Covariate Shift

Ali Hassan et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2013)

Article Computer Science, Artificial Intelligence

Domain Adaptation via Transfer Component Analysis

Sinno Jialin Pan et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS (2011)

Article Computer Science, Artificial Intelligence

Survey on speech emotion recognition: Features, classification schemes, and databases

Moataz El Ayadi et al.

PATTERN RECOGNITION (2011)

Article Computer Science, Artificial Intelligence

A Survey on Transfer Learning

Sinno Jialin Pan et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2010)

Article Computer Science, Artificial Intelligence

Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies

Bjoern Schuller et al.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2010)