☆ 4.6 Article

Improved TOPSIS method for peak frame selection in audio-video human emotion recognition

MULTIMEDIA TOOLS AND APPLICATIONS (2019)

期刊

MULTIMEDIA TOOLS AND APPLICATIONS

卷 78, 期 5, 页码 6277-6308

出版社

SPRINGER

DOI: 10.1007/s11042-018-6402-x

关键词

Face recognition; Improved TOPSIS method; Peak frame selection; Audio-video emotion recognition

类别

Computer Science, Information Systems Computer Science, Software Engineering Computer Science, Theory & Methods Engineering, Electrical & Electronic

资金

University Grant Commission (UGC), Ministry of Human Resource Development (MHRD) of India [F. 25-1/2013-14(BSR)/7-379/2012(BSR)]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The peak frame selection with corresponding voice segment identification is a challenging problem in the audio-video human emotion recognition. The peak frame is a most relevant descriptor of facial expression that can be inferred from varied emotional states. In this paper, an improved Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) is proposed to select the key frame based on facial action units co-occurrence behavior in the visual sequences. The proposed method utilizes the experts judgments while identifying the peak frame in video modality. It locates the peak voiced segment in audio modality using synchronous and asynchronous temporal relationship with selected peak visual frame. The facial action unit features of peak frame are fused with nine statistical characteristics of spectral features of the voiced segment. The weighted product rule-based decision level fusion is performed to combine the posterior probabilities of two independent (i.e., audio, and video) support vector machines based classification models. The performance of the proposed peak frame and voiced segment selection method is evaluated and compared with the existing Maximum-Dissimilarity (MAX-DIST), Dendrogram-Clustering (DEND-CLUSTER), and Emotion Intensity (EIFS) based peak frame selection methods on two challenging emotion datasets in two different languages namely eNTERFACE'05 in English and BAUM-1a in Turkish. The results show that the system with the proposed method has performed better than the existing techniques, and it achieved 88.03%, and 84.61% emotion recognition accuracies on the eNTERFACE'05 and BAUM-1a datasets respectively.

Improved TOPSIS method for peak frame selection in audio-video human emotion recognition

期刊

MULTIMEDIA TOOLS AND APPLICATIONS

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Improved TOPSIS method for peak frame selection in audio-video human emotion recognition

期刊

MULTIMEDIA TOOLS AND APPLICATIONS

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文