4.3 Article

A New Amharic Speech Emotion Dataset and Classification Benchmark

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

A classification benchmark for Arabic alphabet phonemes with diacritics in deep neural networks

Sir Eiad Almekhlafi et al.

Summary: This paper focuses on the classification of Arabic alphabet phonemes and introduces a new dataset called AAPD. Various classification systems using different feature extraction techniques and deep neural networks are built and compared based on AAPD. The experimental results show that MFCC is the most effective feature extraction method, and the proposed VGG-based model achieves the highest accuracy with the least computational load.

COMPUTER SPEECH AND LANGUAGE (2022)

Article Engineering, Biomedical

Sentiment analysis in non-fixed length audios using a Fully Convolutional Neural Network

Maria Teresa Garcia-Ordas et al.

Summary: The study introduces a sentiment analysis method that can process audio of any length, using Mel spectrogram and Mel Frequency Cepstral Coefficients as audio description methods, and a Fully Convolutional Neural Network architecture as classifier. The results, validated on three well-known datasets, show promising performance surpassing existing methods, and the method's ability to analyze sentiment in near real time is particularly useful for a wide range of fields such as call centers, medical consultations, and financial brokers.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2021)

Article Automation & Control Systems

A novel dual attention-based BLSTM with hybrid features in speech emotion recognition

Qiupu Chen et al.

Summary: Although emotional state does not change the content of language, it plays a significant role in human communication by providing positive feedback. The proposed dual attention-BLSTM architecture helps in recognizing speech emotion effectively and improves performance compared to baseline methods. The experiments on IEMOCAP databases demonstrate the ability of the designed architecture to better distinguish emotional features.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2021)

Proceedings Paper Computer Science, Artificial Intelligence

AN EXPLORATION OF LOG-MEL SPECTROGRAM AND MFCC FEATURES FOR ALZHEIMER'S DEMENTIA RECOGNITION FROM SPONTANEOUS SPEECH

Amit Meghanani et al.

Summary: In this study, deep neural networks were used to identify and predict scores for patients with Alzheimer's disease. The results indicate that log-Mel spectrograms and MFCC features are effective in solving the AD recognition problem.

2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT) (2021)

Article Engineering, Electrical & Electronic

Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference

Sudarsana Reddy Kadin et al.

CIRCUITS SYSTEMS AND SIGNAL PROCESSING (2020)

Article Engineering, Biomedical

Speech emotion recognition with deep convolutional neural networks

Dias Issa et al.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2020)

Article Chemistry, Multidisciplinary

Amharic OCR: An End-to-End Learning

Birhanu Belay et al.

APPLIED SCIENCES-BASEL (2020)

Article Computer Science, Information Systems

Clustering-Based Speech Emotion Recognition by Incorporating Learned Features and Deep BiLSTM

Mustaqeem et al.

IEEE ACCESS (2020)

Review Computer Science, Information Systems

Speech Emotion Recognition Using Deep Learning Techniques: A Review

Ruhul Amin Khalil et al.

IEEE ACCESS (2019)

Review Computer Science, Hardware & Architecture

Speech Emotion Recognition Two Decades in a Nutshell, Benchmarks, and Ongoing Trends

Bjoern W. Schuller

COMMUNICATIONS OF THE ACM (2018)

Article Acoustics

Speech Emotion Recognition for Performance Interaction

Nikolaos Vryzas et al.

JOURNAL OF THE AUDIO ENGINEERING SOCIETY (2018)

Article Computer Science, Artificial Intelligence

CHEAVD: a Chinese natural emotional audio-visual database

Ya Li et al.

JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING (2017)

Article Computer Science, Interdisciplinary Applications

IEMOCAP: interactive emotional dyadic motion capture database

Carlos Busso et al.

LANGUAGE RESOURCES AND EVALUATION (2008)