4.6 Article

Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Artificial Intelligence

A review of speaker diarization: Recent advances with deep learning

Tae Jin Park et al.

Summary: Speaker diarization is a task to label audio or video recordings with speaker identity classes, and with the advancement of deep learning technology, rapid progress has been made in this field, showing the importance of complementary relationship between speaker diarization and speech recognition.

COMPUTER SPEECH AND LANGUAGE (2022)

Review Computer Science, Artificial Intelligence

Generative adversarial networks for speech processing: A review

Aamir Wali et al.

Summary: Generative adversarial networks (GANs) have shown remarkable progress in speech processing, especially in areas like speech synthesis, enhancement, and data augmentation. This paper reviews novel GAN-based frameworks and algorithms, provides an overview of common datasets and evaluation metrics used in speech GANs, and suggests future research directions and challenges.

COMPUTER SPEECH AND LANGUAGE (2022)

Review Computer Science, Artificial Intelligence

Deep learning for depression recognition with audiovisual cues: A review

Lang He et al.

Summary: As the pace of work and life accelerates, people are facing increasing pressure that can lead to depression. Deep Learning is being utilized to automatically detect depression by extracting cues from audio and video data. Research in automatic depression detection using DL faces challenges but shows promising directions.

INFORMATION FUSION (2022)

Article Computer Science, Artificial Intelligence

Ambient acoustic event assistive framework for identification, detection, and recognition of unknown acoustic events of a residence

Sharnil Pandya et al.

Summary: Smart living, a subset of Ambient Assisted Living, utilizes the latest technologies and intelligent processes to enable residents to live independently with a virtual companion 24 x 7. This approach not only saves critical energy resources but also improves residents' quality of life.

ADVANCED ENGINEERING INFORMATICS (2021)

Article Acoustics

FluentNet: End-to-End Detection of Stuttered Speech Disfluencies With Deep Learning

Tedd Kourkounakis et al.

Summary: FluentNet is a deep neural network capable of detecting and recognizing various types of stuttering effectively, utilizing a Squeeze-and-Excitation Residual convolutional neural network and bidirectional long short-term memory layers. Through experiments on the UCLASS dataset and LibriStutter dataset, FluentNet demonstrates strong performance and outperforms other solutions in the field.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Article Computer Science, Artificial Intelligence

Employing PCA and t-statistical approach for feature extraction and classification of emotion from multichannel EEG signal

Md Asadur Rahman et al.

EGYPTIAN INFORMATICS JOURNAL (2020)

Article Computer Science, Artificial Intelligence

Effect of speech segment samples selection in stutter block detection and remediation

Pierre Arbajian et al.

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS (2019)

Article Acoustics

Dysarthric speech classification from coded telephone speech using glottal features

N. P. Narendra et al.

SPEECH COMMUNICATION (2019)

Article Engineering, Electrical & Electronic

Parameterization of Excitation Signal for Improving the Quality of HMM-Based Speech Synthesis System

N. P. Narendra et al.

CIRCUITS SYSTEMS AND SIGNAL PROCESSING (2017)

Article Engineering, Biomedical

Speech rate estimation in disordered speech based on spectral landmark detection

Hernandez-Diaz Huici et al.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2016)

Article Computer Science, Artificial Intelligence

Gaussian Mixture Model Based Classification of Stuttering Dysfluencies

P. Mahesha et al.

JOURNAL OF INTELLIGENT SYSTEMS (2016)

Article Engineering, Electrical & Electronic

Efficient One-Pass Decoding with NNLM for Speech Recognition

Yongzhe Shi et al.

IEEE SIGNAL PROCESSING LETTERS (2014)

Article Computer Science, Artificial Intelligence

Learning by abstraction: Hierarchical classification model using evidential theoretic approach and Bayesian ensemble model

Mahdi Pakdaman Naeini et al.

NEUROCOMPUTING (2014)

Article Acoustics

Robust Speaker Identification in Noisy and Reverberant Conditions

Xiaojia Zhao et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2014)

Article Computer Science, Artificial Intelligence

Hierarchical ANN system for stuttering identification

Izabela Swietlicka et al.

COMPUTER SPEECH AND LANGUAGE (2013)

Article Computer Science, Artificial Intelligence

Two-stage intonation modeling using feedforward neural networks for syllable based text-to-speech synthesis

V. Ramu Reddy et al.

COMPUTER SPEECH AND LANGUAGE (2013)

Article Automation & Control Systems

Self-Adjustable Neural Network for speech recognition

Hua-Nong Ting et al.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2013)

Article Engineering, Electrical & Electronic

Phoneme recognition using zerocrossing interval distribution of speech patterns and ANN

R. Kumar et al.

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY (2013)

Article Engineering, Electrical & Electronic

Time-domain non-linear feature parameter for consonant classification

T. M. Thasleema et al.

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY (2012)

Article Engineering, Biomedical

Automatic detection of voice impairments from text-dependent running speech

J. I. Godino-Llorente et al.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2009)

Article Computer Science, Artificial Intelligence

Classification of audio signals using SVM and RBFNN

P. Dhanalakshmi et al.

EXPERT SYSTEMS WITH APPLICATIONS (2009)

Article Computer Science, Artificial Intelligence

Speech nonfluency detection using Kohonen networks

Izabela Szczurowska et al.

NEURAL COMPUTING & APPLICATIONS (2009)

Article Computer Science, Artificial Intelligence

Text-dependent speaker recognition using wavelets and neural networks

Chee Peng Lim et al.

SOFT COMPUTING (2007)

Article Computer Science, Interdisciplinary Applications

A classification technique based on radial basis function neural networks

H Sarimveis et al.

ADVANCES IN ENGINEERING SOFTWARE (2006)

Article Engineering, Biomedical

Pathological voice quality assessment using artificial neural networks

RT Ritchings et al.

MEDICAL ENGINEERING & PHYSICS (2002)

Article Audiology & Speech-Language Pathology

Individual and consensus judgments of disfluency types in the speech of persons who stutter

AK Cordes

JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH (2000)