☆ 4.6 Article

Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition

SENSORS (2022)

相关参考文献

注意：仅列出部分参考文献，下载原文获取全部文献信息。

Article Computer Science, Artificial Intelligence

A review of speaker diarization: Recent advances with deep learning

Tae Jin Park et al.

Summary: Speaker diarization is a task to label audio or video recordings with speaker identity classes, and with the advancement of deep learning technology, rapid progress has been made in this field, showing the importance of complementary relationship between speaker diarization and speech recognition.

COMPUTER SPEECH AND LANGUAGE (2022)

添加到收藏夹

Review Computer Science, Artificial Intelligence

Generative adversarial networks for speech processing: A review

Aamir Wali et al.

Summary: Generative adversarial networks (GANs) have shown remarkable progress in speech processing, especially in areas like speech synthesis, enhancement, and data augmentation. This paper reviews novel GAN-based frameworks and algorithms, provides an overview of common datasets and evaluation metrics used in speech GANs, and suggests future research directions and challenges.

COMPUTER SPEECH AND LANGUAGE (2022)

添加到收藏夹

Review Computer Science, Artificial Intelligence

Deep learning for depression recognition with audiovisual cues: A review

Lang He et al.

Summary: As the pace of work and life accelerates, people are facing increasing pressure that can lead to depression. Deep Learning is being utilized to automatically detect depression by extracting cues from audio and video data. Research in automatic depression detection using DL faces challenges but shows promising directions.

INFORMATION FUSION (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Ambient acoustic event assistive framework for identification, detection, and recognition of unknown acoustic events of a residence

Sharnil Pandya et al.

Summary: Smart living, a subset of Ambient Assisted Living, utilizes the latest technologies and intelligent processes to enable residents to live independently with a virtual companion 24 x 7. This approach not only saves critical energy resources but also improves residents' quality of life.

ADVANCED ENGINEERING INFORMATICS (2021)

添加到收藏夹

Article Acoustics

FluentNet: End-to-End Detection of Stuttered Speech Disfluencies With Deep Learning

Tedd Kourkounakis et al.

Summary: FluentNet is a deep neural network capable of detecting and recognizing various types of stuttering effectively, utilizing a Squeeze-and-Excitation Residual convolutional neural network and bidirectional long short-term memory layers. Through experiments on the UCLASS dataset and LibriStutter dataset, FluentNet demonstrates strong performance and outperforms other solutions in the field.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Employing PCA and t-statistical approach for feature extraction and classification of emotion from multichannel EEG signal

Md Asadur Rahman et al.

EGYPTIAN INFORMATICS JOURNAL (2020)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Effect of speech segment samples selection in stutter block detection and remediation

Pierre Arbajian et al.

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS (2019)

添加到收藏夹

Article Acoustics

Dysarthric speech classification from coded telephone speech using glottal features

N. P. Narendra et al.

SPEECH COMMUNICATION (2019)

添加到收藏夹

Article Engineering, Electrical & Electronic

Parameterization of Excitation Signal for Improving the Quality of HMM-Based Speech Synthesis System

N. P. Narendra et al.

CIRCUITS SYSTEMS AND SIGNAL PROCESSING (2017)

添加到收藏夹

Article Engineering, Biomedical

Automatic classification of speech dysfluencies in continuous speech based on similarity measures and morphological image processing tools

Iman Esmaili et al.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2016)

添加到收藏夹

Article Engineering, Biomedical

Speech rate estimation in disordered speech based on spectral landmark detection

Hernandez-Diaz Huici et al.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2016)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Gaussian Mixture Model Based Classification of Stuttering Dysfluencies

P. Mahesha et al.

JOURNAL OF INTELLIGENT SYSTEMS (2016)

添加到收藏夹

Article Engineering, Electrical & Electronic

Efficient One-Pass Decoding with NNLM for Speech Recognition

Yongzhe Shi et al.

IEEE SIGNAL PROCESSING LETTERS (2014)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Learning by abstraction: Hierarchical classification model using evidential theoretic approach and Bayesian ensemble model

Mahdi Pakdaman Naeini et al.

NEUROCOMPUTING (2014)

添加到收藏夹

Article Acoustics

Robust Speaker Identification in Noisy and Reverberant Conditions

Xiaojia Zhao et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2014)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Hierarchical ANN system for stuttering identification

Izabela Swietlicka et al.

COMPUTER SPEECH AND LANGUAGE (2013)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Two-stage intonation modeling using feedforward neural networks for syllable based text-to-speech synthesis

V. Ramu Reddy et al.

COMPUTER SPEECH AND LANGUAGE (2013)

添加到收藏夹

Article Automation & Control Systems

Self-Adjustable Neural Network for speech recognition

Hua-Nong Ting et al.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2013)

添加到收藏夹

Article Engineering, Electrical & Electronic

Phoneme recognition using zerocrossing interval distribution of speech patterns and ANN

R. Kumar et al.

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY (2013)

添加到收藏夹

Article Engineering, Electrical & Electronic

Time-domain non-linear feature parameter for consonant classification

T. M. Thasleema et al.

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY (2012)

添加到收藏夹

Article Engineering, Biomedical

Automatic detection of voice impairments from text-dependent running speech

J. I. Godino-Llorente et al.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2009)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Classification of audio signals using SVM and RBFNN

P. Dhanalakshmi et al.

EXPERT SYSTEMS WITH APPLICATIONS (2009)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Speech nonfluency detection using Kohonen networks

Izabela Szczurowska et al.

NEURAL COMPUTING & APPLICATIONS (2009)

添加到收藏夹

Article Audiology & Speech-Language Pathology

Identification of children's stuttered and nonstuttered speech by highly experienced judges: Binary judgments and comparisons with disfluency-types definitions

Anne K. Bothe

JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH (2008)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Text-dependent speaker recognition using wavelets and neural networks

Chee Peng Lim et al.

SOFT COMPUTING (2007)

添加到收藏夹

Article Computer Science, Interdisciplinary Applications

A classification technique based on radial basis function neural networks

H Sarimveis et al.

ADVANCES IN ENGINEERING SOFTWARE (2006)

添加到收藏夹

Article Engineering, Biomedical

Pathological voice quality assessment using artificial neural networks

RT Ritchings et al.

MEDICAL ENGINEERING & PHYSICS (2002)

添加到收藏夹

Article Audiology & Speech-Language Pathology

Individual and consensus judgments of disfluency types in the speech of persons who stutter

AK Cordes

JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH (2000)

添加到收藏夹

© Peeref 2019-2024. All rights reserved.