4.6 Article

Towards Robust Waveform-Based Acoustic Models

Related references

Note: Only part of the references are listed.
Article Acoustics

Learning Waveform-Based Acoustic Models Using Deep Variational Convolutional Neural Networks

Dino Oglic et al.

Summary: This study explores the potential of stochastic neural networks in learning effective waveform-based acoustic models, utilizing deep convolutional neural networks and stochastic variational inference to achieve superior performance in empirical results.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Proceedings Paper Computer Science, Artificial Intelligence

FRAME-LEVEL SPECAUGMENT FOR DEEP CONVOLUTIONAL NEURAL NETWORKS IN HYBRID ASR SYSTEMS

Xinwei Li et al.

Summary: Inspired by SpecAugment, a frame-level SpecAugment method (f-SpecAugment) is proposed to improve the performance of deep CNNs for hybrid HMM ASR systems. By applying transformations to each convolution window independently during training, f-SpecAugment reduces WER across different ASR tasks and is shown to be effective even with large training data, with benefits comparable to doubling the amount of training data for deep CNNs.

2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT) (2021)

Proceedings Paper Acoustics

MULTI-SCALE OCTAVE CONVOLUTIONS FOR ROBUST SPEECH RECOGNITION

Joanna Rownicka et al.

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (2020)

Proceedings Paper Computer Science, Artificial Intelligence

Multi-Modal Data Augmentation for End-to-End ASR

Adithya Renduchintala et al.

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES (2018)

Article Computer Science, Artificial Intelligence

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

Emmanuel Vincent et al.

COMPUTER SPEECH AND LANGUAGE (2017)

Article Acoustics

Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition

Yanmin Qian et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2016)

Article Acoustics

Data Augmentation for Deep Neural Network Acoustic Modeling

Xiaodong Cui et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2015)

Article Acoustics

An Overview of Noise-Robust Automatic Speech Recognition

Jinyu Li et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2014)

Article Engineering, Electrical & Electronic

Making Machines Understand Us in Reverberant Rooms

Takuya Yoshioka et al.

IEEE SIGNAL PROCESSING MAGAZINE (2012)

Article Acoustics

Combined Features and Kernel Design for Noise Robust Phoneme Classification Using Support Vector Machines

Jibran Yousafzai et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2011)