4.6 Article

Deep Learning-Based Automated Lip-Reading: A Survey

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Engineering, Electrical & Electronic

Lipreading with DenseNet and resBi-LSTM

Xuejuan Chen et al.

SIGNAL IMAGE AND VIDEO PROCESSING (2020)

Article Computer Science, Artificial Intelligence

Guided autoencoder for dimensionality reduction of pedestrian features

Xuan Li et al.

APPLIED INTELLIGENCE (2020)

Proceedings Paper Acoustics

LIPREADING USING TEMPORAL CONVOLUTIONAL NETWORKS

Brais Martinez et al.

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (2020)

Article Computer Science, Information Systems

Lip Reading Sentences Using Deep Learning With Only Visual Cues

Souheil Fenghour et al.

IEEE ACCESS (2020)

Article Computer Science, Information Systems

A Survey of Research on Lipreading Technology

Mingfeng Hao et al.

IEEE ACCESS (2020)

Article Computer Science, Artificial Intelligence

Lip reading with Hahn Convolutional Neural Networks

Abderrahim Mesbah et al.

IMAGE AND VISION COMPUTING (2019)

Proceedings Paper Engineering, Electrical & Electronic

LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild

Shuang Yang et al.

2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019) (2019)

Article Computer Science, Information Systems

Lip Reading Using Committee Networks With Two Different Types of Concatenated Frame Images

Dong-Won Jang et al.

IEEE ACCESS (2019)

Article Acoustics

A corpus of audio-visual Lombard speech with frontal and profile views

Najwa Alghamdi et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2018)

Review Computer Science, Artificial Intelligence

Survey on automatic lip-reading in the era of deep learning

Adriana Fernandez-Lopez et al.

IMAGE AND VISION COMPUTING (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Deep Lip Reading: a comparison of models and an online application

Triantafyllos Afouras et al.

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Improving Viseme Recognition using GAN-based Frontal View Mapping

Dario Augusto Borges Oliveira et al.

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

LCANet: End-to-End Lipreading with Cascaded Attention-CTC

Kai Xu et al.

PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018) (2018)

Article Computer Science, Artificial Intelligence

An audio-visual corpus for multimodal automatic speech recognition

Andrzej Czyzewski et al.

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS (2017)

Article Computer Science, Information Systems

3D Convolutional Neural Networks for Cross Audio-Visual Matching Recognition

Amirsina Torfi et al.

IEEE ACCESS (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Lip Reading Sentences in the Wild

Joon Son Chung et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Editorial Material Computer Science, Artificial Intelligence

Visual units and confusion modelling for automatic lip-reading

Dominic Howell et al.

IMAGE AND VISION COMPUTING (2016)

Article Computer Science, Information Systems

TCD-TIMIT: An Audio-Visual Corpus of Continuous Speech

Naomi Harte et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2015)

Editorial Material Computer Science, Artificial Intelligence

A review of recent advances in visual speech decoding

Ziheng Zhou et al.

IMAGE AND VISION COMPUTING (2014)

Proceedings Paper Imaging Science & Photographic Technology

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data

Chris McCool et al.

2012 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW) (2012)

Article Computer Science, Artificial Intelligence

A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities

Yee Wan Wong et al.

PATTERN RECOGNITION LETTERS (2011)

Article Acoustics

Adaptive Multimodal Fusion by Uncertainty Compensation With Application to Audiovisual Speech Recognition

George Papandreou et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2009)

Article Computer Science, Information Systems

Lipreading With Local Spatiotemporal Descriptors

Guoying Zhao et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2009)

Article Acoustics

An audio-visual corpus for speech perception and automatic speech recognition (L)

Martin Cooke et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2006)

Article Audiology & Speech-Language Pathology

Speechreading and its association with reading among deaf, hearing and dyslexic individuals

Tara Mohammed et al.

CLINICAL LINGUISTICS & PHONETICS (2006)

Article Acoustics

Audio-visual speech recognition using an infrared headset

J Huang et al.

SPEECH COMMUNICATION (2004)

Article Computer Science, Artificial Intelligence

Extraction of visual features for lipreading

I Matthews et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2002)

Article Computer Science, Information Systems

Audio-Visual Speech Modeling for Continuous Speech Recognition

Stephane Dupont et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2000)