4.6 Article

Deep Learning-Based Automated Lip-Reading: A Survey

期刊

IEEE ACCESS
卷 9, 期 -, 页码 121184-121205

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3107946

关键词

Feature extraction; Lips; Videos; Speech recognition; Neural networks; Visualization; Speech; Visual speech recognition; lip-reading; deep learning; feature extraction; classification; computer vision; natural language processing

资金

  1. Chinasoft International Ltd.
  2. London South Bank University

向作者/读者索取更多资源

This paper presents a survey on automated lip-reading approaches, focusing on deep learning methodologies. The survey compares different components of automated lip-reading systems and highlights the advantages of Convolutional Neural Networks, Attention-Transformers, and Temporal Convolutional Networks. Additionally, it compares different classification schemas used for lip-reading and reviews the most up-to-date lip-reading systems.
A survey on automated lip-reading approaches is presented in this paper with the main focus being on deep learning related methodologies which have proven to be more fruitful for both feature extraction and classification. This survey also provides comparisons of all the different components that make up automated lip-reading systems including the audio-visual databases, feature extraction, classification networks and classification schemas. The main contributions and unique insights of this survey are: 1) A comparison of Convolutional Neural Networks with other neural network architectures for feature extraction; 2) A critical review on the advantages of Attention-Transformers and Temporal Convolutional Networks to Recurrent Neural Networks for classification; 3) A comparison of different classification schemas used for lip-reading including ASCII characters, phonemes and visemes, and 4) A review of the most up-to-date lip-reading systems up until early 2021.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据