Related references
Note: Only part of the references are listed.
Proceedings Paper
Acoustics
DEVELOPING REAL-TIME STREAMING TRANSFORMER TRANSDUCER FOR SPEECH RECOGNITION ON LARGE-SCALE DATASET
Xie Chen et al.
Summary: This research investigates the potential of Transformer Transducer (T-T) models for low latency and fast decoding on large-scale datasets by combining Transformer-XL and chunk-wise streaming processes, showing that it outperforms other models in streaming scenarios. Runtime cost and latency can be optimized with a relatively small look-ahead.
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) (2021)
Proceedings Paper
Computer Science, Information Systems
TransModality: An End2End Fusion Method with Transformer for Multimodal Sentiment Analysis
Zilong Wang et al.
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020) (2020)