相关参考文献
注意:仅列出部分参考文献,下载原文获取全部文献信息。Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown
Silvan Heller et al.
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL (2022)
MARS: Learning Modality-Agnostic Representation for Scalable Cross-Media Retrieval
Yunbo Wang et al.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)
FUSION AND ORTHOGONAL PROJECTION FOR IMPROVED FACE-VOICE ASSOCIATION
Muhammad Saad Saeed et al.
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) (2022)
Disentangled Representation Learning for Cross-Modal Biometric Matching
Hailong Ning et al.
IEEE TRANSACTIONS ON MULTIMEDIA (2022)
Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association
Peisong Wen et al.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)
LEARNING AUDIO-VISUAL CORRELATIONS FROM VARIATIONAL CROSS-MODAL GENERATION
Ye Zhu et al.
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) (2021)
An Overview of Image Caption Generation Methods
Haoran Wang et al.
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE (2020)
Modality-specific and shared generative adversarial network for cross-modal retrieval
Fei Wu et al.
PATTERN RECOGNITION (2020)
A study on deep learning spatiotemporal models and feature extraction techniques for video understanding
M. Suresha et al.
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL (2020)
On Learning Associations of Faces and Voices
Changil Kim et al.
COMPUTER VISION - ACCV 2018, PT V (2019)
The Modality-Specific Learning Style Hypothesis: A Mini-Review
Karoline Aslaksen et al.
FRONTIERS IN PSYCHOLOGY (2018)
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Arsha Nagrani et al.
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)
Look, Listen and Learn
Relja Arandjelovic et al.
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)
Matching novel face and voice identity using static and dynamic facial images
Harriet M. J. Smith et al.
ATTENTION PERCEPTION & PSYCHOPHYSICS (2016)