Related references
Note: Only part of the references are listed.AST: Audio Spectrogram Transformer
Yuan Gong et al.
INTERSPEECH 2021 (2021)
ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio
Andrey Guzhov et al.
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) (2021)
ESResNet: Environmental Sound Classification Based on Visual Domain Models
Andrey Guzhov et al.
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2021)
Zero-Shot Audio Classification Via Semantic Embeddings
Huang Xie et al.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)
ZERO-SHOT AUDIO CLASSIFICATION BASED ON CLASS LABEL EMBEDDINGS
Huang Xie et al.
2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) (2019)
Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
Justin Salamon et al.
IEEE SIGNAL PROCESSING LETTERS (2017)
Look, Listen and Learn
Relja Arandjelovic et al.
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)
ESC: Dataset for Environmental Sound Classification
Karol J. Piczak
MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE (2015)
Attribute-Based Classification for Zero-Shot Visual Object Categorization
Christoph H. Lampert et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2014)