4.6 Article

SampleCNN: End-to-End Deep Convolutional Neural Networks Using Very Small Filters for Music Classification

Journal

APPLIED SCIENCES-BASEL
Volume 8, Issue 1, Pages -

Publisher

MDPI
DOI: 10.3390/app8010150

Keywords

convolutional neural networks; music classification; raw waveforms; sample-level filters; downsampling; filter visualization; transfer learning

Funding

  1. Basic Science Research Program through the National Research Foundation of Korea - Ministry of Science, ICT & Future Planning [2015R1C1A1A02036962]
  2. Korea Advanced Institute of Science and Technology [G04140049]

Ask authors/readers for more resources

Convolutional Neural Networks (CNN) have been applied to diverse machine learning tasks for different modalities of raw data in an end-to-end fashion. In the audio domain, a raw waveform-based approach has been explored to directly learn hierarchical characteristics of audio. However, the majority of previous studies have limited their model capacity by taking a frame-level structure similar to short-time Fourier transforms. We previously proposed a CNN architecture which learns representations using sample-level filters beyond typical frame-level input representations. The architecture showed comparable performance to the spectrogram-based CNN model in music auto-tagging. In this paper, we extend the previous work in three ways. First, considering the sample-level model requires much longer training time, we progressively downsample the input signals and examine how it affects the performance. Second, we extend the model using multi-level and multi-scale feature aggregation technique and subsequently conduct transfer learning for several music classification tasks. Finally, we visualize filters learned by the sample-level CNN in each layer to identify hierarchically learned features and show that they are sensitive to log-scaled frequency.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available