4.7 Article

Environmental sound classification with dilated convolutions

Journal

APPLIED ACOUSTICS
Volume 148, Issue -, Pages 123-132

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.apacoust.2018.12.019

Keywords

Sound information retrieval; Environmental sound classification; Dilated convolutions

Categories

Funding

  1. National Natural Science Fund of China [61672332, 61322211, 61432011, U1435212, 11671006, 61603173, 61802238]
  2. Program for New Century Excellent Talents in University [NCET-12-1031]
  3. Program for the Outstanding Innovative Teams of Higher Learning Institutions of Shanxi
  4. Program for the Young San Jin Scholars of Shanxi
  5. National Key Basic Research and Development Program of China (973) [2013CB329404, 2013CB329502]

Ask authors/readers for more resources

In sound information retrieval (SIR) area, environmental sound classification (ESC) emerges as a new issue, which aims at classifying environments by analysing the complex features extracted from the various sound data. As one of the most efficient feature extraction methods, convolution neural networks (CNN) has made its success in speech and music signal processing, and in particular, CNN with pooling has worked effectively in classifying environmental and urban sound sources. However, pooling causes information loss. In this paper, dilated CNN, being introduced to ESC problem, achieves better results than that of CNN with max-pooling and other state-of-the-art approaches. At the same time, we explore the effect of different dilation rate and the number of layers of dilated convolution to the experimental results, and find that expanding the number of covered frames or enlarging the dilation rate will make the accuracy reduce. That may be the sound signal has short-term stability, the size of the overlay frame seriously affects the feature extraction of the sound signal, and there is an inherent gridding in the dilation model conjunction defect. (C) 2018 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available