Journal
APPLIED ACOUSTICS
Volume 148, Issue -, Pages 123-132Publisher
ELSEVIER SCI LTD
DOI: 10.1016/j.apacoust.2018.12.019
Keywords
Sound information retrieval; Environmental sound classification; Dilated convolutions
Categories
Funding
- National Natural Science Fund of China [61672332, 61322211, 61432011, U1435212, 11671006, 61603173, 61802238]
- Program for New Century Excellent Talents in University [NCET-12-1031]
- Program for the Outstanding Innovative Teams of Higher Learning Institutions of Shanxi
- Program for the Young San Jin Scholars of Shanxi
- National Key Basic Research and Development Program of China (973) [2013CB329404, 2013CB329502]
Ask authors/readers for more resources
In sound information retrieval (SIR) area, environmental sound classification (ESC) emerges as a new issue, which aims at classifying environments by analysing the complex features extracted from the various sound data. As one of the most efficient feature extraction methods, convolution neural networks (CNN) has made its success in speech and music signal processing, and in particular, CNN with pooling has worked effectively in classifying environmental and urban sound sources. However, pooling causes information loss. In this paper, dilated CNN, being introduced to ESC problem, achieves better results than that of CNN with max-pooling and other state-of-the-art approaches. At the same time, we explore the effect of different dilation rate and the number of layers of dilated convolution to the experimental results, and find that expanding the number of covered frames or enlarging the dilation rate will make the accuracy reduce. That may be the sound signal has short-term stability, the size of the overlay frame seriously affects the feature extraction of the sound signal, and there is an inherent gridding in the dilation model conjunction defect. (C) 2018 Elsevier Ltd. All rights reserved.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available