☆ 4.7 Article

Environmental sound classification with dilated convolutions

APPLIED ACOUSTICS (2019)

Journal

APPLIED ACOUSTICS

Volume 148, Issue -, Pages 123-132

Publisher

ELSEVIER SCI LTD

DOI: 10.1016/j.apacoust.2018.12.019

Keywords

Sound information retrieval; Environmental sound classification; Dilated convolutions

Funding

National Natural Science Fund of China [61672332, 61322211, 61432011, U1435212, 11671006, 61603173, 61802238]
Program for New Century Excellent Talents in University [NCET-12-1031]
Program for the Outstanding Innovative Teams of Higher Learning Institutions of Shanxi
Program for the Young San Jin Scholars of Shanxi
National Key Basic Research and Development Program of China (973) [2013CB329404, 2013CB329502]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In sound information retrieval (SIR) area, environmental sound classification (ESC) emerges as a new issue, which aims at classifying environments by analysing the complex features extracted from the various sound data. As one of the most efficient feature extraction methods, convolution neural networks (CNN) has made its success in speech and music signal processing, and in particular, CNN with pooling has worked effectively in classifying environmental and urban sound sources. However, pooling causes information loss. In this paper, dilated CNN, being introduced to ESC problem, achieves better results than that of CNN with max-pooling and other state-of-the-art approaches. At the same time, we explore the effect of different dilation rate and the number of layers of dilated convolution to the experimental results, and find that expanding the number of covered frames or enlarging the dilation rate will make the accuracy reduce. That may be the sound signal has short-term stability, the size of the overlay frame seriously affects the feature extraction of the sound signal, and there is an inherent gridding in the dilation model conjunction defect. (C) 2018 Elsevier Ltd. All rights reserved.

Environmental sound classification with dilated convolutions

Journal

APPLIED ACOUSTICS

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Environmental sound classification with dilated convolutions

Journal

APPLIED ACOUSTICS

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper