☆ 4.6 Article

Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion

SENSORS (2019)

Journal

SENSORS

Volume 19, Issue 7, Pages -

Publisher

MDPI

DOI: 10.3390/s19071733

Keywords

Auditory Cognition; Environment Sound Classification; Convolutional Neural Network; DempsterShafer evidence theory; Fusion Model

Funding

China Scholarship Council [201606290083]
National Science Foundation of China [61502391]
Natural Science Basic Research Plan in Shaanxi Province of China [2017JM6043]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

With the popularity of using deep learning-based models in various categorization problems and their proven robustness compared to conventional methods, a growing number of researchers have exploited such methods in environment sound classification tasks in recent years. However, the performances of existing models use auditory features like log-mel spectrogram (LM) and mel frequency cepstral coefficient (MFCC), or raw waveform to train deep neural networks for environment sound classification (ESC) are unsatisfactory. In this paper, we first propose two combined features to give a more comprehensive representation of environment sounds Then, a fourfour-layer convolutional neural network (CNN) is presented to improve the performance of ESC with the proposed aggregated features. Finally, the CNN trained with different features are fused using the Dempster-Shafer evidence theory to compose TSCNN-DS model. The experiment results indicate that our combined features with the four-layer CNN are appropriate for environment sound taxonomic problems and dramatically outperform other conventional methods. The proposed TSCNN-DS model achieves a classification accuracy of 97.2%, which is the highest taxonomic accuracy on UrbanSound8K datasets compared to existing models.

Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion

Journal

SENSORS

Publisher

MDPI

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion

Journal

SENSORS

Publisher

MDPI

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper