4.7 Article

A new lateral geniculate nucleus pattern-based environmental sound classification using a new large sound dataset

Journal

APPLIED ACOUSTICS
Volume 196, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.apacoust.2022.108897

Keywords

Environmental sound classification; Human auditory system; Human visual system; Sound classification; Hand-modeled learning method

Categories

Ask authors/readers for more resources

In this research, a highly accurate sound classification architecture is developed using a biologically inspired feature extraction function. A new environmental sound classification dataset is collected as a testbed and a hand-modeled sound classification model is proposed. The model achieves 93.34% classification accuracy on the collected dataset.
Background and purpose: One of the essential purposes of sound classification is to achieve similar/over classification ability of the human auditory system (HAS). A new dataset and a biologically inspired feature extraction function have been proposed to realize this aim. We have developed a highly accurate sound classification architecture using the proposed biological-inspired feature extraction function. Materials and methods: In this research, a new environmental sound classification (ESC) dataset has been collected as a testbed, and this dataset contains 5000 sounds with 50 classes. Moreover, the collected ESC sound dataset is balanced. A new hand-modeled sound classification model has been proposed to classify sounds of this dataset. This model consists of (i) feature generation using a new lateral geniculate nucleus pattern (LGNPat), statistical moments and discrete wavelet transform (DWT), (ii) loop-based (iterative) neighborhood component analysis (INCA) based feature selection, (iii) classification using the selected features by k nearest neighbors (kNN) with 10-fold cross-validation. The proposed sound classification architecture is an extendable model. In this aspect, a new generation of hand-modeled sound/onedimensional signal classification methods can be proposed. Results: The presented hand-modeled learning method was applied to the ESC dataset acquired, and our LGNPat-based model attained 93.34% classification. Conclusions: The computed 93.34% on the collected ESC dataset has demonstrated the success of our model. Moreover, the collected dataset has been publicly published. In this aspect, the published dataset can be used to improve advanced sound classification models. (C) 2022 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available