☆ 4.6 Article

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

IEEE SIGNAL PROCESSING LETTERS (2017)

期刊

IEEE SIGNAL PROCESSING LETTERS

卷 24, 期 3, 页码 279-283

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/LSP.2017.2657381

关键词

Deepconvolutional neural networks (CNNs); deep learning; environmental sound classification; urban sound dataset

类别

Engineering, Electrical & Electronic

资金

NSF [1544753]
Direct For Computer & Info Scie & Enginr
Division Of Computer and Network Systems [1544753] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The ability of deep convolutional neural networks (CNNs) to learn discriminative spectro-temporal patterns makes them well suited to environmental sound classification. However, the relative scarcity of labeled data has impeded the exploitation of this family of high-capacity models. This study has two primary contributions: first, we propose a deep CNN architecture for environmental sound classification. Second, we propose the use of audio data augmentation for overcoming the problem of data scarcity and explore the influence of different augmentations on the performance of the proposed CNN architecture. Combined with data augmentation, the proposed model produces state-of-theart results for environmental sound classification. We show that the improved performance stems from the combination of a deep, highcapacity model and an augmented training set: this combination outperforms both the proposed CNN without augmentation and a shallow dictionary learning model with augmentation. Finally, we examine the influence of each augmentation on the model's classification accuracy for each class, and observe that the accuracy for each class is influenced differently by each augmentation, suggesting that the performance of the model could be improved further by applying class-conditional data augmentation.

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

期刊

IEEE SIGNAL PROCESSING LETTERS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

期刊

IEEE SIGNAL PROCESSING LETTERS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文