3.8 Proceedings Paper

Direction of Arrival Estimation of Moving Sound Sources using Deep Learning

出版社

IEEE
DOI: 10.1109/I2MTC48687.2022.9806668

关键词

-

资金

  1. Natural Sciences and Engineering Research Council of Canada

向作者/读者索取更多资源

In this paper, we evaluate the performance of a deep learning classification system for localization of moving sound sources, and investigate the effect of different hyperparameters and acoustic conditions. The study demonstrates the significance of hyperparameters and acoustic conditions in localizing high-speed sources.
Sound source localization is an important task for several applications and the use of deep learning for this task has recently become a popular research topic. While nearly all previous work has focused on static sound sources, in this paper we evaluate the performance of a deep learning classification system for localization of moving sound sources and we evaluate the effect of different hyperparameters and acoustic conditions. A feedforward neural network is used to estimate the direction of arrival of moving sound sources, with Short Time Fourier Transform input features. Diverse synthetic datasets are generated to represent different acoustic conditions, and hyperparameters are tested to determine which combination results in better direction-ofarrival detection for moving sources. We evaluate the performance of the different combinations in terms of precision and recall, in a multi-class multi-label classification framework, and we find that (1) the number of frequency bins and the reverberation time have a significant effect for localizing high-speed sources, and (2) precision and recall decay slowly at low speeds while dropping sharply at high speeds.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据