期刊
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
卷 -, 期 -, 页码 8802-8806出版社
IEEE
DOI: 10.1109/ICASSP43922.2022.9746947
关键词
Sound event detection (SED); source separation (SS); multi-task learning (MTL); weakly supervised
In weakly supervised sound event detection, a novel multi-task learning method is proposed to utilize the prior knowledge of time-frequency masks for each sound event. The method treats SED as the main task and source separation as the auxiliary task, and incorporates regularization constraints using shared masks.
In weakly supervised sound event detection (SED), only coarse-grained labels are available, and thus the supervision information is quite limited. To fully utilize prior knowledge of the time-frequency masks of each sound event, we propose a novel multi-task learning (MTL) method that takes SED as the main task and source separation as the auxiliary task. For active events, we minimize the overlap of their masks as the segment loss to learn distinguishing features. For inactive events, the proposed method measures the activity of masks as silent loss to reduce the insertion error. The auxiliary source separation task calculates an extra penalty according to the shared masks, which can further incorporate prior knowledge in the form of regularization constraints. We demonstrated that the proposed method can effectively reduce the insertion error and achieve a better performance in SED task than single-task methods.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据