4.6 Article

Source Separation Using Dilated Time-Frequency DenseNet for Music Identification in Broadcast Contents

期刊

APPLIED SCIENCES-BASEL
卷 10, 期 5, 页码 -

出版社

MDPI
DOI: 10.3390/app10051727

关键词

source separation; DenseNet; broadcast contents; music identification; dilated convolution

资金

  1. Ministry of Culture, Sports and Tourism (MCST)
  2. Korea Copyright Commission
  3. Korea Creative Content Agency (KOCCA) [2018-MICRO-9500] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)
  4. National Research Foundation of Korea [21A20131600002] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

向作者/读者索取更多资源

We propose a source separation architecture using dilated time-frequency DenseNet for background music identification of broadcast content. We apply source separation techniques to the mixed signals of music and speech. For the source separation purpose, we propose a new architecture to add a time-frequency dilated convolution to the conventional DenseNet in order to effectively increase the receptive field in the source separation scheme. In addition, we apply different convolutions to each frequency band of the spectrogram in order to reflect the different frequency characteristics of the low- and high-frequency bands. To verify the performance of the proposed architecture, we perform singing-voice separation and music-identification experiments. As a result, we confirm that the proposed architecture produces the best performance in both experiments because it uses the dilated convolution to reflect wide contextual information.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据