4.7 Article

Tag Propagation and Cost-Sensitive Learning for Music Auto-Tagging

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 23, 期 -, 页码 1605-1616

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2020.3001521

关键词

Training; Training data; Propagation losses; Multimedia systems; Tagging; Music; Social networking (online); Music auto-tagging; music information retrieval; tag propagation; cost-sensitive learning

资金

  1. Ministry of Science and Technology of Taiwan [106-2221-E-002-041-MY3, 108-2218-E-002-054]

向作者/读者索取更多资源

This study introduces a cost-sensitive tag propagation learning method that successfully improves the performance of three auto-tagging models. The cost-sensitive loss function helps reduce the impact of missing tags, and the artist music context is found to be more effective for tag propagation than other music contexts.
The performance of music auto-tagging depends on the quality of training data. In practice, the links between songs and tags in the manually labeled training data can be incorrect (false positive) or missing (false negative). In this paper, we propose a cost-sensitive tag propagation learning method to improve auto-tagging. Specifically, we exploit music context to determine similar songs and propagate tags between them. Both propagated tags and original tags are used to optimize the auto-tagging models, and cost-sensitivity is incorporated into the loss function to enhance the robustness by adjusting the weight of relevant (positive) links with respect to irrelevant (negative) links. The proposed method is tested on three auto-tagging models: 2D-CNN, CRNN, and SampleCNN. The Million Song Dataset is used for training, and four music contexts, artist, playlist, tag, and listener, are used for song similarity measurement. The experimental results show 1) The proposed method can successfully improve the performance of the three auto-tagging models, 2) The cost-sensitive loss function helps reduce the impact of missing tags, and 3) The artist music context is more powerful for tag propagation than the other three music contexts.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据