4.7 Article

Label augmented and weighted majority voting for crowdsourcing

期刊

INFORMATION SCIENCES
卷 606, 期 -, 页码 397-409

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.05.066

关键词

Crowdsourcing learning; Label integration; Label augmentation; Label weighting; Majority voting

资金

  1. Science and Technology Project of Hubei Province-Unveiling System [2021BEC007]
  2. Industry-University-Research Innovation Funds for Chinese Universities [2020ITA05008]
  3. Foundation of Key Laboratory of Artificial Intelligence, Ministry of Education, P.R. China [AI2020002]

向作者/读者索取更多资源

Crowdsourcing is an efficient way to obtain multiple noisy labels for each unlabeled instance. Label integration methods aim to infer the true label of each instance from its multiple noisy labels. This paper proposes a novel label integration method called LAWMA, which improves the performance by augmenting and weighting the labels.
Crowdsourcing provides an efficient way to obtain multiple noisy labels from different crowd workers for each unlabeled instance. Label integration methods are designed to infer the unknown true label of each instance from its multiple noisy label set. We argue that when the label quality is higher than random classification, the more the number of labels, the better the performance of label integration methods. However, in real-world crowd-sourcing scenarios, each instance cannot obtain enough labels for saving costs. To solve this problem, this paper proposes a novel label integration method called label augmented and weighted majority voting (LAWMV). At first, LAWMV uses the K-nearest neighbors (KNN) algorithm to find each instance's K-nearest neighbors (including itself) and merges their multiple noisy label sets to obtain its augmented multiple noisy label set. Then, the labels from different neighbors are weighted by the distances and the label similarities between each instance and its neighbors. Finally, the integrated label of each instance is inferred by weighted majority voting (MV). The experimental results on 34 simulated and two real-world crowdsourced datasets show that LAWMV significantly outperforms all the other state-of-the-art label integration methods. (C) 2022 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据