4.6 Article

Attribute augmentation-based label integration for crowdsourcing

Journal

FRONTIERS OF COMPUTER SCIENCE
Volume 17, Issue 5, Pages -

Publisher

HIGHER EDUCATION PRESS
DOI: 10.1007/s11704-022-2225-z

Keywords

crowdsourcing; label integration; attribute augmentation; instance filtering

Ask authors/readers for more resources

Crowdsourcing is an effective and low-cost method for collecting labels, however, the quality of these labels is often low due to the insufficient professional knowledge of crowd workers. To address this issue, this paper proposes a novel three-stage label integration method called Attribute Augmentation-based Label Integration (AALI).
Crowdsourcing provides an effective and low-cost way to collect labels from crowd workers. Due to the lack of professional knowledge, the quality of crowdsourced labels is relatively low. A common approach to addressing this issue is to collect multiple labels for each instance from different crowd workers and then a label integration method is used to infer its true label. However, to our knowledge, almost all existing label integration methods merely make use of the original attribute information and do not pay attention to the quality of the multiple noisy label set of each instance. To solve these issues, this paper proposes a novel three-stage label integration method called attribute augmentation-based label integration (AALI). In the first stage, we design an attribute augmentation method to enrich the original attribute space. In the second stage, we develop a filter to single out reliable instances with high-quality multiple noisy label sets. In the third stage, we use majority voting to initialize integrated labels of reliable instances and then use cross-validation to build multiple component classifiers on reliable instances to predict all instances. Experimental results on simulated and real-world crowdsourced datasets demonstrate that AALI outperforms all the other state-of-the-art competitors.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available