4.7 Article

Unsupervised Ensemble Classification With Sequential and Networked Data

Journal

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
Volume 34, Issue 10, Pages 5009-5022

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2020.3046645

Keywords

Data models; Task analysis; Signal processing algorithms; Inference algorithms; Tensors; Distributed databases; Tagging; Ensemble learning; unsupervised; sequential classification; crowdsourcing; dependent data

Funding

  1. US National Science Foundation [1500713, 1514056]

Ask authors/readers for more resources

This study explores the method of unsupervised ensemble classification and introduces novel algorithms for handling data dependencies in both sequential and networked data. Evaluation on synthetic and real datasets shows that knowledge of data dependencies in the meta-learner has a positive impact on unsupervised ensemble classification task.
Ensemble learning, the machine learning paradigm where multiple models are combined, has exhibited promising perfomance in a variety of tasks. The present work focuses on unsupervised ensemble classification. The term unsupervised refers to the ensemble combiner who has no knowledge of the ground-truth labels that each classifier has been trained on. While most prior works on unsupervised ensemble classification are designed for independent and identically distributed (i.i.d.) data, the present work introduces an unsupervised scheme for learning from ensembles of classifiers in the presence of data dependencies. Two types of data dependencies are considered: sequential data and networked data whose dependencies are captured by a graph. For both, novel moment matching and Expectation-Maximization algorithms are developed. Performance of these algorithms is evaluated on synthetic and real datasets, which indicate that knowledge of data dependencies in the meta-learner is beneficial for the unsupervised ensemble classification task.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available