4.7 Article

Semi-supervised learning with missing values imputation

期刊

KNOWLEDGE-BASED SYSTEMS
卷 284, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2023.111171

关键词

Missing value; Imputation and classification; Semi-supervised; Normalizing flow; Conditional distribution estimation

向作者/读者索取更多资源

This paper proposes a novel semi-supervised conditional normalizing flow (SSCFlow) algorithm that combines unsupervised imputation and supervised classification. By estimating the conditional distribution of incomplete instances, SSCFlow facilitates imputation and classification simultaneously, addressing the issue of separated tasks ignoring data distribution and label information in traditional methods.
Incomplete instances with various missing attributes in many real-world applications have brought challenges to the classification tasks. Unsupervised imputation is often employed to replace the missing values with substitute values before supervised classification. However, this process often separates the imputation and classification, which may lead to inferior performance since the separated two tasks ignore the data distribution and label information contained in each other. Besides, traditional methods may rely on improper assumptions to initialize the missing values, whereas the unreliability of such initialization might degrade the performance. To address these problems, a novel semi-supervised conditional normalizing flow (SSCFlow) is proposed in this paper. SSCFlow combines unsupervised imputation and supervised classification as a joint semi-supervised task, which estimates the conditional distribution of incomplete instances to facilitate the imputation and classification simultaneously. Moreover, SSCFlow treats the initialized missing values as corrupted initial imputations and iteratively reconstructs their latent representations to approximate their true conditional distribution. Experiments on real-world datasets demonstrate the robustness and effectiveness of the proposed algorithm.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据