☆ 4.2 Article

Multi-scale affinities with missing data: Estimation and applications

STATISTICAL ANALYSIS AND DATA MINING (2022)

期刊

STATISTICAL ANALYSIS AND DATA MINING

卷 15, 期 3, 页码 303-313

出版社

WILEY

DOI: 10.1002/sam.11561

关键词

kernels; missing data; penalized estimation

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications Statistics & Probability

资金

National Institutes of Health [R01EB026936, R01GM135928]
National Science Foundation [DMS-1752692]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The paper introduces a new method for constructing row and column affinities even when data are missing by leveraging a co-clustering technique. It exploits solving the optimization problem for multiple pairs of cost parameters and filling in missing values with increasingly smooth estimates. This approach takes advantage of the coupled similarity structure among both the rows and columns of a data matrix.

Many machine learning algorithms depend on weights that quantify row and column similarities of a data matrix. The choice of weights can dramatically impact the effectiveness of the algorithm. Nonetheless, the problem of choosing weights has arguably not been given enough study. When a data matrix is completely observed, Gaussian kernel affinities can be used to quantify the local similarity between pairs of rows and pairs of columns. Computing weights in the presence of missing data, however, becomes challenging. In this paper, we propose a new method to construct row and column affinities even when data are missing by building off a co-clustering technique. This method takes advantage of solving the optimization problem for multiple pairs of cost parameters and filling in the missing values with increasingly smooth estimates. It exploits the coupled similarity structure among both the rows and columns of a data matrix. We show these affinities can be used to perform tasks such as data imputation, clustering, and matrix completion on graphs.

Multi-scale affinities with missing data: Estimation and applications

期刊

STATISTICAL ANALYSIS AND DATA MINING

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-scale affinities with missing data: Estimation and applications

期刊

STATISTICAL ANALYSIS AND DATA MINING

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文