☆ 4.7 Article

Ensemble correlation-based low-rank matrix completion with applications to traffic data imputation

KNOWLEDGE-BASED SYSTEMS (2017)

期刊

KNOWLEDGE-BASED SYSTEMS

卷 132, 期 -, 页码 249-262

出版社

ELSEVIER

DOI: 10.1016/j.knosys.2017.06.010

关键词

Missing data; Low-rank matrix completion; Nearest neighbor; Pearson's correlation; Ensemble learning

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [61203244, U1564201, U1664258, 61601203, 61403172, 61572241]
Fujian Provincial Key Laboratory of Information Processing and Intelligent Control (Minjiang University) [MJUKF201724]
Key Research and Development Program of Jiangsu Province [BE2016149]
Natural Science Foundation of Jiangsu Province [BK20140555]
China Postdoctoral Science Foundation [2016M600375]
National Key Research and Development Program of China [2017YFB0102500]
Talent Foundation of Jiangsu University, China [14JDG066]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Low-rank matrix completion (LRMC) is a recently emerging technique which has achieved promising performance in many real-world applications, such as traffic data imputation. In order to estimate missing values, the current LRMC based methods optimize the rank of the matrix comprising the whole traffic data, potentially assuming that all traffic data is equally important. As a result, it puts more emphasis on the commonality of traffic data while ignoring its subtle but crucial difference due to different locations of loop detectors as well as dates of sampling. To handle this problem and further improve imputation performance, a novel correlation-based LRMC method is proposed in this paper. Firstly, LRMC is applied to get initial estimations of missing values. Then, a distance matrix containing pairwise distance between samples is built based on a weighted Pearson's correlation which strikes a balance between observed values and imputed values. For a specific sample, its most similar samples based on the distance matrix constructed are chosen by using an adaptive K-nearest neighboring (KNN) search. LRMC is then applied on these samples with much stronger correlation to obtain refined estimations of missing values. Finally, we also propose a simple but effective ensemble learning strategy to integrate multiple imputed values for a specific sample for further improving imputation performance. Extensive numerical experiments are performed on both traffic flow volume data as well as standard benchmark datasets. The results confirm that the proposed correlation-based LRMC and its ensemble learning version achieve better imputation performance than competing methods. (C) 2017 Elsevier B.V. All rights reserved.

Ensemble correlation-based low-rank matrix completion with applications to traffic data imputation

期刊

KNOWLEDGE-BASED SYSTEMS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Ensemble correlation-based low-rank matrix completion with applications to traffic data imputation

期刊

KNOWLEDGE-BASED SYSTEMS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文