☆ 4.7 Article

An intelligent scheme for big data recovery in Internet of Things based on Multi-Attribute assistance and Extremely randomized trees

INFORMATION SCIENCES (2021)

期刊

INFORMATION SCIENCES

卷 557, 期 -, 页码 66-83

出版社

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2020.12.041

关键词

Data recovery; Multi-attribute; Extremely randomized trees; Internet of Things

类别

Computer Science, Information Systems

资金

National Natural Science Foundation of China [61370210]
Science Foundation of Fujian Province of China [2019J01245]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Due to the constraints of sensor nodes in the Internet of Things, data loss is a key issue for applications dependent on data completeness. Current solutions, such as interpolation, use data correlation, but it can be difficult to extract data correlation in cases where the coupling degree between different attributes is low. This paper proposes an intelligent recovery scheme based on Multi-Attribute assistance and Extremely randomized Trees (MAET) for big data in the Internet of Things, which has shown efficiency and improved accuracy compared to other algorithms.

Due to the inherent characteristics of sensor nodes in Internet of Things, such as constrained energy, data redundancy, limited communication range and computing capabilities, the data loss problem becomes one key issue for applications which depends heavily on the data completeness. Some of the current solutions, such as interpolation, are designed to use data correlation for the data recovery problem. Spatiotemporal correlation is an important characteristic of sensory data since the nodes are generally deployed to observed similar physical phenomenon. However, it is very difficult to extract data correlation especially in case that the coupling degree between different perceptual attributes is low. Machine learning is an efficient auto-learning method that can obtain the inherent rules automatically. This paper has proposed an intelligent recovery scheme for big data in Internet of Things based on Multi-Attribute assistance and Extremely randomized Trees (MAET). Firstly, the collected dataset is denoised by detecting and removing the outliers. Secondly, the slave attributes are chosen whose correlations are high with the target attribute by using the Spearman correlation coefficient. Thirdly, the proposed scheme is trained by using extremely randomized trees with slave attributes. Finally, the missing data can be recovered with the trained model as well as the help of other attributes whose data is not lost. Experiment shows that the proposed scheme with multiple attributes is efficient and can improve the accuracy of recovered data compared with other algorithms. (C) 2020 Elsevier Inc. All rights reserved.

An intelligent scheme for big data recovery in Internet of Things based on Multi-Attribute assistance and Extremely randomized trees

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

An intelligent scheme for big data recovery in Internet of Things based on Multi-Attribute assistance and Extremely randomized trees

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文