4.8 Article

Stochastic Configuration Networks Based Adaptive Storage Replica Management for Power Big Data Processing

Journal

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
Volume 16, Issue 1, Pages 373-383

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TII.2019.2919268

Keywords

Big Data; Power industry; Bandwidth; Cloud computing; Real-time systems; Prediction algorithms; Urban areas; Cloud storage; data replica optimization; geodistributed; power big data processing; stochastic configuration networks (SCNs)

Funding

  1. National Natural Science Foundation of China [61877020]
  2. Science and Technology Projects of Guangdong Province, China [2014B010117007, 2018B010109002]
  3. Science and Technology Project of Guangzhou Municipality, China [201904010393]

Ask authors/readers for more resources

In the power industry, processing business big data from geographically distributed locations, such as online line-loss analysis, has emerged as an important application. How to achieve highly efficient big data storage to meet the requirements of low latency processing applications is quite challenging. In this paper, we propose a novel adaptive power storage replica management system, named PARMS, based on stochastic configuration networks (SCNs), in which the network traffic and the data center (DC) geodistribution are taken into consideration to improve data real-time processing. First, as a fast learning model with less computation burden and sound prediction performance, the SCN model is employed to estimate the traffic state of power data networks. Then, a series of data replica management algorithms is proposed to lower the effects of limited bandwidths and a fixed underlying infrastructure. Finally, the proposed PARMS is implemented using data-parallel computing frameworks (DCFs) for the power industry. Experiments are carried out in an electric power corporation of 230 million users, China Southern power grid, and the results show that our proposed solution can deal with power big data storage efficiently and the job completion times across geodistributed DCs are reduced by 12.19 on average.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available