4.6 Article

Estimation Accuracy on Execution Time of Run-Time Tasks in a Heterogeneous Distributed Environment

Journal

SENSORS
Volume 16, Issue 9, Pages -

Publisher

MDPI
DOI: 10.3390/s16091386

Keywords

cloud computing; data convergence; MapReduce; data analysis; speculative execution; J0101

Funding

  1. NSFC [61300238, 61300237, 61232016, 1405254, 61373133]
  2. Marie Curie Fellowship [701697-CAR-MSCA-IFEF-ST]
  3. Project of six personnel in Jiangsu Province [2014-WLW-013, R2015L06]
  4. Basic Research Programs (Natural Science Foundation) of Jiangsu Province [BK20131004]
  5. PAPD fund

Ask authors/readers for more resources

Distributed Computing has achieved tremendous development since cloud computing was proposed in 2006, and played a vital role promoting rapid growth of data collecting and analysis models, e.g., Internet of things, Cyber-Physical Systems, Big Data Analytics, etc. Hadoop has become a data convergence platform for sensor networks. As one of the core components, MapReduce facilitates allocating, processing and mining of collected large-scale data, where speculative execution strategies help solve straggler problems. However, there is still no efficient solution for accurate estimation on execution time of run-time tasks, which can affect task allocation and distribution in MapReduce. In this paper, task execution data have been collected and employed for the estimation. A two-phase regression (TPR) method is proposed to predict the finishing time of each task accurately. Detailed data of each task have drawn interests with detailed analysis report being made. According to the results, the prediction accuracy of concurrent tasks' execution time can be improved, in particular for some regular jobs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available