4.1 Article

Parallel Data Processing with MapReduce: A Survey

期刊

SIGMOD RECORD
卷 40, 期 4, 页码 11-20

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/2094114.2094118

关键词

-

资金

  1. National Research Fund (NRF)
  2. Korea government(MEST) [2011-0016282]
  3. National Research Foundation of Korea [2011-0016282] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

向作者/读者索取更多资源

A prominent parallel data processing tool MapReduce is gaining significant momentum from both industry and academia as the volume of data to analyze grows rapidly. While MapReduce is used in many areas where massive data analysis is required, there are still debates on its performance, efficiency per node, and simple abstraction. This survey intends to assist the database and open source communities in understanding various technical aspects of the MapReduce framework. In this survey, we characterize the MapReduce framework and discuss its inherent pros and cons. We then introduce its optimization strategies reported in the recent literature. We also discuss the open issues and challenges raised on parallel data analysis with MapReduce.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据