☆ 4.7 Article

The Family of MapReduce and Large-Scale Data Processing Systems

ACM COMPUTING SURVEYS (2013)

期刊

ACM COMPUTING SURVEYS

卷 46, 期 1, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/2522968.2522979

关键词

Design; Algorithms; Performance; MapReduce; big data; large-scale data processing

类别

Computer Science, Theory & Methods

资金

Australian Government

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In the last two decades, the continuous increase of computational power has produced an overwhelming flow of data which has called for a paradigm shift in the computing architecture and large-scale data processing mechanisms. MapReduce is a simple and powerful programming model that enables easy development of scalable parallel applications to process vast amounts of data on large clusters of commodity machines. It isolates the application from the details of running a distributed program such as issues on data distribution, scheduling, and fault tolerance. However, the original implementation of the MapReduce framework had some limitations that have been tackled by many research efforts in several followup works after its introduction. This article provides a comprehensive survey for a family of approaches and mechanisms of large-scale data processing mechanisms that have been implemented based on the original idea of the MapReduce framework and are currently gaining a lot of momentum in both research and industrial communities. We also cover a set of introduced systems that have been implemented to provide declarative programming interfaces on top of the MapReduce framework. In addition, we review several large-scale data processing systems that resemble some of the ideas of the MapReduce framework for different purposes and application scenarios. Finally, we discuss some of the future research directions for implementing the next generation of MapReduce-like solutions.

The Family of MapReduce and Large-Scale Data Processing Systems

期刊

ACM COMPUTING SURVEYS

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

The Family of MapReduce and Large-Scale Data Processing Systems

期刊

ACM COMPUTING SURVEYS

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文