4.6 Article

A Survey of Distributed Data Stream Processing Frameworks

期刊

IEEE ACCESS
卷 7, 期 -, 页码 154300-154316

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2019.2946884

关键词

Dataflow architectures; data stream architectures; distributed processing systems comparison; survey; taxonomy

资金

  1. SOSCIP under the TalentEdge Post-doctoral Fellowship (OCE Project) [28379]
  2. Cloud Project A Multilevel Streaming Data Analytics Infrastructure for Predictive Analytics [SPEDF1-042]
  3. Federal Economic Development Agency of Southern Ontario
  4. Province of Ontario
  5. IBM Canada Ltd
  6. Ontario Centres of Excellence
  7. Mitacs

向作者/读者索取更多资源

Big data processing systems are evolving to be more stream oriented where each data record is processed as it arrives by distributed and low-latency computational frameworks on a continuous basis. As the stream processing technology matures and more organizations invest in digital transformations, new applications of stream analytics will be identified and implemented across a wide spectrum of industries. One of the challenges in developing a streaming analytics infrastructure is the difficulty in selecting the right stream processing framework for the different use cases. With a view to addressing this issue, in this paper we present a taxonomy, a comparative study of distributed data stream processing and analytics frameworks, and a critical review of representative open source (Storm, Spark Streaming, Flink, Kafka Streams) and commercial (IBM Streams) distributed data stream processing frameworks. The study also reports our ongoing study on a multilevel streaming analytics architecture that can serve as a guide for organizations and individuals planning to implement a real-time data stream processing and analytics framework.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据