3.8 Proceedings Paper

IOMiner: Large-scale Analytics Framework for Gaining Knowledge from I/O Logs

出版社

IEEE
DOI: 10.1109/CLUSTER.2018.00062

关键词

-

资金

  1. Office of Science, Office of Advanced Scientific Computing Research, of the U.S. Department of Energy [DE-AC02-05CH11231, DE-AC02-06CH11357]
  2. Office of Science of the U.S. Department of Energy [DE-AC02-05CH11231]

向作者/读者索取更多资源

Modern HPC systems are collecting large amounts of I/O performance data. The massive volume and heterogeneity of this data, however, have made timely performance of in-depth integrated analysis difficult. To overcome this difficulty and to allow users to identify the root causes of poor application I/O performance, we present IOMiner, an I/O log analytics framework. IOMiner provides an easy-to-use interface for analyzing instrumentation data, a unified storage schema that hides the heterogeneity of the raw instrumentation data, and a sweep-line-based algorithm for root cause analysis of poor application I/O performance. IOMiner is implemented atop Spark to facilitate efficient, interactive, parallel analysis. We demonstrate the capabilities of IOMiner by using it to analyze logs collected on a large-scale production HPC system. Our analysis techniques not only uncover the root cause of poor I/O performance in key application case studies but also provide new insight into HPC I/O workload characterization.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据