☆ 4.5 Article

Parallel data mining techniques on Graphics Processing Unit with Compute Unified Device Architecture (CUDA)

JOURNAL OF SUPERCOMPUTING (2013)

期刊

JOURNAL OF SUPERCOMPUTING

卷 64, 期 3, 页码 942-967

出版社

SPRINGER

DOI: 10.1007/s11227-011-0672-7

关键词

Parallel computing; CUDA; Data mining; Classification; Clustering; Association rules mining

类别

Computer Science, Hardware & Architecture Computer Science, Theory & Methods Engineering, Electrical & Electronic

资金

Natural Science Foundation of China [70621001/70921061, 70531040]
NVIDIA's Professor Partnership
Graduate University of Chinese Academy of Sciences [085102 GNOO, 085102 HNOO]
Chinese Academy of Sciences

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Recent development in Graphics Processing Units (GPUs) has enabled inexpensive high performance computing for general-purpose applications. Compute Unified Device Architecture (CUDA) programming model provides the programmers adequate C language like APIs to better exploit the parallel power of the GPU. Data mining is widely used and has significant applications in various domains. However, current data mining toolkits cannot meet the requirement of applications with large-scale databases in terms of speed. In this paper, we propose three techniques to speedup fundamental problems in data mining algorithms on the CUDA platform: scalable thread scheduling scheme for irregular pattern, parallel distributed top-k scheme, and parallel high dimension reduction scheme. They play a key role in our CUDA-based implementation of three representative data mining algorithms, CU-Apriori, CU-KNN, and CU-K-means. These parallel implementations outperform the other state-of-the-art implementations significantly on a HP xw8600 workstation with a Tesla C1060 GPU and a Core-quad Intel Xeon CPU. Our results have shown that GPU + CUDA parallel architecture is feasible and promising for data mining applications.

Parallel data mining techniques on Graphics Processing Unit with Compute Unified Device Architecture (CUDA)

期刊

JOURNAL OF SUPERCOMPUTING

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Parallel data mining techniques on Graphics Processing Unit with Compute Unified Device Architecture (CUDA)

期刊

JOURNAL OF SUPERCOMPUTING

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文