4.6 Article

Kernel Density Estimation, Kernel Methods, and Fast Learning in Large Data Sets

期刊

IEEE TRANSACTIONS ON CYBERNETICS
卷 44, 期 1, 页码 1-20

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSMCB.2012.2236828

关键词

Kernel density estimate (KDE); kernel methods; quadratic programming (QP); sampling; support vector machine (SVM)

资金

  1. Hong Kong Polytechnic University [1-ZV5V]
  2. National Natural Science Foundation of China [60903100, 61272210, 61170122]
  3. Fundamental Research Funds for the Central Universities [JUSRP111A38, JUSRP21128]
  4. Natural Science Foundation of Jiangsu Province [BK2009067, BK2011417]

向作者/读者索取更多资源

Kernel methods such as the standard support vector machine and support vector regression trainings take O(N-3) time and O(N-2) space complexities in their naive implementations, where N is the training set size. It is thus computationally infeasible in applying them to large data sets, and a replacement of the naive method for finding the quadratic programming (QP) solutions is highly desirable. By observing that many kernel methods can be linked up with kernel density estimate (KDE) which can be efficiently implemented by some approximation techniques, a new learning method called fast KDE (FastKDE) is proposed to scale up kernel methods. It is based on establishing a connection between KDE and the QP problems formulated for kernel methods using an entropy-based integrated-squared-error criterion. As a result, FastKDE approximation methods can be applied to solve these QP problems. In this paper, the latest advance in fast data reduction via KDE is exploited. With just a simple sampling strategy, the resulted FastKDE method can be used to scale up various kernel methods with a theoretical guarantee that their performance does not degrade a lot. It has a time complexity of O(m(3)) where m is the number of the data points sampled from the training set. Experiments on different benchmarking data sets demonstrate that the proposed method has comparable performance with the state-of-art method and it is effective for a wide range of kernel methods to achieve fast learning in large data sets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据