4.7 Article

Novel kernel density estimator based on ensemble unbiased cross-validation

期刊

INFORMATION SCIENCES
卷 581, 期 -, 页码 327-344

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2021.09.045

关键词

Probability density function; Kernel density estimator; Bandwidth; Unbiased cross-validation; Data-block based UCV; Data-point based UCV

资金

  1. National Natural Science Foundation of China [61972261]
  2. Basic Research Foundation of Shenzhen [20210312191246002]
  3. Scientific Research Foundation of Shenzhen University [860/000002110628]
  4. Key R&D Program of Science and Technology Foundation of Hebei Province [19210310D]
  5. Natural Science Foundation of Hebei Province [F2021201020]

向作者/读者索取更多资源

This paper proposes a novel ensemble UCV based KDE (EUCV-KDE), which determines the expectation of an estimated PDF using an ensemble of data-block based UCVs. A novel objective function is designed for EUCV-KDE by considering the empirical and structural risk of KDE together. The experimental results show that EUCV-KDE is more stable and performs better than classical UCV-KDE and RCV-KDE.
Unbiased cross-validation (UCV) is a commonly-used method to calculate the optimal bandwidth for the kernel density estimator (KDE), which estimates the underlying probability density function (PDF) for a given data set. Since the UCV method was proposed, there have been few studies that have pointed out its instability when determining the KDE bandwidth. Following the principle of stability improvement, this paper presents a novel ensemble UCV based KDE (EUCV-KDE), which determines the expectation of an estimated PDF using an ensemble of data-block based UCVs rather than a single data-point based UCV. To derive the optimal bandwidth, a novel objective function is designed for EUCV-KDE by considering the empirical and structural risk of KDE together. We validate the rationality and effectiveness of EUCV-KDE on 10 probability distributions. The experimental results show that EUCV-KDE is convergent as the number of data-block based UCVs increases and can obtain a more stable and better prediction performance than the classical UCV-KDE and the revisited cross-validation (RCV) based KDE (RCV-KDE). In addition, a real-world application based on UK climate data is provided to further validate the effectiveness of EUCV-KDE by determining the optimal bandwidth for Nadaraya-Watson kernel regression estimator. (c) 2021 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据