4.6 Article

Bandwidth choice for nonparametric classification

Journal

ANNALS OF STATISTICS
Volume 33, Issue 1, Pages 284-306

Publisher

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/009053604000000959

Keywords

Bayes risk; bootstrap; cross-validation; classification error; discrimination; error rate; kernel methods; nonparametric density estimation

Ask authors/readers for more resources

It is shown that, for kernel-based classification with univariate distributions and two populations, optimal bandwidth choice has a dichotomous character. If the two densities cross at just one point. where their curvature.,, have the same signs, then minimum Bayes risk is achieved using bandwidths which are an order of magnitude larger than those which minimize pointwise estimation error. On the other hand, if the curvature signs are different, or if there are multiple crossing points. then bandwidths of conventional size are generally appropriate. The range of different modes of behavior is narrower in multivariate settings. There, the optimal size of bandwidth is generally the same as that which is appropriate for pointwise density estimation. These properties motivate empirical rules for bandwidth choice.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available