4.7 Article

Neighborhood size selection in the k-nearest-neighbor rule using statistical confidence

Journal

PATTERN RECOGNITION
Volume 39, Issue 3, Pages 417-423

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2005.08.009

Keywords

pattern classification; nearest-neighbor rule; probability of error; statistical confidence

Ask authors/readers for more resources

The k-nearest-neighbor rule is one of the most attractive pattern classification algorithms. In practice. the choice of k is determined by the cross-validation method. In this work, we propose a new method for neighborhood size selection that is based on the concept of statistical confidence. We define the confidence associated with a decision that is made by the majority rule from a finite number of observations and use it as a criterion to determine the number of nearest neighbors needed. The new algorithm is tested on several real-world datasets and yields results comparable to the k-nearest-neighbor rule. However, in contrast to the k-nearest-neighbor rule that uses a fixed number of nearest neighbors throughout the feature space, our method locally adjusts the number of nearest neighbors until a satisfactory level of confidence is reached. In addition, the statistical confidence provides a natural way to balance the trade-off between the reject rate and the error rate by excluding patterns that have low confidence levels. We believe that this property of our method can be of great importance in applications where the confidence with which a decision is made is equally or more important than the overall error rate. (c) 2005 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available