4.7 Article

A hybrid discretization method for naive Bayesian classifiers

Journal

PATTERN RECOGNITION
Volume 45, Issue 6, Pages 2321-2325

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2011.12.014

Keywords

Hybrid discretization; Naive Bayesian classifier; Nonparametric measure

Funding

  1. National Science Council in Taiwan [99-2410-H-006-072-MY2]

Ask authors/readers for more resources

Since naive Bayesian classifiers are suitable for processing discrete attributes, many methods have been proposed for discretizing continuous ones. However, none of the previous studies apply more than one discretization method to the continuous attributes in a data set for naive Bayesian classifiers. Different approaches employ different information embedded in continuous attributes to determine the boundaries for discretization. It is likely that discretizing the continuous attributes in a data set using different methods can utilize the information embedded in the attributes more thoroughly and thus improve the performance of nave Bayesian classifiers. In this study, we propose a nonparametric measure to evaluate the dependence level between a continuous attribute and the class. The nonparametric measure is then used to develop a hybrid method for discretizing continuous attributes so that the accuracy of the nave Bayesian classifier can be enhanced. This hybrid method is tested on 20 data sets, and the results demonstrate that discretizing the continuous attributes in a data set by various methods can generally have a higher prediction accuracy. (C) 2011 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available