4.7 Article

Improving Tree augmented Naive Bayes for class probability estimation

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 26, Issue -, Pages 239-245

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2011.08.010

Keywords

Naive Bayes; Tree Augmented Naive Bayes; Class probability estimation; Conditional log likelihood; Ensemble learning

Funding

  1. National Natural Science Foundation of China [60905033, 61075063]
  2. Provincial Natural Science Foundation of Hubei [2009CDB139]
  3. Fundamental Research Funds for the Central Universities [CUG090109]

Ask authors/readers for more resources

Numerous algorithms have been proposed to improve Naive Bayes (NB) by weakening its conditional attribute independence assumption, among which Tree Augmented Naive Bayes (TAN) has demonstrated remarkable classification performance in terms of classification accuracy or error rate, while maintaining efficiency and simplicity. In many real-world applications, however, classification accuracy or error rate is not enough. For example, in direct marketing, we often need to deploy different promotion strategies to customers with different likelihood (class probability) of buying some products. Thus, accurate class probability estimation is often required to make optimal decisions. In this paper, we investigate the class probability estimation performance of TAN in terms of conditional log likelihood (CLL) and present a new algorithm to improve its class probability estimation performance by the spanning TAN classifiers. We call our improved algorithm Averaged Tree Augmented Naive Bayes (ATAN). The experimental results on a large number of UCI datasets published on the main web site of Weka platform show that ATAN significantly outperforms TAN and all the other algorithms used to compare in terms of CLL. (C) 2011 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available