4.5 Article

Genetically engineered decision trees: Population diversity produces smarter trees

Journal

OPERATIONS RESEARCH
Volume 51, Issue 6, Pages 894-907

Publisher

INST OPERATIONS RESEARCH MANAGEMENT SCIENCES
DOI: 10.1287/opre.51.6.894.24919

Keywords

statistics, data analysis : data mining marketing, estimation/statistical techniques : decision trees computers/computer science, artificial; intelligence : genetic algorithms

Ask authors/readers for more resources

When considering a decision tree for the purpose of classification, accuracy is usually the sole performance measure used in the construction process. In this paper, we introduce the idea of combining a decision tree's expected value and variance in a new probabilistic measure for assessing the performance of a tree. We develop a genetic algorithm for constructing a tree using our new measure and conduct computational experiments that show the advantages of our approach. Further, we investigate the effect of introducing diversity into the population used by our genetic algorithm. We allow the genetic algorithm to simultaneously focus on two distinct probabilistic measures-one that is risk averse and one that is risk seeking. Our bivariate genetic algorithm for constructing a decision tree performs very well, scales up quite nicely to handle data sets with hundreds of thousands of points, and requires only a small percent of the data to generate a high-quality decision tree. We demonstrate the effectiveness of our algorithm on three large data sets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available