4.5 Article

Multi-valued attribute and multi-labeled data decision tree algorithm

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s13042-011-0015-2

Keywords

Decision tree; Multi-valued attribute; Multi-labeled data; Similarity

Funding

  1. National Natural Science Foundation of China [61073133, 60773084, 60603023]
  2. National Research Foundation for the Doctoral Program of Higher Education of China [20070151009]

Ask authors/readers for more resources

This paper analyzes the existing decision tree algorithms for dealing with multi-valued and multi-labeled data. These algorithms have the following shortcomings: The choice of which attributes is difficult and the calculation for similarity is not precise enough. Based on these deficiencies, this paper proposes a new decision tree algorithm for multi-valued and multi-labeled data (AMDT). In the algorithm, firstly a new formula sim5 is proposed for calculating the similarity between two label-sets in the child nodes. It comprehensively considers the condition which the elements appear and not appear in both of the two label-sets at the same time and adjusts the proportion of them by the coefficient alpha, so that the similarity calculations of the label-sets are more comprehensive and accurate. Secondly, we propose the new conditions of the corresponding node to stop splitting. Lastly, we give the prediction method. Results of comparison experiments with the existing algorithms (MMC, SSC and SCC_SP_1) show that AMDT has the higher predictive accuracy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available