☆ 4.5 Article

Multi-valued attribute and multi-labeled data decision tree algorithm

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2011)

Journal

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS

Volume 2, Issue 2, Pages 67-74

Publisher

SPRINGER HEIDELBERG

DOI: 10.1007/s13042-011-0015-2

Keywords

Decision tree; Multi-valued attribute; Multi-labeled data; Similarity

Funding

National Natural Science Foundation of China [61073133, 60773084, 60603023]
National Research Foundation for the Doctoral Program of Higher Education of China [20070151009]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

This paper analyzes the existing decision tree algorithms for dealing with multi-valued and multi-labeled data. These algorithms have the following shortcomings: The choice of which attributes is difficult and the calculation for similarity is not precise enough. Based on these deficiencies, this paper proposes a new decision tree algorithm for multi-valued and multi-labeled data (AMDT). In the algorithm, firstly a new formula sim5 is proposed for calculating the similarity between two label-sets in the child nodes. It comprehensively considers the condition which the elements appear and not appear in both of the two label-sets at the same time and adjusts the proportion of them by the coefficient alpha, so that the similarity calculations of the label-sets are more comprehensive and accurate. Secondly, we propose the new conditions of the corresponding node to stop splitting. Lastly, we give the prediction method. Results of comparison experiments with the existing algorithms (MMC, SSC and SCC_SP_1) show that AMDT has the higher predictive accuracy.

Multi-valued attribute and multi-labeled data decision tree algorithm

Journal

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS

Publisher

SPRINGER HEIDELBERG

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Multi-valued attribute and multi-labeled data decision tree algorithm

Journal

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS

Publisher

SPRINGER HEIDELBERG

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper