4.7 Article

Improved support vector machine algorithm for heterogeneous data

Journal

PATTERN RECOGNITION
Volume 48, Issue 6, Pages 2072-2083

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2014.12.015

Keywords

Support vector machine; Heterogeneous data; Nominal attribute; Numerical attribute; Classification learning

Funding

  1. National Program on Key Basic Research Project [2013CB329304]
  2. National Natural Science Foundation of China [61222210, 61432011]
  3. New Century Excellent Talents in University [NCET-12-0399]

Ask authors/readers for more resources

A support vector machine (SVM) is a popular algorithm for classification learning. The classical SVM effectively manages classification tasks defined by means of numerical attributes. However, both numerical and nominal attributes are used in practical tasks and the classical SVM does not fully consider the difference between them. Nominal attributes are usually regarded as numerical after coding. This may deteriorate the performance of learning algorithms. In this study, we propose a novel SVM algorithm for learning with heterogeneous data, known as a heterogeneous SVM (HSVM). The proposed algorithm learns an mapping to embed nominal attributes into a real space by minimizing an estimated generalization error, instead of by direct coding. Extensive experiments are conducted, and some interesting results are obtained. The experiments show that HSVM improves classification performance for both nominal and heterogeneous data. (C) 2014 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available