4.7 Article

GMDH-based semi-supervised feature selection for customer classification

期刊

KNOWLEDGE-BASED SYSTEMS
卷 132, 期 -, 页码 236-248

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2017.06.018

关键词

Feature selection; Group method of data handling (GMDH); Customer classification; Semi-supervised learning

资金

  1. National Natural Science Foundation of China [71471124, 71571126]
  2. Youth Foundation of Sichuan Province [2015RZ0056]
  3. Excellent Youth Fund of Sichuan University [skqx201607, 2013SCU04A08, skzx2016-rcrw14]
  4. Young Teachers Visiting Scholar Program of Sichuan University

向作者/读者索取更多资源

Data dimension reduction is an important step for customer classification modeling, and feature selection has been a research focus of the data dimension reduction field. This study introduces the group method of data handling (GMDH), puts forward a GMDH-based semi-supervised feature selection (GMDH-SSFS) algorithm, and applies it to customer feature selection. The algorithm can utilize a few samples with class labels L, and a large number of samples without class labels U simultaneously. What is more, it considers the relationship between features and class labels, and that between features during feature selection. The GMDH-SSFS model mainly consists of three stages: 1) Train N basic classification models based on the dataset L with class labels; 2) Label samples selectively in the dataset U without class labels, and add them to L; 3) Train the GMDH neural network based on the new training set L, and select the optimal feature subset Fs. Based on an empirical analysis of four customer classification datasets, results suggest that the features selected by the GMDH-SSFS model have a good explainability. Meanwhile, the customer classification performance of the classification model trained by the selected feature subset is superior to that of the models trained by the commonly used Laplacian score (an unsupervised feature selection algorithm), Fisher score (a supervised feature selection algorithm), and the FW-SemiFS and S3VM-FS (two semi-supervised feature selection algorithms). (C) 2017 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据