4.7 Article

Using sensitivity analysis and visualization techniques to open black box data mining models

期刊

INFORMATION SCIENCES
卷 225, 期 -, 页码 1-17

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2012.10.039

关键词

Sensitivity analysis; Visualization; Input importance; Supervised data mining; Regression; Classification

资金

  1. FEDER, through the program COMPETE
  2. Portuguese Foundation for Science and Technology (FCT) [FCOMP-01-0124-FEDER-022674]

向作者/读者索取更多资源

In this paper, we propose a new visualization approach based on a Sensitivity Analysis (SA) to extract human understandable knowledge from supervised learning black box data mining models, such as Neural Networks (NNs), Support Vector Machines (SVMs) and ensembles, including Random Forests (RFs). Five SA methods (three of which are purely new) and four measures of input importance (one novel) are presented. Also, the SA approach is adapted to handle discrete variables and to aggregate multiple sensitivity responses. Moreover, several visualizations for the SA results are introduced, such as input pair importance color matrix and variable effect characteristic surface. A wide range of experiments was performed in order to test the SA methods and measures by fitting four well-known models (NN, SVM, RF and decision trees) to synthetic datasets (five regression and five classification tasks). In addition, the visualization capabilities of the SA are demonstrated using four real-world datasets (e.g., bank direct marketing and white wine quality). (C) 2012 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据