☆ 4.7 Article

Using sensitivity analysis and visualization techniques to open black box data mining models

INFORMATION SCIENCES (2013)

Journal

INFORMATION SCIENCES

Volume 225, Issue -, Pages 1-17

Publisher

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2012.10.039

Keywords

Sensitivity analysis; Visualization; Input importance; Supervised data mining; Regression; Classification

Funding

FEDER, through the program COMPETE
Portuguese Foundation for Science and Technology (FCT) [FCOMP-01-0124-FEDER-022674]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In this paper, we propose a new visualization approach based on a Sensitivity Analysis (SA) to extract human understandable knowledge from supervised learning black box data mining models, such as Neural Networks (NNs), Support Vector Machines (SVMs) and ensembles, including Random Forests (RFs). Five SA methods (three of which are purely new) and four measures of input importance (one novel) are presented. Also, the SA approach is adapted to handle discrete variables and to aggregate multiple sensitivity responses. Moreover, several visualizations for the SA results are introduced, such as input pair importance color matrix and variable effect characteristic surface. A wide range of experiments was performed in order to test the SA methods and measures by fitting four well-known models (NN, SVM, RF and decision trees) to synthetic datasets (five regression and five classification tasks). In addition, the visualization capabilities of the SA are demonstrated using four real-world datasets (e.g., bank direct marketing and white wine quality). (C) 2012 Elsevier Inc. All rights reserved.

Using sensitivity analysis and visualization techniques to open black box data mining models

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Using sensitivity analysis and visualization techniques to open black box data mining models

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper