4.7 Article

Resampling ensemble model based on data distribution for imbalanced credit risk evaluation in P2P lending

Journal

INFORMATION SCIENCES
Volume 536, Issue -, Pages 120-134

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2020.05.040

Keywords

Big data applications; Credit risk evaluation; Imbalanced classification; Data distribution; Ensemble model

Funding

  1. National Natural Science Foundation of China [61872135, 61702053]
  2. Natural Science Foundation of Hunan Province [2018JJ2066]
  3. Open Fund Project for Innovation Platform of Universities in Hunan [11K002]

Ask authors/readers for more resources

The misclassification of loan applicants by credit scoring model is one of the main factors causing the loss of investors' profits in P2P lending. Class imbalance of credit data is a main factor that affects classification performance of the model. Most existing methods of addressing class imbalance in credit scoring worked on improving the prediction accuracy for minority class samples (bad credit), which usually led to decreasing the prediction performance for majority class samples (good credit) significantly. In this paper, we propose a novel resampling ensemble model based on data distribution (REMDD) for imbalanced credit risk evaluation in P2P lending. REMMD solves class imbalance problem by using proposed undersampling method based on majority class data distribution (UMCDD). To further improve classification performance of REMMD, base classifiers with better comprehensive performance on the validation set are used to conduct class prediction. We validate the classification performance of REMDD on the three real and representative P2P lending credit datasets. The experimental results demonstrate that REMDD not only has good prediction performance for both majority class and minority class, but also effectively improves the comprehensive classification performance for imbalanced credit risk evaluation in P2P lending, compared with existing models. (C) 2020 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available