4.6 Article

Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending

Journal

ANNALS OF OPERATIONS RESEARCH
Volume 266, Issue 1-2, Pages 511-529

Publisher

SPRINGER
DOI: 10.1007/s10479-017-2668-z

Keywords

P2P lending; Default prediction; Soft information; Topic model

Funding

  1. National Natural Science Foundation of China [71571059, 71331002, 71731005]
  2. Humanities and Social Sciences Fund Projects of the Ministry of Education [13YJA630037, 15YJA630010]

Ask authors/readers for more resources

Predicting whether a borrower will default on a loan is of significant concern to platforms and investors in online peer-to-peer (P2P) lending. Because the data types online platforms use are complex and involve unstructured information such as text, which is difficult to quantify and analyze, loan default prediction faces new challenges in P2P. To this end, we propose a default prediction method for P2P lending combined with soft information related to textual description. We introduce a topic model to extract valuable features from the descriptive text concerning loans and construct four default prediction models to demonstrate the performance of these features for default prediction. Moreover, a two-stage method is designed to select an effective feature set containing both soft and hard information. An empirical analysis using real-word data from a major P2P lending platform in China shows that the proposed method can improve loan default prediction performance compared with existing methods based only on hard information.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available