4.6 Article

Explainability of Machine Learning Models for Bankruptcy Prediction

期刊

IEEE ACCESS
卷 9, 期 -, 页码 124887-124899

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3110270

关键词

Predictive models; Bankruptcy; Data models; Machine learning; Companies; Feature extraction; Analytical models; Bankruptcy prediction; machine learning; explainable AI; feature importance

资金

  1. National Research Foundation of Korea (NRF) Grants through Korean Government [Ministry of Science and ICT (MSIT)] [NRF-2017R1E1A1A03070105, NRF-2019R1A5A1028324]
  2. Institute for the Information and Communications Technology Promotion (IITP) Grant through Korean Government [Ministry of Science, ICT and Future Planning (MSIP)] [2019-0-01906]
  3. Information Technology Research Center (ITRC) [IITP-2020-2018-0-01441]

向作者/读者索取更多资源

With the increasing amount of data, there is a growing demand for utilizing machine learning methodologies in bankruptcy prediction. This study focuses on the interpretability of machine learning models, particularly measuring feature importance using the LIME algorithm, to address the black-box issue.
As the amount of data increases, it is more likely that the assumptions in the existing economic analysis model are unsatisfied or make it difficult to establish a new analysis model. Therefore, there has been increased demand for applying the machine learning methodology to bankruptcy prediction due to its high performance. By contrast, machine learning models usually operate as black-boxes but credit rating regulatory systems require the provisioning of appropriate information regarding credit rating standards. If machine learning models have sufficient interpretablility, they would have the potential to be used as effective analytical models in bankruptcy prediction. From this aspect, we study the explainability of machine learning models for bankruptcy prediction by applying the Local Interpretable Model-Agnostic Explanations (LIME) algorithm, which measures the feature importance for each data point. To compare how the feature importance measured through LIME differs from that of models themselves, we first applied this algorithm to typical tree-based models that have ability to measure the feature importance of the models themselves. We showed that the feature importance measured through LIME could be a consistent generalization of the feature importance measured by tree-based models themselves. Moreover, we study the consistency of the feature importance through the model's predicted bankruptcy probability, which suggests the possibility that observations of important features can be used as a basis for the fair treatment of loan eligibility requirements.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据