4.5 Article

Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis

期刊

INFLAMMATION RESEARCH
卷 72, 期 6, 页码 1315-1324

出版社

SPRINGER BASEL AG
DOI: 10.1007/s00011-023-01755-7

关键词

SLE; Lupus nephritis; Machine learning; Meteorological data

向作者/读者索取更多资源

The objective of this study was to establish a machine learning pipeline for the diagnosis of lupus nephritis (LN). A cohort of SLE patients was analyzed, and important features such as ASO, RBP, LA1, LA2, and proteinuria were identified using a collective feature selection method. The XGB model based on these features showed the best performance for LN diagnosis.
ObjectiveDiagnosis of lupus nephritis (LN) is a complex process, which usually requires renal biopsy. We aim to establish a machine learning pipeline to help diagnosis of LN.MethodsA cohort of 681 systemic lupus erythematosus (SLE) patients without LN and 786 SLE patients with LN was established, and a total of 95 clinical, laboratory data and 17 meteorological indicators were collected. After tenfold cross-validation, the patients were divided into training set and test set. The features selected by collective feature selection method of mutual information (MI) and multisurf were used to construct the models of logistic regression, decision tree, random forest, naive Bayes, support vector machine (SVM), light gradient boosting (LGB), extreme gradient boosting (XGB), and artificial neural network (ANN), the models were compared and verified in post-analysis.ResultsCollective feature selection method screens out antistreptolysin (ASO), retinol binding protein (RBP), lupus anticoagulant 1 (LA1), LA2, proteinuria and other features, and the hyperparameter optimized XGB (ROC: AUC = 0.995; PRC: AUC = 1.000, APS = 1.000; balance accuracy: 0.990) has the best performance, followed by LGB (ROC: AUC = 0.992; PRC: AUC = 0.997, APS = 0.977; balance accuracy: 0.957). The worst performance is naive Bayes model (ROC: AUC = 0.799; PRC: AUC = 0.822, APS = 0.823; balance accuracy: 0.693). In the composite feature importance bar plots, ASO, RF, Up/Ucr, and other features play important roles in LN.ConclusionWe developed and validated a new and simple machine learning pathway for diagnosis of LN, especially the XGB model based on ASO, LA1, LA2, proteinuria, and other features screened out by collective feature selection.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据