4.7 Article

Identification of most important features based on a fuzzy ensemble technique: Evaluation on joint space narrowing progression in knee osteoarthritis patients

Journal

Publisher

ELSEVIER IRELAND LTD
DOI: 10.1016/j.ijmedinf.2021.104614

Keywords

Feature selection; Classification problem; Prediction models; JSN progression; KOA; Fuzzy logic

Funding

  1. European Community's H2020 Programme [777,159]

Ask authors/readers for more resources

This study proposed an ensemble fuzzy feature selection method that aggregates results from multiple feature selection algorithms to select informative features, ultimately showing better and more stable performance in JSN prediction task.
Objective: Feature selection (FS) is a crucial and at the same time challenging processing step that aims to reduce the dimensionality of complex classification or regression problems. Various techniques have been proposed in the literature to address this challenge with emphasis to medical applications. However, each one of the existing FS algorithms come with its own advantages and disadvantages introducing a certain level of bias. Materials and Methods: To avoid bias and alleviate the defectiveness of single feature selection results, an ensemble FS methodology is proposed in this paper that aggregates the results of several FS algorithms (filter, wrapper and embedded ones). Fuzzy logic is employed to combine multiple feature importance scores thus leading to a more robust selection of informative features. The proposed fuzzy ensemble FS methodology was applied on the problem of knee osteoarthritis (KOA) prediction with special emphasis on the progression of joint space narrowing (JSN). The proposed FS methodology was integrated into an end-to-end machine learning pipeline and a thorough experimental evaluation was conducted using data from the Osteoarthritis Initiative (OAI) database. Several classifiers were investigated for their suitability in the task of JSN prediction and the best performing model was then post-hoc analyzed by using the SHAP method. Results: The results showed that the proposed method presented a better and more stable performance in contrast to other competitive feature selection methods, leading to an average accuracy of 78.14% using XG Boost at 31 selected features. The post-hoc explainability highlighted the important features that contribute to the classification of patients with JSN progression. Conclusions: The proposed fuzzy feature selection approach improves the performance of the predictive models by selecting a small optimal subset of features compared to popular feature selection methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available