4.6 Article

A novel model to optimize multiple imputation algorithm for missing data using evolution methods

期刊

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.bspc.2022.103661

关键词

Multiple imputation; Fitness value; Multiple regression; Genetic algorithm; Missing data; Optimization

向作者/读者索取更多资源

This study introduces a novel method that optimizes multiple regression imputation processes by combining multiple imputations with a genetic algorithm to improve fitness values for missing patient data. Significant improvements were achieved according to the experimental results.
The concept of missing data is considered significant when applying statistical methods to a dataset and the quality of the data analysis results is based on the correct data completeness. As a result, improving missing data filling processes is vital in order to give more reliable data throughout the phase of analysis. Here, we present a novel method for optimizing multiple regression imputation processes and obtaining the best fitness values for missing data from patients by combining multiple imputations with a genetic algorithm. To train and assess our proposed method, we employed 583 patient records from a publicly available database, divided into 416 records of liver patients and 167 records of the non-liver patients. The proposed approach offers the largest improvement for missing data findings, according to the results. Instead of employing the normal equation in multiple imputations, which yielded 92.72 as the utmost fitness value with Mean Absolute Error (MAE) 0.5877 from 1.1840 after our second optimization, we were able to achieve a fitness value of 233. The proposed approach might be tested using a large database and used in Hepatocellular carcinoma (HCC) labs to help clinicians make accurate diagnoses.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据