期刊
MATHEMATICS
卷 11, 期 23, 页码 -出版社
MDPI
DOI: 10.3390/math11234778
关键词
healthcare costs; CGBN; regression algorithm; hybrid algorithm
类别
This paper introduces a hybrid machine learning algorithm for predicting healthcare cost by first learning the isolated characteristic variables using network structure learning algorithms and then training regression algorithms. Experimental results show that the proposed scheme can achieve similar or higher prediction accuracy with a reduced amount of data, compared to popular single machine learning algorithms.
Healthcare cost is an issue of concern right now. While many complex machine learning algorithms have been proposed to analyze healthcare cost and address the shortcomings of linear regression and reliance on expert analyses, these algorithms do not take into account whether each characteristic variable contained in the healthcare data has a positive effect on predicting healthcare cost. This paper uses hybrid machine learning algorithms to predict healthcare cost. First, network structure learning algorithms (a score-based algorithm, constraint-based algorithm, and hybrid algorithm) for a Conditional Gaussian Bayesian Network (CGBN) are used to learn the isolated characteristic variables in healthcare data without changing the data properties (i.e., discrete or continuous). Then, the isolated characteristic variables are removed from the original data and the remaining data used to train regression algorithms. Two public healthcare datasets are used to test the performance of the proposed hybrid machine learning algorithm model. Experiments show that when compared to popular single machine learning algorithms (Long Short Term Memory, Random Forest, etc.) the proposed scheme can obtain similar or higher prediction accuracy with a reduced amount of data.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据