4.5 Article

LASSO Regression Modeling on Prediction of Medical Terms among Seafarers' Health Documents Using Tidy Text Mining

期刊

BIOENGINEERING-BASEL
卷 9, 期 3, 页码 -

出版社

MDPI
DOI: 10.3390/bioengineering9030124

关键词

seafarers; text mining; lasso regression; disease mapping; correlations

资金

  1. Italian Ministry of Health [J59J21011210001]
  2. Epidemiological Observatory of Seafarers Pathologies and Injuries

向作者/读者索取更多资源

Seafarers face a higher risk of illnesses and accidents compared to land workers. Lack of medical professionals on seagoing vessels makes disease diagnosis even more challenging. This study proposes a text mining approach combined with sentiment analysis and the LASSO regression algorithm to classify and establish an Epidemiological Observatory of Seafarers' Pathologies and Injuries. The proposed approach achieves a high accuracy in classifying text documents and provides potential for health assistance and disease classification.
Generally, seafarers face a higher risk of illnesses and accidents than land workers. In most cases, there are no medical professionals on board seagoing vessels, which makes disease diagnosis even more difficult. When this occurs, onshore doctors may be able to provide medical advice through telemedicine by receiving better symptomatic and clinical details in the health abstracts of seafarers. The adoption of text mining techniques can assist in extracting diagnostic information from clinical texts. We applied lexicon sentimental analysis to explore the automatic labeling of positive and negative healthcare terms to seafarers' text healthcare documents. This was due to the lack of experimental evaluations using computational techniques. In order to classify diseases and their associated symptoms, the LASSO regression algorithm is applied to analyze these text documents. A visualization of symptomatic data frequency for each disease can be achieved by analyzing TF-IDF values. The proposed approach allows for the classification of text documents with 93.8% accuracy by using a machine learning model called LASSO regression. It is possible to classify text documents effectively with tidy text mining libraries. In addition to delivering health assistance, this method can be used to classify diseases and establish health observatories. Knowledge developed in the present work will be applied to establish an Epidemiological Observatory of Seafarers' Pathologies and Injuries. This Observatory will be a collaborative initiative of the Italian Ministry of Health, University of Camerino, and International Radio Medical Centre (C.I.R.M.), the Italian TMAS.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据