4.3 Article

A divisive hierarchical clustering methodology for enhancing the ensemble prediction power in large scale population studies: the ATHLOS project

Journal

HEALTH INFORMATION SCIENCE AND SYSTEMS
Volume 10, Issue 1, Pages -

Publisher

SPRINGER
DOI: 10.1007/s13755-022-00171-1

Keywords

Clustering; Prediction enhancement; ATHLOS cohort; Ensemble methods

Funding

  1. ATHLOS (Aging Trajectories of Health) - European Union [635316]

Ask authors/readers for more resources

The ATHLOS cohort consists of harmonized datasets from international groups focusing on health and aging. The Healthy Aging index is constructed based on selected variables from 16 individual studies. This paper explores additional variables in ATHLOS and investigates their use in predicting the Healthy Aging index. By utilizing data clustering and unsupervised learning, the study demonstrates the predictive utility of exploiting hidden data structures.
The ATHLOS cohort is composed of several harmonized datasets of international groups related to health and aging. As a result, the Healthy Aging index has been constructed based on a selection of variables from 16 individual studies. In this paper, we consider additional variables found in ATHLOS and investigate their utilization for predicting the Healthy Aging index. For this purpose, motivated by the volume and diversity of the dataset, we focus our attention upon data clustering, where unsupervised learning is utilized to enhance prediction power. Thus we show the predictive utility of exploiting hidden data structures. In addition, we demonstrate that imposed computation bottlenecks can be surpassed when using appropriate hierarchical clustering, within a clustering for ensemble classification scheme, while retaining prediction benefits. We propose a complete methodology that is evaluated against baseline methods and the original concept. The results are very encouraging suggesting further developments in this direction along with applications in tasks with similar characteristics. A straightforward open source implementation for the R project is also provided (https://github.com/Petros-Barmpas/HCEP).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available