4.6 Article

Hierarchical continuous-time inhomogeneous hidden Markov model for cancer screening with extensive followup data

Journal

STATISTICAL METHODS IN MEDICAL RESEARCH
Volume 31, Issue 12, Pages 2383-2399

Publisher

SAGE PUBLICATIONS LTD
DOI: 10.1177/09622802221122390

Keywords

Disease progression; Cancer screening; Continuous time Markov model; Non-homogeneous Markov process; Hierarchical mixture model

Ask authors/readers for more resources

Continuous-time hidden Markov models are a useful approach for disease modeling. Most applications use time-homogeneous models, but this assumption is too simplistic for accurately modeling disease progression. We consider the heterogeneity of disease progression and risks in the population using piece-wise constant intensity functions and a latent mixture structure. We propose an efficient algorithm for inference and demonstrate the superior performance of our method using synthetic data and a real-world cervical cancer screening dataset.
Continuous-time hidden Markov models are an attractive approach for disease modeling because they are explainable and capable of handling both irregularly sampled, skewed and sparse data arising from real-world medical practice, in particular to screening data with extensive followup. Most applications in this context consider time-homogeneous models due to their relative computational simplicity. However, the time homogeneous assumption is too strong to accurately model the natural history of many diseases including cancer. Moreover, cancer risk across the population is not homogeneous either, since exposure to disease risk factors can vary considerably between individuals. This is important when analyzing longitudinal datasets and different birth cohorts. We model the heterogeneity of disease progression and regression using piece-wise constant intensity functions and model the heterogeneity of risks in the population using a latent mixture structure. Different submodels under the mixture structure employ the same types of Markov states reflecting disease progression and allowing both clinical interpretation and model parsimony. We also consider flexible observational models dealing with model over-dispersion in real data. An efficient, scalable Expectation-Maximization algorithm for inference is proposed with the theoretical guaranteed convergence property. We demonstrate our method's superior performance compared to other state-of-the-art methods using synthetic data and a real-world cervical cancer screening dataset from the Cancer Registry of Norway. Moreover, we present two model-based risk stratification methods that identify the risk levels of individuals.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available