4.6 Article

Improving the Use of Mortality Data in Public Health: A Comparison of Garbage Code Redistribution Models

Journal

AMERICAN JOURNAL OF PUBLIC HEALTH
Volume 110, Issue 2, Pages 222-229

Publisher

AMER PUBLIC HEALTH ASSOC INC
DOI: 10.2105/AJPH.2019.305439

Keywords

-

Funding

  1. Higher Education Sprout Project, Ministry of Education in Taiwan [NTU-CC-108L891601]
  2. Sustainability Science Research Program, Academia Sinica in Taiwan [AS-SS-107-01]

Ask authors/readers for more resources

Objectives. To describe and compare 3 garbage code (GC) redistribution models: naive Bayes classifier (NB), coarsened exact matching (CEM), and multinomial logistic regression (MLR). Methods. We analyzed Taiwan Vital Registration data (2008-2016) using a 2-step approach. First, we used non-GC death records to evaluate 3 different prediction models (NB, CEM, and MLR), incorporating individual-level information on multiple causes of death (MCDs) and demographic characteristics. Second, we applied the best-performing model to GC death records to predict the underlying causes of death. We conducted additional simulation analyses for evaluating the predictive performance of models. Results. When we did not account for MCDs, all 3 models presented high average misclassification rates in GC assignment (NB, 81%; CEM, 86%; MLR, 81%). In the presence of MCD information, NB and MLR exhibited significant improvement in assignment accuracy (19% and 17% misclassification rate, respectively). Furthermore, CEM without a variable selection procedure resulted in a substantially higher misclassification rate (40%). Conclusions. Comparing potential GC redistribution approaches provides guidance for obtaining better estimates of cause-of-death distribution and highlights the significance of MCD information for vital registration system reform.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available