4.7 Article

Predicting and mapping neighborhood-scale health outcomes: A machine learning approach

Journal

COMPUTERS ENVIRONMENT AND URBAN SYSTEMS
Volume 85, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.compenvurbsys.2020.101562

Keywords

Urban health; Neighborhood; Machine learning; Crowdsourced data; 311 service

Funding

  1. Good Systems Grand Challenge at The University of Texas at Austin

Ask authors/readers for more resources

This paper presents a machine-learning approach to predict the prevalence of common non-communicable chronic diseases at the neighborhood level, demonstrating fairly accurate results in the City of Austin. Socioeconomic variables are found to be the strongest predictors for health outcomes, and historical records of 311 service requests can improve model performance. The machine learning models developed in this study can help evaluate future scenarios and identify areas in need of intervention.
Estimating health outcomes at a neighborhood scale is important for promoting urban health, yet costly and time-consuming. In this paper, we present a machine-learning-enabled approach to predicting the prevalence of six common non-communicable chronic diseases at the census tract level. We apply our approach to the City of Austin and show that our method can yield fairly accurate predictions. In searching for the best predictive models, we experiment with eight different machine learning algorithms and 60 predictor variables that characterize the social environment, the physical environment, and the aspects and degrees of neighborhood disorder. Our analysis suggests that (a) the sociodemographic and socioeconomic variables are the strongest predictors for tract-level health outcomes and (b) the historical records of 311 service requests can be a useful complementary data source as the information distilled from the 311 data often helps improve the models' performance. The machine learning models yielded from this study can help the public and city officials evaluate future scenarios and understand how changes in the neighborhood conditions can lead to changes in the health outcomes. By analyzing where the most significant discrepancies between the predicted and the actual values are, we will also be ready to identify areas of best practice and areas in need of greater investment or policy intervention.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available