4.0 Article

A multi-stage approach to maximizing geocoding success in a large population-based cohort study through automated and interactive processes

期刊

GEOSPATIAL HEALTH
卷 6, 期 2, 页码 273-284

出版社

UNIV NAPLES FEDERICO II
DOI: 10.4081/gh.2012.145

关键词

epidemiologic methods; geographical information systems; prospective studies; residence characteristics; United States of America

资金

  1. National Cancer Institute at the National Institutes of Health [3R01 CA092447-08S1]

向作者/读者索取更多资源

To enable spatial analyses within a large, prospective cohort study of nearly 86,000 adults enrolled in a 12-state area in the southeastern United States of America from 2002-2009, a multi-stage geocoding protocol was developed to efficiently maximize the proportion of participants assigned an address level geographic coordinate. Addresses were parsed, cleaned and standardized before applying a combination of automated and interactive geocoding tools. Our full protocol increased the non-Post Office (PO) Box match rate from 74.5% to 97.6%. Overall, we geocoded 99.96% of participant addresses, with only 5.2% at the ZIP code centroid level (2.8% PO Box and 2.3% non-PO Box addresses). One key to reducing the need for interactive geocoding was the use of multiple base maps. Still, addresses in areas with population density <44 persons/km(2) were much more likely to require resource-intensive interactive geocoding than those in areas with >920 persons/km(2) (odds ratio (OR) = 5.24; 95% confidence interval (CI) = 4.23, 6.49), as were addresses collected from participants during in-person interviews compared with mailed questionnaires (OR = 1.83; 95% CI = 1.59, 2.11). This study demonstrates that population density and address ascertainment method can influence automated geocoding results and that high success in address level geocoding is achievable for large-scale studies covering wide geographical areas.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据