Journal
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE
Volume 36, Issue 1, Pages 1-30Publisher
TAYLOR & FRANCIS LTD
DOI: 10.1080/13658816.2021.1893323
Keywords
VGI; OpenStreetMap; Topic modelling; LDA; L-LDA; latent labeled dirichlet allocation; spatial data quality; FIXME; fixme; text mining
Categories
Funding
- Australian Research Council [ARC DP170100153]
Ask authors/readers for more resources
This article discusses the assessment of OpenStreetMap (OSM) data quality by examining issues documented through the FIXME tag. It presents a classification and analysis of these quality issues across USA and Australia, grounded in ISO-19157 standard. The research aims to inform the development of automated error correction methods for VGI datasets by linking established ISO data quality standards to OSM issue categorization.
Assessing OpenStreetMap (OSM) data quality against authoritative data sources may not always be viable. This is primarily because of the multi-dimensional nature and heterogeneity of the maps, yet the activity is pivotal for targeted data cleansing and quality enhancement undertakings in these data sets. A salient facet of OSM, allowing contributors to flag potential problems encountered during the mapping process, is the FIXME tag. In this article, we examine and discuss OSM data quality through the vast expanse of issues (knowledge) documented via FIXME. We present a classification and analysis of these quality issues, exposed as topic models and grounded in the ISO-19157 standard, across USA and Australia. Regional distributions of these topics are further qualitatively analyzed to ascertain the variation of key issues in OSM. We also present a comparison of the intrinsic issue classification against those identified in an issue corpus of an authoritative map data source. Due to the considerable heterogeneity in user mapping and reporting, OSM issue detection and classification remains problematic. This research presents a flexible and intrinsic data-mining approach, linking established ISO data quality standards to OSM issue categorization. Our work, thus informs the development of automated error correction methods for VGI datasets.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available