3.8 Article

Automated Quality Assessment of Metadata across Open Data Portals

期刊

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/2964909

关键词

Open Data; quality assessment; data quality; data portal

资金

  1. Austrian Research Promotion Agency (FFG) under the project ADE-QUATe [849982]

向作者/读者索取更多资源

The Open Data movement has become a driver for publicly available data on the Web. More and more data-from governments and public institutions but also from the private sector-are made available online and are mainly published in so-called Open Data portals. However, with the increasing number of published resources, there is a number of concerns with regards to the quality of the data sources and the corresponding metadata, which compromise the searchability, discoverability, and usability of resources. In order to get a more complete picture of the severity of these issues, the present work aims at developing a generic metadata quality assessment framework for various Open Data portals: We treat data portals independently from the portal software frameworks by mapping the specific metadata of three widely used portal software frameworks (CKAN, Socrata, OpenDataSoft) to the standardized Data Catalog Vocabulary metadata schema. We subsequently define several qualitymetrics, which can be evaluated automatically and in an efficient manner. Finally, we report findings based on monitoring a set of over 260 Open Data portals with 1.1M datasets. This includes the discussion of general quality issues, for example, the retrievability of data, and the analysis of our specific quality metrics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据