4.6 Article

Identifying, attributing, and overcoming common data quality issues of manned station observations

Journal

INTERNATIONAL JOURNAL OF CLIMATOLOGY
Volume 37, Issue 11, Pages 4131-4145

Publisher

WILEY
DOI: 10.1002/joc.5037

Keywords

quality control; error attribution; station observations; data rescue; metadata; data homogenization; Bolivia; Peru

Funding

  1. Swiss Program for Research on Global Issues for Development (r4d) [IZ01Z0_147320]
  2. Swiss Agency for Development and Cooperation (SDC) [7F-08453.01]
  3. FP7 project ERA-CLIM2
  4. Swiss National Science Foundation (SNF) [IZ01Z0_147320] Funding Source: Swiss National Science Foundation (SNF)

Ask authors/readers for more resources

In situ climatological observations are essential for studies related to climate trends and extreme events. However, in many regions of the globe, observational records are affected by a large number of data quality issues. Assessing and controlling the quality of such datasets is an important, often overlooked aspect of climate research. Besides analysing the measurement data, metadata are important for a comprehensive data quality assessment. However, metadata are often missing, but may partly be reconstructed by suitable actions such as station inspections. This study identifies and attributes the most important common data quality issues in Bolivian and Peruvian temperature and precipitation datasets. The same or similar errors are found in many other predominantly manned station networks worldwide. A large fraction of these issues can be traced back to measurement errors by the observers. Therefore, the most effective way to prevent errors is to strengthen the training of observers and to establish a near real-time quality control (QC) procedure. Many common data quality issues are hardly detected by usual QC approaches. Data visualization, however, is an effective tool to identify and attribute those issues, and therefore enables data users to potentially correct errors and to decide which purposes are not affected by specific problems. The resulting increase in usable station records is particularly important in areas where station networks are sparse. In such networks, adequate selection and treatment of time series based on a comprehensive QC procedure may contribute to improving data homogeneity more than statistical data homogenization methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available