4.6 Article

Testing for differences between two distributions in the presence of serial correlation using the Kolmogorov-Smirnov and Kuiper's tests

Journal

INTERNATIONAL JOURNAL OF CLIMATOLOGY
Volume 41, Issue 14, Pages 6314-6323

Publisher

WILEY
DOI: 10.1002/joc.7196

Keywords

distributional testing; effective sample size; Kolmogorov– Smirnov test; Kuiper' s test; Monte Carlo simulation; serial correlation; temporal coherence

Ask authors/readers for more resources

Testing for distributional differences in climate research often involves the KS and KU tests. However, ignoring temporal coherence due to daily autocorrelation in the data can lead to significant inference errors. These errors can be mitigated through effective use of look-up tables or broadly applying polynomial coefficients fit to simulation results.
Testing for differences between two states is a staple of climate research, for example, applying a Student's t test to test for the differences in means. A more general approach is to test for differences in the entire distributions. Increasingly, this latter approach is being used in the context of climate change research where some societal impacts may be more sensitive to changes further from the centre of the distribution. The Kolmogorov-Smirnov (KS) test, probably the most widely-used method in distributional testing, along with the closely related, but lesser known Kuiper's (KU) test are examined here. These, like most common statistical tests, assume that the data to which they are applied consist of independent observations. Unfortunately, commonly used data such as daily time series of temperature typically violate this assumption due to day-to-day autocorrelation. This work explores the consequences of this. Three variants of the KS and KU tests are explored: the traditional approach ignoring autocorrelation, use of an 'effective sample size' based on the lag-1 autocorrelation, and Monte Carlo simulations employing a first order autoregressive model appropriate to a variety of data commonly used in climate science. Results indicate that large errors in inferences are possible when the temporal coherence is ignored. The guidance and materials provided here can be used to anticipate the magnitude of the errors. Bias caused by the errors can be mitigated via easy to use 'look-up' tables or more broadly through application of polynomial coefficients fit to the simulation results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available