4.7 Article

Univariate statistical analysis of environmental (compositional) data: Problems and possibilities

期刊

SCIENCE OF THE TOTAL ENVIRONMENT
卷 407, 期 23, 页码 6100-6108

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.scitotenv.2009.08.008

关键词

Compositional data; Closure problem; Univariate statistical analysis; Exploratory data analysis; Log transformation

资金

  1. Council of the Czech Government [MSM 6198959214]

向作者/读者索取更多资源

For almost 30 years it has been known that compositional (closed) data have special geometrical properties. in environmental sciences, where the concentration of chemical elements in different sample materials is investigated, almost all datasets are compositional. In general, compositional data are parts of a whole which only give relative information. Data that sum up to a constant, e.g. 100wt.%, 1,000,000 mg/kg are the best known example. It is widely neglected that the closure characteristic remains even if only one of all possible elements is measured, it is an inherent property of compositional data. No variable is free to vary independent of all the others. Existing transformations to open closed data are seldom applied. They are more complicated than a log transformation and the relationship to the original data unit is lost. Results obtained when using classical statistical techniques for data analysis appeared reasonable and the possible consequences of working with closed data were rarely questioned. Here the simple univariate case of data analysis is investigated. It can be demonstrated that data closure must be overcome prior to calculating even simple statistical measures like mean or standard deviation or plotting graphs of the data distribution, e.g. a histogram. Some measures like the standard deviation (or the variance) make no statistical sense with closed data and all statistical tests building on the standard deviation (or variance) will thus provide erroneous results if used with the original data. (C) 2009 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据