期刊
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 8, 2021
卷 8, 期 -, 页码 271-299出版社
ANNUAL REVIEWS
DOI: 10.1146/annurev-statistics-042720-124436
关键词
amalgamation; coherence; composition; correspondence analysis; count data; logratios; regression; redundancy analysis; subcomposition
Compositional data are nonnegative data with a constant-sum constraint, with logratios as the fundamental transformation. Combining components can alleviate the issue of zero values. Various statistical analysis can be performed after transforming the data into logratios.
Compositional data are nonnegative data carrying relative, rather than absolute, information-these are often data with a constant-sum constraint on the sample values, for example, proportions or percentages summing to 1% or 100%, respectively. Ratios between components of a composition are important since they are unaffected by the particular set of components chosen. Logarithms of ratios (logratios) are the fundamental transformation in the ratio approach to compositional data analysis-all data thus need to be strictly positive, so that zero values present a major problem. Components that group together based on domain knowledge can be amalgamated (i.e., summed) to create new components, and this can alleviate the problem of data zeros. Once compositional data are transformed to logratios, regular univariate and multivariate statistical analysis can be performed, such as dimension reduction and clustering, as well as modeling. Alternative methodologies that come close to the ideals of the logratio approach are also considered, especially those that avoid the problem of data zeros, which is particularly acute in large bioinformatic data sets.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据