☆ 4.6 Review

From the statistics of data to the statistics of knowledge: Symbolic data analysis

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2003)

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

卷 98, 期 462, 页码 470-487

出版社

AMER STATISTICAL ASSOC

DOI: 10.1198/016214503000242

关键词

clustering; concepts; descriptive statistics; principal components; symbolic data

类别

Statistics & Probability

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Increasingly, datasets are so large they must be summarized in some fashion so that the resulting summary dataset is of a more manageable size, while still retaining as much knowledge inherent to the entire dataset as possible. One consequence of this situation is that the data may no longer be formatted as single values such as is the case for classical data, but rather may be represented by lists, intervals, distributions, and the like. These summarized data are examples of symbolic data. This article looks at the concept of symbolic data in general, and then attempts to review the methods currently available to analyze such data. It quickly becomes clear that the range of methodologies available draws analogies with developments before 1900 that formed a foundation for the inferential statistics of the 1900s, methods largely limited to small (by comparison) datasets and classical data formats. The scarcity of available methodologies for symbolic data also becomes clear and so draws attention to an enormous need for the development of a vast catalog (so to speak) of new symbolic methodologies along with rigorous mathematical and statistical foundational work for these methods.

From the statistics of data to the statistics of knowledge: Symbolic data analysis

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

出版社

AMER STATISTICAL ASSOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

From the statistics of data to the statistics of knowledge: Symbolic data analysis

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

出版社

AMER STATISTICAL ASSOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文