4.3 Article

Principal component analysis

期刊

NATURE REVIEWS METHODS PRIMERS
卷 2, 期 1, 页码 -

出版社

SPRINGERNATURE
DOI: 10.1038/s43586-022-00184-w

关键词

-

资金

  1. National Science Foundation [DMS2013736, IIS1837931]
  2. National Institutes of Health [5R01 EB001988-21]
  3. Stanford Data Science Institute

向作者/读者索取更多资源

Principal component analysis is a versatile statistical method that reduces a large data table to its essential features. It explains the variance of the data by finding major components and supports graphical interpretation. Additionally, it can be used for handling incomplete data matrices and analyzing images, shapes, and functions.
Principal component analysis is a versatile statistical method for reducing a cases-by-variables data table to its essential features, called principal components. Principal components are a few linear combinations of the original variables that maximally explain the variance of all the variables. In the process, the method provides an approximation of the original data table using only these few major components. This Primer presents a comprehensive review of the method's definition and geometry, as well as the interpretation of its numerical and graphical results. The main graphical result is often in the form of a biplot, using the major components to map the cases and adding the original variables to support the distance interpretation of the cases' positions. Variants of the method are also treated, such as the analysis of grouped data and categorical data, known as correspondence analysis. Also described and illustrated are the latest innovative applications of principal component analysis: for estimating missing values in huge data matrices, sparse component estimation, and the analysis of images, shapes and functions. Supplementary material includes video animations and computer scripts in the R environment.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据