Journal
STATISTICA SINICA
Volume 31, Issue 2, Pages 571-601Publisher
STATISTICA SINICA
DOI: 10.5705/ss.202019.0052
Keywords
Eigenstructure; sample correlation; spiked models
Categories
Funding
- NIH [R01 EB001988, RO1 GM134483]
- Hong Kong RGC General Research Fund [16202918]
- Samsung Scholarship
Ask authors/readers for more resources
For high-dimensional data, using random matrix theory can help derive asymptotic results for the spectral properties of sample correlation matrices.
While the first-order spectral properties of sample correlation matrices match those of sample covariance matrices, their asymptotic distributions can differ significantly.
The fluctuations of both sample eigenvalues and eigenvectors based on correlations are often much smaller than those of their sample covariance counterparts.
Sample correlation matrices are widely used, but for high-dimensional data little is known about their spectral properties beyond null models , which assume the data have independent coordinates. In the class of spiked models, we apply random matrix theory to derive asymptotic first-order and distributional results for both leading eigenvalues and eigenvectors of sample correlation matrices, assuming a high-dimensional regime in which the ratio p/n, of number of variables p to sample size n, converges to a positive constant. While the first-order spectral properties of sample correlation matrices match those of sample covariance matrices, their asymptotic distributions can differ significantly. Indeed, the correlation-based fluctuations of both sample eigenvalues and eigenvectors are often remarkably smaller than those of their sample covariance counterparts.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available