4424 articles – 2353 references  [version française]
HAL: hal-00565149, version 1

Detailed view  Export this paper
Australian & New Zealand Journal of Statistics 52, 4 (2010) 423-437
On estimation of the population spectral distribution from a high-dimensional sample covariance matrix
Zhidong Bai 1, 2, Jiaqi Chen, Jian-Feng Yao 3, 4
(2010)

Sample covariance matrices play a central role in numerous popular statistical methodologies, for example principal components analysis, Kalman filtering and independent component analysis. However, modern random matrix theory indicates that, when the dimension of a random vector is not negligible with respect to the sample size, the sample covariance matrix demonstrates significant deviations from the underlying population covariance matrix. There is an urgent need to develop new estimation tools in such cases with high-dimensional data to recover the characteristics of the population covariance matrix from the observed sample covariance matrix. We propose a novel solution to this problem based on the method of moments. When the parametric dimension of the population spectrum is finite and known, we prove that the proposed estimator is strongly consistent and asymptotically Gaussian. Otherwise, we combine the first estimation method with a cross-validation procedure to select the unknown model dimension. Simulation experiments demonstrate the consistency of the proposed procedure. We also indicate possible extensions of the proposed estimator to the case where the population spectrum has a density.
1:  Key Laboratory of Applied Statistics under Ministry of Education (KLASMOE)
Northeast Normal University
2:  Department of Statistics and Applied Probability (DSAP)
National University of Singapore
3:  VISTAS (INRIA - IRISA)
INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – CNRS : UMR6074 – Université de Rennes 1 – École normale supérieure de Cachan - ENS Cachan
4:  Institut de Recherche Mathématique de Rennes (IRMAR)
CNRS : UMR6625 – Université de Rennes 1 – École normale supérieure de Cachan - ENS Cachan – Institut National des Sciences Appliquées (INSA) : - RENNES – Université de Rennes II - Haute Bretagne
Mathematics/Statistics

Statistics/Statistics Theory
eigenvalues of covariance matrices – high-dimensional statistics – Marčenko–Pastur distribution – sample covariance matrices