Effective PCA for high-dimension, low-sample-size data with singular value decomposition of cross data matrix
In this paper, we propose a new methodology to deal with PCA in high-dimension, low-sample-size (HDLSS) data situations. We give an idea of estimating eigenvalues via singular values of a cross data matrix. We provide consistency properties of the eigenvalue estimation as well as its limiting distribution when the dimension d and the sample size n both grow to infinity in such a way that n is much lower than d. We apply the new methodology to estimating PC directions and PC scores in HDLSS data situations. We give an application of the findings in this paper to a mixture model to classify a dataset into two clusters. We demonstrate how the new methodology performs by using HDLSS data from a microarray study of prostate cancer.
Year of publication: |
2010
|
---|---|
Authors: | Yata, Kazuyoshi ; Aoshima, Makoto |
Published in: |
Journal of Multivariate Analysis. - Elsevier, ISSN 0047-259X. - Vol. 101.2010, 9, p. 2060-2077
|
Publisher: |
Elsevier |
Keywords: | Consistency Eigenvalue distribution HDLSS Microarray data analysis Mixture model Principal component analysis Singular value |
Saved in:
Online Resource
Saved in favorites
Similar items by person
-
A distance-based, misclassification rate adjusted classifier for multiclass, high-dimensional data
Aoshima, Makoto, (2014)
-
Generalized time-dependent conditional linear models under left truncation and right censoring
Aoshima, Makoto, (2010)
-
Asymptotic second-order consistency for two-stage estimation methodologies and its applications
Aoshima, Makoto, (2010)
- More ...