Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical Aspects

Camacho Páez, José; Ferrer Riquelme, Alberto José

Published in

Elsevier, Chemometrics and Intelligent Laboratory Systems, (131), p. 37-50

DOI: 10.1016/j.chemolab.2013.12.003

Tools

Export citation

Search in Google Scholar

Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical Aspects

Journal article published in 2014 by José Camacho Páez

, Alberto José Ferrer Riquelme

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving forbidden

Policy details

Data provided by

Abstract

This is the second paper of a series devoted to provide theoretical and practical results and new algorithms for the selection of the number of Principal Components (PCs) in Principal Component Analysis (PCA) using crossvalidation. The study is especially focused on the element-wise k-fold (ekf), which is among the most used algorithms for that purpose. In this paper, a taxonomy of PCA applications is proposed and it is argued that cross-validatory algorithms computing the prediction error in observable variables, like ekf, are only suited for a class of applications. A number of cross-validation methods, several of which are original, are compared in two applications of this class: missing data imputation and compression. The results showthat the ekf is especially suited for missing data applications while other traditional cross-validation methods, those by Wold and Eastment and Krzanowski, are not found to provide useful outcomes in any of the two applications. These results are of special value considering that the methods investigated are computed in the main commercial software packets for chemometrics. Finally, the choice of the missing data algorithm within ekf is also investigated. ; Spanish Ministry of Science and Innovation ; FEDER funds from the European Union DPI2008-06880-C03-01 DPI2008-06880-C03-03 TEC2011-22579 ; Juan de la Cierva program ; Camacho Páez, J.; Ferrer Riquelme, AJ. (2014). Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical Aspects. Chemometrics and Intelligent Laboratory Systems. 131:37-50. doi:10.1016/j.chemolab.2013.12.003. ; Senia ; 37 ; 50 ; 131

Published in

Links

Tools

Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical Aspects

Abstract