4.5 Article

Orthogonal projection to latent structures solution properties for chemometrics and systems biology data

Journal

JOURNAL OF CHEMOMETRICS
Volume 25, Issue 9, Pages 514-525

Publisher

WILEY-BLACKWELL
DOI: 10.1002/cem.1398

Keywords

O-PLS; partial least squares (PLS); Mid-infrared (MIR) calibration; Beer's law; systems biology

Funding

  1. DOE Office of Biological and Environmental Research [DE-AC36-08GO28308]
  2. BioEnergy Research Center
  3. Office of Biological and Environmental Research in the DOE Office of Science
  4. Department of Energy, Office of Science
  5. ASCR and BioEnergy Research Offices within the Office of Science [DE-AC36-99GO10337]

Ask authors/readers for more resources

Partial least squares (PLS) is a widely used algorithm in the field of chemometrics. In calibration studies, a PLS variant called orthogonal projection to latent structures (O-PLS) has been shown to successfully reduce the number of model components while maintaining good prediction accuracy, although no theoretical analysis exists demonstrating its applicability in this context. Using a discrete formulation of the linear mixture model known as Beer's law, we explicitly analyze O-PLS solution properties for calibration data. We find that, in the absence of noise and for large n, O-PLS solutions are simpler but just as accurate as PLS solutions for systems in which analyte and background concentrations are uncorrelated. However, the same is not true for the most general chemometric data in which correlations between the analyte and background concentrations are nonzero and pure profiles overlap. On the contrary, forcing the removal of orthogonal components may actually degrade interpretability of the model. This situation can also arise when the data are noisy and n is small, because O-PLS may identify and model the noise as orthogonal when it is statistically uncorrelated with the analytes. For the types of data arising from systems biology studies, in which the number of response variables may be much greater than the number of observations, we show that O-PLS is unlikely to discover orthogonal variation whether or not it exists. In this case, O-PLS and PLS solutions are the same. Copyright (C) 2011 John Wiley & Sons, Ltd.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available