4.7 Article

Sparse non-negative generalized PCA with applications to metabolomics

Journal

BIOINFORMATICS
Volume 27, Issue 21, Pages 3029-3035

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btr522

Keywords

-

Funding

  1. National Institute of Neurological Disorders and Stroke [R21NS05875-1, K08NS0044276]
  2. McKnight Endowment Fund
  3. DANA Foundation
  4. Lisa and Robert Lourie Foundation
  5. NIH Intellectual and Developmental Disabilities Research Grant [P30HD024064]

Ask authors/readers for more resources

Motivation: Nuclear magnetic resonance (NMR) spectroscopy has been used to study mixtures of metabolites in biological samples. This technology produces a spectrum for each sample depicting the chemical shifts at which an unknown number of latent metabolites resonate. The interpretation of this data with common multivariate exploratory methods such as principal components analysis (PCA) is limited due to high-dimensionality, non-negativity of the underlying spectra and dependencies at adjacent chemical shifts. Results: We develop a novel modification of PCA that is appropriate for analysis of NMR data, entitled Sparse Non-Negative Generalized PCA. This method yields interpretable principal components and loading vectors that select important features and directly account for both the non-negativity of the underlying spectra and dependencies at adjacent chemical shifts. Through the reanalysis of experimental NMR data on five purified neural cell types, we demonstrate the utility of our methods for dimension reduction, pattern recognition, sample exploration and feature selection. Our methods lead to the identification of novel metabolites that reflect the differences between these cell types.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available