4.8 Article

A hierarchical approach to removal of unwanted variation for large-scale metabolomics data

期刊

NATURE COMMUNICATIONS
卷 12, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41467-021-25210-5

关键词

-

资金

  1. Australian Research Council Discovery Project [DP170100654]
  2. Judith and David Coffey Lifelab Scholarship
  3. Australian Commonwealth Government Research Training Program Stipend Scholarship
  4. University of Sydney Postgraduate Research Scholarship - Heart Research Australia
  5. National Health and Medical Research Council [APP11359290, APP1173469]
  6. Heart Research Australia
  7. New South Wales Office of Health and Medical Research
  8. Australian Research Council
  9. Sydney Medical School Foundation, New South Wales Health [DOH1003, DOH1006]
  10. National Heart Foundation [NHF104853]
  11. Parker-Hughes Bequest
  12. Frecker Family
  13. NSW Office of Health and Medical Research
  14. NSW Statewide Biobank
  15. The Vonwiller Foundationand Heart Research Australia

向作者/读者索取更多资源

A new computational workflow is developed to remove unwanted data variation while preserving biologically relevant information in large scale metabolomics studies.
Liquid chromatography-mass spectrometry-based metabolomics studies are increasingly applied to large population cohorts, which run for several weeks or even years in data acquisition. This inevitably introduces unwanted intra- and inter-batch variations over time that can overshadow true biological signals and thus hinder potential biological discoveries. To date, normalisation approaches have struggled to mitigate the variability introduced by technical factors whilst preserving biological variance, especially for protracted acquisitions. Here, we propose a study design framework with an arrangement for embedding biological sample replicates to quantify variance within and between batches and a workflow that uses these replicates to remove unwanted variation in a hierarchical manner (hRUV). We use this design to produce a dataset of more than 1000 human plasma samples run over an extended period of time. We demonstrate significant improvement of hRUV over existing methods in preserving biological signals whilst removing unwanted variation for large scale metabolomics studies. Our tools not only provide a strategy for large scale data normalisation, but also provides guidance on the design strategy for large omics studies. Mass spectrometry-based metabolomics is a powerful method for profiling large clinical cohorts but batch variations can obscure biologically meaningful differences. Here, the authors develop a computational workflow that removes unwanted data variation while preserving biologically relevant information.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据