4.7 Article

A computationally efficient modular optimal discovery procedure

期刊

BIOINFORMATICS
卷 27, 期 4, 页码 509-515

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btq701

关键词

-

资金

  1. NIH [HG002913]

向作者/读者索取更多资源

Motivation: It is well known that patterns of differential gene expression across biological conditions are often shared by many genes, particularly those within functional groups. Taking advantage of these patterns can lead to increased statistical power and biological clarity when testing for differential expression in a microarray experiment. The optimal discovery procedure (ODP), which maximizes the expected number of true positives for each fixed number of expected false positives, is a framework aimed at this goal. Storey et al. introduced an estimator of the ODP for identifying differentially expressed genes. However, their ODP estimator grows quadratically in computational time with respect to the number of genes. Reducing this computational burden is a key step in making the ODP practical for usage in a variety of high-throughput problems. Results: Here, we propose a new estimate of the ODP called the modular ODP (mODP). The existing 'full ODP' requires that the likelihood function for each gene be evaluated according to the parameter estimates for all genes. The mODP assigns genes to modules according to a Kullback-Leibler distance, and then evaluates the statistic only at the module-averaged parameter estimates. We show that the mODP is relatively insensitive to the choice of the number of modules, but dramatically reduces the computational complexity from quadratic to linear in the number of genes. We compare the full ODP algorithm and mODP on simulated data and gene expression data from a recent study of Morrocan Amazighs. The mODP and full ODP algorithm perform very similarly across a range of comparisons.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemical Research Methods

Probabilistic models of genetic variation in structured populations applied to global human studies

Wei Hao, Minsun Song, John D. Storey

BIOINFORMATICS (2016)

Article Biochemical Research Methods

Polyester: simulating RNA-seq datasets with differential transcript expression

Alyssa C. Frazee, Andrew E. Jaffe, Ben Langmead, Jeffrey T. Leek

BIOINFORMATICS (2015)

Article Biochemical Research Methods

Test set bias affects reproducibility of gene signatures

Prasad Patil, Pierre-Olivier Bachant-Winner, Benjamin Haibe-Kains, Jeffrey T. Leek

BIOINFORMATICS (2015)

Article Biochemical Research Methods

Practical impacts of genomic data cleaning on biological discovery using surrogate variable analysis

Andrew E. Jaffe, Thomas Hyde, Joel Kleinman, Daniel R. Weinbergern, Joshua G. Chenoweth, Ronald D. Mckay, Jeffrey T. Leek, Carlo Colantuoni

BMC BIOINFORMATICS (2015)

Editorial Material Multidisciplinary Sciences

P values are just the tip of the iceberg

Jeffrey T. Leek, Roger D. Peng

NATURE (2015)

Letter Biotechnology & Applied Microbiology

Ballgown bridges the gap between transcriptome assembly and expression analysis

Alyssa C. Frazee, Geo Pertea, Andrew E. Jaffe, Ben Langmead, Steven L. Salzberg, Jeffrey T. Leek

NATURE BIOTECHNOLOGY (2015)

Article Biochemistry & Molecular Biology

A nested parallel experiment demonstrates differences in intensity-dependence between RNA-seq and microarrays

David G. Robinson, Jean Y. Wang, John D. Storey

NUCLEIC ACIDS RESEARCH (2015)

Editorial Material Multidisciplinary Sciences

Opinion: Reproducible research can still be wrong: Adopting a prevention approach

Jeffrey T. Leek, Roger D. Peng

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2015)

Editorial Material Multidisciplinary Sciences

What is the question?

Jeffery T. Leek, Roger D. Peng

SCIENCE (2015)

Article Biochemical Research Methods

Beyond the E-Value: Stratified Statistics for Protein Domain Prediction

Alejandro Ochoa, John D. Storey, Manuel Llinas, Mona Singh

PLOS COMPUTATIONAL BIOLOGY (2015)

Article Genetics & Heredity

Scaling probabilistic models of genetic variation to millions of humans

Prem Gopalan, Wei Hao, David M. Blei, John D. Storey

NATURE GENETICS (2016)

Article Multidisciplinary Sciences

Systems-level analysis of mechanisms regulating yeast metabolic flux

Sean R. Hackett, Vito R. T. Zanotelli, Wenxin Xu, Jonathan Goya, Junyoung O. Park, David H. Perlman, Patrick A. Gibney, David Botstein, John D. Storey, Joshua D. Rabinowitz

SCIENCE (2016)

Article Multidisciplinary Sciences

Genome-wide real-time in vivo transcriptional dynamics during Plasmodium falciparum blood-stage development

Heather J. Painter, Neo Christopher Chung, Aswathy Sebastian, Istvan Albert, John D. Storey, Manuel Llinas

NATURE COMMUNICATIONS (2018)

Article Biochemical Research Methods

Statistical significance of variables driving systematic variation in high-dimensional data

Neo Christopher Chung, John D. Storey

BIOINFORMATICS (2015)

Article Medicine, Research & Experimental

Genomic and clinical predictors for improving estimator precision in randomized trials of breast cancer treatments

Prasad Patil, Elizabeth Colantuoni, Jeffrey T. Leek, Michael Rosenblum

CONTEMPORARY CLINICAL TRIALS COMMUNICATIONS (2016)

暂无数据