4.4 Article

RANK DISCRIMINANTS FOR PREDICTING PHENOTYPES FROM RNA EXPRESSION

期刊

ANNALS OF APPLIED STATISTICS
卷 8, 期 3, 页码 1469-1491

出版社

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/14-AOAS738

关键词

Cancer classification; gene expression; rank discriminant; order statistics

资金

  1. NIH-NCRR [UL1 RR 025005]
  2. National Science Foundation [CCF-0845407]

向作者/读者索取更多资源

Statistical methods for analyzing large-scale biomolecular data are commonplace in computational biology. A notable example is phenotype prediction from gene expression data, for instance, detecting human cancers, differentiating subtypes and predicting clinical outcomes. Still, clinical applications remain scarce. One reason is that the complexity of the decision rules that emerge from standard statistical learning impedes biological understanding, in particular, any mechanistic interpretation. Here we explore decision rules for binary classification utilizing only the ordering of expression among several genes; the basic building blocks are then two-gene expression comparisons. The simplest example, just one comparison, is the TSP classifier, which has appeared in a variety of cancer-related discovery studies. Decision rules based on multiple comparisons can better accommodate class heterogeneity, and thereby increase accuracy, and might provide a link with biological mechanism. We consider a general framework (rank-in-context) for designing discriminant functions, including a data-driven selection of the number and identity of the genes in the support (context). We then specialize to two examples: voting among several pairs and comparing the median expression in two groups of genes. Comprehensive experiments assess accuracy relative to other, more complex, methods, and reinforce earlier observations that simple classifiers are competitive.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据