4.8 Article

Impossibility of successful classification when useful features are rare and weak

出版社

NATL ACAD SCIENCES
DOI: 10.1073/pnas.0903931106

关键词

higher criticism; phase diagram; region of impossibility; region of possibility; threshold feature selection

资金

  1. National Science Foundation [DMS-0908613]
  2. Direct For Mathematical & Physical Scien
  3. Division Of Mathematical Sciences [0908613] Funding Source: National Science Foundation

向作者/读者索取更多资源

We study a two-class classification problem with a large number of features, out of which many are useless and only a few are useful, but we do not know which ones they are. The number of features is large compared with the number of training observations. Calibrating the model with 4 key parameters-the number of features, the size of the training sample, the fraction, and strength of useful features-we identify a region in parameter space where no trained classifier can reliably separate the two classes on fresh data. The complement of this region-where successful classification is possible-is also briefly discussed.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据