4.5 Article

Feature selection, mutual information, and the classification of high-dimensional patterns

期刊

PATTERN ANALYSIS AND APPLICATIONS
卷 11, 期 3-4, 页码 309-319

出版社

SPRINGER
DOI: 10.1007/s10044-008-0107-0

关键词

filter feature selection; mutual information; entropic spanning graphs; microarray

资金

  1. DPI2005-01280 [Spanish Government]

向作者/读者索取更多资源

We propose a novel feature selection filter for supervised learning, which relies on the efficient estimation of the mutual information between a high-dimensional set of features and the classes. We bypass the estimation of the probability density function with the aid of the entropic-graphs approximation of Renyi entropy, and the subsequent approximation of the Shannon entropy. Thus, the complexity does not depend on the number of dimensions but on the number of patterns/samples, and the curse of dimensionality is circumvented. We show that it is then possible to outperform algorithms which individually rank features, as well as a greedy algorithm based on the maximal relevance and minimal redundancy criterion. We successfully test our method both in the contexts of image classification and microarray data classification. For most of the tested data sets, we obtain better classification results than those reported in the literature.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据