4.7 Article

Feature Selection via Global Redundancy Minimization

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2015.2426703

关键词

Feature selection; feature ranking; redundancy minimization

资金

  1. US National Science Foundation [(NSF)-IIS 1117965, NSF-IIS 1302675, NSF-IIS 1344152, NSF-DBI 1356628]
  2. Direct For Computer & Info Scie & Enginr
  3. Div Of Information & Intelligent Systems [1302675] Funding Source: National Science Foundation
  4. Div Of Biological Infrastructure
  5. Direct For Biological Sciences [1356628] Funding Source: National Science Foundation
  6. Div Of Information & Intelligent Systems
  7. Direct For Computer & Info Scie & Enginr [1302564] Funding Source: National Science Foundation

向作者/读者索取更多资源

Feature selection has been an important research topic in data mining, because the real data sets often have high-dimensional features, such as the bioinformatics and text mining applications. Many existing filter feature selection methods rank features by optimizing certain feature ranking criterions, such that correlated features often have similar rankings. These correlated features are redundant and don't provide large mutual information to help data mining. Thus, when we select a limited number of features, we hope to select the top non-redundant features such that the useful mutual information can be maximized. In previous research, Ding et al. recognized this important issue and proposed the minimum Redundancy Maximum Relevance Feature Selection (mRMR) model to minimize the redundancy between sequentially selected features. However, this method used the greedy search, thus the global feature redundancy wasn't considered and the results are not optimal. In this paper, we propose a new feature selection framework to globally minimize the feature redundancy with maximizing the given feature ranking scores, which can come from any supervised or unsupervised methods. Our new model has no parameter so that it is especially suitable for practical data mining application. Experimental results on benchmark data sets show that the proposed method consistently improves the feature selection results compared to the original methods. Meanwhile, we introduce a new unsupervised global and local discriminative feature selection method which can be unified with the global feature redundancy minimization framework and shows superior performance.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据