4.7 Article

Transductive Multilabel Learning via Label Set Propagation

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2011.141

关键词

Data mining; machine learning; multilabel learning; transductive learning; semi-supervised learning; unlabeled data

资金

  1. National Fundamental Research Program of China [2010CB327900]
  2. National Science Foundation of China [61073097]
  3. Jiangsu Science Foundation [BK2011566]
  4. Jiangsu 333 High-Level Talent Cultivation Program
  5. Hong Kong Baptist University Faculty Research Grants
  6. Hong Kong Research Grant Council [201508]

向作者/读者索取更多资源

The problem of multilabel classification has attracted great interest in the last decade, where each instance can be assigned with a set of multiple class labels simultaneously. It has a wide variety of real-world applications, e. g., automatic image annotations and gene function analysis. Current research on multilabel classification focuses on supervised settings which assume existence of large amounts of labeled training data. However, in many applications, the labeling of multilabeled data is extremely expensive and time consuming, while there are often abundant unlabeled data available. In this paper, we study the problem of transductive multilabel learning and propose a novel solution, called TRAsductive Multilabel Classification (TRAM), to effectively assign a set of multiple labels to each instance. Different from supervised multilabel learning methods, we estimate the label sets of the unlabeled instances effectively by utilizing the information from both labeled and unlabeled data. We first formulate the transductive multilabel learning as an optimization problem of estimating label concept compositions. Then, we derive a closed-form solution to this optimization problem and propose an effective algorithm to assign label sets to the unlabeled instances. Empirical studies on several real-world multilabel learning tasks demonstrate that our TRAM method can effectively boost the performance of multilabel classification by using both labeled and unlabeled data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据