4.7 Article

HD-MTL: Hierarchical Deep Multi-Task Learning for Large-Scale Visual Recognition

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING
卷 26, 期 4, 页码 1923-1938

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2017.2667405

关键词

Large-scale visual recognition; hierarchical deep multi-task learning (HD-MTL); group-specific deep representations; incremental deep learning; tree classifier; soft prediction

资金

  1. National High-Technology Program of China (863 Program) [2014AA012301]
  2. Program for Changjiang Scholars and Innovative Research Team in University [IRT13090]
  3. Program of Shaanxi Province Innovative Research Team [2014KCT-17]
  4. National Science Foundation [1651166-CNS]
  5. National Natural Science Foundation of China [61622205]
  6. Zhejiang Provincial Natural Science Foundation of China [LR15F020002]

向作者/读者索取更多资源

In this paper, a hierarchical deep multi-task learning (HD-MTL) algorithm is developed to support large-scale visual recognition (e.g., recognizing thousands or even tens of thousands of atomic object classes automatically). To achieve more effective accomplishment of the coarse-to-fine tasks for hierarchical visual recognition, multiple sets of deep features are first extracted from the different layers of deep convolutional neural networks (deep CNNs). A visual tree is then learned by assigning the visually-similar atomic object classes with similar learning complexities into the same group, and it can provide a good environment for identifying the inter-related learning tasks automatically. By leveraging the inter-task relatedness (interclass similarities) to learn more discriminative group-specific deep representations, our deep multi-task learning algorithm can achieve the global optimum easily and obtain more discriminative node classifiers for distinguishing the visually-similar atomic object classes (in the same group) effectively. Our HD-MTL algorithm can control the inter-level error propagation effectively by using an end-to-end approach for jointly learning more representative deep CNNs (for image representation) and more discriminative tree classifier (for large-scale visual recognition) and updating them simultaneously. Our incremental deep learning algorithms can effectively adapt both the deep CNNs and the tree classifier to the new training images and the new object classes. Our experimental results have demonstrated that our HD-MTL algorithm can achieve very competitive results on both the accuracy rates and the computational efficiency for large-scale visual recognition.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据