4.7 Article

Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 17, 期 4, 页码 498-511

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2015.2398195

关键词

Concept fusion; domain adaption; multimodal fusion; video concept indexing

资金

  1. NSFC [6127327461370127]
  2. 973 Program [2011CB302203]
  3. National Key Technology R&D Program of China [2012BAH01F03, NSFB4123104, FRFCU 2014JBZ004, Z131110001913143]
  4. Tsinghua-Tencent Joint Laboratory for IIT
  5. Natural Sciences and Engineering Research Council of Canada (NSERC) [RGPIN239031]

向作者/读者索取更多资源

Semantic models are widely used to bridge the semantic gap between low-level features and high-level features in video concept indexing. Multimodal fusion and concept fusion are two commonly used approaches in building semantic models. In the previous work, domain adaptation is neglected in multimodal fusion, and many probability maximization based and unsupervised concept fusion methods are counterintuitive since they do not incorporate subjective human intuition. In this paper, we present a new two-stage semantic model combining the multimodal fusion and the concept fusion incorporating human heuristics. In the multimodal fusion model, we employ a new generic unsupervised method, namely, domain adaptive linear combination (DALC), to update the linear combination (LC) weights by incorporating the differences of element distributions between training and testing domains. In the concept fusion model, a novel mechanical node equilibrium (NE) model is developed by using forces to model the concept correlations to update the score of concepts represented by nodes. It is intuitive and can incorporate multiple kinds of correlations simultaneously to construct more sophisticated semantic structure. Compared to other state-of-the-art supervised and unsupervised methods, the new model can use either unsupervised or supervised factors to significantly improve the mean inferred average precision (MAP) performance on all datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据