☆ 4.7 Article

A Generic Framework for Video Annotation via Semi-Supervised Learning

IEEE TRANSACTIONS ON MULTIMEDIA (2012)

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

卷 14, 期 4, 页码 1206-1219

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TMM.2012.2191944

关键词

Broadcast video; concave-convex procedure (CCCP); event detection; graph; Internet; multiple instance learning; semi-supervised learning; web-casting text

类别

Computer Science, Information Systems Computer Science, Software Engineering Telecommunications

资金

973 Program [2010CB327905, 2012CB316304]
National Natural Science Foundation of China [60833006, 61070104, 90920303]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Learning-based video annotation is essential for video analysis and understanding, and many various approaches have been proposed to avoid the intensive labor costs of purely manual annotation. However, there lacks a generic framework due to several difficulties, such as dependence of domain knowledge, insufficiency of training data, no precise localization and inefficacy for large-scale video dataset. In this paper, we propose a novel approach based on semi-supervised learning by means of information from the Internet for interesting event annotation in videos. Concretely, a Fast Graph-based Semi-Supervised Multiple Instance Learning (FGSSMIL) algorithm, which aims to simultaneously tackle these difficulties in a generic framework for various video domains (e. g., sports, news, and movies), is proposed to jointly explore small-scale expert labeled videos and large-scale unlabeled videos to train the models. The expert labeled videos are obtained from the analysis and alignment of well-structured video related text (e. g., movie scripts, web-casting text, close caption). The unlabeled data are obtained by querying related events from the video search engine (e. g., YouTube, Google) in order to give more distributive information for event modeling. Two critical issues of FGSSMIL are: 1) how to calculate the weight assignment for a graph construction, where the weight of an edge specifies the similarity between two data points. To tackle this problem, we propose a novel Multiple Instance Learning Induced Similarity (MILIS) measure by learning instance sensitive classifiers; 2) how to solve the algorithm efficiently for large-scale dataset through an optimization approach. To address this issue, Concave-Convex Procedure (CCCP) and nonnegative multiplicative updating rule are adopted. We perform the extensive experiments in three popular video domains: movies, sports, and news. The results compared with the state-of-the-arts are promising and demonstrate the effectiveness and efficiency of our proposed approach.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.7

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Instance elimination strategy for non-convex multiple-instance support vector machine

Min Yuan, Yitian Xu

Summary: This research introduces the safe screening rule to mitigate the storage burden of multiple-instance SVM. It designs an instance elimination strategy, a dual screening method, and a smart dual coordinate descent method for faster solution efficiency.

APPLIED SOFT COMPUTING (2022)