4.7 Article

Soft Margin Multiple Kernel Learning

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2012.2237183

关键词

Multiple kernel learning; support vector machines

资金

  1. Multi-plAtform Game Innovation Centre (MAGIC) in Nanyang Technological University
  2. Interactive Digital Media Programme Office (IDMPO)

向作者/读者索取更多资源

Multiple kernel learning (MKL) has been proposed for kernel methods by learning the optimal kernel from a set of predefined base kernels. However, the traditional L1MKL method often achieves worse results than the simplest method using the average of base kernels (i.e., average kernel) in some practical applications. In order to improve the effectiveness of MKL, this paper presents a novel soft margin perspective for MKL. Specifically, we introduce an additional slack variable called kernel slack variable to each quadratic constraint of MKL, which corresponds to one support vector machine model using a single base kernel. We first show that L1MKL can be deemed as hard margin MKL, and then we propose a novel soft margin framework for MKL. Three commonly used loss functions, including the hinge loss, the square hinge loss, and the square loss, can be readily incorporated into this framework, leading to the new soft margin MKL objective functions. Many existing MKL methods can be shown as special cases under our soft margin framework. For example, the hinge loss soft margin MKL leads to a new box constraint for kernel combination coefficients. Using different hyper-parameter values for this formulation, we can inherently bridge the method using average kernel, L1MKL, and the hinge loss soft margin MKL. The square hinge loss soft margin MKL unifies the family of elastic net constraint/regularizer based approaches; and the square loss soft margin MKL incorporates L2MKL naturally. Moreover, we also develop efficient algorithms for solving both the hinge loss and square hinge loss soft margin MKL. Comprehensive experimental studies for various MKL algorithms on several benchmark data sets and two real world applications, including video action recognition and event recognition demonstrate that our proposed algorithms can efficiently achieve an effective yet sparse solution for MKL.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Toward Equivalent Transformation of User Preferences in Cross Domain Recommendation

Xu Chen, Ya Zhang, Ivor W. Tsang, Yuangang Pan, Jingchao Su

Summary: This article discusses cross-domain recommendation in scenarios where different domains have the same set of users but no overlapping items. Most existing methods focus on shared-user representation, but fail to capture domain-specific features. In this article, an equivalent transformation learner (ETL) is proposed to preserve both domain-specific and overlapped features by modeling the joint distribution of user behaviors across domains.

ACM TRANSACTIONS ON INFORMATION SYSTEMS (2023)

Article Engineering, Industrial

Operation twins: production-intralogistics synchronisation in Industry 4.0

Mingxing Li, Daqiang Guo, Ming Li, Ting Qu, George Q. Huang

Summary: The widespread adoption of Industry 4.0 technologies is revolutionising manufacturing operations. This paper introduces a novel concept of operations twins (OT) for achieving synchronisation between production and intralogistics (PiL) through the use of Industry 4.0 technologies and innovative operations management strategies.

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH (2023)

Article Computer Science, Artificial Intelligence

LADDER: Latent boundary-guided adversarial training

Xiaowei Zhou, Ivor W. Tsang, Jie Yin

Summary: Deep Neural Networks have achieved great success in classification tasks, but they are vulnerable to adversarial attacks. Adversarial training is an effective strategy to improve the robustness of DNN models, but existing methods fail to generalize well to standard test data. To achieve a better trade-off between standard accuracy and adversarial robustness, a novel adversarial training framework called LADDER is proposed, which generates high-quality adversarial examples through perturbations on latent features.

MACHINE LEARNING (2023)

Article Computer Science, Artificial Intelligence

Open Set Domain Adaptation With Soft Unknown-Class Rejection

Yiming Xu, Lin Chen, Lixin Duan, Ivor W. Tsang, Jiebo Luo

Summary: This article studies the problem of open set domain adaptation and proposes a method that performs soft rejection of unknown target classes and simultaneously matches the source and target domains. Extensive experiments on three standard datasets validate the effectiveness of the proposed method.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Latent Class-Conditional Noise Model

Jiangchao Yao, Bo Han, Zhihan Zhou, Ya Zhang, Ivor W. Tsang

Summary: Learning with noisy labels is important in the Big Data era to save costs. Previous noise-transition-based methods achieved good performance but relied on impractical anchor sets. Our approach introduces a Bayesian framework for parameterizing the noise transition and solves the problem of ill-posed stochastic learning in back-propagation.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

DifFormer: Multi-Resolutional Differencing Transformer With Dynamic Ranging for Time Series Analysis

Bing Li, Wei Cui, Le Zhang, Ce Zhu, Wei Wang, Ivor W. Tsang, Joey Tianyi Zhou

Summary: Time series analysis is crucial in various fields such as economics, finance, and surveillance. However, traditional Transformer models have limitations in representing nuanced patterns in time series data. To overcome these challenges, we propose a novel Transformer architecture called DifFormer, which incorporates a multi-resolutional differencing mechanism. DifFormer outperforms existing models in classification, regression, and forecasting tasks, while also exhibiting efficiency and lower time consumption.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Noisy Label Learning With Provable Consistency for a Wider Family of Losses

Defu Liu, Wen Li, Lixin Duan, Ivor W. Tsang, Guowu Yang

Summary: Deep models have achieved impressive performance in various visual recognition tasks, but their generalization ability is compromised by noisy labels. This paper presents a dynamic label learning algorithm that allows the use of different loss functions for classification in the presence of label noise, ensuring that the search for the optimal classifier of noise-free samples is not hindered by label noise.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Missingness-Pattern-Adaptive Learning With Incomplete Data

Yongshun Gong, Zhibin Li, Wei Liu, Xiankai Lu, Xinwang Liu, Ivor W. W. Tsang, Yilong Yin

Summary: Many real-world problems involve data with missing values, which can hinder learning achievements. Existing methods use a universal model for all incomplete data, resulting in suboptimal models for each missingness pattern. This paper proposes a general model that can adjust to different missingness patterns, minimizing competition between data. The model is based on observable features and does not rely on data imputation, and a low-rank constraint is introduced to improve generalization ability.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

A Multi-View Multi-Task Learning Framework for Multi-Variate Time Series Forecasting

Jinliang Deng, Xiusi Chen, Renhe Jiang, Xuan Song, Ivor W. Tsang

Summary: Multi-variate time series (MTS) data is a common type of data abstraction in the real world, generated from a hybrid dynamical system. MTS data can be categorized into spatial and temporal attributes, and can be analyzed from the spatial view or temporal view. A novel multi-view multi-task (MVMT) learning framework is proposed to extract hidden MVMT information from MTS data while predicting. The framework improves effectiveness and efficiency of canonical architectures according to extensive experiments on three datasets.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Latent Representation Guided Multi-View Clustering

Shudong Huang, Ivor W. W. Tsang, Zenglin Xu, Jiancheng Lv

Summary: Multi-view clustering aims to reveal correlations between different input modalities in an unsupervised way. This paper proposes a novel model that learns a robust structured similarity graph and performs multi-view clustering simultaneously. The similarity graph is adaptively learned based on a latent representation that is invulnerable to noise and outliers. Experimental results on benchmark datasets demonstrate the effectiveness of the proposed model.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Taming Overconfident Prediction on Unlabeled Data From Hindsight

Jing Li, Yuangang Pan, Ivor W. Tsang

Summary: This article proposes a dual mechanism called adaptive sharpening (ADS) to minimize prediction uncertainty in semi-supervised learning. ADS applies a soft-threshold to mask out uncertain and negligible predictions, and sharpens the informed ones to distill certain predictions.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Distribution Matching for Machine Teaching

Xiaofeng Cao, Ivor W. Tsang

Summary: Machine teaching is a reverse problem of machine learning, aiming to guide the student towards its target hypothesis using known learning parameters. Previous studies focused on balancing teaching risk and cost to find the best teaching examples. However, when the student doesn't disclose any cue of the learning parameters, the optimization solver becomes ineffective. This article presents a distribution matching-based machine teaching strategy that iteratively shrinks teaching cost to eliminate boundary perturbations, providing an effective solution.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Structure-Informed Shadow Removal Networks

Yuhao Liu, Qing Guo, Lan Fu, Zhanghan Ke, Ke Xu, Wei Feng, Ivor W. Tsang, Rynson W. H. Lau

Summary: In this paper, a novel structure-informed shadow removal network (StructNet) is proposed to address the problem of shadow remnants in existing deep learning-based methods. StructNet reconstructs the structure information of the input image without shadows and uses it to guide the image-level shadow removal. Two main modules, MSFE and MFRA, are developed to extract image structural features and regularize feature consistency. Additionally, an extension called MStructNet is proposed to exploit multi-level structure information and improve shadow removal performance.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Complexity-guided Slimmable Decoder for Efficient Deep Video Compression

Zhihao Hu, Dong Xu

Summary: In this paper, we propose a combination of complexity-guided slimmable decoder (cgSlimDecoder) and skip-adaptive entropy coding (SaEC) for efficient deep video compression. The cgSlimDecoder automatically determines the optimal channel width for each slimmable convolution layer and allocates the optimal number of parameters for different modules, supporting multiple complexity levels. The SaEC further speeds up the decoding process by skipping the entropy coding for well-predicted elements. Experimental results demonstrate that the proposed methods significantly improve coding efficiency with minimal performance drop.

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2023)

Article Computer Science, Artificial Intelligence

Data-Efficient Learning via Minimizing Hyperspherical Energy

Xiaofeng Cao, Weiyang Liu, Ivor W. Tsang

Summary: This paper addresses the problem of data-efficient learning from scratch in scenarios where data or labels are expensive to collect. It proposes the MHEAL algorithm based on active learning on homeomorphic tubes of spherical manifolds, and provides comprehensive theoretical guarantees. Empirical results demonstrate the effectiveness of MHEAL in various applications for data-efficient learning.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

暂无数据