4.8 Article

Incorporating Diversity and Informativeness in Multiple-Instance Active Learning

Journal

IEEE TRANSACTIONS ON FUZZY SYSTEMS
Volume 25, Issue 6, Pages 1460-1475

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TFUZZ.2017.2717803

Keywords

Clustering; diversity; fuzzy rough set; multiple-instance active learning (MIAL)

Funding

  1. National Natural Science Foundation of China [61402460, 71371063, 61472257, 61672443]
  2. Natural Science Foundation of SZU [2017060]
  3. Basic Research Project of Knowledge Innovation Program in Shenzhen [JCYJ20150324140036825]
  4. Guangdong Provincial Science and Technology Plan Project [2013B040403005]
  5. HD Video R&D Platform for Intelligent Analysis and Processing in Guangdong Engineering Technology Research Centre of Colleges and Universities [GCZX-A1409]
  6. Hong Kong RGC General Research Fund [9042322 (CityU 11200116)]

Ask authors/readers for more resources

Multiple-instance active learning (MIAL) is a paradigm to collect sufficient training bags for a multiple-instance learning (MIL) problem, by selecting and querying the most valuable unlabeled bags iteratively. Existing works on MIAL evaluate an unlabeled bag by its informativeness with regard to the current classifier, but neglect the internal distribution of its instances, which can reflect the diversity of the bag. In this paper, two diversity criteria, i.e., clustering-based diversity and fuzzy rough set based diversity, are proposed for MIAL by utilizing a support vector machine (SVM) based MIL classifier. In the first criterion, a kernel k-means clustering algorithm is used to explore the hidden structure of the instances in the feature space of the SVM, and the diversity degree of an unlabeled bag is measured by the number of unique clusters covered by the bag. In the second criterion, the lower approximations in fuzzy rough sets are used to define a new concept named dissimilarity degree, which depicts the uniqueness of an instance so as to measure the diversity degree of a bag. By incorporating the proposed diversity criteria with existing informativeness measurements, new MIAL algorithms are developed, which can select bags with both high informativeness and diversity. Experimental comparisons demonstrate the feasibility and effectiveness of the proposed methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

Self-representative kernel concept factorization

Wenhui Wu, Yujie Chen, Ran Wang, Le Ou-Yang

Summary: This paper proposes a semi-supervised self-representative kernel concept factorization (S3RKCF) method that integrates adaptive kernel learning and low-dimensional data representation learning into a unified model. An adaptive local geometric structure is acquired in the KCF-induced self-representation space to facilitate data representation learning. Limited supervisory information is imposed as constraints to enhance the discriminability of data representation. The proposed S3RKCF outperforms state-of-the-art methods in clustering and classification tasks according to experimental results.

KNOWLEDGE-BASED SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Joint Decision Tree and Visual Feature Rate Control Optimization for VVC UHD Coding

Mingliang Zhou, Xuekai Wei, Weijia Jia, Sam Kwong

Summary: In this paper, a joint decision tree and visual feature optimization rate control scheme for ultrahigh-definition (UHD) versatile video coding (VVC) is proposed. The scheme includes a new rate-distortion (R-D) model for UHD videos, a decision-tree-based multiclass classification scheme, and a convex optimization algorithm. Experimental results show that compared to other state-of-the-art algorithms, the proposed method achieves significant bit rate reductions while maintaining a given peak signal-to-noise ratio (PSNR) or structural similarity index measure (SSIM).

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Article Computer Science, Artificial Intelligence

A meta-framework for multi-label active learning based on deep reinforcement learning

Shuyue Chen, Ran Wang, Jian Lu

Summary: Multi-label Active Learning (MLAL) is an effective method that improves the performance of multi-label classifiers with less annotation effort. This paper proposes a deep reinforcement learning (DRL) model to explore a general evaluation method for MLAL and addresses label correlation and data imbalanced problems using a self-attention mechanism and a reward function. Experimental results show that the DRL-based MLAL method achieves comparable results to other methods reported in the literature.

NEURAL NETWORKS (2023)

Article Computer Science, Artificial Intelligence

Surrogate-Assisted Hybrid-Model Estimation of Distribution Algorithm for Mixed-Variable Hyperparameters Optimization in Convolutional Neural Networks

Jian-Yu Li, Zhi-Hui Zhan, Jin Xu, Sam Kwong, Jun Zhang

Summary: This article proposes a novel estimation of distribution algorithm (EDA), named surrogate-assisted hybrid-model EDA (SHEDA), for efficient hyperparameters optimization. The algorithm design includes hybrid-model EDA, orthogonal initialization strategy, and surrogate-assisted multi-level evaluation method. Experimental results show that SHEDA is very effective and efficient for hyperparameters optimization on widely used classification benchmark problems.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Automation & Control Systems

Many-Objective Job-Shop Scheduling: A Multiple Populations for Multiple Objectives-Based Genetic Algorithm Approach

Si-Chen Liu, Zong-Gan Chen, Zhi-Hui Zhan, Sang-Woon Jeon, Sam Kwong, Jun Zhang

Summary: This article addresses the job-shop scheduling problem with multiple objectives, including completion time, total tardiness, advance time, production cost, and machine loss. A multiple populations for multiple objectives genetic algorithm (MPMOGA) is proposed to optimize these objectives simultaneously. The MPMOGA algorithm utilizes an archive sharing technique and an archive update strategy to improve the quality and diversity of the solutions. Experimental results show that MPMOGA outperforms other state-of-the-art algorithms on most test instances.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

Article Automation & Control Systems

Global-and-Local Collaborative Learning for Co-Salient Object Detection

Runmin Cong, Ning Yang, Chongyi Li, Huazhu Fu, Yao Zhao, Qingming Huang, Sam Kwong

Summary: This article proposes a global-and-local collaborative learning architecture (GLNet) to effectively extract interimage correspondence in co-salient object detection. The GLNet utilizes global and local correspondence modeling, pairwise correlation transformation, and correspondence aggregation to enhance the comprehensive interimage collaboration cues. The evaluation results demonstrate the superiority of GLNet over state-of-the-art competitors.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

Article Engineering, Electrical & Electronic

Task-Oriented Compact Representation of 3D Point Clouds via A Matrix Optimization-Driven Network

Yue Qian, Junhui Hou, Qijian Zhang, Yiming Zeng, Sam Kwong, Ying He

Summary: This paper presents MOPS-Net, a deep learning-based method for compact representation of 3D point clouds using matrix optimization. It achieves favorable performance in various tasks and exhibits robustness to noisy data.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Engineering, Electrical & Electronic

A CTU-Level Screen Content Rate Control for Low-Delay Versatile Video Coding

Yi Chen, Meng Wang, Shiqi Wang, Zhangkai Ni, Sam Kwong

Summary: In this paper, a rate control scheme is proposed for screen content video coding in the VVC standard. The method relies on pre-analysis to obtain content information and incorporates complexity-aware rate models and distortion models to achieve optimal bit allocations. Experimental results demonstrate the effectiveness of the proposed method in improving coding performance.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Engineering, Electrical & Electronic

?-Domain VVC Rate Control Based on Nash Equilibrium

Jielian Lin, Aiping Huang, Tiesong Zhao, Xu Wang, Sam Kwong

Summary: In this paper, a solution is proposed to address the bit allocation problem in VVC video compression by formulating it as a Nash equilibrium problem. By introducing ?-domain RD models, a constrained optimization problem is derived and solved using a Newton method and Nash equilibrium. Experimental results demonstrate the effectiveness and superiority of the proposed method.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Computer Science, Artificial Intelligence

Region Purity-Based Local Feature Selection: A Multiobjective Perspective

Yu Zhou, Yan Qiu, Sam Kwong

Summary: In contrast to traditional feature selection methods, local feature selection methods partition the sample space and obtain feature subsets for each local region. However, most existing local feature selection algorithms lack a problem-specific objective function and instead use a distance-like objective function, leading to limited classification performance. In this article, we propose a novel objective function called region purity (RP) for local feature selection. To solve this problem, we use an improved nondominated sorting genetic algorithm III and develop a regional feature sharing strategy. Experimental results on various datasets demonstrate the effectiveness of our proposed RP-LFS. Compared to other state-of-the-art feature selection and local feature selection algorithms, RP-LFS achieves competitive classification accuracy while reducing the feature subset size.

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION (2023)

Article Computer Science, Artificial Intelligence

Extending version-space theory to multi-label active learning with imbalanced data

Ran Wang, Shuyue Chen, Yu Yu

Summary: Version space is a crucial concept in supervised learning, but its application in multi-label active learning has not been explored. This paper extends the version space theory from single-label scenario to multi-label scenario, establishes a spatial structure for the multi-label version space, and proposes a simplified representation and a new multi-label active learning algorithm. The algorithm is further enhanced by addressing the issue of class imbalance in multi-label data. Experimental comparisons demonstrate the feasibility and effectiveness of the proposed methods.

PATTERN RECOGNITION (2023)

Article Computer Science, Artificial Intelligence

PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN With Dual-Discriminators

Runmin Cong, Wenyu Yang, Wei Zhang, Chongyi Li, Chun-Le Guo, Qingming Huang, Sam Kwong

Summary: Due to light absorption and scattering in water, underwater images often suffer from degradation issues such as low contrast, color distortion, and blurriness, making underwater understanding tasks more challenging. To address this, the study proposes a physical model-guided GAN model called PUGAN, which combines the advantages of GANs in visual aesthetics and physical model-based methods in scene adaptability. The proposed model includes a Parameters Estimation subnetwork for physical model inversion and a Two-Stream Interaction Enhancement subnetwork with a Degradation Quantization module. Dual-Discriminators are also designed for adversarial constraint to improve authenticity and visual aesthetics.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Article Engineering, Electrical & Electronic

Full-Reference Image Quality Assessment: Addressing Content Misalignment Issue by Comparing Order Statistics of Deep Features

Xingran Liao, Xuekai Wei, Mingliang Zhou, Sam Kwong

Summary: This paper proposes a deep order statistical similarity (DOSS) FR-IQA model to evaluate content-misaligned image pairs encountered in image reconstruction and texture synthesis tasks. DOSS compares the order statistics of deep features in the reference and distorted images to output perceptual quality scores. It mimics the human visual system's behavior and possesses advanced texture perception capability, producing superior quality assessment results on various texture synthesis algorithms.

IEEE TRANSACTIONS ON BROADCASTING (2023)

Article Engineering, Electrical & Electronic

Enhanced Motion Compensation for Deep Video Compression

Haifeng Guo, Sam Kwong, Chuanmin Jia, Shiqi Wang

Summary: Most deep learning-based video compression frameworks rely on motion estimation and compensation, but the artifacts of warped frames limit the performance. In this work, we propose enhanced motion compensation to reduce error propagation. We incorporate a designed convolutional neural network into Open DVC as the enhancement network, and optimize the framework with a single loss function considering the trade-off between bit cost and frame quality. Experimental results show that our model achieves significant bit savings and outperforms Open DVC in terms of PSNR and bit rate savings.

IEEE SIGNAL PROCESSING LETTERS (2023)

Article Computer Science, Hardware & Architecture

Anchor-Free Tracker Based on Space-Time Memory Network

Guang Han, Chen Cao, Jixin Liu, Sam Kwong

Summary: This article proposes a new Anchor-free Tracker based on Space-time Memory Network (ATSMN) to solve the appearance problems in object tracking. By utilizing space-time memory network, memory feature fusion network, and transformer feature cross fusion network, the tracker can effectively use temporal context information and better adapt to appearance changes, achieving accurate classification and regression results. Extensive experimental results show that ATSMN outperforms other advanced trackers on challenging benchmarks.

IEEE MULTIMEDIA (2023)

No Data Available