4.6 Article

Generalized Pair-Counting Similarity Measures for Clustering and Cluster Ensembles

Journal

IEEE ACCESS
Volume 5, Issue -, Pages 16904-16918

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2017.2741221

Keywords

Clustering evaluation; cluster ensembles; similarity measures

Funding

  1. Scientific and Technological Project of Guangzhou, China [201607010053, 201607010191, 201707010284]
  2. Outstanding Young Teachers in the Higher Education Institutions of Guangdong Province, China [Yq201401]
  3. Teaching Reform Project of Guangzhou University [JY2016033]
  4. Education Reform Project of the Guangdong Province, through the Project Research on Construction and Mining methods in Knowledge Graphs of Computer Sciences Courses [426, 236]
  5. Natural Science Foundation of Guangdong, China [2014A030313524, 2016A030313540]
  6. Science and Technology Projects of Guangdong Province, China [2016B010127001]
  7. Research Grants Council of the Hong Kong Special Administrative Region, China [CityU 11300715]
  8. City University of Hong Kong [7004674]

Ask authors/readers for more resources

In this paper, a number of pair-counting similarity measures associated with a general formulation of cluster ensemble are proposed. These measures are formulated based on our motivation to evaluate the consistency between an individual clustering solution and a cluster ensemble solution, or that between different cluster ensemble solutions, in a uniform manner. A number of criteria are proposed for the comparison of these generalized measures, from both the perspectives of theoretical analysis and experimental validation. We identify their different behaviors and their correlations in different scenarios of traditional clustering solutions and cluster ensembles, with the hope that the results of these studies could 1) serve as important criteria for the design and selection of evaluation measures for clustering solutions, and 2) provide explanations for ambiguous clustering results in related scenarios. Experiments with both synthetic and real data sets are conducted to verify our findings.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Hardware & Architecture

Adversarial Adaptive Interpolation in Autoencoders for Dually Regularizing Representation Learning

Guanyue Li, Xiwen Wei, Si Wu, Zhiwen Yu, Sheng Qian, Hau-San Wong

Summary: This article proposes an Adversarial Adaptive Interpolation-based AutoEncoder (AdvAI-AE) to address the mismatch issue caused by linear interpolation in the latent space. By training an additional interpolation correction module and applying prior matching, the performance gains in downstream tasks on benchmark datasets are significantly improved.

IEEE MULTIMEDIA (2022)

Article Computer Science, Information Systems

Unreliable-to-Reliable Instance Translation for Semi-Supervised Pedestrian Detection

Sihao Lin, Wenhao Wu, Si Wu, Yong Xu, Hau-San Wong

Summary: This paper proposes a method called Un2Reliab for generating realistic pedestrian instances in a semi-supervised setting. Un2Reliab uses an encoder-decoder-like generative network and a discriminative network to jointly train, and applies regularization and consistency to ensure the similarity and consistency of the synthesized instances with the real instances. Experimental results show that Un2Reliab is able to synthesize high-fidelity pedestrian instances and improves the state-of-the-art results on multiple semi-supervised pedestrian detection benchmarks.

IEEE TRANSACTIONS ON MULTIMEDIA (2022)

Article Computer Science, Artificial Intelligence

Learning scene-adaptive pseudo annotations for pedestrian detection in semi-supervised scenarios

Wenhao Wu, Qianfen Jiao, Hau-San Wong, Gaozhe Li, Si Wu

Summary: Sufficient labeled training data may not be available for pedestrian detection in many real-world scenes. In this paper, a Scene-adaptive Pseudo Annotation (SaPA) approach is proposed to improve the generalization performance and pseudo annotation quality for training a more precise and scene-specific pedestrian detector, by exploiting both source data with sufficient supervision and unannotated target data with domain-specific information.

KNOWLEDGE-BASED SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Semisupervised Classification With Novel Graph Construction for High-Dimensional Data

Zhiwen Yu, Fengxu Ye, Kaixiang Yang, Wenming Cao, C. L. Philip Chen, Lianglun Cheng, Jane You, Hau-San Wong

Summary: In this article, a novel graph construction method for semisupervised classification is proposed. The method optimizes the similarity matrix in both label space and an additional subspace to achieve a better and more robust result compared to the original data space. Additionally, a high-quality subspace is obtained by learning the projection matrix.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Asymmetric Graph-Guided Multitask Survival Analysis With Self-Paced Learning

Cheng Liu, Wenming Cao, Si Wu, Wenjun Shen, Dazhi Jiang, Zhiwen Yu, Hau-San Wong

Summary: This article introduces an asymmetric graph-guided multitask learning method combined with self-paced learning for survival analysis. The experimental results demonstrate that this method can improve the performance of survival analysis and achieve higher prediction accuracy compared to the previous state-of-the-art methods.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Automation & Control Systems

Semisupervised Multiple Choice Learning for Ensemble Classification

Jian Zhong, Xiangping Zeng, Wenming Cao, Si Wu, Cheng Liu, Zhiwen Yu, Hau-San Wong

Summary: This article proposes a semisupervised multiple choice learning approach to enhance the predictive performance of classification models by jointly training a network ensemble on partially labeled data.

IEEE TRANSACTIONS ON CYBERNETICS (2022)

Article Automation & Control Systems

Progressive Hybrid Classifier Ensemble for Imbalanced Data

Kaixiang Yang, Zhiwen Yu, C. L. Philip Chen, Wenming Cao, Hau-San Wong, Jane You, Guoqiang Han

Summary: The research team proposed a hybrid classifier ensemble framework that includes data transformation and an adaptive undersampling process, which effectively solves the classification problem of imbalanced data. They also designed a progressive ensemble framework to improve the performance of the classifier ensemble.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Unsupervised discriminative feature learning via finding a clustering-friendly embedding space

Wenming Cao, Zhongfan Zhang, Cheng Liu, Rui Li, Qianfen Jiao, Zhiwen Yu, Hau-San Wong

Summary: In this paper, an enhanced deep clustering network (EDCN) is proposed, which consists of a Feature Extractor, a Conditional Generator, a Discriminator, and a Siamese Network. The EDCN utilizes adversarial training to generate two types of data and original data, which are used to train the Feature Extractor for effective latent representation learning. A Siamese network is adopted to generate realistic data using pseudo-labels for enhancing the Feature Extractor. Extensive experiments show the effectiveness and superiority of the proposed EDCN.

PATTERN RECOGNITION (2022)

Article Engineering, Biomedical

View-Aware Collaborative Learning for Survival Prediction and Subgroup Identification

Cheng Liu, Si Wu, Dazhi Jiang, Zhiwen Yu, Hau-San Wong

Summary: Advances in high throughput experimental methods have led to the availability of diverse omic datasets in clinical analysis applications. Different types of omic data contribute to the understanding of disease progression. By integrating multiple omics data, the proposed View-aware Collaborative Learning (VaCoL) method can jointly boost the performance of survival prediction and subgroup identification.

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Self-Supervised Graph Completion for Incomplete Multi-View Clustering

Cheng Liu, Si Wu, Rui Li, Dazhi Jiang, Hau-San Wong

Summary: This work proposes a new approach for incomplete multi-view clustering by addressing the issue of missing instances through similarity graph completion and self-supervised multi-view graph completion algorithm. The inferred graph is leveraged in representation learning by incorporating constrained feature learning. The experimental results demonstrate the effectiveness of the proposed method.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Incremental Weighted Ensemble Broad Learning System for Imbalanced Data

Kaixiang Yang, Zhiwen Yu, C. L. Philip Chen, Wenming Cao, Jane You, Hau-San Wong

Summary: In this paper, a weighted broad learning system (BLS) model is proposed to address the imbalance problem and outliers and noises in imbalanced data. By introducing a weighting mechanism and density calculation, the adaptive weighted BLS (AWBLS) and incremental weighted ensemble BLS (IWEB) are proposed, which demonstrate better classification performance in experimental results.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2022)

Article Computer Science, Artificial Intelligence

Discriminator feature-based progressive GAN inversion?

Quanpeng Song, Guanyue Li, Si Wu, Wenjun Shen, Hau-San Wong

Summary: In this paper, a Discriminator Feature-based Progressive Inversion (DFPI) model is proposed for GAN-based image reconstruction and enhancement. By learning the mapping relationship between discriminator features and generator features, high-quality reconstruction and enhancement are achieved.

KNOWLEDGE-BASED SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

TSEV-GAN: Generative Adversarial Networks with Target-aware Style Encoding and Verification for facial makeup transfer

Zhen Xu, Si Wu, Qianfen Jiao, Hau-San Wong

Summary: This paper introduces a GAN-based generative model for accurately extracting and transferring makeup styles from reference facial images to target faces. The proposed model utilizes target-aware makeup style encoding and verification, and improves the accuracy and fidelity of makeup transfer through encoding the difference map and learning style consistency.

KNOWLEDGE-BASED SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Self-Guided Partial Graph Propagation for Incomplete Multiview Clustering

Cheng Liu, Rui Li, Si Wu, Hangjun Che, Dazhi Jiang, Zhiwen Yu, Hau-San Wong

Summary: In this work, the authors study the incomplete multiview clustering (IMVC) scenario, where some instances are missing in certain views. They propose a graph propagation-based approach to address the issue of missing entries in the partial graph, and to exploit both consistency and complementary information. The proposed method outperforms state-of-the-art methods in extensive experiments.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Information Systems

Semantic Regularized Class-Conditional GANs for Semi-Supervised Fine-Grained Image Synthesis

Tianyi Chen, Si Wu, Xuhui Yang, Yong Xu, Hau-San Wong

Summary: Learning effective generative models for natural image synthesis is essential to reduce the reliance of deep models on massive training data. This research introduces a Semantic Regularized class-conditional Generative Adversarial Network, named SReGAN, to address the issue of limited data and labels in Fine-Grained Image Synthesis (FGIS) tasks. By incorporating additional discriminators and classifiers, the generator is encouraged to model both marginal and class-conditional data distributions, and to discover the differences between object categories, ultimately generating high-fidelity images on various FGIS benchmarks.

IEEE TRANSACTIONS ON MULTIMEDIA (2022)

No Data Available