4.7 Article

Fast affinity propagation clustering: A multilevel approach

Journal

PATTERN RECOGNITION
Volume 45, Issue 1, Pages 474-486

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2011.04.032

Keywords

Clustering; Affinity propagation; Graph partitioning; Spectral clustering; Manifold structure

Funding

  1. National Natural Science Foundation of China [60803097, 61003198, 61001202, 60970067]
  2. National Science and Technology Ministry of China [9140A07011810DZ0107, 9140A07021010DZ0131]
  3. Fund for Foreign Scholars in University Research and Teaching Programs [B07048]
  4. Fundamental Research Funds for the Central Universities [JY10000902001, K50510020001, JY10000902045]
  5. National Research Foundation for the Doctoral Program of Higher Education of China [20100203120008]

Ask authors/readers for more resources

In this paper, we propose a novel Fast Affinity Propagation clustering approach (FAP). FAP simultaneously considers both local and global structure information contained in datasets, and is a high-quality multilevel graph partitioning method that can implement both vector-based and graph-based clustering. First, a new Fast Sampling algorithm (FS) is proposed to coarsen the input sparse graph and choose a small number of final representative exemplars. Then a density-weighted spectral clustering method is presented to partition those exemplars on the global underlying structure of data manifold. Finally, the cluster assignments of all data points can be achieved through their corresponding representative exemplars. Experimental results on two synthetic datasets and many real-world datasets show that our algorithm outperforms the state-of-the-art original affinity propagation and spectral clustering algorithms in terms of speed, memory usage, and quality on both vector-based and graph-based clustering. (C) 2011 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Environmental Sciences

Deep Fully Convolutional Embedding Networks for Hyperspectral Images Dimensionality Reduction

Na Li, Deyun Zhou, Jiao Shi, Mingyang Zhang, Tao Wu, Maoguo Gong

Summary: Due to the superior spatial-spectral extraction capability of CNN, it has great potential in DR of HSIs. However, most CNN-based methods are supervised, while unsupervised methods often focus on data reconstruction rather than discriminability. Therefore, a deep fully convolutional embedding network (DFCEN) is proposed in this study to address these issues and improve the effectiveness of unsupervised learning.

REMOTE SENSING (2021)

Article Neurosciences

Comprehensive subtyping of Parkinson's disease patients with similarity fusion: a case study with BioFIND data

Matthew Brendel, Chang Su, Yu Hou, Claire Henchcliffe, Fei Wang

Summary: This study aimed to identify subtypes of moderate-to-advanced Parkinson's disease by comprehensively considering motor and non-motor manifestations. Three unique subtypes emerged from the clustering results, each characterized by different levels of symptom severity. These subtypes showed significant differences in motor and non-motor clinical features, providing important information for further research on the treatment and management of Parkinson's disease.

NPJ PARKINSONS DISEASE (2021)

Article Health Care Sciences & Services

Clinical subphenotypes in COVID-19: derivation, validation, prediction, temporal patterns, and interaction with social determinants of health

Chang Su, Yongkang Zhang, James H. Flory, Mark G. Weiner, Rainu Kaushal, Edward J. Schenck, Fei Wang

Summary: The study identified 4 biologically distinct subphenotypes of COVID-19 using machine learning and clinical data, which were highly predictive of clinical outcomes. It found varying prevalence of subphenotypes across the peak of the outbreak in NYC. Furthermore, social determinants of health specifically influenced mortality outcomes in certain subphenotypes.

NPJ DIGITAL MEDICINE (2021)

Article Clinical Neurology

Comparison of the Parkinson's KinetiGraph to off/on levodopa response testing: Single center experience

Ivan Guan, Maissa Trabilsy, Samantha Barkan, Ashwin Malhotra, Yu Hou, Fei Wang, Natalie Hellmers, Harini Sarva, Claire Henchcliffe

Summary: This study compared levodopa off/on testing with Parkinson's Kinetigraph motor scores in PD patients. The results showed that a robust off/on response does not necessarily indicate adequately controlled motor symptoms. The PKG may provide additional clinically relevant data on motor symptoms for prospective observational studies.

CLINICAL NEUROLOGY AND NEUROSURGERY (2021)

Article Computer Science, Interdisciplinary Applications

CODER: Knowledge-infused cross-lingual medical term embedding for term normalization

Zheng Yuan, Zhengyun Zhao, Haixia Sun, Jiao Li, Fei Wang, Sheng Yu

Summary: This paper introduces knowledge-aware embedding, CODER, a critical tool for medical term normalization. By utilizing contrastive learning and a medical knowledge graph, CODER can extract semantic similarity and relatedness of medical concepts, which can be used for medical term normalization or feature extraction for machine learning.

JOURNAL OF BIOMEDICAL INFORMATICS (2022)

Article Oncology

Simulating Colorectal Cancer Trials Using Real-World Data

Zhaoyi Chen, Hansi Zhang, Thomas J. George, Yi Guo, Mattia Prosperi, Jingchuan Guo, Dejana Braithwaite, Fei Wang, Warren Kibbe, Lynne Wagner, Jiang Bian

Summary: In this study, we simulated colorectal cancer trials using real-world data and tested two simulation scenarios. The results showed that our simulations can generate effectiveness and safety outcomes comparable with the original trials.

JCO CLINICAL CANCER INFORMATICS (2022)

Article Environmental Sciences

An Efficient Feature Extraction Network for Unsupervised Hyperspectral Change Detection

Hongyu Zhao, Kaiyuan Feng, Yue Wu, Maoguo Gong

Summary: This paper proposes a novel feature extraction network that combines Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN) for hyperspectral change detection tasks. The experimental results demonstrate that the proposed method yields reliable detection results and has fewer noise regions.

REMOTE SENSING (2022)

Article Environmental Sciences

Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images

Junyu Gao, Maoguo Gong, Xuelong Li

Summary: In this paper, we propose a method named SwinCounter for object counting in remote sensing. The method addresses the issue of imbalanced object labels by introducing a Balanced MSE Loss and captures multi-scale information accurately using the attention mechanism. Experiments on the RSOC dataset demonstrate the competitiveness and superiority of the proposed method.

REMOTE SENSING (2022)

Review Medicine, General & Internal

Algorithmic fairness in computational medicine

Jie Xu, Yunyu Xiao, Wendy Hui Wang, Yue Ning, Elizabeth A. Shenkman, Jiang Bian, Fei Wang

Summary: Machine learning models are increasingly being used in clinical decision-making, but recent research has highlighted the potential biases that these techniques may introduce, particularly for vulnerable ethnic minorities. This paper provides a comprehensive review of algorithmic fairness in computational medicine, discussing different types of bias, metrics for quantifying fairness, and methods for mitigating bias. It also summarizes popular software libraries and tools for evaluating and mitigating bias, serving as a valuable resource for researchers and practitioners in computational medicine.

EBIOMEDICINE (2022)

Article Computer Science, Information Systems

Ontology-aware Prescription Recommendation in Treatment Pathways Using Multi-evidence Healthcare Data

Zijun Yao, Bin Liu, Fei Wang, Daby Sow, Ying Li

Summary: This article proposes a novel prescription recommendation framework called OntoPath, which predicts the next drug in chronic disease treatment pathways by integrating multiple medical evidence from domain knowledge guidance, medical history profiling, and side information utilization. Extensive experiments on a large-scale depression cohort demonstrate the effectiveness of OntoPath in prescription recommendation.

ACM TRANSACTIONS ON INFORMATION SYSTEMS (2023)

Review Genetics & Heredity

Application of Deep Learning on Single-cell RNA Sequencing Data Analysis: A Review

Matthew Brendel, Chang Su, Zilong Bai, Hao Zhang, Olivier Elemento, Fei Wang

Summary: Single-cell RNA sequencing (scRNA-seq) is widely used to quantify the gene expression profile of thousands of single cells simultaneously. Deep learning techniques have emerged as a promising tool for scRNA-seq data analysis, allowing for the extraction of informative and compact features from noisy and high-dimensional data. This review surveys recent developments in deep learning for scRNA-seq data analysis, highlights key advancements made by deep learning in the analysis pipeline, and discusses the benefits and challenges of applying deep learning to scRNA-seq data.

GENOMICS PROTEOMICS & BIOINFORMATICS (2022)

Review Health Care Sciences & Services

Multimodal machine learning in precision health: A scoping review

Adrienne Kline, Hanyin Wang, Yikuan Li, Saya Dennis, Meghan Hutch, Zhenxing Xu, Fei Wang, Feixiong Cheng, Yuan Luo

Summary: This review summarizes current studies on multi-modal data fusion in the health sector, highlighting the common use of multi-modal methods in neurology and oncology and the improved predictive performance achieved through data fusion. However, the lack of clear clinical deployment strategies, FDA approval, and analysis of biases and healthcare disparities in diverse sub-populations was noted.

NPJ DIGITAL MEDICINE (2022)

Article Computer Science, Artificial Intelligence

On Expressivity and Trainability of Quadratic Networks

Feng-Lei Fan, Mengzhou Li, Fei Wang, Rongjie Lai, Ge Wang

Summary: Inspired by the diversity of biological neurons, this paper explores the application of quadratic artificial neurons in deep learning and highlights the differences in expressivity and training risk between traditional neurons and quadratic networks with or without quadratic activation. By applying spline theory and algebraic geometry, the superior model expressivity of quadratic networks over traditional networks is mathematically demonstrated, and an effective training strategy called ReLinear is proposed to stabilize the training process of quadratic networks.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Proceedings Paper Computer Science, Information Systems

Harnessing Biomedical Literature to Calibrate Clinicians' Trust in AI Decision Support Systems

Qian Yang, Yuexing Hao, Kexin Quan, Stephen Yang, Yiran Zhao, Volodymyr Kuleshov, Fei Wang

Summary: Clinical decision support tools (DSTs) powered by AI can improve diagnostic and treatment decision-making, but AI models are not always correct. To address this, researchers investigated how clinicians validate each other's suggestions and designed a new DST that incorporates these interactions. The design uses GPT-3 to provide literature evidence showcasing the robustness and applicability of AI suggestions. A prototype study with clinicians demonstrated the promise of this approach and revealed new opportunities for design and research.

PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023) (2023)

Article Health Care Sciences & Services

Exploring the feasibility of using real-world data from a large clinical data research network to simulate clinical trials of Alzheimer's disease

Zhaoyi Chen, Hansi Zhang, Yi Guo, Thomas J. George, Mattia Prosperi, William R. Hogan, Zhe He, Elizabeth A. Shenkman, Fei Wang, Jiang Bian

Summary: This study explored the feasibility of using real-world data to simulate clinical trials for Alzheimer's disease, comparing different formulations of donepezil. Two main simulation scenarios were considered: one-arm simulation and two-arm simulation with propensity score matching. Higher SAE rates were observed in the simulated trials compared to the original trial, indicating potential limitations of the approach.

NPJ DIGITAL MEDICINE (2021)

Article Computer Science, Artificial Intelligence

Exploiting sublimated deep features for image retrieval

Guang-Hai Liu, Zuo-Yong Li, Jing-Yu Yang, David Zhang

Summary: This article introduces a novel image retrieval method that improves retrieval performance by using sublimated deep features. The method incorporates orientation-selective features and color perceptual features, effectively mimicking these mechanisms to provide a more discriminating representation.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Region-adaptive and context-complementary cross modulation for RGB-T semantic segmentation

Fengguang Peng, Zihan Ding, Ziming Chen, Gang Wang, Tianrui Hui, Si Liu, Hang Shi

Summary: RGB-Thermal (RGB-T) semantic segmentation is an emerging task that aims to improve the robustness of segmentation methods under extreme imaging conditions by using thermal infrared modality. The challenges of foreground-background distinguishment and complementary information mining are addressed by proposing a cross modulation process with two collaborative components. Experimental results show that the proposed method achieves state-of-the-art performances on current RGB-T segmentation benchmarks.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

F-SCP: An automatic prompt generation method for specific classes based on visual language pre-training models

Baihong Han, Xiaoyan Jiang, Zhijun Fang, Hamido Fujita, Yongbin Gao

Summary: This paper proposes a novel automatic prompt generation method called F-SCP, which focuses on generating accurate prompts for low-accuracy classes and similar classes. Experimental results show that our approach outperforms state-of-the-art methods on six multi-domain datasets.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Residual Deformable Convolution for better image de-weathering

Huikai Liu, Ao Zhang, Wenqian Zhu, Bin Fu, Bingjian Ding, Shengwu Xiong

Summary: Adverse weather conditions present challenges for computer vision tasks, and image de-weathering is an important component of image restoration. This paper proposes a multi-patch skip-forward structure and a Residual Deformable Convolutional module to improve feature extraction and pixel-wise reconstruction.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

A linear transportation LP distance for pattern recognition

Oliver M. Crook, Mihai Cucuringu, Tim Hurst, Carola-Bibiane Schonlieb, Matthew Thorpe, Konstantinos C. Zygalakis

Summary: The transportation LP distance (TLP) is a generalization of the Wasserstein WP distance that can be applied directly to color or multi-channelled images, as well as multivariate time-series. TLP interprets signals as functions, while WP interprets signals as measures. Although both distances are powerful tools in modeling data with spatial or temporal perturbations, their computational cost can be prohibitively high for moderate pattern recognition tasks. The linear Wasserstein distance offers a method for projecting signals into a Euclidean space, and in this study, we propose linear versions of the TLP distance (LTLP) that show significant improvement over the linear WP distance in signal processing tasks while being several orders of magnitude faster to compute than the TLP distance.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Learning a target-dependent classifier for cross-domain semantic segmentation: Fine-tuning versus meta-learning

Haitao Tian, Shiru Qu, Pierre Payeur

Summary: This paper proposes a method of target-dependent classifier, which optimizes the joint hypothesis of domain adaptation into a target-dependent hypothesis that better fits with the target domain clusters through an unsupervised fine-tuning strategy and the concept of meta-learning. Experimental results demonstrate that this method outperforms existing techniques in synthetic-to-real adaptation and cross-city adaptation benchmarks.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

KGSR: A kernel guided network for real-world blind super-resolution

Qingsen Yan, Axi Niu, Chaoqun Wang, Wei Dong, Marcin Wozniak, Yanning Zhang

Summary: Deep learning-based methods have achieved remarkable results in the field of super-resolution. However, the limitation of paired training image sets has led researchers to explore self-supervised learning. However, the assumption of inaccurate downscaling kernel functions often leads to degraded results. To address this issue, this paper introduces KGSR, a kernel-guided network that trains both upscaling and downscaling networks to generate high-quality high-resolution images even without knowing the actual downscaling process.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Gait feature learning via spatio-temporal two-branch networks

Yifan Chen, Xuelong Li

Summary: Gait recognition is a popular technology for identification due to its ability to capture gait features over long distances without cooperation. However, current methods face challenges as they use a single network to extract both temporal and spatial features. To solve this problem, we propose a two-branch network that focuses on spatial and temporal feature extraction separately. By combining these features, we can effectively learn the spatio-temporal information of gait sequences.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

PAMI: Partition Input and Aggregate Outputs for Model Interpretation

Wei Shi, Wentao Zhang, Wei-shi Zheng, Ruixuan Wang

Summary: This article proposes a simple yet effective visualization framework called PAMI, which does not require detailed model structure and parameters to obtain visualization results. It can be applied to various prediction tasks with different model backbones and input formats.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Disturbance rejection with compensation on features

Xiaobo Hu, Jianbo Su, Jun Zhang

Summary: This paper reviews the latest technologies in pattern recognition, highlighting their instabilities and failures in practical applications. From a control perspective, the significance of disturbance rejection in pattern recognition is discussed, and the existing problems are summarized. Finally, potential solutions related to the application of compensation on features are discussed to emphasize future research directions.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

ECLAD: Extracting Concepts with Local Aggregated Descriptors

Andres Felipe Posada-Moreno, Nikita Surya, Sebastian Trimpe

Summary: Convolutional neural networks are widely used in critical systems, and explainable artificial intelligence has proposed methods for generating high-level explanations. However, these methods lack the ability to determine the location of concepts. To address this, we propose a novel method for automatic concept extraction and localization based on pixel-wise aggregations, and validate it using synthetic datasets.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Dynamic Graph Contrastive Learning via Maximize Temporal Consistency

Peng Bao, Jianian Li, Rong Yan, Zhongyi Liu

Summary: In this paper, a novel Dynamic Graph Contrastive Learning framework, DyGCL, is proposed to capture the temporal consistency in dynamic graphs and achieve good performance in node representation learning.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

ConvGeN: A convex space learning approach for deep-generative oversampling and imbalanced classification of small tabular datasets

Kristian Schultz, Saptarshi Bej, Waldemar Hahn, Markus Wolfien, Prashant Srivastava, Olaf Wolkenhauer

Summary: Research indicates that deep generative models perform poorly compared to linear interpolation-based methods for synthetic data generation on small, imbalanced tabular datasets. To address this, a new approach called ConvGeN, combining convex space learning with deep generative models, has been proposed. ConvGeN improves imbalanced classification on small datasets while remaining competitive with existing linear interpolation methods.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

H-CapsNet: A capsule network for hierarchical image classification

Khondaker Tasrif Noor, Antonio Robles-Kelly

Summary: In this paper, the authors propose H-CapsNet, a capsule network designed for hierarchical image classification. The network effectively captures hierarchical relationships using dedicated capsules for each class hierarchy. A modified hinge loss is utilized to enforce consistency among the involved hierarchies. Additionally, a strategy for dynamically adjusting training parameters is presented to achieve better balance between the class hierarchies. Experimental results demonstrate that H-CapsNet outperforms competing hierarchical classification networks.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

CS-net: Conv-simpleformer network for agricultural image segmentation

Lei Liu, Guorun Li, Yuefeng Du, Xiaoyu Li, Xiuheng Wu, Zhi Qiao, Tianyi Wang

Summary: This study proposes a new agricultural image segmentation model called CS-Net, which uses Simple-Attention Block and Simpleformer to improve accuracy and inference speed, and addresses the issue of performance collapse of Transformers in agricultural image processing.

PATTERN RECOGNITION (2024)