4.6 Article

A novel graph-based k-means for nonlinear manifold clustering and representative selection

Journal

NEUROCOMPUTING
Volume 143, Issue -, Pages 109-122

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2014.05.067

Keywords

k-means; Manifold clustering; Random walk; Graph learning

Funding

  1. NSFC China [61273258]
  2. Ph.D. Programs Foundation of Ministry of Education of China [20120073110018]

Ask authors/readers for more resources

Many real-world applications expose the nonlinear manifold structure of the lower dimension rather than its high-dimensional input space. This greatly challenges most existing clustering and representative selection algorithms which do not take the manifold characteristics into consideration. The performance of the corresponding learning algorithms can be greatly improved if the manifold structure is considered. In this paper, we propose a graph-based k-means algorithm, GKM, which bears the simplicity of classic k-means while incorporating global information of data geometric distribution. GKM fully exploits the intrinsic manifold structure for appropriate data clustering and representative selection. GKM is evaluated on both synthetic and real-life data sets and achieves very impressive results compared to the state-of-the-art approaches, including classic k-means, kernel k-means, spectral clustering, and clustering through ranking and for representative selection. Given the widespread appearance of manifold structures in real world problems, GKM shows promising potential for partitioning manifold-distributed data. (C) 2014 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biology

BiT-MAC: Mortality prediction by bidirectional time and multi-feature attention coupled network on multivariate irregular time series

Qinfen Wang, Geng Chen, Xuting Jin, Siyuan Ren, Gang Wang, Longbing Cao, Yong Xia

Summary: Mortality prediction is crucial in evaluating illness severity and improving patient prognosis. Existing methods for analyzing multivariate time series (MTSs) suffer from sparse and incomplete data. We propose a BiT-MAC network that captures both intra-time series coupling and inter-time series coupling to estimate missing values and improve MTS-based prediction. Extensive experiments on clinical datasets demonstrate the superiority of BiT-MAC and the interpretability of its features.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Article Engineering, Electrical & Electronic

Learning a Coordinated Network for Detail-Refinement Multiexposure Image Fusion

Jiawei Li, Jinyuan Liu, Shihua Zhou, Qiang Zhang, Nikola K. Kasabov

Summary: In this paper, we propose a coordinated learning network for detail-refinement in multi-exposure image fusion. Our network obtains shallow feature maps from over/under-exposed source images and generates smooth attention weight maps to establish global connections. By cooperating with an edge revision module, our method effectively refines edge details and suppresses noise in fused images.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Computer Science, Artificial Intelligence

An Efficient Method for Modeling Nonoccurring Behaviors by Negative Sequential Patterns With Loose Constraints

Ping Qiu, Yongshun Gong, Yuhai Zhao, Longbing Cao, Chengqi Zhang, Xiangjun Dong

Summary: This article explores an efficient method for mining negative sequential patterns (NSPs) using temporal point processes (TPPs) to model frequently occurring and nonoccurring events and behaviors. By loosening constraints, a new definition of negative containment is provided, and an efficient method for calculating the supports of negative sequences is proposed. Finally, a novel and efficient algorithm is presented to identify valuable NSPs.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Editorial Material Computer Science, Artificial Intelligence

AI and data science for smart emergency, crisis and disaster resilience

Longbing Cao

Summary: The uncertain world faces increasing emergencies, crises and disasters, including COVID-19 pandemic, hurricane Ian, global financial inflation and recession, misinformation disaster, and cyberattacks. AI for smart disaster resilience transforms traditional reactive and scripted disaster management into proactive and intelligent resilience in the face of diverse ECDs. This article provides a systematic overview of various ECDs, conventional ECD management, ECD data complexities, and the research landscape of AISDR. Translational disaster AI is crucial in enabling smart disaster resilience.

INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS (2023)

Article Computer Science, Artificial Intelligence

Trans-AI/DS: transformative, transdisciplinary and translational artificial intelligence and data science

Longbing Cao

Summary: After years of development, a new generation of AI and data science has emerged, based on the integration of science, technology, and engineering. This new generation embraces Trans-AI/DS thinking, which combines AI and data science to promote transformative, transdisciplinary, and translational approaches. These paradigm shifts encourage innovative thinking beyond traditional AI and data-driven methods, and focus on the complexities of human intelligence, nature, society, and their creations.

INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS (2023)

Article Computer Science, Artificial Intelligence

Trans-AI/DS: transformative, transdisciplinary and translational artificial intelligence and data science

Longbing Cao

Summary: After 70 years of AI and 50 years of DS, AI/DS have entered a new age, where they are built upon the integration of science, technology, and engineering. This integration has resulted in Trans-AI/DS, which promote transformative, transdisciplinary, and translational thinking, methodologies, and practices in AI/DS.

INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS (2023)

Review Biotechnology & Applied Microbiology

Machine Learning for Brain MRI Data Harmonisation: A Systematic Review

Grace Wen, Vickie Shim, Samantha Jane Holdsworth, Justin Fernandez, Miao Qiao, Nikola Kasabov, Alan Wang

Summary: This study explores the performance of various machine learning algorithms in harmonizing MRI data, summarizing the findings from relevant peer-reviewed articles. It provides guidelines for current methods and identifies potential future research directions. MRI data can be harmonized either implicitly (n = 21) or explicitly (n = 20).

BIOENGINEERING-BASEL (2023)

Article Computer Science, Artificial Intelligence

Image Segmentation Based on Fuzzy Low-Rank Structural Clustering

Sensen Song, Zhenhong Jia, Jie Yang, Nikola Kasabov

Summary: This paper proposes a fuzzy clustering method based on low-rank representation for image segmentation, which improves clustering results by enhancing superpixels constructed by edges and adding a fuzzy regularization term.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Copula Variational LSTM for High-Dimensional Cross-Market Multivariate Dependence Modeling

Jia Xu, Longbing Cao

Summary: This paper proposes a method that combines deep variational sequential learning with copula-based statistical dependence modeling to address the challenging problem of modeling high-dimensional, long-range dependencies between nonnormal multivariates. The method can characterize both the temporal dependence degrees and structures between the hidden variables representing the nonnormal multivariates, and it outperforms benchmarks in terms of both technical significance and portfolio forecasting performance.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

GeSeNet: A General Semantic-Guided Network With Couple Mask Ensemble for Medical Image Fusion

Jiawei Li, Jinyuan Liu, Shihua Zhou, Qiang Zhang, Nikola K. Kasabov

Summary: Currently, multimodal medical image fusion technology has become an essential means for predicting diseases and studying pathology. To address the challenge of preserving unique features from different modal source images while ensuring time efficiency, a flexible semantic-guided architecture called GeSeNet is proposed. The experimental results demonstrate that our method outperforms ten state-of-the-art methods in generating high-quality fused images.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Revealing the Distributional Vulnerability of Discriminators by Implicit Generators

Zhilin Zhao, Longbing Cao, Kun-Yu Lin

Summary: In deep neural learning, training a discriminator on in-distribution samples may lead to misclassification of out-of-distribution samples, which poses a significant challenge for robust and safe deep learning. To address this issue, we propose a general approach called Fine-tuning Discriminators by Implicit Generators (FIG) that enhances the discriminatory power of standard discriminators in distinguishing in-distribution and out-of-distribution samples. FIG leverages information theory to infer an energy-based implicit generator from a discriminator and uses a Langevin dynamic sampler to draw specific out-of-distribution samples. Experimental results demonstrate that FIG achieves state-of-the-art out-of-distribution detection performance.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Out-of-Distribution Detection by Cross-Class Vicinity Distribution of In-Distribution Data

Zhilin Zhao, Longbing Cao, Kun-Yu Lin

Summary: Deep neural networks for image classification only learn to map in-distribution inputs to their corresponding ground-truth labels in training without differentiating out-of-distribution samples from in-distribution ones. To address this issue, we draw out-of-distribution samples from the vicinity distribution of training in-distribution samples for learning to reject the prediction on out-of-distribution inputs. Experiments show that the proposed method significantly outperforms existing methods in improving the capacity for discriminating between in-and out-of-distribution samples.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Engineering, Electrical & Electronic

GALFusion: Multi-Exposure Image Fusion via a GlobalLocal Aggregation Learning Network

Jia Lei, Jiawei Li, Jinyuan Liu, Shihua Zhou, Qiang Zhang, Nikola K. Kasabov

Summary: The goal of multi-exposure image fusion is to generate synthetic results with abundant details and balanced exposure from low dynamic range (LDR) images. To solve the problem of existing methods only considering pixel values in local view field, we propose a global-local aggregation network for fusing extreme exposure images in an unsupervised way. Our method achieves the best results in terms of MEF-structure similarity index measure (SSIM) and peak signal-to-noise ratio (PSNR), outperforming 12 state-of-the-art fusion methods.

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT (2023)

Article Computer Science, Artificial Intelligence

Gray Learning From Non-IID Data With Out-of-Distribution Samples

Zhilin Zhao, Longbing Cao, Chang-Dong Wang

Summary: The integrity of training data is uncertain, especially for non-IID datasets. Experts may misclassify samples, leading to unreliable labels. This study proposes a gray learning (GL) method that leverages both ground-truth and complementary labels to improve the robustness of neural networks.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Theory & Methods

AI in Finance: Challenges, Techniques, and Opportunities

Longbing Cao

Summary: This article provides an overview of the application of artificial intelligence techniques in the finance industry. It offers a comprehensive and dense landscape of the challenges, techniques, and opportunities of AIDS research in finance over the past decades. The article outlines the challenges of financial businesses and data, categorizes the decades of AIDS research in finance, illustrates the data-driven analytics and learning in financial businesses, compares classic and modern AIDS techniques, and discusses future opportunities for AIDS-empowered finance and finance-motivated AIDS research.

ACM COMPUTING SURVEYS (2023)

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)