4.6 Article

ACOSampling: An ant colony optimization-based undersampling method for classifying imbalanced DNA microarray data

期刊

NEUROCOMPUTING
卷 101, 期 -, 页码 309-318

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2012.08.018

关键词

DNA microarray; Ant colony optimization; Class imbalance; Undersampling; Support vector machine

资金

  1. National Natural Science Foundation of China [60873036]
  2. Ph.D Programs Foundation for young researchers of Ministry of Education of China [20070217051]
  3. Ph.D Foundation of Jiangsu University of Science and Technology [35301002]

向作者/读者索取更多资源

In DNA microarray data, class imbalance problem occurs frequently, causing poor prediction performance for minority classes. Moreover, its other features, such as high-dimension, small sample, high noise etc., intensify this damage. In this study, we propose ACOSampling that is a novel undersampling method based on the idea of ant colony optimization (ACO) to address this problem. The algorithm starts with feature selection technology to eliminate noisy genes in data. Then we randomly and repeatedly divided the original training set into two groups: training set and validation set. In each division, one modified ACO algorithm as a variant of our previous work is conducted to filter less informative majority samples and search the corresponding optimal training sample subset. At last, the statistical results from all local optimal training sample subsets are given in the form of frequence list, where each frequence indicates the importance of the corresponding majority sample. We only extracted those high frequency ones and combined them with all minority samples to construct the final balanced training set. We evaluated the method on four benchmark skewed DNA microarray datasets by support vector machine (SVM) classifier, showing that the proposed method outperforms many other sampling approaches, which indicates its superiority. (C) 2012 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

An Improved Mean Imputation Clustering Algorithm for Incomplete Data

Hong Shi, Pingxin Wang, Xin Yang, Hualong Yu

Summary: This study proposes an improved clustering algorithm for handling incomplete data sets caused by random noise, data loss, data acquisition limitations, etc. The algorithm divides the data set into two sets based on missing and non-missing values, and uses mean imputation to fill the missing attribute values. Experimental results demonstrate the effectiveness of the algorithm in clustering.

NEURAL PROCESSING LETTERS (2022)

Article Computer Science, Artificial Intelligence

Adaptive Decision Threshold-Based Extreme Learning Machine for Classifying Imbalanced Multi-label Data

Shang Gao, Wenlu Dong, Ke Cheng, Xibei Yang, Shang Zheng, Hualong Yu

NEURAL PROCESSING LETTERS (2020)

Article Computer Science, Software Engineering

Software Defect Prediction Based on Fuzzy Weighted Extreme Learning Machine with Relative Density Information

Shang Zheng, Jinjing Gai, Hualong Yu, Haitao Zou, Shang Gao

SCIENTIFIC PROGRAMMING (2020)

Article Computer Science, Software Engineering

Probability Density Machine: A New Solution of Class Imbalance Learning

Ruihan Cheng, Longfei Zhang, Shiqi Wu, Sen Xu, Shang Gao, Hualong Yu

Summary: In this paper, a novel solution of Class Imbalance Learning called Probability Density Machine (PDM) is introduced, which is an improved Gaussian Naive Bayes (GNB) model and shows promising results in experiments.

SCIENTIFIC PROGRAMMING (2021)

Article Chemistry, Analytical

Early Warning of Gas Concentration in Coal Mines Production Based on Probability Density Machine

Yadong Cai, Shiqi Wu, Ming Zhou, Shang Gao, Hualong Yu

Summary: Gas explosion is a significant factor impacting coal mine safety, and employing the probability density machine algorithm can effectively prevent gas explosion accidents and enhance early warning accuracy.

SENSORS (2021)

Article Mathematics

Beam-Influenced Attribute Selector for Producing Stable Reduct

Wangwang Yan, Jing Ba, Taihua Xu, Hualong Yu, Jinlong Shi, Bin Han

Summary: This study proposes a novel attribute selector called Beam-Influenced Selector (BIS), which enhances the stability of attribute reduction through random partition and beam strategies. Experimental results show that this selector significantly improves the stability of the derived reducts and achieves excellent performance in classification tasks.

MATHEMATICS (2022)

Article Computer Science, Artificial Intelligence

Instance weighted SMOTE by indirectly exploring the data distribution

Aimin Zhang, Hualong Yu, Shanlin Zhou, Zhangjun Huan, Xibei Yang

Summary: This study presents the instance weighted SMOTE (IW-SMOTE) algorithm, which improves the SMOTE algorithm by indirectly exploiting distribution data. It uses an UnderBagging-like undersampling ensemble algorithm to classify each training instance and acquire confusing information. Based on the confusing information, the algorithm accurately estimates the location information of each instance and handles noisy and borderline instances accordingly. The balanced instance set is then used to train multiple classifiers to verify the algorithm's generality and effectiveness.

KNOWLEDGE-BASED SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Active Learning by Extreme Learning Machine with Considering Exploration and Exploitation Simultaneously

Yan Gu, Hualong Yu, Xibei Yang, Shang Gao

Summary: This paper introduces an improved active learning algorithm AL-SNN-ELM, which combines shared nearest neighbor clustering algorithm and online-sequential extreme learning machine. It can reduce labeling cost and improve performance simultaneously.

NEURAL PROCESSING LETTERS (2023)

Article Computer Science, Information Systems

Heterogeneous Cross-Project Defect Prediction via Optimal Transport

Xing Zong, Guiyu Li, Shang Zheng, Haitao Zou, Hualong Yu, Shang Gao

Summary: Heterogeneous Cross-Project Defect Prediction (HCPDP) aims to learn a prediction model from a heterogeneous source project and apply it to a target project. This paper introduces optimal transport (OT) theory to establish the relationship between source and target data distributions and proposes two prediction algorithms based on OT theory. Experimental results demonstrate the effectiveness of the proposed methods in helping developers identify defects in the early phase of software development.

IEEE ACCESS (2023)

Article Computer Science, Information Systems

RUE: A robust personalized cost assignment strategy for class imbalance cost-sensitive learning

Shanlin Zhou, Yan Gu, Hualong Yu, Xibei Yang, Shang Gao

Summary: This article introduces a novel strategy called RUE for estimating location information and cost assignment to address the problem of class-imbalance learning. The strategy indirectly explores location information through a random undersampling ensemble, is robust towards data distribution, and accurately estimates the significance of each instance.

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES (2023)

Article Computer Science, Artificial Intelligence

PLVI-CE: a multi-label active learning algorithm with simultaneously considering uncertainty and diversity

Yan Gu, Jicong Duan, Hualong Yu, Xibei Yang, Shang Gao

Summary: This study focuses on the challenge of designing an effective query strategy for measuring uniform information about unlabeled instances in multi-label active learning (MLAL). The proposed query strategy, PLVI-CE, combines uncertainty and diversity measures to reduce labeling cost and obtain an accurate and robust classification model. Additionally, the study explores the use of label-weighted extreme learning machine (LW-ELM) as the base classifier, which has advantages such as low computational cost and strong generalization performance, and can handle multi-label data with class imbalance distributions.

APPLIED INTELLIGENCE (2023)

Article Mathematics

A Broad TSK Fuzzy Classifier with a Simplified Set of Fuzzy Rules for Class-Imbalanced Learning

Jinghong Zhang, Yingying Li, Bowen Liu, Hao Chen, Jie Zhou, Hualong Yu, Bin Qin

Summary: With the expansion of data scale and diversity, the issue of class imbalance has become increasingly salient. To address these challenges, a novel fuzzy classifier is proposed that can handle classification tasks with class-imbalanced data.

MATHEMATICS (2023)

Article Computer Science, Information Systems

B-APFDQN: A UAV Path Planning Algorithm Based on Deep Q-Network and Artificial Potential Field

Fuchen Kong, Qi Wang, Shang Gao, Hualong Yu

Summary: This paper proposes an optimized DQN algorithm called B-APFDQN for UAV path planning, which combines Artificial Potential Field (APF) as prior knowledge and uses a multi-output neural network. It also introduces a SA-$\varepsilon$-greedy algorithm to automatically adjust the exploration frequency and prevent falling into local optima. With the removal of unnecessary nodes and the use of the B-spline algorithm, the obtained paths are smoother and shorter. Simulation experiments show that the proposed B-APFDQN algorithm outperforms classical DQN and is effective in UAV path planning.

IEEE ACCESS (2023)

Article Computer Science, Artificial Intelligence

ECC plus plus : An algorithm family based on ensemble of classifier chains for classifying imbalanced multi-label data

Jicong Duan, Yan Gu, Hualong Yu, Xibei Yang, Shang Gao

Summary: Multi-label learning has a wide range of real-world applications and the problem of class imbalance in multilabel data has been less addressed. This study proposes the ECC++ algorithm family, which combines the ensemble classifier chain algorithm with binary-class imbalance learning techniques to tackle the challenges of class imbalance and label correlations. Experimental results demonstrate the effectiveness and superiority of ECC++ over existing class imbalance multi-label learning algorithms.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Information Systems

DME: An Adaptive and Just-in-Time Weighted Ensemble Learning Method for Classifying Block-Based Concept Drift Steam

Baoquan Feng, Yan Gu, Hualong Yu, Xibei Yang, Shang Gao

Summary: This study proposes a novel incremental learning algorithm called distribution matching ensemble (DME) for adaptive weighted ensemble learning. DME estimates the distribution of each data block and maintains a group of classifiers in a buffer. When a new data block is received, the similarity between its distribution and reserved distributions is calculated to guide weight assignment for adaptive ensemble decision. Experiments show that DME can track and adapt to various types of concept drift, outperforming state-of-the-art algorithms.

IEEE ACCESS (2022)

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)