4.7 Article

A cost sensitive decision tree algorithm with two adaptive mechanisms

期刊

KNOWLEDGE-BASED SYSTEMS
卷 88, 期 -, 页码 24-33

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2015.08.012

关键词

Adaptive mechanisms; Cost sensitive; Decision tree; Granular computing

资金

  1. Key Project of Education Department of Fujian Province [JA13192]
  2. National Natural Science Foundation of China [61379049, 61379089, 61170128]
  3. Science and Technology Key Project of Fujian Province [2012H0043]
  4. Zhangzhou Municipal Natural Science Foundation [ZZ2014J14]

向作者/读者索取更多资源

Decision trees have been widely used in data mining and machine learning as a comprehensible knowledge representation. Minimal cost decision tree construction plays a crucial role in cost sensitive learning. Recently, many algorithms have been developed to tackle this problem. These algorithms choose an appropriate cut point of a numeric attribute by computing all possible cut points and assign a node through test all attributes. Therefore, the efficiency of these algorithms for large data sets is often unsatisfactory. To solve this issue, in this paper we propose a cost sensitive decision tree algorithm with two adaptive mechanisms to learn cost sensitive decision trees from training data sets based on C4.5 algorithm. The two adaptive mechanisms play an important role in cost sensitive decision tree construction. The first mechanism, adaptive selecting the cut point (ASCP) mechanism, selects the cut point adaptively to build a classifier rather than calculates each possible cut point of an attribute. It improves the efficiency of evaluating numeric attributes for cut point selection significantly. The second mechanism, adaptive removing attribute (ARA) mechanism, removes some redundant attributes in the process of selecting node. The effectiveness of the proposed algorithm is demonstrated on fourteen UCI data sets with representative test cost Normal distribution. Compared with the CS-C4.5 algorithm, the proposed algorithm significantly increases efficiency. (C) 2015 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

WDIBS: Wasserstein deterministic information bottleneck for state abstraction to balance state-compression and performance

Xianchao Zhu, Tianyi Huang, Ruiyuan Zhang, William Zhu

Summary: As an important branch of reinforcement learning, Apprenticeship learning focuses on how an agent learns behavioral decisions by observing an expert policy. State abstraction is often used to compress the state space to improve learning efficiency, but finding the right balance is crucial. DIBS aims to address this issue by using the information rate for compression and KL divergence for decision performance evaluation. WDIBS improves upon DIBS by using Wasserstein distance to measure the difference between distributions, ensuring good decision performance even with overlapping support sets.

APPLIED INTELLIGENCE (2022)

Article Computer Science, Artificial Intelligence

Anchor: The achieved goal to replace the subgoal for hierarchical reinforcement learning

Ruijia Li, Zhiling Cai, Tianyi Huang, William Zhu

Summary: This paper introduces a new concept called anchor to replace subgoals in hierarchical reinforcement learning, which encourages agents to move fast away from the anchor towards the desired goal. By using intrinsic rewards based on distance from the achieved goal to the anchor and weighted by extrinsic rewards, the proposed method demonstrates effectiveness in continuous control tasks with long horizons.

KNOWLEDGE-BASED SYSTEMS (2021)

Article Computer Science, Artificial Intelligence

Saliency: a new selection criterion of important architectures in neural architecture search

Jie Hao, Zhiling Cai, Ruijia Li, William Zhu

Summary: Neural architecture search (NAS) has been successful in automatically designing high-performance neural networks, with the proposal of architecture saliency addressing the significant deviation in architecture selection from gradient-based methods.

NEURAL COMPUTING & APPLICATIONS (2022)

Article Computer Science, Artificial Intelligence

Adaptive exploration policy for exploration-exploitation tradeoff in continuous action control optimization

Min Li, Tianyi Huang, William Zhu

Summary: This research proposes an adaptive exploration policy to address the exploration-exploitation tradeoff by adjusting the exploration noise based on training stability. The effectiveness of this policy is demonstrated through theoretical analysis and experiments.

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2021)

Article Computer Science, Artificial Intelligence

MDMD options discovery for accelerating exploration in sparse-reward domains

Xianchao Zhu, Ruiyuan Zhang, William Zhu

Summary: The paper proposes a new method for option discovery to accelerate exploration in sparse-reward domains by reducing the expected cover time of the environment. The method selects two nonadjacent vertices of the state transition matrix as options, which significantly reduces the exploration time.

KNOWLEDGE-BASED SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

A CNN-based policy for optimizing continuous action control by learning state sequences

Tianyi Huang, Min Li, Xiaolong Qin, William Zhu

Summary: In this paper, a new convolutional deterministic policy is proposed for effective learning in continuous action control by utilizing convolutional neural networks to extract information from state sequences. Experimental results show that this policy enables the agent to take better actions and learn faster than existing reinforcement learning methods for continuous action control.

NEUROCOMPUTING (2022)

Article Computer Science, Artificial Intelligence

Deep graph clustering with enhanced feature representations for community detection

Jie Hao, William Zhu

Summary: Community detection is the process of partitioning community nodes into groups with similar attributes and topologies. Deep graph clustering, using graph neural networks, has become a mainstream approach due to its powerful abilities of feature representation and relationship extraction. However, existing methods lack attribute information, leading to a decrease in detection performance. To address this, an enhanced feature representation approach is proposed by incorporating a basic autoencoder and a self-supervised mechanism into the deep graph clustering model. Experimental results demonstrate the improved performance of community detection compared to other methods.

APPLIED INTELLIGENCE (2023)

Article Computer Science, Information Systems

Multi-view fuzzy clustering of deep random walk and sparse low-rank embedding

Shiping Wang, Shunxin Xiao, William Zhu, Yingya Guo

Summary: Multi-view clustering is a method to improve learning performance by utilizing discriminative information from heterogeneous data sources. This study proposes a novel multi-view fuzzy clustering method that combines deep random walk and sparse low-rank embedding to fully exploit the complementarity and consistency of multi-view data.

INFORMATION SCIENCES (2022)

Article Computer Science, Artificial Intelligence

Layered feature representation for differentiable architecture search

Jie Hao, William Zhu

Summary: The Differentiable architecture search (DARTS) method has made significant progress in designing automated neural architectures. We propose an effective approach called Layered Feature Representation for Differentiable Architecture Search (LFR-DARTS) that can extract layered features from shallow to deep layers, resulting in a competitive performance on various datasets.

SOFT COMPUTING (2022)

Article Computer Science, Cybernetics

Multigraph Random Walk for Joint Learning of Multiview Clustering and Semisupervised Classification

Shiping Wang, Lele Fu, Zhewen Wang, Haiping Xu, William Zhu

Summary: Recent research has proposed a multigraph random walk scheme for addressing multiview clustering and semisupervised classification tasks, which integrates random walk with multiview learning. The scheme recursively learns a globally stable probability distribution matrix from multiple views to obtain label indicators for clustering or semisupervised classification. An adaptive weight vector is also learned to incorporate the diversity and complementarity of multiview data. Comprehensive comparative experiments demonstrate the superiority of the proposed method in terms of both clustering and classification performance.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

ACP based reinforcement learning for long-term recommender system

Tianyi Huang, Min Li, William Zhu

Summary: This paper proposes a method that combines the ACP approach with reinforcement learning-based recommender systems to address noise and improve recommendations. By training a parallel environment and using its predicted states, noise in the recommendation process can be reduced. Theoretical analysis and experiments demonstrate that this method outperforms existing recommender systems.

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2022)

Article Computer Science, Artificial Intelligence

Clustering experience replay for the effective exploitation in reinforcement learning

Min Li, Tianyi Huang, William Zhu

Summary: In this paper, a clustering experience replay method called CER is proposed to effectively exploit the experience hidden in all explored transitions in reinforcement learning. The method divides the training process into several periods, clusters the transitions explored in each period, and constructs a conditional probability density function to ensure sufficient replay of all kinds of transitions.

PATTERN RECOGNITION (2022)

Article Computer Science, Cybernetics

Multiview Deep Matrix Factorization for Shared Compact Representation

Zexi Chen, Yunhe Zhang, Sheng Huang, Yanfang Liu, William Zhu, Shiping Wang

Summary: This article proposes a multiview deep matrix factorization model to learn a shared compact representation from multiview data. By combining multiple matrix factorizations from different views with a shared hidden layer, the model obtains a high-level semantic representation. The nonnegativity constraint of the learned representation is transformed using a projection operation, and the model is trained with a joint loss of reconstruction error and compactness loss. Experiments demonstrate the superiority of the proposed method in multiview clustering.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2023)

Article Computer Science, Cybernetics

An Overview of Advanced Deep Graph Node Clustering

Shiping Wang, Jinbin Yang, Jie Yao, Yang Bai, William Zhu

Summary: Graph data and graph node clustering are important in data analysis. Recently, deep neural networks have been used for graph node clustering because of their powerful representation capabilities. This article reviews the latest methods in this field and evaluates their performance on real-world graph datasets using several metrics. The results highlight potential research challenges and directions for deep graph node clustering.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2023)

Article Mathematics, Applied

ARCHITECTURE SELF-ATTENTION MECHANISM: NONLINEAR OPTIMIZATION FOR NEURAL ARCHITECTURE SEARCH

Jie Hao, William Zhu

Summary: This paper introduces an architecture self-attention mechanism for neural architecture search, which improves the performance stability and efficiency of architecture search by constructing interrelationships among architectures.

JOURNAL OF NONLINEAR AND VARIATIONAL ANALYSIS (2021)

Article Computer Science, Artificial Intelligence

Confidence-based and sample-reweighted test-time adaptation

Hao Yang, Min Wang, Zhengfei Yu, Hang Zhang, Jinshen Jiang, Yun Zhou

Summary: In this paper, a novel method called CSTTA is proposed for test time adaptation (TTA), which utilizes confidence-based optimization and sample reweighting to better utilize sample information. Extensive experiments demonstrate the effectiveness of the proposed method.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

A novel method for generating a canonical basis for decision implications based on object-induced three-way operators

Jin Liu, Ju-Sheng Mi, Dong-Yun Niu

Summary: This article focuses on a novel method for generating a canonical basis for decision implications based on object-induced operators (OE operators). The logic of decision implication based on OE operators is described, and a method for obtaining the canonical basis for decision implications is given. The completeness, nonredundancy, and optimality of the canonical basis are proven. Additionally, a method for generating true premises based on OE operators is proposed.

KNOWLEDGE-BASED SYSTEMS (2024)

Review Computer Science, Artificial Intelligence

Efficient utilization of pre-trained models: A review of sentiment analysis via prompt learning

Kun Bu, Yuanchao Liu, Xiaolong Ju

Summary: This paper discusses the importance of sentiment analysis and pre-trained models in natural language processing, and explores the application of prompt learning. The research shows that prompt learning is more suitable for sentiment analysis tasks and can achieve good performance.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

M-EDEM: A MNN-based Empirical Decomposition Ensemble Method for improved time series forecasting

Xiangjun Cai, Dagang Li

Summary: This paper presents a new decomposition mechanism based on learned decomposition mapping. By using a neural network to learn the relationship between original time series and decomposed results, the repetitive computation overhead during rolling decomposition is relieved. Additionally, extended mapping and partial decomposition methods are proposed to alleviate boundary effects on prediction performance. Comparative studies demonstrate that the proposed method outperforms existing RDEMs in terms of operation speed and prediction accuracy.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Privacy-preserving trust management method based on blockchain for cross-domain industrial IoT

Xu Wu, Yang Liu, Jie Tian, Yuanpeng Li

Summary: This paper proposes a blockchain-based privacy-preserving trust management architecture, which adopts federated learning to train task-specific trust models and utilizes differential privacy to protect device privacy. In addition, a game theory-based incentive mechanism and a parallel consensus protocol are proposed to improve the accuracy of trust computing and the efficiency of consensus.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

MV-ReID: 3D Multi-view Transformation Network for Occluded Person Re-Identification

Zaiyang Yu, Prayag Tiwari, Luyang Hou, Lusi Li, Weijun Li, Limin Jiang, Xin Ning

Summary: This study introduces a 3D view-based approach that effectively handles occlusions and leverages the geometric information of 3D objects. The proposed method achieves state-of-the-art results on occluded ReID tasks and exhibits competitive performance on holistic ReID tasks.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

City-scale continual neural semantic mapping with three-layer sampling and panoptic representation

Yongliang Shi, Runyi Yang, Zirui Wu, Pengfei Li, Caiyun Liu, Hao Zhao, Guyue Zhou

Summary: Neural implicit representations have gained attention due to their expressive, continuous, and compact properties. However, there is still a lack of research on city-scale continual implicit dense mapping based on sparse LiDAR input. In this study, a city-scale continual neural mapping system with a panoptic representation is developed, incorporating environment-level and instance-level modeling. A tailored three-layer sampling strategy and category-specific prior are proposed to address the challenges of representing geometric information in city-scale space and achieving high fidelity mapping of instances under incomplete observation.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

MDSSN: An end-to-end deep network on triangle mesh parameterization

Ruihan Hu, Zhi-Ri Tang, Rui Yang, Zhongjie Wang

Summary: Mesh data is crucial for 3D computer vision applications worldwide, but traditional deep learning frameworks have struggled with handling meshes. This paper proposes MDSSN, a simple mesh computation framework that models triangle meshes and represents their shape using face-based and edge-based Riemannian graphs. The framework incorporates end-to-end operators inspired by traditional deep learning frameworks, and includes dedicated modules for addressing challenges in mesh classification and segmentation tasks. Experimental results demonstrate that MDSSN outperforms other state-of-the-art approaches.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Semi-supervised learning with missing values imputation

Buliao Huang, Yunhui Zhu, Muhammad Usman, Huanhuan Chen

Summary: This paper proposes a novel semi-supervised conditional normalizing flow (SSCFlow) algorithm that combines unsupervised imputation and supervised classification. By estimating the conditional distribution of incomplete instances, SSCFlow facilitates imputation and classification simultaneously, addressing the issue of separated tasks ignoring data distribution and label information in traditional methods.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Emotion-and-knowledge grounded response generation in an open-domain dialogue setting

Deeksha Varshney, Asif Ekbal, Erik Cambria

Summary: This paper focuses on the neural-based interactive dialogue system that aims to engage and retain humans in long-lasting conversations. It proposes a new neural generative model that combines step-wise co-attention, self-attention-based transformer network, and an emotion classifier to control emotion and knowledge transfer during response generation. The results from quantitative, qualitative, and human evaluation show that the proposed models can generate natural and coherent sentences, capturing essential facts with significant improvement over emotional content.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

MvTS-library: An open library for deep multivariate time series forecasting

Junchen Ye, Weimiao Li, Zhixin Zhang, Tongyu Zhu, Leilei Sun, Bowen Du

Summary: Modeling multivariate time series has long been a topic of interest for scholars in various fields. This paper introduces MvTS, an open library based on Pytorch, which provides a unified framework for implementing and evaluating these models. Extensive experiments on public datasets demonstrate the effectiveness and universality of the models reproduced by MvTS.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

An adaptive hybrid mutated differential evolution feature selection method for low and high-dimensional medical datasets

Reham R. Mostafa, Ahmed M. Khedr, Zaher Al Aghbari, Imad Afyouni, Ibrahim Kamel, Naveed Ahmed

Summary: Feature selection is crucial in classification procedures, but it faces challenges in high-dimensional datasets. To overcome these challenges, this study proposes an Adaptive Hybrid-Mutated Differential Evolution method that incorporates the mechanics of the Spider Wasp Optimization algorithm and the concept of Enhanced Solution Quality. Experimental results demonstrate the effectiveness of the method in terms of accuracy and convergence speed, and it outperforms contemporary cutting-edge algorithms.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

TCM Model for improving track sequence classification in real scenarios with Multi-Feature Fusion and Transformer Block

Ti Xiang, Pin Lv, Liguo Sun, Yipu Yang, Jiuwu Hao

Summary: This paper introduces a Track Classification Model (TCM) based on marine radar, which can effectively recognize and classify shipping tracks. By using a feature extraction network with multi-feature fusion and a dataset production method to address missing labels, the classification accuracy is improved, resulting in successful engineering application in real scenarios.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Language model as an Annotator: Unsupervised context-aware quality phrase generation

Zhihao Zhang, Yuan Zuo, Chenghua Lin, Junjie Wu

Summary: This paper proposes a novel unsupervised context-aware quality phrase mining framework called LMPhrase, which is built upon large pre-trained language models. The framework mines quality phrases as silver labels using a parameter-free probing technique on the pre-trained language model BERT, and formalizes the phrase tagging task as a sequence generation problem by fine-tuning on the Sequence to-Sequence pre-trained language model BART. The results of extensive experiments show that LMPhrase consistently outperforms existing competitors in two different granularity phrase mining tasks.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Stochastic Gradient Descent for matrix completion: Hybrid parallelization on shared- and distributed-memory systems

Kemal Buyukkaya, M. Ozan Karsavuran, Cevdet Aykanat

Summary: The study aims to investigate the hybrid parallelization of the Stochastic Gradient Descent (SGD) algorithm for solving the matrix completion problem on a high-performance computing platform. A hybrid parallel decentralized SGD framework with asynchronous inter-process communication and a novel flexible partitioning scheme is proposed to achieve scalability up to hundreds of processors. Experimental results on real-world benchmark datasets show that the proposed algorithm achieves 6x higher throughput on sparse datasets compared to the state-of-the-art, while achieving comparable throughput on relatively dense datasets.

KNOWLEDGE-BASED SYSTEMS (2024)