4.7 Article

Incremental updating approximations in dominance-based rough sets approach under the variation of the attribute set

期刊

KNOWLEDGE-BASED SYSTEMS
卷 40, 期 -, 页码 17-26

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2012.11.002

关键词

Rough sets; Knowledge discovery; Dominance relation; Incremental updating; Approximations

资金

  1. National Science Foundation of China [60873108, 61175047, 61100117]
  2. NSAF [U1230117]
  3. Youth Social Science Foundation of the Chinese Education Commission [10YJCZH117, 11YJC630127]
  4. Fundamental Research Funds for the Central Universities [SWJTU11ZT08]
  5. Research Fund of Traction Power State Key Laboratory, Southwest Jiaotong University [2012TPL_T15]

向作者/读者索取更多资源

Dominance-based Rough Sets Approach (DRSA) is a generalized model of the classical Rough Sets Theory (RST) which may handle information with preference-ordered attribute domain. The attribute set in the information system may evolve over time. Approximations of DRSA used to induce decision rules need updating for knowledge discovery and other related tasks. We firstly introduce a kind of dominance matrix to calculate P-dominating sets and P-dominated sets in DRSA. Then we discuss the principles of updating P-dominating sets and P-dominated sets when some attributes are added into or deleted from the attribute set P. Furthermore, we propose incremental approaches and algorithms for updating approximations in DRSA. The proposed incremental approaches effectively reduce the computational time in comparison with the non-incremental approach are validated by experimental evaluations on different data sets from UCI. (C) 2012 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

An effective multi-task learning model for end-to-end emotion-cause pair extraction

Chenbing Li, Jie Hu, Tianrui Li, Shengdong Du, Fei Teng

Summary: This paper proposes a novel multi-task learning model for extracting emotion, cause, and emotion-cause pairs from documents. The model utilizes a shared module and a task-specific module to share information between tasks, and employs a sampling-based method to address the problem of label imbalance.

APPLIED INTELLIGENCE (2023)

Article Computer Science, Theory & Methods

Fuzzy-rough hybrid dimensionality reduction

Zhihong Wang, Hongmei Chen, Zhong Yuan, Tianrui Li

Summary: Dimensionality reduction is an important preprocessing method for data analysis. The proposed hybrid dimensionality reduction method combines fuzzy rough set and linear discriminant analysis to improve interpretability and retain original feature information.

FUZZY SETS AND SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Semi-supervised feature selection based on pairwise constraint-guided dual space latent representation learning and double sparse graphs discriminant

Hao Chen, Hongmei Chen, Weiyi Li, Tianrui Li

Summary: This study proposes a semi-supervised feature selection algorithm that combines latent representation learning and sparse graph discriminative model to improve the performance of a learning model. The method can consider both the structure information in data space and feature space, and effectively utilize label information. The feasibility and effectiveness of the proposed method are validated through experiments.

APPLIED INTELLIGENCE (2023)

Review Computer Science, Artificial Intelligence

A review of sequential three-way decision and multi-granularity learning

Xin Yang, Yanhua Li, Tianrui Li

Summary: The concept of three-way decision, focusing on thinking, problem solving, and information processing in threes, has been extensively studied and applied in the fields of machine learning and data engineering in recent years. The integration of dynamic and uncertainty through multi-granularity learning in an open-world environment has brought new vitality to three-way decision. This paper investigates and summarizes the initial and development models of three-way decision, traces the historical line of sequential three-way decision from rough set to granular computing, and proposes a unified framework of three-way multi-granularity learning with four crucial problems on mining uncertain regions continuously. Additionally, proposals on three-way decision associated with open-continual learning are provided.

INTERNATIONAL JOURNAL OF APPROXIMATE REASONING (2023)

Article Computer Science, Artificial Intelligence

Modeling multi-regional temporal correlation with gated recurrent unit and multiple linear regression for urban traffic flow prediction

Taha M. Rajeh, Tianrui Li, Chongshou Li, Muhammad Hafeez Javed, Zhpeng Luo, Fares Alhaek

Summary: Urban traffic flow prediction has attracted much attention, especially with the availability of large amounts of traffic data. However, existing traffic flow techniques heavily rely on limited auxiliary information. This paper proposes a framework based on data characteristics and temporal correlation to achieve accurate predictions without additional influential factors.

KNOWLEDGE-BASED SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Dual-grained human mobility learning for location-aware trip recommendation with spatial-temporal graph knowledge fusion

Qiang Gao, Wei Wang, Li Huang, Xin Yang, Tianrui Li, Hamido Fujita

Summary: Trip recommendation is a popular and significant location-aware service that can help visitors make more accurate travel plans. However, previous studies face challenges such as capturing heterogeneous interactions, dealing with data sparsity, and considering contextual facts. To address these challenges, this work proposes a novel framework called GraphTrip, which utilizes spatial-temporal graph representation learning, dual-grained human mobility learning, and explicit information fusion to improve trip inference performance. The experimental results demonstrate promising gains against cutting-edge baselines.

INFORMATION FUSION (2023)

Article Computer Science, Information Systems

Linear dimensionality reduction method based on topological properties

Yuqin Yao, Hua Meng, Yang Gao, Zhiguo Long, Tianrui Li

Summary: Dimensionality reduction is a significant data preprocessing technique. This article analyzes the drawbacks of the traditional linear unsupervised dimensionality reduction method LPP and proposes an improved model to resolve them. Furthermore, the article enhances LPP to maintain topological connectivity of data. Experimental results demonstrate that the new model outperforms the original LPP model and other classic linear or non-linear dimensionality reduction methods.

INFORMATION SCIENCES (2023)

Article Computer Science, Information Systems

Federated learning in smart cities: Privacy and security survey

Rasha Al-Huthaifi, Tianrui Li, Wei Huang, Jin Gu, Chongshou Li

Summary: Smart cities have emerged globally in the past decade, utilizing big data and the internet of things to enhance monitoring and integration of infrastructure systems, resulting in more efficient, livable, and sustainable cities. However, the increased vulnerability of big data to attacks and the implementation of stricter regulations for protecting user data present significant privacy and security challenges. Federated learning has gained attention as a method to address these challenges by distributing learning over decentralized data, providing privacy protection and artificial intelligence advantages. This study reviews the application of federated learning in smart city systems and discusses its benefits, drawbacks, research issues, and future directions, emphasizing the need for comprehensive testing to improve data protection and performance in smart cities.

INFORMATION SCIENCES (2023)

Article Computer Science, Information Systems

Incomplete multi-view clustering via kernelized graph learning

Dongxue Xia, Yan Yang, Shuhong Yang, Tianrui Li

Summary: This paper proposes a Kernelized Graph-based Incomplete Multi-view Clustering (KGIMC) algorithm to overcome the limitation of incomplete data in real-world multi-view clustering. The algorithm optimizes similarity learning, clustering analysis, and kernel completion in a mutual reinforcement manner to achieve an overall optimal clustering result. Experimental results show that KGIMC outperforms the state-of-the-art approaches on popular datasets.

INFORMATION SCIENCES (2023)

Article Computer Science, Information Systems

A Generalized Deep Learning Clustering Algorithm Based on Non-Negative Matrix Factorization

Dexian Wang, Tianrui Li, Ping Deng, Fan Zhang, Wei Huang, Pengfei Zhang, Jia Liu

Summary: In this article, a Generalized Deep Learning Clustering (GDLC) algorithm based on Non-Negative Matrix Factorization (NMF) is proposed to address the slow convergence and low clustering accuracy in the update process of NMF. A nonlinear constrained NMF (NNMF) algorithm is constructed to achieve sequential updates of the matrix elements guided by the learning rate, and the GDLC algorithm is constructed by transforming gradient values into generalized weights and biases through a nonlinear activation function. The experimental results on eight datasets demonstrate the efficient performance of the GDLC algorithm.

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2023)

Article Computer Science, Cybernetics

Graph Regularized Sparse Non-Negative Matrix Factorization for Clustering

Ping Deng, Tianrui Li, Hongjun Wang, Dexian Wang, Shi-Jinn Horng, Rui Liu

Summary: This article proposes a novel graph regularized sparse NMF algorithm (GSNMF) for data reconstruction tasks, addressing the issue of sensitivity to noise in the square loss method. In addition, an extension algorithm called GSNMTF is also introduced, and experimental results demonstrate its good performance.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

SemiFREE: Semisupervised Feature Selection With Fuzzy Relevance and Redundancy

Keyu Liu, Tianrui Li, Xibei Yang, Hongmei Chen, Jie Wang, Zhixuan Deng

Summary: In this article, a novel semi-supervised feature selection scheme called SemiFREE is proposed, which redefines the feature relevance and redundancy by considering the fuzziness or uncertainty in data labeling. Experimental results demonstrate the superiority of SemiFREE in the presence of partially labeled data.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Spatio-Temporal Dynamic Graph Relation Learning for Urban Metro Flow Prediction

Peng Xie, Minbo Ma, Tianrui Li, Shenggong Ji, Shengdong Du, Zeng Yu, Junbo Zhang

Summary: This paper presents a spatio-temporal dynamic graph relational learning model for predicting urban metro station flow. The model captures the traffic patterns of different stations using a node embedding representation module, learns dynamic spatial relationships between metro stations through a dynamic graph relationship learning module, and utilizes a transformer for long-term relationship prediction. Experimental results demonstrate the advantages of our method in urban metro flow prediction.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

C2IMUFS: Complementary and Consensus Learning-Based Incomplete Multi-View Unsupervised Feature Selection

Yanyong Huang, Zongxin Shen, Yuxin Cai, Xiuwen Yi, Dongjie Wang, Fengmao Lv, Tianrui Li

Summary: This paper proposes a complementary and consensus learning-based incomplete multi-view unsupervised feature selection method ((CIMUFS)-I-2) to address the issue of incomplete multi-view data. The method integrates feature selection into an extended weighted non-negative matrix factorization model and achieves complete similarity graph reconstruction for each view.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Long sequence time-series forecasting with deep learning: A survey

Zonglei Chen, Minbo Ma, Tianrui Li, Hongjun Wang, Chongshou Li

Summary: The development of deep learning technology has greatly improved time series forecasting, particularly in the area of long sequence time series forecasting (LSTF). This article provides a comprehensive survey of LSTF studies with deep learning technology, including rigorous definitions, taxonomy based on network structure, key problems and solutions, evaluation metrics, applications, datasets, and open-source codes. It also presents a comparison of the proposed Kruskal-Wallis test based evaluation method with existing metrics and proposes potential research directions. All resources and codes are available online at https://github.com/Masterleia/TSF_LSTF_Compare.

INFORMATION FUSION (2023)

Article Computer Science, Artificial Intelligence

Confidence-based and sample-reweighted test-time adaptation

Hao Yang, Min Wang, Zhengfei Yu, Hang Zhang, Jinshen Jiang, Yun Zhou

Summary: In this paper, a novel method called CSTTA is proposed for test time adaptation (TTA), which utilizes confidence-based optimization and sample reweighting to better utilize sample information. Extensive experiments demonstrate the effectiveness of the proposed method.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

A novel method for generating a canonical basis for decision implications based on object-induced three-way operators

Jin Liu, Ju-Sheng Mi, Dong-Yun Niu

Summary: This article focuses on a novel method for generating a canonical basis for decision implications based on object-induced operators (OE operators). The logic of decision implication based on OE operators is described, and a method for obtaining the canonical basis for decision implications is given. The completeness, nonredundancy, and optimality of the canonical basis are proven. Additionally, a method for generating true premises based on OE operators is proposed.

KNOWLEDGE-BASED SYSTEMS (2024)

Review Computer Science, Artificial Intelligence

Efficient utilization of pre-trained models: A review of sentiment analysis via prompt learning

Kun Bu, Yuanchao Liu, Xiaolong Ju

Summary: This paper discusses the importance of sentiment analysis and pre-trained models in natural language processing, and explores the application of prompt learning. The research shows that prompt learning is more suitable for sentiment analysis tasks and can achieve good performance.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

M-EDEM: A MNN-based Empirical Decomposition Ensemble Method for improved time series forecasting

Xiangjun Cai, Dagang Li

Summary: This paper presents a new decomposition mechanism based on learned decomposition mapping. By using a neural network to learn the relationship between original time series and decomposed results, the repetitive computation overhead during rolling decomposition is relieved. Additionally, extended mapping and partial decomposition methods are proposed to alleviate boundary effects on prediction performance. Comparative studies demonstrate that the proposed method outperforms existing RDEMs in terms of operation speed and prediction accuracy.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Privacy-preserving trust management method based on blockchain for cross-domain industrial IoT

Xu Wu, Yang Liu, Jie Tian, Yuanpeng Li

Summary: This paper proposes a blockchain-based privacy-preserving trust management architecture, which adopts federated learning to train task-specific trust models and utilizes differential privacy to protect device privacy. In addition, a game theory-based incentive mechanism and a parallel consensus protocol are proposed to improve the accuracy of trust computing and the efficiency of consensus.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

MV-ReID: 3D Multi-view Transformation Network for Occluded Person Re-Identification

Zaiyang Yu, Prayag Tiwari, Luyang Hou, Lusi Li, Weijun Li, Limin Jiang, Xin Ning

Summary: This study introduces a 3D view-based approach that effectively handles occlusions and leverages the geometric information of 3D objects. The proposed method achieves state-of-the-art results on occluded ReID tasks and exhibits competitive performance on holistic ReID tasks.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

City-scale continual neural semantic mapping with three-layer sampling and panoptic representation

Yongliang Shi, Runyi Yang, Zirui Wu, Pengfei Li, Caiyun Liu, Hao Zhao, Guyue Zhou

Summary: Neural implicit representations have gained attention due to their expressive, continuous, and compact properties. However, there is still a lack of research on city-scale continual implicit dense mapping based on sparse LiDAR input. In this study, a city-scale continual neural mapping system with a panoptic representation is developed, incorporating environment-level and instance-level modeling. A tailored three-layer sampling strategy and category-specific prior are proposed to address the challenges of representing geometric information in city-scale space and achieving high fidelity mapping of instances under incomplete observation.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

MDSSN: An end-to-end deep network on triangle mesh parameterization

Ruihan Hu, Zhi-Ri Tang, Rui Yang, Zhongjie Wang

Summary: Mesh data is crucial for 3D computer vision applications worldwide, but traditional deep learning frameworks have struggled with handling meshes. This paper proposes MDSSN, a simple mesh computation framework that models triangle meshes and represents their shape using face-based and edge-based Riemannian graphs. The framework incorporates end-to-end operators inspired by traditional deep learning frameworks, and includes dedicated modules for addressing challenges in mesh classification and segmentation tasks. Experimental results demonstrate that MDSSN outperforms other state-of-the-art approaches.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Semi-supervised learning with missing values imputation

Buliao Huang, Yunhui Zhu, Muhammad Usman, Huanhuan Chen

Summary: This paper proposes a novel semi-supervised conditional normalizing flow (SSCFlow) algorithm that combines unsupervised imputation and supervised classification. By estimating the conditional distribution of incomplete instances, SSCFlow facilitates imputation and classification simultaneously, addressing the issue of separated tasks ignoring data distribution and label information in traditional methods.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Emotion-and-knowledge grounded response generation in an open-domain dialogue setting

Deeksha Varshney, Asif Ekbal, Erik Cambria

Summary: This paper focuses on the neural-based interactive dialogue system that aims to engage and retain humans in long-lasting conversations. It proposes a new neural generative model that combines step-wise co-attention, self-attention-based transformer network, and an emotion classifier to control emotion and knowledge transfer during response generation. The results from quantitative, qualitative, and human evaluation show that the proposed models can generate natural and coherent sentences, capturing essential facts with significant improvement over emotional content.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

MvTS-library: An open library for deep multivariate time series forecasting

Junchen Ye, Weimiao Li, Zhixin Zhang, Tongyu Zhu, Leilei Sun, Bowen Du

Summary: Modeling multivariate time series has long been a topic of interest for scholars in various fields. This paper introduces MvTS, an open library based on Pytorch, which provides a unified framework for implementing and evaluating these models. Extensive experiments on public datasets demonstrate the effectiveness and universality of the models reproduced by MvTS.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

An adaptive hybrid mutated differential evolution feature selection method for low and high-dimensional medical datasets

Reham R. Mostafa, Ahmed M. Khedr, Zaher Al Aghbari, Imad Afyouni, Ibrahim Kamel, Naveed Ahmed

Summary: Feature selection is crucial in classification procedures, but it faces challenges in high-dimensional datasets. To overcome these challenges, this study proposes an Adaptive Hybrid-Mutated Differential Evolution method that incorporates the mechanics of the Spider Wasp Optimization algorithm and the concept of Enhanced Solution Quality. Experimental results demonstrate the effectiveness of the method in terms of accuracy and convergence speed, and it outperforms contemporary cutting-edge algorithms.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

TCM Model for improving track sequence classification in real scenarios with Multi-Feature Fusion and Transformer Block

Ti Xiang, Pin Lv, Liguo Sun, Yipu Yang, Jiuwu Hao

Summary: This paper introduces a Track Classification Model (TCM) based on marine radar, which can effectively recognize and classify shipping tracks. By using a feature extraction network with multi-feature fusion and a dataset production method to address missing labels, the classification accuracy is improved, resulting in successful engineering application in real scenarios.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Language model as an Annotator: Unsupervised context-aware quality phrase generation

Zhihao Zhang, Yuan Zuo, Chenghua Lin, Junjie Wu

Summary: This paper proposes a novel unsupervised context-aware quality phrase mining framework called LMPhrase, which is built upon large pre-trained language models. The framework mines quality phrases as silver labels using a parameter-free probing technique on the pre-trained language model BERT, and formalizes the phrase tagging task as a sequence generation problem by fine-tuning on the Sequence to-Sequence pre-trained language model BART. The results of extensive experiments show that LMPhrase consistently outperforms existing competitors in two different granularity phrase mining tasks.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

Stochastic Gradient Descent for matrix completion: Hybrid parallelization on shared- and distributed-memory systems

Kemal Buyukkaya, M. Ozan Karsavuran, Cevdet Aykanat

Summary: The study aims to investigate the hybrid parallelization of the Stochastic Gradient Descent (SGD) algorithm for solving the matrix completion problem on a high-performance computing platform. A hybrid parallel decentralized SGD framework with asynchronous inter-process communication and a novel flexible partitioning scheme is proposed to achieve scalability up to hundreds of processors. Experimental results on real-world benchmark datasets show that the proposed algorithm achieves 6x higher throughput on sparse datasets compared to the state-of-the-art, while achieving comparable throughput on relatively dense datasets.

KNOWLEDGE-BASED SYSTEMS (2024)