4.7 Article

A feature selection technique for inference of graphs from their known topological properties: Revealing scale-free gene regulatory networks

期刊

INFORMATION SCIENCES
卷 272, 期 -, 页码 1-15

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2014.02.096

关键词

Feature selection; Reverse-engineering; Gene network inference; Systems Biology; Bioinformatics; Pattern recognition

资金

  1. FAPESP [2011/50761-2]
  2. NAP eScience - PRP - USP
  3. Fundacao Araucaria
  4. CNPq
  5. CAPES
  6. Fundacao de Amparo a Pesquisa do Estado de Sao Paulo (FAPESP) [11/50761-2] Funding Source: FAPESP

向作者/读者索取更多资源

An important problem in bioinformatics is the inference of gene regulatory networks (GRNs) from expression profiles. In general, the main limitations faced by GRN inference methods are the small number of samples with huge dimensionalities and the noisy nature of the expression measurements. Alternatives are thus needed to obtain better accuracy for the GRNs inference problem. Many pattern recognition techniques rely on prior knowledge about the problem in addition to the training data to gain statistical estimation power. This work addresses the GRN inference problem by modeling prior knowledge about the network topology. The main contribution of this paper is a novel methodology that aggregates scale-free properties to a classical low-cost feature selection method, known as Sequential Floating Forward Selection (SFFS), for guiding the inference task. Such methodology explores the search space iteratively by applying a scale-free property to reduce the search space. In this way, the search space traversed by the method integrates the exploration of all combinations of predictors set when the number of combinations is small (dimensionality (k) <= 2) with a floating search when the number of combinations becomes explosive (dimensionality (k) >= 3). This process is guided by scale-free prior information. Experimental results using synthetic and real data show that this technique provides smaller estimation errors than those obtained without guiding the SFFS application by the scale-free model, thus maintaining the robustness of the SFFS method. Therefore, we show that the proposed framework may be applied in combination with other existing GRN inference methods to improve the prediction accuracy of networks with scale-free properties. (C) 2014 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Medicine, General & Internal

ACE2 Interaction Networks in COVID-19: A Physiological Framework for Prediction of Outcome in Patients with Cardiovascular Risk Factors

Zofia Wicik, Ceren Eyileten, Daniel Jakubik, Sergio N. Simoes, David C. Martins, Rodrigo Pavao, Jolanta M. Siller-Matula, Marek Postula

JOURNAL OF CLINICAL MEDICINE (2020)

Article Biochemical Research Methods

Feature extraction approaches for biological sequences: a comparative study of mathematical features

Robson P. Bonidia, Lucas D. H. Sampaio, Douglas S. Domingues, Alexandre R. Paschoal, Fabricio M. Lopes, Andre C. P. L. F. de Carvalho, Danilo S. Sanches

Summary: This study proposes a research on feature extraction approaches based on mathematical features to address the challenge of extracting significant discriminatory information from biological sequence data. Through case studies and experiments, the effectiveness and robustness of the new algorithm are demonstrated.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Automation & Control Systems

Assessing Active Learning Strategies to Improve the Quality Control of the Soybean Seed Vigor

Douglas Felipe Pereira, Pedro Henrique Bugatti, Fabricio Martins Lopes, Andre Luis Siqueira Marques de Souza, Priscila Tiemi Maeda Saito

Summary: This article discusses the challenges seed companies face in pursuing excellence in production quality, and proposes soybean seed vigor learning and classification methods to address these issues. The research found that active learning methods can achieve higher classification accuracy more quickly, while reducing the number of labeled samples required in the learning process.

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS (2021)

Article Multidisciplinary Sciences

Analysis of co-authorship networks among Brazilian graduate programs in computer science

Alex Nunes da Silva Junior, Matheus Montanini Breve, Jesus Pascual Mena-Chalco, Fabricio Martins Lopes

Summary: This study analyzes and characterizes the co-authorship networks of academic Brazilian graduate programs in computer science, exploring different network topologies and quality indices related to the assessment unit CAPES.

PLOS ONE (2022)

Letter Biochemical Research Methods

Letter on the results of the BASiNET method in the paper 'A systematic evaluation of computational tools for lncRNA identification'

Fabricio Martins Lopes, Matheus H. Pimenta-Zanon

Summary: This article identifies a conceptual error in a published paper and provides the correct method and results to prevent the method from being misused or replicated.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemistry & Molecular Biology

Thrombosis-related circulating miR-16-5p is associated with disease severity in patients hospitalised for COVID-19

Ceren Eyileten, Zofia Wicik, Sergio N. Simoes, David C. Martins-Jr, Krzysztof Klos, Wojciech Wlodarczyk, Alice Assinger, Dariusz Soldacki, Andrzej Chcialowski, Jolanta M. Siller-Matula, Marek Postula

Summary: By utilizing bioinformatic and co-expression analysis, this study identified and validated miRNAs associated with thrombosis in COVID-19 patients, which can serve as potential diagnostic and prognostic biomarkers for disease severity. The findings contribute to our understanding of the pathogenesis of COVID-19 and the identification of novel predictive markers.

RNA BIOLOGY (2022)

Review Biochemistry & Molecular Biology

Temporal progress of gene expression analysis with RNA-Seq data: A review on the relationship between computational methods

Juliana Costa-Silva, Douglas S. Domingues, David Menotti, Mariangela Hungria, Fabricio Martins Lopes

Summary: This paper provides a review of the pipeline for differential expression analysis, discussing the steps, methods, challenges, and tutorial aspects. It aims to guide new entrants and assist established users in updating their analysis pipelines.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2023)

Article Biochemistry & Molecular Biology

Network Analysis of Biomarkers Associated with Occupational Exposure to Benzene and Malathion

Marcus Vinicius C. Santos, Arthur S. S. Feltrin, Isabele C. C. Costa-Amaral, Liliane R. R. Teixeira, Jamila A. A. Perini, David C. C. Martins Jr, Ariane L. L. Larentis

Summary: Network Medicine is a useful platform for studying the molecular complexity of complex diseases and identifying disease modules and pathways. It can provide insights into how environmental chemical exposures affect human cells and help monitor and prevent exposure-related diseases. In this study, benzene and malathion-exposed differentially expressed genes were used to construct interaction networks and identify important hub genes associated with these chemicals.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2023)

Review Biochemistry & Molecular Biology

Temporal progress of gene expression analysis with RNA-Seq data: A review on the relationship between computational methods

Juliana Costa-Silva, Mariangela Hungria, Douglas S. Domingues, David Menotti, Fabricio Martins Lopes

Summary: This paper provides a comprehensive review of the computational analysis pipeline for differential gene expression analysis from RNA-seq data. It introduces the objectives, methods, and properties of each step, presents a timeline of the computational methods, and discusses the relationships between important tools. The paper serves as a tutorial for beginners and helps established users update their analysis pipelines.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Biological Sequence Analysis Using Complex Networks and Entropy Maximization: A Case Study in SARS-CoV-2

Matheus H. Pimenta-Zanon, Vinicius Augusto De Souza, Ronaldo Fumio Hashimoto, Fabricio Martins Lopes

Summary: During the COVID-19 pandemic, genetic mutations in the SARS-CoV-2 virus have resulted in increased infectivity. The existing classification and identification methods for variants are computationally complex and cannot handle large numbers of sequences simultaneously. This study proposes an alignment-free method called BASiNETEntropy for classifying SARS-CoV-2 variants of concern. The method maps biological sequences into a network, selects informative edges through entropy maximization, and extracts topological measurements as feature vectors for classification. Experimental results demonstrate high accuracy in classifying variants of concern, contributing to reducing the feature space. Unique patterns are also extracted for each variant relative to the reference sequence. The proposed method is implemented as an open-source tool in R language.

AMBIENT INTELLIGENCE IN HEALTH CARE, ICAIHC 2022 (2023)

Meeting Abstract Cardiac & Cardiovascular Systems

Expression changes of circulating ACE2 regulating-microRNA profiles in patients with COVID-19 during hospitalisation

J. Jarosz-Popek, C. Eyileten, Z. Wicik, A. Nowak, M. Wolska, A. Shahzadi, D. Jakubik, S. N. Simoes, D. C. Martins, J. Siller-Matula, M. Postula

EUROPEAN HEART JOURNAL (2022)

Meeting Abstract Cardiac & Cardiovascular Systems

Thrombosis-related miR-16-5p predicts the disease severity in patients hospitalised for COVID-19

D. Keshwani, C. Eyileten, Z. Wicik, A. Nowak, D. Jakubik, S. N. Simoes, D. C. Martins-, A. Shahzadi, J. Jarosz-Popek, M. Wolska, J. Siller-Matula, M. Postula

EUROPEAN HEART JOURNAL (2022)

Proceedings Paper Engineering, Biomedical

A Method for Computing Attractor Fields in Coupled Boolean Networks

Carlos R. P. Tovar, David C. Martins-, Luiz C. S. Rozante, Eloi Araujo

Summary: The paper presents a computationally efficient method to identify attractor fields in coupled Boolean networks (CBNs), a class of models with potential applications in Systems Biology. Experimental results demonstrate that the proposed method is capable of recovering the dynamics structure of large-scale CBNs in a feasible time.

2022 IEEE 22ND INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE 2022) (2022)

Proceedings Paper Computer Science, Information Systems

Interpretability with Relevance Aggregation in Neural Networks for Absenteeism Prediction

Julio Marcos Gomes Junior, Fabricio Martins Lopes

Summary: This paper proposes an approach to classify employee absenteeism using neural networks and Layer-wise relevance propagation. It can identify the most relevant features, assign relevance scores for absenteeism classification, and explain the reasons for absenteeism, which is important for human resource management and occupational medicine.

2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22) (2022)

Proceedings Paper Computer Science, Interdisciplinary Applications

Multi-GPU Approach for Large-Scale Multiple Sequence Alignment

Rodrigo A. de O. Siqueira, Marco A. Stefanes, Luiz C. S. Rozante, David C. Martins-Jr, Jorge E. S. de Souza, Eloi Araujo

Summary: Multiple sequence alignment is essential in representing biological sequence similarities, but due to the complexity of the problem, only approximate solutions are possible. This study introduces a Multi-GPU approach for efficient handling of large-scale lengthy sequence alignments compared to existing methods.

COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT I (2021)

Article Computer Science, Information Systems

A consensus model considers managing manipulative and overconfident behaviours in large-scale group decision-making

Xia Liang, Jie Guo, Peide Liu

Summary: This paper investigates a novel consensus model based on social networks to manage manipulative and overconfident behaviors in large-scale group decision-making. By proposing a novel clustering model and improved methods, the consensus reaching is effectively facilitated. The feedback mechanism and management approach are employed to handle decision makers' behaviors. Simulation experiments and comparative analysis demonstrate the effectiveness of the model.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

CGN: Class gradient network for the construction of adversarial samples

Xiang Li, Haiwang Guo, Xinyang Deng, Wen Jiang

Summary: This paper proposes a method based on class gradient networks for generating high-quality adversarial samples. By introducing a high-level class gradient matrix and combining classification loss and perturbation loss, the method demonstrates superiority in the transferability of adversarial samples on targeted attacks.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Distinguishing latent interaction types from implicit feedbacks for recommendation

Lingyun Lu, Bang Wang, Zizhuo Zhang, Shenghao Liu

Summary: Many recommendation algorithms only rely on implicit feedbacks due to privacy concerns. However, the encoding of interaction types is often ignored. This paper proposes a relation-aware neural model that classifies implicit feedbacks by encoding edges, thereby enhancing recommendation performance.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Proximity-based density description with regularized reconstruction algorithm for anomaly detection

Jaehong Yu, Hyungrok Do

Summary: This study discusses unsupervised anomaly detection using one-class classification, which determines whether a new instance belongs to the target class by constructing a decision boundary. The proposed method uses a proximity-based density description and a regularized reconstruction algorithm to overcome the limitations of existing one-class classification methods. Experimental results demonstrate the superior performance of the proposed algorithm.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Non-iterative border-peeling clustering algorithm based on swap strategy

Hui Tu, Shifei Ding, Xiao Xu, Haiwei Hou, Chao Li, Ling Ding

Summary: Border-Peeling algorithm is a density-based clustering algorithm, but its complexity and issues on unbalanced datasets restrict its application. This paper proposes a non-iterative border-peeling clustering algorithm, which improves the clustering performance by distinguishing and associating core points and border points.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A two-stage denoising framework for zero-shot learning with noisy labels

Long Tang, Pan Zhao, Zhigeng Pan, Xingxing Duan, Panos M. Pardalos

Summary: In this work, a two-stage denoising framework (TSDF) is proposed for zero-shot learning (ZSL) to address the issue of noisy labels. The framework includes a tailored loss function to remove suspected noisy-label instances and a ramp-style loss function to reduce the negative impact of remaining noisy labels. In addition, a dynamic screening strategy (DSS) is developed to efficiently handle the nonconvexity of the ramp-style loss.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Selection of a viable blockchain service provider for data management within the internet of medical things: An MCDM approach to Indian healthcare

Raghunathan Krishankumar, Sundararajan Dhruva, Kattur S. Ravichandran, Samarjit Kar

Summary: Health 4.0 is gaining global attention for better healthcare through digital technologies. This study proposes a new decision-making framework for selecting viable blockchain service providers in the Internet of Medical Things (IoMT). The framework addresses the limitations in previous studies and demonstrates its applicability in the Indian healthcare sector. The results show the top ranking BSPs, the importance of various criteria, and the effectiveness of the developed model.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Q-learning with heterogeneous update strategy

Tao Tan, Hong Xie, Liang Feng

Summary: This paper proposes a heterogeneous update idea and designs HetUp Q-learning algorithm to enlarge the normalized gap by overestimating the Q-value corresponding to the optimal action and underestimating the Q-value corresponding to the other actions. To address the limitation, a softmax strategy is applied to estimate the optimal action, resulting in HetUpSoft Q-learning and HetUpSoft DQN. Extensive experimental results show significant improvements over SOTA baselines.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Dyformer: A dynamic transformer-based architecture for multivariate time series classification

Chao Yang, Xianzhi Wang, Lina Yao, Guodong Long, Guandong Xu

Summary: This paper proposes a dynamic transformer-based architecture called Dyformer for multivariate time series classification. Dyformer captures multi-scale features through hierarchical pooling and adaptive learning strategies, and improves model performance by introducing feature-map-wise attention mechanisms and a joint loss function.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

ESSENT: an arithmetic optimization algorithm with enhanced scatter search strategy for automated test case generation

Xiguang Li, Baolu Feng, Yunhe Sun, Ammar Hawbani, Saeed Hammod Alsamhi, Liang Zhao

Summary: This paper proposes an enhanced scatter search strategy, using opposition-based learning, to solve the problem of automated test case generation based on path coverage (ATCG-PC). The proposed ESSENT algorithm selects the path with the lowest path entropy among the uncovered paths as the target path and generates new test cases to cover the target path by modifying the dimensions of existing test cases. Experimental results show that the ESSENT algorithm outperforms other state-of-the-art algorithms, achieving maximum path coverage with fewer test cases.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

An attention based approach for automated account linkage in federated identity management

Shirin Dabbaghi Varnosfaderani, Piotr Kasprzak, Aytaj Badirova, Ralph Krimmel, Christof Pohl, Ramin Yahyapour

Summary: Linking digital accounts belonging to the same user is crucial for security, user satisfaction, and next-generation service development. However, research on account linkage is mainly focused on social networks, and there is a lack of studies in other domains. To address this, we propose SmartSSO, a framework that automates the account linkage process by analyzing user routines and behavior during login processes. Our experiments on a large dataset show that SmartSSO achieves over 98% accuracy in hit-precision.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A memetic algorithm with fuzzy-based population control for the joint order batching and picker routing problem

Renchao Wu, Jianjun He, Xin Li, Zuguo Chen

Summary: This paper proposes a memetic algorithm with fuzzy-based population control (MA-FPC) to solve the joint order batching and picker routing problem (JOBPRP). The algorithm incorporates batch exchange crossover and a two-level local improvement procedure. Experimental results show that MA-FPC outperforms existing algorithms in terms of solution quality.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Refining one-class representation: A unified transformer for unsupervised time-series anomaly detection

Guoxiang Zhong, Fagui Liu, Jun Jiang, Bin Wang, C. L. Philip Chen

Summary: In this study, we propose the AMFormer framework to address the problem of mixed normal and anomaly samples in deep unsupervised time-series anomaly detection. By refining the one-class representation and introducing the masked operation mechanism and cost sensitive learning theory, our approach significantly improves anomaly detection performance.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A data-driven optimisation method for a class of problems with redundant variables and indefinite objective functions

Jin Zhou, Kang Zhou, Gexiang Zhang, Ferrante Neri, Wangyang Shen, Weiping Jin

Summary: In this paper, the authors focus on the issue of multi-objective optimisation problems with redundant variables and indefinite objective functions (MOPRVIF) in practical problem-solving. They propose a dual data-driven method for solving this problem, which consists of eliminating redundant variables, constructing objective functions, selecting evolution operators, and using a multi-objective evolutionary algorithm. The experiments conducted on two different problem domains demonstrate the effectiveness, practicality, and scalability of the proposed method.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A Monte Carlo fuzzy logistic regression framework against imbalance and separation

Georgios Charizanos, Haydar Demirhan, Duygu Icen

Summary: This article proposes a new fuzzy logistic regression framework that addresses the problems of separation and imbalance while maintaining the interpretability of classical logistic regression. By fuzzifying binary variables and classifying subjects based on a fuzzy threshold, the framework demonstrates superior performance on imbalanced datasets.

INFORMATION SCIENCES (2024)