4.6 Article

Adapting Community Detection Algorithms for Disease Module Identification in Heterogeneous Biological Networks

期刊

FRONTIERS IN GENETICS
卷 10, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fgene.2019.00164

关键词

overlapping community detection; non-overlapping community detection; disease module identification; biological networks; heterogeneous networks

资金

  1. VAJRA faculty scheme of the Govt. of India [VJR/2017/000187]
  2. Intel India
  3. Industrial Consultancy & Sponsored Research, Indian Institute of Technology Madras [BIO/16-17/856/NFIG/HIMA]

向作者/读者索取更多资源

Biological networks catalog the complex web of interactions happening between different molecules, typically proteins, within a cell. These networks are known to be highly modular, with groups of proteins associated with specific biological functions. Human diseases often arise from the dysfunction of one or more such proteins of the biological functional group. The ability, to identify and automatically extract these modules has implications for understanding the etiology of different diseases as well as the functional roles of different protein modules in disease. The recent DREAM challenge posed the problem of identifying disease modules from six heterogeneous networks of proteins/genes. There exist many community detection algorithms, but all of them are not adaptable to the biological context, as these networks are densely connected and the size of biologically relevant modules is quite small. The contribution of this study is 3-fold: first, we present a comprehensive assessment of many classic community detection algorithms for biological networks to identify non-overlapping communities, and propose heuristics to identify small and structurally well-defined communities-core modules. We evaluated our performance over 180 GWAS datasets. In comparison to traditional approaches, with our proposed approach we could identify 50% more number of disease-relevant modules. Thus, we show that it is important to identify more compact modules for better performance. Next, we sought to understand the peculiar characteristics of disease-enriched modules and what causes standard community detection algorithms to detect so few of them. We performed a comprehensive analysis of the interaction patterns of known disease genes to understand the structure of disease modules and show that merely considering the known disease genes set as a module does not give good quality clusters, as measured by typical metrics such as modularity and conductance. We go on to present a methodology leveraging these known disease genes, to also include the neighboring nodes of these genes into a module, to form good quality clusters and subsequently extract a gold-standard set of disease modules. Lastly, we demonstrate, with justification, that overlapping community detection algorithms should be the preferred choice for disease module identification since several genes participate in multiple biological functions.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biology

A machine learning model for nowcasting epidemic incidence

Saumya Yashmohini Sahai, Saket Gurukar, Wasiur R. KhudaBukhsh, Srinivasan Parthasarathy, Grzegorz A. Rempala

Summary: Due to reporting delays, the accuracy of daily national and statewide COVID-19 incidence counts is often questionable, necessitating estimation from recent data. This paper presents a simple random forest statistical model for nowcasting the daily new infection counts based on historical data and simple covariates such as current reported infection counts, day of the week, and time since first reporting. Application of the model in adjusting daily infection counts in Ohio demonstrates that this data-driven method outperforms a complex hierarchical Bayesian model in terms of prediction quality and computational burden. The interactive notebook for nowcasting can be accessed online at https://tinyurl.com/simpleMLnowcasting.

MATHEMATICAL BIOSCIENCES (2022)

Article Psychology, Experimental

Knowledge Gaps: A Challenge for Agent-Based Automatic Task Completion

Goonmeet Bajaj, Sean Current, Daniel Schmidt, Bortik Bandyopadhyay, Christopher W. Myers, Srinivasan Parthasarathy

Summary: The study of human cognition and artificial intelligence have a symbiotic relationship, with human cognition possessing abilities that modern AI systems cannot compete with, such as the detection, identification, and resolution of knowledge gaps. Researchers aim to incorporate these capabilities into artificial agents to explore the understanding of knowledge gaps in visual-linguistic communication.

TOPICS IN COGNITIVE SCIENCE (2022)

Review Biochemical Research Methods

Designing Biological Circuits: From Principles to Applications

Debomita Chakraborty, Raghunathan Rengaswamy, Karthik Raman

Summary: This paper systematically organizes key works in the field of genetic circuit design using the framework of generalized morphological analysis. It maps literature based on design methodologies, modeling techniques, circuit functionalities, design characteristics, and strategies for robust design. The paper concludes with an outlook on future research areas based on the assessment of research gaps.

ACS SYNTHETIC BIOLOGY (2022)

Article Genetics & Heredity

Multi-Omic Data Improve Prediction of Personalized Tumor Suppressors and Oncogenes

Malvika Sudhakar, Raghunathan Rengaswamy, Karthik Raman

Summary: The study develops a multi-omic approach called PIVOT, which uses a machine learning model to classify genes as tumor suppressor genes, oncogenes, or neutral genes based on their functional impact in patients. The models trained on multi-omic data improve predictions and identify both common and rare driver genes.

FRONTIERS IN GENETICS (2022)

Article Physics, Multidisciplinary

Effect of dormant spare capacity on the attack tolerance of complex networks

Sai Saranga Das, Karthik Raman

Summary: This study proposes a strategy to enhance the robustness of networks by allocating spare capacity for vulnerable nodes, resulting in significant improvements in both scale-free and real-world networks.

PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS (2022)

Article Computer Science, Artificial Intelligence

HAM : Hybrid Associations Models for Sequential Recommendation

Bo Peng, Zhiyun Ren, Srinivasan Parthasarathy, Xia Ning

Summary: Sequential recommendation is a useful tool for helping users select preferred items based on their purchase/rating history. This paper proposes a hybrid associations model (HAM) that considers users' long-term preferences, association patterns in recent purchases/ratings, and item synergies to generate sequential recommendations. Experimental results show that HAM models outperform the state-of-the-art methods in terms of performance and runtime efficiency.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2022)

Article Multidisciplinary Sciences

Probing patterning in microbial consortia with a cellular automaton for spatial organisation

Sankalpa Venkatraghavan, Sathvik Anantakrishnan, Karthik Raman

Summary: Microbial consortia exhibit spatiotemporal organization through bacterial communication. Simulations show that secretion rates play a key role in controlling the behavior of the coupled consortia. These models provide a simplified and controllable approach to pattern formation in synthetic biology.

SCIENTIFIC REPORTS (2022)

Article Biology

On biological networks capable of robust adaptation in the presence of uncertainties: A linear systems-theoretic approach

Priyan Bhattacharya, Karthik Raman, Arun K. Tangirala

Summary: In this work, a generic methodology inspired by systems theory is presented to discover the design principles for robust adaptation in two different contexts: deterministic and stochastic. Contrary to the existing approaches, the proposed methodologies provide admissible network structures without resorting to computationally burdensome techniques. The proposed frameworks do not assume prior knowledge about the particular rate kinetics, thereby validating the conclusions for a large class of biological networks.

MATHEMATICAL BIOSCIENCES (2023)

Article Multidisciplinary Sciences

Sloppiness: Fundamental study, new formalism and its application in model assessment

Prem Jagadeesan, Karthik Raman, Arun K. Tangirala

Summary: Computational modelling of biological processes faces multiple challenges, such as identifiability, parameter estimation from limited data, informative experiments, and anisotropic sensitivity. Sloppiness, the property where model predictions are nearly identical over large regions in the parameter space, is a crucial but inconspicuous challenge. This study addresses critical unanswered questions about sloppiness, including its quantification, practical implications, and its impact on system identification.

PLOS ONE (2023)

Review Multidisciplinary Sciences

Big Data for a Small World: A Review on Databases and Resources for Studying Microbiomes

Pratyay Sengupta, Shobhan Karthick Muthamilselvi Sivabalan, Amrita Mahesh, Indumathi Palanikumar, Dinesh Kumar Kuppa Baskaran, Karthik Raman

Summary: Microorganisms are widely distributed in nature and form complex networks to survive in different environments. The structure of these communities is influenced by factors like nutrient availability, temperature, pH, and microbial composition. Categorizing accessible biomes according to their habitats helps in understanding the complexity of environment-specific communities.

JOURNAL OF THE INDIAN INSTITUTE OF SCIENCE (2023)

Article Multidisciplinary Sciences

COWAVE: A labelled COVID-19 wave dataset for building predictive models

Melpakkam Pradeep, Karthik Raman

Summary: The COVID-19 pandemic has posed a significant challenge to global healthcare systems. Multiple waves of the disease have strained healthcare resources worldwide, leading to diligent data collection. This manuscript collates COVID-19 case data from the World Health Organization website to create a labelled dataset for building supervised learning classifiers. The dataset, along with a simple XGBoost model, demonstrates its utility for predicting future waves and will be valuable for epidemiologists and others interested in early prediction.

PLOS ONE (2023)

Article Ecology

Metagenome-based metabolic modelling predicts unique microbial interactions in deep-sea hydrothermal plume microbiomes

Dinesh Kumar Kuppa Baskaran, Shreyansh Umale, Zhichao Zhou, Karthik Raman, Karthik Anantharaman

Summary: Deep-sea hydrothermal vents are abundant and have important roles in ocean biogeochemistry. By studying microbial communities in the Guaymas Basin hydrothermal system, we identified key species and their interactions. Metabolic models were used to infer metabolic exchanges and horizontal gene transfer events within the community. Our findings highlight the importance of microbial interactions in driving community structure and organization in hydrothermal plume microbiomes.

ISME COMMUNICATIONS (2023)

Proceedings Paper Automation & Control Systems

Bayesian Optimal Experiment Design for Sloppy Systems br

Prem Jagadeesan, Karthik Raman, Arun K. Tangirala

Summary: In complex dynamical systems, the precise estimation of parameters and prediction quality rely on the information contained in experimental data. Optimal Experimental Design (OEL) refers to the selection of experimental schemes that maximize the information in the data. OED utilizes Fisher Information Matrix and variance-covariance matrix as central concepts. However, in sloppy models, applying OED leads to decreased predictive ability despite precise parameter estimation. This study introduces a new information gain index as an experiment design criterion in the Bayesian framework, demonstrating its effectiveness in minimizing prediction and parameter uncertainty in sloppy models through simulations.

IFAC PAPERSONLINE (2022)

Review Biology

Discovering design principles for biological functionalities: Perspectives from systems biology

Priyan Bhattacharya, Karthik Raman, Arun K. Tangirala

Summary: Network architecture plays a crucial role in governing the dynamics of biological networks, and the mapping between network structures and output functionality aids in understanding biological systems and has applications in synthetic biology and therapeutics. This review provides a qualitative and quantitative study of computational efforts, rule-based methods, and systems-theoretic approaches, based on the well-researched biological phenotypes of oscillation, toggle switching, and adaptation.

JOURNAL OF BIOSCIENCES (2022)

Article Genetics & Heredity

iCOMIC: a graphical interface-driven bioinformatics pipeline for analyzing cancer omics data

Anjana Anilkumar Sithara, Devi Priyanka Maripuri, Keerthika Moorthy, Sai Sruthi Amirtha Ganesh, Philge Philip, Shayantan Banerjee, Malvika Sudhakar, Karthik Raman

Summary: This article introduces a user-friendly pipeline tool for analyzing cancer genomic data, capable of analyzing whole-genome and transcriptome data, predicting pathogenicity of mutations, and distinguishing tumor suppressor genes from oncogenes.

NAR GENOMICS AND BIOINFORMATICS (2022)

暂无数据