4.5 Article

Prediction of small molecule binding property of protein domains with Bayesian classifiers based on Markov chains

期刊

COMPUTATIONAL BIOLOGY AND CHEMISTRY
卷 33, 期 6, 页码 457-460

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.compbiolchem.2009.09.005

关键词

Function prediction; Proteomics; Small molecule binding domains; Drug discovery; Bayesian classifiers; Markov chains

资金

  1. BMBF [01GR0450]

向作者/读者索取更多资源

Accurate computational methods that can help to predict biological function of a protein from its sequence are of great interest to research biologists and pharmaceutical companies. One approach to assume the function of proteins is to predict the interactions between proteins and other molecules. In this work, we propose a machine learning method that uses a primary sequence of a domain to predict its propensity for interaction with small molecules. By curating the Pfam database with respect to the small molecule binding ability of its component domains, we have constructed a dataset of small molecule binding and non-binding domains. This dataset was then used as training set to learn a Bayesian classifier, which should distinguish members of each class. The domain sequences of both classes are modelled with Markov chains. In a Jack-knife test, our classification procedure achieved the predictive accuracies of 77.2% and 66.7% for binding and non-binding classes respectively. We demonstrate the applicability of our classifier by using it to identify previously unknown small molecule binding domains. Our predictions are available as supplementary material and can provide very Useful information to drug discovery specialists. Given the ubiquitous and essential role small molecules play in biological processes, Our method is important for identifying pharmaceutically relevant components of complete proteomes. The software is available from the author upon request. (C) 2009 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Correction Biotechnology & Applied Microbiology

Butler enables rapid cloud-based analysis of thousands of human genomes (vol 79, pg 134, 2019)

Sergei Yakneen, Sebastian M. Waszak, Michael Gertz, Jan O. Korbel, Brice Aminou, Javier Bartolome, Keith A. Boroevich, Rich Boyce, Angela N. Brooks, Alex Buchanan, Ivo Buchhalter, Adam P. Butler, Niall J. Byrne, Andy Cafferkey, Peter J. Campbell, Zhaohong Chen, Sunghoon Cho, Wan Choi, Peter Clapham, Brandi N. Davis-Dusenbery, Francisco M. De La Vega, Jonas Demeulemeester, Michelle T. Dow, Lewis Jonathan Dursi, Juergen Eils, Roland Eils, Kyle Ellrott, Claudiu Farcas, Francesco Favero, Nodirjon Fayzullaev, Vincent Ferretti, Paul Flicek, Nuno A. Fonseca, Josep Ll. Gelpi, Gad Getz, Bob Gibson, Robert L. Grossman, Olivier Harismendy, Allison P. Heath, Michael C. Heinold, Julian M. Hess, Oliver Hofmann, Jongwhi H. Hong, Thomas J. Hudson, Barbara Hutter, Carolyn M. Hutter, Daniel Hubschmann, Seiya Imoto, Sinisa Ivkovic, Seung-Hyup Jeon, Wei Jiao, Jongsun Jung, Rolf Kabbe, Andre Kahles, Jules N. A. Kerssemakers, Hyung-Lae Kim, Hyunghwan Kim, Jihoon Kim, Youngwook Kim, Kortine Kleinheinz, Michael Koscher, Antonios Koures, Milena Kovacevic, Chris Lawerenz, Ignaty Leshchiner, Jia Liu, Dimitri Livitz, George L. Mihaiescu, Sanja Mijalkovic, Ana Mijalkovic Lazic, Satoru Miyano, Naoki Miyoshi, Hardeep K. Nahal-Bose, Hidewaki Nakagawa, Mia Nastic, Steven J. Newhouse, Jonathan Nicholson, Brian D. O'Connor, David Ocana, Kazuhiro Ohi, Lucila Ohno-Machado, Larsson Omberg, B. F. Francis Ouellette, Nagarajan Paramasivam, Marc D. Perry, Todd D. Pihl, Manuel Prinz, Montserrat Puiggros, Petar Radovic, Keiran M. Raine, Esther Rheinbay, Mara Rosenberg, Romina Royo, Gunnar Ratsch, Gordon Saksena, Matthias Schlesner, Solomon I. Shorser, Charles Short, Heidi J. Sofia, Jonathan Spring, Lincoln D. Stein, Adam J. Struck, Grace Tiao, Nebojsa Tijanic, David Torrents, Peter Van Loo, Miguel Vazquez, David Vicente, Jeremiah A. Wala, Zhining Wang, Sebastian M. Waszak, Joachim Weischenfeldt, Johannes Werner, Ashley Williams, Youngchoon Woo, Adam J. Wright, Qian Xiang, Liming Yang, Denis Yuen, Christina K. Yung, Junjun Zhang, Jan O. Korbel

NATURE BIOTECHNOLOGY (2023)

Editorial Material Biochemistry & Molecular Biology

Identifying multimodal signatures underlying the somatic comorbidity of psychosis: the COMMITMENT roadmap

Emanuel Schwarz, Dag Alnaes, Ole A. Andreassen, Han Cao, Junfang Chen, Franziska Degenhardt, Daria Doncevic, Dominic Dwyer, Roland Eils, Jeanette Erdmann, Carl Herrmann, Martin Hofmann-Apitius, Tobias Kaufmann, Nikolaos Koutsouleris, Alpha T. Kodamullil, Adyasha Khuntia, Soeren Mucha, Markus M. Noethen, Riya Paul, Mads L. Pedersen, Andres Quintero, Heribert Schunkert, Ashwini Sharma, Heike Tost, Lars T. Westlye, Youcheng Zhang, Andreas Meyer-Lindenberg

MOLECULAR PSYCHIATRY (2021)

Article Oncology

Aggressive PDACs Show Hypomethylation of Repetitive Elements and the Execution of an Intrinsic IFN Program Linked to a Ductal Cell of Origin

Elisa Espinet, Zuguang Gu, Charles D. Imbusch, Nathalia A. Giese, Magdalena Buescher, Mariam Safavi, Silke Weisenburger, Corinna Klein, Vanessa Vogel, Mattia Falcone, Jacob Insua-Rodriguez, Manuel Reitberger, Vera Thiel, Steffi O. Kossi, Alexander Muckenhuber, Karnjit Sarai, Alex Y. L. Lee, Elyne Backx, Soheila Zarei, Matthias M. Gaida, Manuel Rodriguez-Paredes, Elisa Donato, Hsi-Yu Yen, Roland Eils, Matthias Schlesner, Nicole Pfarr, Thilo Hackert, Christoph Plass, Benedikt Brors, Katja Steiger, Dieter Weichenhan, H. Efsun Arda, Ilse Rooman, Janel L. Kopp, Oliver Strobel, Wilko Weichert, Martin R. Sprick, Andreas Trumpp

Summary: Pancreatic ductal adenocarcinoma (PDAC) has two distinct subtypes, one aggressive subtype with low methylation and high IFN signaling, and another subtype with high methylation and low IFN signaling. These subtypes preserve traits from normal ductal/acinar cells associated with IFN signaling.

CANCER DISCOVERY (2021)

Article Biochemistry & Molecular Biology

Differentially methylated regions within lung cancer risk loci are enriched in deregulated enhancers

Marina Laplana, Matthias Bieg, Christian Faltus, Svitlana Melnik, Olga Bogatyrova, Zuguang Gu, Thomas Muley, Michael Meister, Hendrik Dienemann, Esther Herpel, Christopher Amos, Matthias Schlesner, Roland Eils, Christoph Plass, Angela Risch

Summary: This study utilized a targeted sequencing approach to investigate DNA methylation changes in NSCLC patients, identifying differential methylation regions and confirming potential regulatory elements. The research contributes to understanding the mechanisms of lung cancer initiation and progression, and offers new potential targets for cancer treatment.

EPIGENETICS (2022)

Article Oncology

Functional States in Tumor-Initiating Cell Differentiation in Human Colorectal Cancer

Martina K. Zowada, Stephan M. Tirier, Sebastian M. Dieter, Teresa G. Krieger, Ava Oberlack, Robert Lorenz Chua, Mario Huerta, Foo Wei Ten, Karin Laaber, Jeongbin Park, Katharina Jechow, Torsten Mueller, Mathias Kalxdorf, Mark Kriegsmann, Katharina Kriegsmann, Friederike Herbst, Jeroen Krijgsveld, Martin Schneider, Roland Eils, Hanno Glimm, Christian Conrad, Claudia R. Ball

Summary: Different cell types with tumor-initiating cell (TIC) activity in colorectal cancer (CRC) show distinct gene expression patterns at single-cell level. Metabolic states are closely linked to TIC activity in primary CRC cultures, suggesting oxidative phosphorylation as a potential target for novel therapies. Transcriptional heterogeneity at single-cell resolution identifies functional states during TIC differentiation and may reveal novel vulnerabilities in human CRC.

CANCERS (2021)

Article Pathology

Lipomatous Solitary Fibrous Tumors Harbor Rare NAB2-STAT6 Fusion Variants and Show Up-Regulation of the Gene PPARG, Encoding for a Regulator of Adipocyte Differentiation

Florian Haller, Lea D. Schlieben, Fulvia Ferrazzi, Michael Michal, Robert Stoehr, Evgeny A. Moskalev, Matthias Bieg, Judith V. M. G. Bovee, Philip Stroebel, Naveed Ishaque, Robert Gruetzmann, Norbert Meidenbauer, Roland Eils, Stefan Wiemann, Arndt Hartmann, Michal Michal, Abbas Agaimy

Summary: This study evaluated NAB2-STAT6 gene fusion variants in lipomatous SFTs and found significant differences in gene expression and fusion variants compared to nonlipomatous SFTs. The results provide a possible molecular genetic basis for the distinct morphologic features of lipomatous SFTs.

AMERICAN JOURNAL OF PATHOLOGY (2021)

Review Biochemical Research Methods

Knowledge bases and software support for variant interpretation in precision oncology

Florian Borchert, Andreas Mock, Aurelie Tomczak, Jonas Huegel, Samer Alkarkoukly, Alexander Knurr, Anna-Lena Volckmar, Albrecht Stenzinger, Peter Schirmacher, Juergen Debus, Dirk Jaeger, Thomas Longerich, Stefan Froehling, Roland Eils, Nina Bougatf, Ulrich Sax, Matthieu-P Schapranow

Summary: Precision oncology is a rapidly evolving interdisciplinary medical specialty, mainly driven by academia. The most commonly used knowledge bases provide good programmatic access options and have been integrated into software tools, but access options are limited for information regarding clinical classifications and therapy recommendations. Specialized tools are needed for different steps in the diagnostic process.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemistry & Molecular Biology

Characterizing genetic intra-tumor heterogeneity across 2,658 human cancer genomes

Stefan C. Dentro, Ignaty Leshchiner, Kerstin Haase, Maxime Tarabichi, Jeff Wintersinger, Amit G. Deshwar, Kaixian Yu, Yulia Rubanova, Geoff Macintyre, Jonas Demeulemeester, Ignacio Vazquez-Garcia, Kortine Kleinheinz, Dimitri G. Livitz, Salem Malikic, Nilgun Donmez, Subhajit Sengupta, Pavana Anur, Clemency Jolly, Marek Cmero, Daniel Rosebrock, Steven E. Schumacher, Yu Fan, Matthew Fittall, Ruben M. Drews, Xiaotong Yao, Thomas B. K. Watkins, Juhee Lee, Matthias Schlesner, Hongtu Zhu, David J. Adams, Nicholas McGranahan, Charles Swanton, Gad Getz, Paul C. Boutros, Marcin Imielinski, Rameen Beroukhim, S. Cenk Sahinalp, Yuan Ji, Martin Peifer, Inigo Martincorena, Florian Markowetz, Ville Mustonen, Ke Yuan, Moritz Gerstung, Paul T. Spellman, Wenyi Wang, Quaid D. Morris, David C. Wedge, Peter Van Loo

Summary: By extensively characterizing intra-tumor heterogeneity (ITH) across 2,658 cancer samples spanning 38 cancer types, this study found evidence of distinct subclonal expansions in nearly all informative samples, with frequent branching relationships between subclones. Positive selection of subclonal driver mutations was observed across most cancer types, indicating the importance of ITH and its drivers in tumor evolution.
Editorial Material Cardiac & Cardiovascular Systems

Hyperinflammation as underlying mechanism predisposing patients with cardiovascular diseases for severe COVID-19

Ulf Landmesser, Irina Lehmann, Roland Eils

EUROPEAN HEART JOURNAL (2021)

Article Biotechnology & Applied Microbiology

Pre-activated antiviral innate immunity in the upper airways controls early SARS-CoV-2 infection in children

J. Loske, J. Roehmel, S. Lukassen, S. Stricker, Vg Magalhaes, J. Liebig, Rl Chua, L. Thurmann, M. Messingschlager, A. Seegebarth, B. Timmermann, S. Klages, M. Ralser, B. Sawitzki, Le Sander, Vm Corman, C. Conrad, S. Laudi, M. Binder, S. Trump, R. Eils, M. A. Mall, I Lehmann

Summary: Children exhibit higher basal expression of relevant pattern recognition receptors in airway immune cells, resulting in stronger early innate antiviral responses to SARS-CoV-2 infection compared to adults. Unique immune cell subpopulations, including cytotoxic T cells and memory CD8+ T cells, predominantly occur in children.

NATURE BIOTECHNOLOGY (2022)

Article Immunology

Age-Related Differences in Structure and Function of Nasal Epithelial Cultures From Healthy Children and Elderly People

Anita Balazs, Pamela Millar-Buechner, Michael Muelleder, Vadim Farztdinov, Lukasz Szyrwiel, Annalisa Addante, Aditi Kuppe, Tihomir Rubil, Marika Drescher, Kathrin Seidel, Sebastian Stricker, Roland Eils, Irina Lehmann, Birgit Sawitzki, Jobst Roehmel, Markus Ralser, Marcus A. Mall

Summary: The nasal epithelium acts as the first line of defense against inhaled pathogens, allergens, and irritants, and plays a crucial role in the development of various respiratory diseases. This study aims to investigate the age-related differences in the structure and function of the nasal epithelium. The results showed intrinsic, age-related differences in the structure and function of the nasal epithelium, which may contribute to the development of age-dependent respiratory diseases.

FRONTIERS IN IMMUNOLOGY (2022)

Article Genetics & Heredity

SSAM-lite: A Light-Weight Web App for Rapid Analysis of Spatially Resolved Transcriptomics Data

Sebastian Tiesmeyer, Shashwat Sahay, Niklas Mueller-Boetticher, Roland Eils, Sebastian D. Mackowiak, Naveed Ishaque

Summary: The combination of a cell's transcriptional profile and location defines its function in a spatial context. Spatially resolved transcriptomics (SRT) has become a popular method for characterizing cells in situ. However, the correct aggregation of mRNA molecules into cells has been a computational problem in single-molecule SRT methods. SSAM-lite is an easy-to-use graphical interface tool that enables rapid and segmentation-free cell typing of SRT data in a web browser.

FRONTIERS IN GENETICS (2022)

Article Multidisciplinary Sciences

Temporal control of the integrated stress response by a stochastic molecular switch

Philipp Klein, Stefan M. Kallenberger, Hanna Roth, Karsten Roth, Thi Bach Nga Ly-Hartig, Vera Magg, Janez Ales, Soheil Rastgou Talemi, Yu Qiang, Steffen Wolf, Olga Oleksiuk, Roma Kurilov, Barbara Di Ventura, Ralf Bartenschlager, Roland Eils, Karl Rohr, Fred A. Hamprecht, Thomas Hoefer, Oliver T. Fackler, Georg Stoecklin, Alessia Ruggieri

Summary: This study elucidated the molecular mechanism of stress granules formation by integrating quantitative experiments and mathematical modeling. The study revealed that the stress response is controlled by a stochastic switch, with key elements including cooperative activation of PKR, ultrasensitive response of SG formation to eIF2 alpha phosphorylation, and negative feedback via GADD34. Furthermore, the study identified GADD34 mRNA levels as a molecular memory of the ISR that plays a central role in cell adaptation to acute and chronic stress.

SCIENCE ADVANCES (2022)

Article Biochemistry & Molecular Biology

Complement activation induces excessive T cell cytotoxicity in severe COVID-19

Philipp Georg, Rosario Astaburuaga-Garcia, Lorenzo Bonaguro, Sophia Brumhard, Laura Michalick, Lena J. Lippert, Tomislav Kostevc, Christiane Gaebel, Maria Schneider, Mathias Streitz, Vadim Demichev, Ioanna Gemuend, Matthias Barone, Pinkus Tober-Lau, Elisa T. Helbig, David Hillus, Lev Petrov, Julia Stein, Hannah-Philine Dey, Daniela Paclik, Christina Iwert, Michael Muelleder, Simran Kaur Aulakh, Sonja Djudjaj, Roman D. Buelow, Henrik E. Mei, Axel R. Schulz, Andreas Thiel, Stefan Hippenstiel, Antoine-Emmanuel Saliba, Roland Eils, Irina Lehmann, Marcus A. Mall, Sebastian Stricker, Jobst Roehmel, Victor M. Corman, Dieter Beule, Emanuel Wyler, Markus Landthaler, Benedikt Obermayer, Saskia von Stillfried, Peter Boor, Munevver Demir, Hans Wesselmann, Norbert Suttorp, Alexander Uhrig, Holger Mueller-Redetzky, Jacob Nattermann, Wolfgang M. Kuebler, Christian Meisel, Markus Ralser, Joachim L. Schultze, Anna C. Aschenbrenner, Charlotte Thibeault, Florian Kurth, Leif E. Sander, Nils Bluethgen, Birgit Sawitzki

Summary: Severe COVID-19 is associated with highly activated CD16(+) T cells that exhibit cytotoxic functions and contribute to endothelial injury. These CD16(+) T cells can degranulate and induce cytotoxicity through immune-complex-mediated mechanisms independent of the T cell receptor, which is not observed in other diseases. The presence of activated CD16(+) T cells and elevated levels of complement proteins upstream of C3a are associated with a fatal outcome of COVID-19, indicating the pathological role of enhanced cytotoxicity and complement activation in the disease.
Article Medical Informatics

Neural network-based integration of polygenic and clinical information: development and validation of a prediction model for 10-year risk of major adverse cardiac events in the UK Biobank cohort

Jakob Steinfeldt, Thore Buergel, Lukas Loock, Paul Kittner, Greg Ruyoga, Julius Upmeier zu Belzen, Simon Sasse, Henrik Strangalies, Lara Christmann, Noah Hollmann, Benedict Wolf, Brian Ference, John Deanfield, Ulf Landmesser, Roland Eils

Summary: In this study, a neural network-based risk model (NeuralCVD) was developed and validated to estimate cardiovascular risk for primary prevention. The model integrates polygenic and clinical predictors and improves risk discrimination compared to established clinical scores and a Cox model. The findings highlight the importance of genetic information in identifying individuals with a high genetic predisposition for preventive interventions.

LANCET DIGITAL HEALTH (2022)

Article Biology

Netting into the Sophoretin pool: An approach to trace GSTP1 inhibitors for reversing chemoresistance

Kunal Bhattacharya, Shikha Mahato, Satyendra Deka, Nongmaithem Randhoni Chanu, Amit Kumar Shrivastava, Pukar Khanal

Summary: Chemoresistance, a major challenge in cancer treatment, is associated with the cellular glutathione-related detoxification system. A study has identified GSTP1 enzyme as critical in the inactivation of anticancer drugs and suggests the need for GSTP1 inhibitors to combat chemoresistance. Through molecular docking and simulations, the study found that quercetin 7-O-beta-D-glucoside showed promise as a potential candidate for addressing chemoresistance in cancer patients.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Structure and energetics of serum protein complex of tea adulterant dye Bismarck brown Y using experimental and computational methods

Manwi Shankar, Majji Sai Sudha Rani, Priyanka Gopi, P. Arsha, Prateek Pandya

Summary: This study investigates the interaction between the food dye BBY and the serum protein BSA. The results show that BBY binds to a specific site on BSA through hydrophobic interactions, affecting the structural stability of the protein. These findings enhance our understanding of the molecular-level interactions between BBY and BSA.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Implementing link prediction in protein networks via feature fusion models based on graph neural networks

Chi Zhang, Qian Gao, Ming Li, Tianfei Yu

Summary: In this study, we propose a graph neural network-based autoencoder model, AGraphSAGE, that effectively predicts protein-protein interactions across diverse biological species by integrating gene ontology.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Named entity recognition of rice genes and phenotypes based on BiGRU neural networks

Kangjie Wu, Liqian Xu, Xinxiang Li, Youhua Zhang, Zhenyu Yue, Yujia Gao, Yiqiong Chen

Summary: Named Entity Recognition (NER) is a crucial task in natural language processing (NLP) and big data analysis, with wide application range. This paper proposes an improved neural network method for NER of rice genes and phenotypes, which can learn semantic information in the context without feature engineering. Experimental results show that the proposed model outperforms other models.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Revisiting structural organization of proteins at high temperature from a network perspective

Suman Hait, Sudip Kundu

Summary: Interactions between amino acids in proteins are crucial for stability and structural integrity. Thermophiles have more and more stable interactions to survive in extreme environments. Different types of interactions are enriched in different structural regions.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

XL1R-Net: Explainable AI-driven improved LI-regularized deep neural architecture for NSCLC biomarker identification

Kountay Dwivedi, Ankit Rajpal, Sheetal Rajpal, Virendra Kumar, Manoj Agarwal, Naveen Kumar

Summary: This study aims to identify biomarkers for non-small cell lung cancer (NSCLC) using copy number variation (CNV) data. A novel deep learning architecture, XL1R-Net, is proposed to improve the classification accuracy for NSCLC subtyping. Twenty NSCLC-relevant biomarkers are uncovered using explainable AI (XAI)-based feature identification. The results show that the identified biomarkers have high classification performance and clinical relevance. Additionally, twelve of the biomarkers are potentially druggable and eighteen of them have a high probability of predicting NSCLC patients' survival likelihood according to the Drug-Gene Interaction Database and the K-M Plotter tool, respectively. This research suggests that investigating these seven novel biomarkers can contribute to NSCLC therapy, and the integration of multiomics data and other sources will help better understand NSCLC heterogeneity.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

AMPCDA: Prediction of circRNA-disease associations by utilizing attention mechanisms on metapaths

Pengli Lu, Wenqi Zhang, Jinkai Wu

Summary: Researchers have developed a computational method, AMPCDA, to predict circRNA-disease associations using predefined metapaths, achieving high predictive accuracy. This method effectively combines node embeddings with higher-order neighborhood representations and provides valuable guidance for revealing new disease mechanisms in biological research.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)