4.5 Article

Data mining PubChem using a support vector machine with the Signature molecular descriptor: Classification of factor XIa inhibitors

Journal

JOURNAL OF MOLECULAR GRAPHICS & MODELLING
Volume 27, Issue 4, Pages 466-475

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.jmgm.2008.08.004

Keywords

De novo; Signature; Factor XIa; HTS

Funding

  1. Department of Energy's 2004 Presidential Early Career Scientist and Engineer Award

Ask authors/readers for more resources

The amount of high-throughput screening (HTS) data readily available has significantly increased because of the PubChem project (http://pubchem.ncbi.nlm.nih.gov/). There is considerable opportunity for data mining of small molecules for a variety of biological systems using cheminformatic tools and the resources available through PubChem. In this work, we trained a support vector machine (SVM) classifier using the Signature molecular descriptor on factor XIa inhibitor HTS data. The optimal number of Signatures was selected by implementing a feature selection algorithm of highly correlated clusters. Our method included an improvement that allowed clusters to work together for accuracy improvement, where previous methods have scored clusters on an individual basis. The resulting model had a 10-fold cross-validation accuracy of 89%, and additional validation was provided by two independent test sets. We applied the SVM to rapidly predict activity for approximately 12 million compounds also deposited in PubChem. Confidence in these predictions was assessed by considering the number of Signatures within the training set range for a given compound, defined as the overlap metric. To further evaluate compounds identified as active by the SVM, docking studies were performed using AutoDock. A focused database of compounds predicted to be active was obtained with several of the compounds appreciably dissimilar to those used in training the SVM. This focused database is suitable for further study. The data mining technique presented here is not specific to factor XIa inhibitors, and could be applied to other bioassays in PubChem where one is looking to expand the search for small molecules as chemical probes. (C) 2008 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemistry & Molecular Biology

Identifying de-NEDDylation inhibitors: Virtual high-throughput screens targeting SENP8

Jonathan J. Chen, Lyndsey N. Schmucker, Donald P. Visco

CHEMICAL BIOLOGY & DRUG DESIGN (2019)

Article Biology

Virtual high-throughput screens identifying hPK-M2 inhibitors: Exploration of model extrapolation

Jonathan J. Chen, Lyndsey N. Schmucker, Donald P. Visco

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2019)

Article Biochemical Research Methods

Optimizing Cell-Free Biosensors to Monitor Enzymatic Production

Amir Pandi, Ioana Grigoras, Olivier Borkowski, Jean-Loup Faulon

ACS SYNTHETIC BIOLOGY (2019)

Review Biochemical Research Methods

Custom-made transcriptional biosensors for metabolic engineering

Mathilde Koch, Amir Pandi, Olivier Borkowski, A. C. Batista, Jean-Loup Faulon

CURRENT OPINION IN BIOTECHNOLOGY (2019)

Article Multidisciplinary Sciences

Metabolic perceptrons for neural computing in biological systems

Amir Pandi, Mathilde Koch, Peter L. Voyvodic, Paul Soudier, Jerome Bonnet, Manish Kushwaha, Jean-Loup Faulon

NATURE COMMUNICATIONS (2019)

Article Biochemical Research Methods

Reinforcement Learning for Bioretrosynthesis

Mathilde Koch, Thomas Duigou, Jean-Loup Faulon

ACS SYNTHETIC BIOLOGY (2020)

Article Biotechnology & Applied Microbiology

Development of a Biosensor for Detection of Benzoic Acid Derivatives in Saccharomyces cerevisiae

Sara Castano-Cerezo, Mathieu Fournie, Philippe Urban, Jean-Loup Faulon, Gilles Truan

FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY (2020)

Article Multidisciplinary Sciences

Large scale active-learning-guided exploration for in vitro protein production optimization

Olivier Borkowski, Mathilde Koch, Agnes Zettor, Amir Pandi, Angelo Cardoso Batista, Paul Soudier, Jean-Loup Faulon

NATURE COMMUNICATIONS (2020)

Article Materials Science, Multidisciplinary

Significance of p-Electrons in the Design of Corrosion Inhibitors for Carbon Steel in Simulated Concrete Pore Solution

A. Mohamed, D. P. Visco, D. M. Bastidas

Summary: The study shows that organic compounds with pi-electrons have better performance in inhibiting the corrosion of carbon steel reinforcements, with the pi-bond electrons playing a crucial role in the adsorption process.

CORROSION (2021)

Article Biochemical Research Methods

Differentially Optimized Cell-Free Buffer Enables Robust Expression from Unprotected Linear DNA in Exonuclease-Deficient Extracts

Angelo Cardoso Batista, Antoine Levrier, Paul Soudier, Peter L. Voyvodic, Tatjana Achmedov, Tristan Reif-Trauttmansdor, Angelique DeVisch, Martin Cohen-Gonsaud, Jean-Loup Faulon, Chase L. Beisel, Jerome Bonnet, Manish Kushwaha

Summary: This study presents a simple, efficient, and cost-effective solution for using linear DNA templates in cell-free systems by deleting the exonuclease gene cluster from Escherichia coli. The research highlights the importance of tailoring buffer composition for the optimal experimental setup, and suggests that similar strategies can be applied to other species in cell-free synthetic biology.

ACS SYNTHETIC BIOLOGY (2022)

Article Biochemical Research Methods

PeroxiHUB: A Modular Cell-Free Biosensing Platform Using H2O2 as Signal Integrator

Paul Soudier, Ana Zuniga, Thomas Duigou, Peter L. Voyvodic, Kenza Bazi-Kabbaj, Manish Kushwaha, Julie A. Vendrell, Jerome Solassol, Jerome Bonnet, Jean-Loup Faulon

Summary: This study reports the engineering of PeroxiHUB, a sensing platform centered around H2O2, that supports cell-free detection of different metabolites. The PeroxiHUB platform utilizes enzymatic transducers to convert metabolites of interest into H2O2, allowing for rapid reprogramming of sensor specificity. This platform has the potential to detect a wide range of metabolites in a modular and scalable fashion.

ACS SYNTHETIC BIOLOGY (2022)

Article Biochemistry & Molecular Biology

Sodium Succinate as a Corrosion Inhibitor for Carbon Steel Rebars in Simulated Concrete Pore Solution

Ahmed P. Mohamed, Donald M. Visco Jr, David Bastidas

Summary: Sodium succinate has been evaluated as an organic corrosion inhibitor for carbon steel rebars in simulated concrete pore solution. It showed strong inhibition performance by forming a protective film on the rebar surface. The inhibitor is classified as a mixed-type inhibitor and is able to displace water molecules and complex with ferrous ions, creating an adsorption film. Various surface characterizations and quantum chemical calculations have provided evidence for the adsorption of sodium succinate.

MOLECULES (2022)

Article Multidisciplinary Sciences

A neural-mechanistic hybrid approach improving the predictive power of genome-scale metabolic models

Leon Faure, Bastien Mollet, Wolfram Liebermeister, Jean-Loup Faulon

Summary: Constraint-based metabolic models have been used to predict microorganism phenotype, but accurate predictions require labor-intensive measurements. We propose hybrid neural-mechanistic models as a machine learning architecture to improve phenotype predictions. Our models outperform constraint-based models with smaller training set sizes, offering a time and resource-saving approach in systems biology and biological engineering projects.

NATURE COMMUNICATIONS (2023)

Article Education & Educational Research

Modification and validation of the mixed-format Engineering Concept Assessment for middle school students using many-facet Rasch measurement

Kristin L. K. Koskey, Nidaa Makki, Wondimu Ahmed, Nicholas G. Garafolo, Donald P. Visco

SCHOOL SCIENCE AND MATHEMATICS (2020)

Article Biochemical Research Methods

Engineering Escherichia coli towards de novo production of gatekeeper (2S)-flavanones: naringenin, pinocembrin, eriodictyol and homoeriodictyol

Mark S. Dunstan, Christopher J. Robinson, Adrian J. Jervis, Cunyu Yan, Pablo Carbonell, Katherine A. Hollywood, Andrew Currin, Neil Swainston, Rosalind Le Feuvre, Jason Micklefield, Jean-Loup Faulon, Rainer Breitling, Nicholas Turner, Eriko Takano, Nigel S. Scrutton

SYNTHETIC BIOLOGY (2020)

Article Biochemical Research Methods

Three-state dynamics of zinc(II) complexes yielding significant antidiabetic targets

Nousheen Parvaiz, Asma Abro, Syed Sikander Azam

Summary: Protein Tyrosine Phosphatase 1B (PTP1B) is a negative regulator of insulin signaling pathways and has potential as a medicinal target. This study explores the binding and conformational orientation of zinc(II) complexes in PTP1B using advanced computational methods. The findings suggest that zinc(II) complexes can bind to important residues in the enzyme and inhibit its activity.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

A computational insight into enhancement of photovoltaic properties of non-fullerene acceptors by end-group modulations in the structural framework of INPIC molecule

Hira Zubair, Muhamed Salim Akhter, Muhammad Waqas, Mariam Ishtiaq, Ijaz Ahmed Bhatti, Javed Iqbal, Ahmed M. Skawky, Rasheed Ahmad Khera

Summary: Improving open-circuit voltage is crucial for enhancing the overall efficiency of organic solar cells. This study successfully improved the open-circuit voltage by modulating the molecular structure and proposed a promising design concept for acceptor molecules that may contribute to the development of advanced organic solar cells.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Fragment databases from screened ligands for drug discovery (FDSL-DD)

Jerica Wilson, Bahrad A. Sokhansanj, Wei Chuen Chong, Rohan Chandraghatgi, Gail L. Rosen, Hai-Feng Ji

Summary: Fragment-based drug design is a computer-aided drug discovery method, however, it has limitations in processing time and success rate. In this study, a new method called Fragment Databases from Screened Ligands Drug Design (FDSL-DD) was proposed, which intelligently incorporates fragment characteristics into the drug design process to improve the binding affinity between drugs and protein targets.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Multiscale modeling of nanoindentation and nanoscratching by generalized particle method

M. Chamani, G. H. Farrahi

Summary: This paper employs the Generalized Particle (GP) method to simulate nanoindentation and nanoscratching, showing that this method maintains consistent atomic properties across different scales and achieves results consistent with full atomic simulations.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Understanding the contagiousness of Covid-19 strains: A geometric approach

Paola Vottero, Elena Carlotta Olivetti, Lucia Chiara D'Agostino, Luca Di Grazia, Enrico Vezzetti, Maral Aminpour, Jacek Adam Tuszynski, Federica Marcolin

Summary: This study aims to characterize the spike protein of the SARS-CoV-2 virus and investigate its interaction with the ACE2 receptor using a geometric analysis. The 3D depth maps of the proteins are filtered using a specific convolutional filter to obtain geometric features. Geometric descriptors and a Support Vector Machine classifier are used for feature extraction and classification, revealing the geometrical reasons for the higher contagiousness of the Omicron variant compared to other variants.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Optimizing biodegradable plastics: Molecular dynamics insights into starch plasticization with glycerol and oleic acid

Diana Margarita Mojica-Munoz, Karla Lizbeth Macias-Sanchez, Estefania Odemaris Juarez-Hernandez, Aurora Rodriguez-Alvarez, Jean-Michel Grevy, Armando Diaz-Valle, Mauricio Carrillo-Tripp, Jose Marcos Falcon-Gonzalez

Summary: By employing molecular dynamics simulations, we investigated the molecular mechanisms underlying the plasticization of starch. Our study revealed that chain size affects the solubility of starch, temperature influences its diffusivity and elastic properties, and oleic acid shows potential as an alternative plasticizer. Blending glycerol or oleic acid with water enhances the elasticity of starch.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

A fragment-based exploration of diverse MMP-9 inhibitors through classification-dependent structural assessment

Sandip Kumar Baidya, Suvankar Banerjee, Balaram Ghosh, Tarun Jha, Nilanjan Adhikari

Summary: This study utilized classification-based QSAR techniques and fragment-based data mining to analyze different MMP-9 inhibitors, revealing the importance of certain molecular fragments in MMP-9 inhibition. These findings have implications for the development of effective MMP-9 inhibitors in the future.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Effect of the bare and functionalized single-wall carbon nanotubes on inhibition of asphaltene molecules aggregation: A molecular dynamic simulation

Farid Faraji Chanzab, Saber Mohammadi, Fatemeh Alemi Mahmoudi

Summary: A comprehensive study using molecular dynamics technique was conducted to investigate the behavior of PAP molecules in a n-heptane/toluene solution and the role of SWCNTs, both bare and functionalized with carboxyl groups, in the aggregation of PAP molecules. The study found that the CNTs hindered the association of PAP molecules through steric hindrance and adsorption mechanisms. The presence of carboxyl groups on the CNTs improved the stability and adsorption of PAP molecules. The results have implications for future research on controlling asphaltene precipitation in the oil industry.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

An exact and vigorous kinetic Monte Carlo simulation to determine the properties of bimodal HDPE synthesized with a dual-site metallocene catalyst

Ramin Bairami Habashi, Mohammad Najafi, Reza Zarghami

Summary: A vigorous Monte Carlo strategy was developed to simulate the copolymerization of ethylene and 1-butene using a dual-site metallocene catalyst. The results showed that the second catalyst site had higher activity than the first site, with ethylene and 1-butene consumption rates five times higher and hydrogen transfer rates three times faster. The molar percentage of 1-butene in the copolymers synthesized from the second site was around 12%, while in the copolymers from the first site it was around 2%. Increasing the 1-butene concentration led to an increase in overall molecular weight, while increasing the hydrogen concentration resulted in a decrease in molecular weight. The ratio of ethylene to 1-butene affected the melt index and the weight fraction of crystals, with higher ratios leading to smaller melt indexes and higher weight fractions of crystals. Increasing the temperature caused changes in molecular weight, bimodal molecular weight distribution, crystal thickness and weight fraction, and density of HDPE.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Topological structures of DNA octahedrons determined by the number of ssDNA strands

Yufan Lu, Xingmin Guo, Shuya Liu

Summary: This paper investigates how to control the nontrivial topological structures of DNA nanocages by adjusting the number of ssDNA strands. A new algorithm and program are developed to calculate the component number of polyhedral links, filling the gap in computer programs on this aspect. The study provides a complete list of topological structures with different component numbers for DNA octahedrons assembled from one or more ssDNA strands.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Theoretical investigation of asphaltene molecules in crude oil viscoelasticity enhancement

Peng Cui, Shideng Yuan, Heng Zhang, Shiling Yuan

Summary: Understanding the mechanisms of viscosity enhancement in crude oil phases is crucial for optimizing extraction and transportation processes. This study employed molecular dynamics simulations to investigate the behavior and viscosification mechanism of asphaltene molecules in complex oil phases. The research suggests that electrostatic interactions and interactions between asphaltene and crude oil molecules contribute to the enhanced viscosity. The findings provide insight into the viscosity enhancement mechanisms in crude oil phases.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Computer-aided accurate calculation of interacted volumes for 3D isosurface point clouds of molecular electrostatic potential

Kun Lv, Jin Zhang, Xiaohua Liu, Yuqiao Zhou, Kai Liu

Summary: In this paper, the authors propose a robust method for evaluating the interactions between chiral catalysts and substrates using computer simulations. The method involves constructing 3D models from point cloud data, filtering out non-interacting points, determining interacting points, and accurately calculating interacted volumes. Experimental results demonstrate the effectiveness of the method in removing non-interacting points and calculating interacted volumes with low errors.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Theoretical models of staurosporine and analogs uncover detailed structural information in biological solution

Crisciele Fontana, Joao Luiz de Meirelles, Hugo Verli

Summary: By using the GROMOS force field and molecular simulations, this study assessed the dynamics of STA-analogs in aqueous solution and their interaction with water, expanding the knowledge of the conformational space of these ligands and providing potential implications for understanding conformational selection during complexation.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

Molecular dynamics simulations of the solubility and conformation change of chitosan grafted polyacrylamide: Impact of grafting rate

Wei Zhao, Wenjie Zou, Fengyang Liu, Fang Zhou, N. Emre Altun

Summary: The effect of grafting rate on the water solubility of chitosan-grafted polyacrylamide (Chi-gPAM) was investigated using molecular dynamics simulations. The results showed that the intramolecular hydrogen bonding of Chi-gPAM played a dominant role in its water solubility. Additionally, the interaction between Chi-gPAM and water increased with grafting rate.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)

Article Biochemical Research Methods

The effect of {O, N} = X ••• M = {Ti, Zr, Hf } interactions on the sensitivity of C-NO2 trigger bonds in FOX-7: Approach based on the QTAIM/EDA-NOCV analysis

Nassima Bachir, Samir Kenouche, Jorge I. Martinez-Araya

Summary: This study investigates the local chemical reactivity of FOX-7 and explores the interaction between the compound and different metals. The findings suggest that the stability and charge transfers of the compound are influenced by the metal involved, and the interaction between Metallocene Methyl Cations and the compound shows potential for neutralization.

JOURNAL OF MOLECULAR GRAPHICS & MODELLING (2024)