4.5 Article

A representation transfer learning approach for enhanced prediction of growth hormone binding proteins

Journal

COMPUTATIONAL BIOLOGY AND CHEMISTRY
Volume 87, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.compbiolchem.2020.107274

Keywords

Growth hormone binding proteins; Autoencoders; Feature selection; SMO-PolyK; Generalized low rank models; Principal component analysis; t-sne

Ask authors/readers for more resources

Growth hormone binding proteins (GHBPs) are soluble proteins that play an important role in the modulation of signaling pathways pertaining to growth hormones. GHBPs are selective and bind non-covalently with growth hormones, but their functions are still not fully understood. Identification and characterization of GHBPs are the preliminary steps for understanding their roles in various cellular processes. As wet lab based experimental methods involve high cost and labor, computational methods can facilitate in narrowing down the search space of putative GHBPs. Performance of machine learning algorithms largely depends on the quality of features that it feeds on. Informative and non-redundant features generally result in enhanced performance and for this purpose feature selection algorithms are commonly used. In the present work, a novel representation transfer learning approach is presented for prediction of GHBPs. For their accurate prediction, deep autoencoder based features were extracted and subsequently SMO-PolyK classifier is trained. The prediction model is evaluated by both leave one out cross validation (LOOCV) and hold out independent testing set. On LOOCV, the prediction model achieved 89.8%% accuracy, with 89.4% sensitivity and 90.2% specificity and accuracy of 93.5%, sensitivity of 90.2% and specificity of 96.8% is attained on the hold out testing set. Further a comparison was made between the full set of sequence-based features, top performing sequence features extracted using feature selection algorithm, deep autoencoder based features and generalized low rank model based features on the prediction accuracy. Principal component analysis of the representative features along with t-sne visualization demonstrated the effectiveness of deep features in prediction of GHBPs. The present method is robust and accurate and may complement other wet lab based methods for identification of novel GHBPs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Mathematical & Computational Biology

Enhanced Prediction and Characterization of CDK Inhibitors Using Optimal Class Distribution

Abhigyan Nath, S. Karthikeyan

INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES (2017)

Article Computer Science, Artificial Intelligence

Enhanced Prediction for Observed Peptide Count in Protein Mass Spectrometry Data by Optimally Balancing the Training Dataset

Anoop Kumar Tiwari, Abhigyan Nath, Karthikeyan Subbiah, Kaushal Kumar Shukla

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (2017)

Article Biology

Enhanced prediction of recombination hotspots using input features extracted by class specific autoencoders

Abhigyan Nath, S. Karthikeyan

JOURNAL OF THEORETICAL BIOLOGY (2018)

Article Biology

Prediction and molecular insights into fungal adhesins and adhesin like proteins

Abhigyan Nath

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2019)

Article Biology

Exploiting ensemble learning to improve prediction of phospholipidosis inducing potential

Abhigyan Nath, Gopal Krishna Sahu

JOURNAL OF THEORETICAL BIOLOGY (2019)

Article Biotechnology & Applied Microbiology

Identification and characterization of trait-specific SNPs using ddRAD sequencing in water buffalo

D. C. Mishra, Poonam Sikka, Sunita Yadav, Jyotika Bhati, S. S. Paul, A. Jerome, Inderjeet Singh, Abhigyan Nath, Neeraj Budhlakoti, A. R. Rao, Anil Rai, K. K. Chaturvedi

GENOMICS (2020)

Article Veterinary Sciences

Inferring Relationship of Blood Metabolic Changes and Average Daily Gain With Feed Conversion Efficiency in Murrah Heifers: Machine Learning Approach

Poonam Sikka, Abhigyan Nath, Shyam Sundar Paul, Jerome Andonissamy, Dwijesh Chandra Mishra, Atmakuri Ramakrishna Rao, Ashok Kumar Balhara, Krishna Kumar Chaturvedi, Keerti Kumar Yadav, Sunesh Balhara

FRONTIERS IN VETERINARY SCIENCE (2020)

Article Biochemistry & Molecular Biology

Evolving scenario of big data and Artificial Intelligence (AI) in drug discovery

Manish Kumar Tripathi, Abhigyan Nath, Tej P. Singh, A. S. Ethayathulla, Punit Kaur

Summary: The accumulation of massive data in Cheminformatics databases has made big data and artificial intelligence indispensable in drug design. The development of newer algorithms and architectures has fulfilled the specific needs of various drug discovery processes, while deep learning neural networks have resulted in a paradigm shift in chemical information mining.

MOLECULAR DIVERSITY (2021)

Article Biology

Prediction for understanding the effectiveness of antiviral peptides

Abhigyan Nath

Summary: The inefficiency of current antivirals and the resistance of viruses have led to the demand for novel antiviral agents. Antiviral peptides show promise as a potential avenue for developing effective antiviral drugs, with the ability to halt the progression of viral infections.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2021)

Article Biotechnology & Applied Microbiology

Estimation of maximum recommended therapeutic dose of anti-retroviral drugs using diversified sampling and varied descriptors

Roopshikha Sahu, Amisha Yadav, Abhigyan Nath

Summary: This study developed a machine learning-based prediction model for MRTD of anti-retroviral drugs. Through feature selection algorithm and representative training/testing set, a subset of top features was extracted, achieving good predictive performance.

MINERVA BIOTECHNOLOGY AND BIOMOLECULAR RESEARCH (2021)

Article Biochemical Research Methods

Improved cytokine-receptor interaction prediction by exploiting the negative sample space

Abhigyan Nath, Andre Leier

BMC BIOINFORMATICS (2020)

Article Biology

Netting into the Sophoretin pool: An approach to trace GSTP1 inhibitors for reversing chemoresistance

Kunal Bhattacharya, Shikha Mahato, Satyendra Deka, Nongmaithem Randhoni Chanu, Amit Kumar Shrivastava, Pukar Khanal

Summary: Chemoresistance, a major challenge in cancer treatment, is associated with the cellular glutathione-related detoxification system. A study has identified GSTP1 enzyme as critical in the inactivation of anticancer drugs and suggests the need for GSTP1 inhibitors to combat chemoresistance. Through molecular docking and simulations, the study found that quercetin 7-O-beta-D-glucoside showed promise as a potential candidate for addressing chemoresistance in cancer patients.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Structure and energetics of serum protein complex of tea adulterant dye Bismarck brown Y using experimental and computational methods

Manwi Shankar, Majji Sai Sudha Rani, Priyanka Gopi, P. Arsha, Prateek Pandya

Summary: This study investigates the interaction between the food dye BBY and the serum protein BSA. The results show that BBY binds to a specific site on BSA through hydrophobic interactions, affecting the structural stability of the protein. These findings enhance our understanding of the molecular-level interactions between BBY and BSA.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Implementing link prediction in protein networks via feature fusion models based on graph neural networks

Chi Zhang, Qian Gao, Ming Li, Tianfei Yu

Summary: In this study, we propose a graph neural network-based autoencoder model, AGraphSAGE, that effectively predicts protein-protein interactions across diverse biological species by integrating gene ontology.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Named entity recognition of rice genes and phenotypes based on BiGRU neural networks

Kangjie Wu, Liqian Xu, Xinxiang Li, Youhua Zhang, Zhenyu Yue, Yujia Gao, Yiqiong Chen

Summary: Named Entity Recognition (NER) is a crucial task in natural language processing (NLP) and big data analysis, with wide application range. This paper proposes an improved neural network method for NER of rice genes and phenotypes, which can learn semantic information in the context without feature engineering. Experimental results show that the proposed model outperforms other models.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Revisiting structural organization of proteins at high temperature from a network perspective

Suman Hait, Sudip Kundu

Summary: Interactions between amino acids in proteins are crucial for stability and structural integrity. Thermophiles have more and more stable interactions to survive in extreme environments. Different types of interactions are enriched in different structural regions.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

XL1R-Net: Explainable AI-driven improved LI-regularized deep neural architecture for NSCLC biomarker identification

Kountay Dwivedi, Ankit Rajpal, Sheetal Rajpal, Virendra Kumar, Manoj Agarwal, Naveen Kumar

Summary: This study aims to identify biomarkers for non-small cell lung cancer (NSCLC) using copy number variation (CNV) data. A novel deep learning architecture, XL1R-Net, is proposed to improve the classification accuracy for NSCLC subtyping. Twenty NSCLC-relevant biomarkers are uncovered using explainable AI (XAI)-based feature identification. The results show that the identified biomarkers have high classification performance and clinical relevance. Additionally, twelve of the biomarkers are potentially druggable and eighteen of them have a high probability of predicting NSCLC patients' survival likelihood according to the Drug-Gene Interaction Database and the K-M Plotter tool, respectively. This research suggests that investigating these seven novel biomarkers can contribute to NSCLC therapy, and the integration of multiomics data and other sources will help better understand NSCLC heterogeneity.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

AMPCDA: Prediction of circRNA-disease associations by utilizing attention mechanisms on metapaths

Pengli Lu, Wenqi Zhang, Jinkai Wu

Summary: Researchers have developed a computational method, AMPCDA, to predict circRNA-disease associations using predefined metapaths, achieving high predictive accuracy. This method effectively combines node embeddings with higher-order neighborhood representations and provides valuable guidance for revealing new disease mechanisms in biological research.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)