4.7 Article

Identification of sub-Golgi protein localization by use of deep representation learning features

期刊

BIOINFORMATICS
卷 36, 期 24, 页码 5600-5609

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btaa1074

关键词

-

资金

  1. National Natural Science Foundation of China [62001090, 91935302, 61922020, 61822108, 61771331]
  2. China Postdoctoral Science Foundation [2020M673184]

向作者/读者索取更多资源

Motivation: The Golgi apparatus has a key functional role in protein biosynthesis within the eukaryotic cell with malfunction resulting in various neurodegenerative diseases. For a better understanding of the Golgi apparatus, it is essential to identification of sub-Golgi protein localization. Although some machine learning methods have been used to identify sub-Golgi localization proteins by sequence representation fusion, more accurate sub-Golgi protein identification is still challenging by existing methodology. Results: we developed a protein sub-Golgi localization identification protocol using deep representation learning features with 107 dimensions. By this protocol, we demonstrated that instead of multi-type protein sequence feature representation fusion as in previous state-of-the-art sub-Golgi-protein localization classifiers, it is sufficient to exploit only one type of feature representation for more accurately identification of sub-Golgi proteins. Compared with independent testing results for benchmark datasets, our protocol is able to perform generally, reliably and robustly for sub-Golgi protein localization prediction. Availabilityand implementation: A use-friendly webserver is freely accessible at http://isGP-DRLF.aibiochem.net and the prediction code is accessible at https://github.com/zhibinlv/isGP-DRLF. Contact: zouquan@nclab.net or qhjiang@hit.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Review Biotechnology & Applied Microbiology

Prediction of bio-sequence modifications and the associations with diseases

Chunyan Ao, Liang Yu, Quan Zou

Summary: This review comprehensively summarizes the predictors for protein, RNA, and DNA modification sites and their association with diseases, emphasizing the importance of accurately identifying and understanding modification sites for disease research.

BRIEFINGS IN FUNCTIONAL GENOMICS (2021)

Article Biochemistry & Molecular Biology

sgRNACNN: identifying sgRNA on-target activity in four crops using ensembles of convolutional neural networks

Mengting Niu, Yuan Lin, Quan Zou

Summary: The study introduced an ensemble convolutional neural network model to accurately predict high on-target sgRNA activity in four crops, demonstrating the importance of machine learning methods in guiding crop gene editing and academic research.

PLANT MOLECULAR BIOLOGY (2021)

Article Biochemical Research Methods

DeepAc4C: a convolutional neural network model with hybrid features composed of physicochemical patterns and distributed representation information for identification of N4-acetylcytidine in mRNA

Chao Wang, Ying Ju, Quan Zou, Chen Lin

Summary: A novel tool, DeepAc4C, was developed to identify ac4C using convolutional neural networks, achieving better and more balanced performance than existing predictors. By evaluating the impact of specific features on model predictions and their interaction effects, several interesting sequence motifs specific to ac4C were identified.

BIOINFORMATICS (2022)

Article Biochemical Research Methods

prPred-DRLF: Plant R protein predictor using deep representation learning features

Yansu Wang, Lei Xu, Quan Zou, Chen Lin

Summary: The study introduces a new computational approach called prPred-DRLF, which accurately predicts plant R proteins using deep representation learning models. The results show that prPred-DRLF outperforms traditional methods in plant R protein prediction tasks.

PROTEOMICS (2022)

Article Computer Science, Information Systems

Mul-SNO: A Novel Prediction Tool for S-Nitrosylation Sites Based on Deep Learning Methods

Qian Zhao, Jiaqi Ma, Yu Wang, Fang Xie, Zhibin Lv, Yaoqun Xu, Hua Shi, Ke Han

Summary: SNO is crucial for plant immune response and human disease treatment, with the efficient prediction tool Mul-SNO showing promising results.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2022)

Article Biochemistry & Molecular Biology

Identify Bitter Peptides by Using Deep Representation Learning Features

Jici Jiang, Xinxu Lin, Yueqi Jiang, Liangzhen Jiang, Zhibin Lv

Summary: This study presents the development of a machine learning prediction method called iBitter-DRLF, based on deep learning techniques, to accurately identify bitter peptides. By utilizing deep representation learning, this method can make accurate predictions solely based on peptide sequence data. This is of significant importance for improving the palatability of peptide therapeutics and dietary supplements.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2022)

Article Genetics & Heredity

Using Sequence Similarity Based on CKSNP Features and a Graph Neural Network Model to Identify miRNA-Disease Associations

Mingxin Li, Yu Fan, Yiting Zhang, Zhibin Lv

Summary: The research focused on the impact of different feature information of miRNA sequences on the relationship between miRNA and disease. It found that a better graph neural network prediction model of miRNA-disease relationship could be built using CKSNAP feature, and the predicted miRNAs related to lung tumors, esophageal tumors, and kidney tumors were consistent with the wet experiment validation database.
Article Genetics & Heredity

Dynamic transcriptome analysis suggests the key genes regulating seed development and filling in Tartary buckwheat (Fagopyrum tataricum Garetn.)

Liangzhen Jiang, Changying Liu, Yu Fan, Qi Wu, Xueling Ye, Qiang Li, Yan Wan, Yanxia Sun, Liang Zou, Dabing Xiang, Zhibin Lv

Summary: - This study assessed the transcriptional dynamics of filling stage Tartary buckwheat seeds and identified key genes related to seed development through RNA sequencing. Phytohormones ABA, AUX, ET, BR and CTK, along with related TFs, were found to substantially regulate seed development by targeting downstream expansin genes and structural starch biosynthetic genes. The transcriptome data could serve as a theoretical basis for improving the yield of Tartary buckwheat.

FRONTIERS IN GENETICS (2022)

Editorial Material Genetics & Heredity

Editorial: Machine learning for biological sequence analysis

Zhibin Lv, Mingxin Li, Yansu Wang, Quan Zou

FRONTIERS IN GENETICS (2023)

Article Food Science & Technology

IUP-BERT: Identification of Umami Peptides Based on BERT Features

Liangzhen Jiang, Jici Jiang, Xiao Wang, Yin Zhang, Bowen Zheng, Shuqi Liu, Yiting Zhang, Changying Liu, Yan Wan, Dabing Xiang, Zhibin Lv

Summary: This study developed a peptide sequence-based umami peptide predictor, iUP-BERT, using a deep learning pretrained neural network feature extraction method. After optimization, the model showed improved performance compared to existing methods. The built iUP-BERT web server can aid in improving the palatability of dietary supplements.
Article Chemistry, Multidisciplinary

Identification of Thermophilic Proteins Based on Sequence-Based Bidirectional Representations from Transformer-Embedding Features

Hongdi Pei, Jiayu Li, Shuhan Ma, Jici Jiang, Mingxin Li, Quan Zou, Zhibin Lv

Summary: Thermophilic proteins have the potential to be used as biocatalysts in biotechnology. BertThermo, a model using BERT as an automatic feature extraction tool, achieved high accuracy in identifying thermophilic proteins. It outperformed previous predictive algorithms and demonstrated robustness in various datasets.+

APPLIED SCIENCES-BASEL (2023)

Article Food Science & Technology

A Machine Learning Method to Identify Umami Peptide Sequences by Using Multiplicative LSTM Embedded Features

Jici Jiang, Jiayu Li, Junxian Li, Hongdi Pei, Mingxin Li, Quan Zou, Zhibin Lv

Summary: A deep learning method called iUmami-DRLF was developed to identify umami peptides based solely on peptide sequence information. The results show that deep learning significantly improved the capability of models in identifying umami peptides. This method can be used to further enhance the umami flavor of food for a satisfying umami-flavored diet.
Article Biochemistry & Molecular Biology

Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides

Yiting Deng, Shuhan Ma, Jiayu Li, Bowen Zheng, Zhibin Lv

Summary: Anticancer peptides (ACPs) are a promising new therapeutic approach in cancer treatment, as they can selectively target cancer cells. This study utilized machine learning algorithms to predict potential ACP sequences based on physicochemical features extracted from peptide sequences. By using feature selection methods, 19 key amino acid physicochemical properties were identified that can predict the likelihood of a peptide sequence functioning as an ACP. The study aims to enhance the efficiency of designing peptide sequences for cancer treatment.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2023)

Article Chemistry, Multidisciplinary

A Stacking Machine Learning Method for IL-10-Induced Peptide Sequence Recognition Based on Unified Deep Representation Learning

Jiayu Li, Jici Jiang, Hongdi Pei, Zhibin Lv

Summary: A new IL-10-induced peptide recognition method called IL10-Stack was introduced in this research, which utilized unified deep representation learning and a stacking algorithm. Feature extraction from peptide sequences was done using two approaches, Amino Acid Index (AAindex) and sequence-based unified representation (UniRep). The IL10-Stack model, constructed using a 1900-dimensional UniRep feature vector, demonstrated excellent performance in IL-10-induced peptide recognition with an accuracy of 0.910 and MCC of 0.820. Compared to existing methods, IL-10Pred and ILeukin10Pred, the IL10-Stack approach showed improved accuracy by 12.1% and 2.4% respectively. The IL10-Stack method has the potential to identify IL-10-induced peptides, aiding in the development of immunosuppressive drugs.

APPLIED SCIENCES-BASEL (2023)

Review Multidisciplinary Sciences

Review of T cell proliferation regulatory factors in treatment and prognostic prediction for solid tumors

Jiayu Li, Shuhan Ma, Hongdi Pei, Jici Jiang, Quan Zou, Zhibin Lv

Summary: This review focuses on the development of Tcprs for solid tumor therapy and prognostic prediction, and proposes strategies to enhance CAR-T cells through targeting different Tcprs, which may lead to the development of a new generation of cell therapies.

HELIYON (2023)

暂无数据