4.5 Review

Deep Learning in Proteomics

期刊

PROTEOMICS
卷 20, 期 21-22, 页码 -

出版社

WILEY
DOI: 10.1002/pmic.201900335

关键词

bioinformatics; deep learning; proteomics

资金

  1. National Cancer Institute (NCI) CPTAC [U24 CA210954]
  2. Cancer Prevention & Research Institutes of Texas (CPRIT) [RR160027]
  3. McNair Medical Institute at The Robert and Janice McNair Foundation

向作者/读者索取更多资源

Proteomics, the study of all the proteins in biological systems, is becoming a data-rich science. Protein sequences and structures are comprehensively catalogued in online databases. With recent advancements in tandem mass spectrometry (MS) technology, protein expression and post-translational modifications (PTMs) can be studied in a variety of biological systems at the global scale. Sophisticated computational algorithms are needed to translate the vast amount of data into novel biological insights. Deep learning automatically extracts data representations at high levels of abstraction from data, and it thrives in data-rich scientific research domains. Here, a comprehensive overview of deep learning applications in proteomics, including retention time prediction, MS/MS spectrum prediction, de novo peptide sequencing, PTM prediction, major histocompatibility complex-peptide binding prediction, and protein structure prediction, is provided. Limitations and the future directions of deep learning in proteomics are also discussed. This review will provide readers an overview of deep learning and how it can be used to analyze proteomics data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Chemistry, Analytical

pDeep3: Toward More Accurate Spectrum Prediction with Fast Few-Shot Learning

Ching Tarn, Wen-Feng Zeng

Summary: This study adopts few-shot learning method to enhance the prediction accuracy of deep learning spectrum prediction, validated on multiple datasets, showing significant improvement in prediction accuracy within seconds.

ANALYTICAL CHEMISTRY (2021)

Article Medicine, Research & Experimental

[Mn(PaPy2Q)(NO)]ClO4, a Near-Infrared Light activated release of Nitric Oxide drug as a nitric oxide donor for therapy of human prostate cancer cells in vitro and in vivo

Yuwan Zhao, Zhuo Li, Huancheng Tang, Shanhong Lin, Wenfeng Zeng, Dongcai Ye, Xin Zeng, Qiuming Luo, Jianwei Li, Zhixian Ao, Jierong Mo, Lixin Chen, Yiqiu Yang, Yunsheng Huang, Jianjun Liu

Summary: This study investigated the synthesis of a near-infrared light-sensitive NO prodrug and its effects on prostate cancer cells. The results showed that the drug effectively inhibited cell proliferation and promoted apoptosis in a concentration-dependent manner. Furthermore, in vivo experiments demonstrated the anti-cancer effects of the drug, with increased NO concentration in tumors after near-infrared light irradiation.

BIOMEDICINE & PHARMACOTHERAPY (2021)

Article Biochemical Research Methods

pDeepXL: MS/MS Spectrum Prediction for Cross-Linked Peptide Pairs by Deep Learning

Zhen-Lin Chen, Peng-Zhi Mao, Wen-Feng Zeng, Hao Chi, Si-Min He

Summary: pDeepXL is a deep learning tool for predicting MS/MS spectra of cross-linked peptide pairs. Trained using transfer learning, it accurately predicts spectra of both noncleavable and cleavable cross-linked peptide pairs, and shows improved robustness through online fine-tuning. Integration of pDeepXL into a database search engine increases the identification of cross-link spectra by 18% on average.

JOURNAL OF PROTEOME RESEARCH (2021)

Article Computer Science, Interdisciplinary Applications

Eight-element fifth-generation multiple-input multiple-output antenna designed by modal currents cancelation

Wen-Feng Zeng, Qing-Xin Chu

Summary: An antenna decoupling method based on modal control is proposed in this paper, which excites a pair of decoupling modes simultaneously to achieve decoupling. The effectiveness of this method is validated through the analysis and design of a head-to-head antenna pair. Additionally, an eight-element MIMO antenna is designed, fabricated, and measured to demonstrate the good performance of the proposed method.

INTERNATIONAL JOURNAL OF RF AND MICROWAVE COMPUTER-AIDED ENGINEERING (2022)

Article Biochemistry & Molecular Biology

The structural context of posttranslational modifications at a proteome-wide scale

Isabell Bludau, Sander Willems, Wen-Feng Zeng, Maximilian T. Strauss, Fynn M. Hansen, Maria C. Tanzer, Ozge Karayel, Brenda A. Schulman, Matthias Mann

Summary: The recent revolution in computational protein structure prediction has provided new insights into the study of the entire proteome. In this study, the researchers analyze posttranslational modifications (PTMs) of proteins to determine their structural context and investigate their potential regulatory sites. The analysis reveals global patterns of PTM occurrence and spatial coregulation of different types of PTMs.

PLOS BIOLOGY (2022)

Article Biochemical Research Methods

OmicsEV: a tool for comprehensive quality evaluation of omics data tables

Bo Wen, Eric J. Jaehnig, Bing Zhang

Summary: OmicsEV is an R package that evaluates the quality of omics data tables by using various methods to assess depth, normalization, biological signal, and other factors. It generates comprehensive visual and quantitative evaluation results to help assess data quality and determine the optimal data processing method and parameters.

BIOINFORMATICS (2022)

Article Engineering, Electrical & Electronic

Design of Self-Decoupling Dielectric Resonator Antenna With Shared Radiator

Yu-Zhong Liang, Fu-Chang Chen, Wen-Feng Zeng, Qing-Xin Chu

Summary: This communication investigates the method of mode cancellation for designing a two-port dielectric resonator antenna (DRA) for in-band full-duplex (IBFD) applications. The antenna structure is simple, consisting only of a single DRA element, a pair of feeding lines, and a pair of metallic probes. By utilizing different modes, the mutual coupling between the exciting port and the passive port can be suppressed to a very low level without the need for an extra decoupling structure. A prototype is fabricated and measured to verify the design, with the results demonstrating broad bandwidth and high isolation throughout the working band.

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION (2023)

Article Multidisciplinary Sciences

AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics

Wen-Feng Zeng, Xie-Xuan Zhou, Sander Willems, Constantin Ammar, Maria Wahle, Isabell Bludau, Eugenia Voytik, Maximillian T. Strauss, Matthias Mann

Summary: Machine learning and deep learning are becoming increasingly important in MS-based proteomics. AlphaPeptDeep is a modular Python framework built on PyTorch that can learn and predict peptide properties. It features a model shop that allows non-specialists to create models easily. AlphaPeptDeep can also predict sequence-based properties and performs well in predicting retention time, collisional cross sections, and fragment intensities.

NATURE COMMUNICATIONS (2022)

Article Multidisciplinary Sciences

pGlycoQuant with a deep residual network for quantitative glycoproteomics at intact glycopeptide level

Siyuan Kong, Pengyun Gong, Wen-Feng Zeng, Biyun Jiang, Xinhang Hou, Yang Zhang, Huanhuan Zhao, Mingqi Liu, Guoquan Yan, Xinwen Zhou, Xihua Qiao, Mengxi Wu, Pengyuan Yang, Chao Liu, Weiqian Cao

Summary: pGlycoQuant is a generic tool for quantitative analysis of intact glycopeptides using both primary and tandem mass spectrometry. It employs a deep learning model and a Match In Run algorithm to improve glycopeptide matching and expand the quantitative function of various search engines. Its application in N-glycoproteomic study demonstrates its potential in exploring site-specific glycosylation and its role in biological processes.

NATURE COMMUNICATIONS (2022)

Article Cardiac & Cardiovascular Systems

CYP2C19 loss-of-function is associated with increased risk of hypertension in a Hakka population: a case-control study

Nan Cai, Cunren Li, Xianfang Gu, Wenfeng Zeng, Jiawei Zhong, Jingfeng Liu, Guopeng Zeng, Junxing Zhu, Haifeng Hong

Summary: The study found that there is a relationship between CYP2C19 gene polymorphisms and hypertension in the Hakka population. Loss-of-function genotypes of CYP2C19 increase the risk of hypertension.

BMC CARDIOVASCULAR DISORDERS (2023)

Article Medicine, Research & Experimental

Quantitative multiorgan proteomics of fatal COVID-19 uncovers tissue-specific effects beyond inflammation

Lisa Schweizer, Tina Schaller, Maximilian Zwiebel, Oezge Karayel, Johannes Bruno Mueller-Reif, Wen-Feng Zeng, Sebastian Dintner, Thierry M. Nordmann, Klaus Hirschbuehl, Bruno Maerkl, Rainer Claus, Matthias Mann

Summary: SARS-CoV-2 can cause damage to lung tissue and other organs in the human body, and this study aimed to analyze these effects comprehensively. Using a mass spectrometry proteomics workflow, the researchers identified inflammatory responses as the initial reaction in all tissues. They also found specific patterns of damage in different organs, such as diffuse alveolar damage in the lungs and organ-specific changes in the kidneys, liver, and lymphatic and vascular systems. In the brain, secondary inflammatory effects were linked to neurotransmitter receptors and myelin degradation. These findings contribute to our understanding of the mechanisms of COVID-19 and provide insights for organ-specific therapeutic interventions.

EMBO MOLECULAR MEDICINE (2023)

Article Biochemistry & Molecular Biology

Robust dimethyl-based multiplex-DIA doubles single-cell proteome depth via a reference channel

Marvin Thielert, Ericka C. M. Itang, Constantin Ammar, Florian A. Rosenberger, Isabell Bludau, Lisa Schweizer, Thierry M. Nordmann, Patricia Skowronek, Maria Wahle, Wen-Feng Zeng, Xie-Xuan Zhou, Andreas-David Brunner, Sabrina Richter, Mitchell P. Levesque, Fabian J. Theis, Martin Steger, Matthias Mann

Summary: Single-cell proteomics allows unbiased characterization of biological function and heterogeneity at the protein level. However, current limitations include proteomic depth, throughput, and robustness. In this study, we introduce a streamlined multiplexed workflow using mDIA to address these limitations. Our approach enables automated and complete dimethyl labeling of bulk or single-cell samples, without compromising proteomic depth. We also demonstrate the ability to quantify twice as many proteins per single cell compared to previous methods, and our workflow allows routine analysis of 80 single cells per day. Additionally, we combine mDIA with spatial proteomics to increase the throughput for microdissection and MS analysis, and successfully identify proteomic signatures of cells within distinct tumor microenvironments in primary cutaneous melanoma.

MOLECULAR SYSTEMS BIOLOGY (2023)

Article Multidisciplinary Sciences

Efficient cancer modeling through CRISPR-Cas9/HDR-based somatic precision gene editing in mice

Wen Bu, Chad J. Creighton, Kelsey S. Heavener, Carolina Gutierrez, Yongchao Dou, Amy T. Ku, Yiqun Zhang, Weiyu Jiang, Jazmin Urrutia, Wen Jiang, Fei Yue, Luyu Jia, Ahmed Atef Ibrahim, Bing Zhang, Shixia Huang, Yi Li

Summary: Technological modifications to the CRISPR-Cas9 vector system allow for precise gene editing in mice, generating tumor models with high flexibility and efficiency. This advancement bridges the gap between CRISPR technology and accurate mouse models, providing more consistent models for studying human tumor evolution and drug testing.

SCIENCE ADVANCES (2023)

Article Radiology, Nuclear Medicine & Medical Imaging

Toward Practical Integration of Omic and Imaging Data in Co-Clinical Trials

Emel Alkim, Heidi Dowst, Julie DiCarlo, Lacey E. Dobrolecki, Anadulce Hernandez-Herrera, David A. Hormuth II, Yuxing Liao, Apollo McOwiti, Robia Pautler, Mothaffar Rimawi, Ashley Roark, Ramakrishnan Rajaram Srinivasan, Jack Virostko, Bing Zhang, Fei Zheng, Daniel L. Rubin, Thomas E. Yankeelov, Michael T. Lewis

Summary: Co-clinical trials involve evaluating therapeutics in both patients and patient-derived xenografts (PDX) to determine how well PDX responses match patient responses, in order to inform pre-clinical and clinical trials. The challenge lies in managing and analyzing the vast amount of data generated across different scales and species. To overcome this challenge, a web-based tool called MIRACCL is being developed to correlate MRI-based changes in tumor characteristics with mRNA expression data in a co-clinical trial setting.

TOMOGRAPHY (2023)

Article Engineering, Electrical & Electronic

Antenna Decoupling Based on Characteristic Modes Cancellation

Qingxin Chu, Wenfeng Zeng

Summary: This paper focuses on the antenna coupling within the MIMO system in 5G and summarizes decoupling techniques. It also elaborates on the new decoupling research developments based on the theory of characteristic mode and provides design examples to validate the proposed decoupling method.

CHINESE JOURNAL OF ELECTRONICS (2022)

暂无数据