4.8 Article

Improved Peptide Retention Time Prediction in Liquid Chromatography through Deep Learning

Journal

ANALYTICAL CHEMISTRY
Volume 90, Issue 18, Pages 10881-10888

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.analchem.8b02386

Keywords

-

Funding

  1. National Key R&D Program of China [2017YFC0908400]
  2. National Natural Science Foundation of China [31500670]
  3. National Key Basic Research Program of China [2014CBA02002, 2014CBA02005]

Ask authors/readers for more resources

The accuracy of peptide retention time (RT) prediction model in liquid chromatography (LC) is still not sufficient for wider implementation in proteomics practice. Herein, we propose deep learning as an ideal tool to considerably improve this prediction. A new peptide RT prediction tool, DeepRT, was designed using a capsule network model, and the public data sets containing peptides separated by reverse-phase liquid chromatography were used to evaluate the DeepRT performance. Compared with other prevailing RT predictors, DeepRT attained overall improvement in the prediction of peptide RTs with an R-2 of similar to 0.994. Moreover, DeepRT was able to accommodate to the peptides that were separated by different types of LC, such as strong cation exchange (SCX) and hydrophilic interaction liquid chromatography (HILIC) and to reach the RT prediction with R-2 values of similar to 0.996 for SCX and similar to 0.993 for HILIC, respectively. If a large peptide data set is available for one type of LC, DeepRT can be promoted to DeepRT(+) using transfer learning. Based on a large peptide data set gained from SWATH, DeepRT(+) further elevated the accuracy of RT prediction for peptides in a small data set and enabled a satisfactory prediction upon limited peptides approximating hundreds. Further, DeepRT automatically learns retention-related properties of amino acids under different separation mechanisms, which are well consistent with retention coefficients (Rc) of the amino acids. DeepRT was thus proven to be an improved RT predictor with high flexibility and efficiency. DeepRT is available at https://github.com/horsepurve/DeepRTplus.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemistry & Molecular Biology

Chromosome-scale genomes reveal genomic consequences of inbreeding in the South China tiger: A comparative study with the Amur tiger

Le Zhang, Tianming Lan, Chuyu Lin, Wenyuan Fu, Yaohua Yuan, Kaixiong Lin, Haimeng Li, Sunil Kumar Sahu, Zhaoyang Liu, Daqing Chen, Qunxiu Liu, Aishan Wang, Xiaohong Wang, Yue Ma, Shizhou Li, Yixin Zhu, Xingzhuo Wang, Xiaotong Ren, Haorong Lu, Yunting Huang, Jieyao Yu, Boyang Liu, Qing Wang, Shaofang Zhang, Xun Xu, Huanming Yang, Dan Liu, Huan Liu, Yanchun Xu

Summary: The South China tiger is critically endangered due to functional extinction in the wild and inbreeding depression among the captive population. This research assembled and characterized the genomes of the South China tiger and six other tiger subspecies, revealing the genomic signatures of inbreeding depression in the South China tiger. The study provides important information for genetic management policies for the South China tiger.

MOLECULAR ECOLOGY RESOURCES (2023)

Correction Multidisciplinary Sciences

Patterns of somatic structural variation in human cancer genomes (vol 578, pg 112, 2020)

Yilong Li, Nicola D. Roberts, Jeremiah A. Wala, Ofer Shapira, Steven E. Schumacher, Kiran Kumar, Ekta Khurana, Sebastian Waszak, Jan O. Korbel, James E. Haber, Marcin Imielinski, Joachim Weischenfeldt, Rameen Beroukhim, Peter J. Campbell

NATURE (2023)

Correction Multidisciplinary Sciences

Analyses of non-coding somatic drivers in 2,658 cancer whole genomes (vol 578, pg 102, 2020)

Esther Rheinbay, Morten Muhlig Nielsen, Federico Abascal, Jeremiah A. Wala, Ofer Shapira, Grace Tiao, Henrik Hornshoj, Julian M. Hess, Randi Istrup Juul, Ziao Lin, Lars Feuerbach, Radhakrishnan Sabarinathan, Tobias Madsen, Jaegil Kim, Loris Mularoni, Shimin Shuai, Andres Lanzos, Carl Herrmann, Yosef E. Maruvka, Ciyue Shen, Samirkumar B. Amin, Pratiti Bandopadhayay, Johanna Bertl, Keith A. Boroevich, John Busanovich, Joana Carlevaro-Fita, Dimple Chakravarty, Calvin Wing Yiu Chan, David Craft, Priyanka Dhingra, Klev Diamanti, Nuno A. Fonseca, Abel Gonzalez-Perez, Qianyun Guo, Mark P. Hamilton, Nicholas J. Haradhvala, Chen Hong, Keren Isaev, Todd A. Johnson, Malene Juul, Andre Kahles, Abdullah Kahraman, Youngwook Kim, Jan Komorowski, Kiran Kumar, Sushant Kumar, Donghoon Lee, Kjong-Van Lehmann, Yilong Li, Eric Minwei Liu, Lucas Lochovsky, Keunchil Park, Oriol Pich, Nicola D. Roberts, Gordon Saksena, Steven E. Schumacher, Nikos Sidiropoulos, Lina Sieverling, Nasa Sinnott-Armstrong, Chip Stewart, David Tamborero, Jose M. C. Tubio, Husen M. Umer, Liis Uuskuela-Reimand, Claes Wadelius, Lina Wadi, Xiaotong Yao, Cheng-Zhong Zhang, Jing Zhang, James E. Haber, Asger Hobolth, Marcin Imielinski, Manolis Kellis, Michael S. Lawrence, Christian von Mering, Hidewaki Nakagawa, Benjamin J. Raphael, Mark A. Rubin, Chris Sander, Lincoln D. Stein, Joshua M. Stuart, Tatsuhiko Tsunoda, David A. Wheeler, Rory Johnson, Jueri Reimand, Mark Gerstein, Ekta Khurana, Peter J. Campbell, Nuria Lopez-Bigas, Joachim Weischenfeldt, Rameen Beroukhim, Inigo Martincorena, Jakob Skou Pedersen, Gad Getz

NATURE (2023)

Review Public, Environmental & Occupational Health

Towards precision medicine: Omics approach for COVID-19

Xiaoping Cen, Fengao Wang, Xinhe Huang, Dragomirka Jovic, Fred Dubee, Huanming Yang, Yixue Li

Summary: This article reviews the role of omics technologies in studying COVID-19, including genomics, proteomics, single-cell multi-omics, and clinical phenomics. Large-scale sequencing and advanced analysis methods contribute to the understanding of virus evolution, prediction of severity risk, and identification of potential treatments. Omics technologies enable precise and global prevention and medicine for COVID-19 by utilizing big data capability and phenotypes refinement. Additionally, deep learning models can be used to decode the evolution rule of SARS-CoV-2, forecast new variants, and prevent future pandemics.

BIOSAFETY AND HEALTH (2023)

Article Chemistry, Analytical

LC-MS/MS-Based Absolute Quantitation of Hemoglobin Subunits from Dried Blood Spots Reveals Novel Biomarkers for α-Thalassemia Silent Carriers

Zhe Ren, Guoying Sun, Qianqian Zhang, Shaomin Zou, Jianhong Chen, Weining Zhao, Guixue Hou, Zeyan Zhong, Jialong Li, Yuhua Ye, Xiangmin Xu, Liang Lin

Summary: A LC-MS/MS-based approach was used to discover unique expression patterns of hemoglobin subunits in different alpha-thalassemia subtypes. Hemoglobin subunit mu showed significant upregulation in silent alpha-thalassemia patients, indicating its potential as a novel biomarker for clinical screening.

ANALYTICAL CHEMISTRY (2023)

Article Multidisciplinary Sciences

Mendelian randomization analyses reveal causal relationships between the human microbiome and longevity

Xiaomin Liu, Leying Zou, Chao Nie, Youwen Qin, Xin Tong, Jian Wang, Huanming Yang, Xun Xu, Xin Jin, Liang Xiao, Tao Zhang, Junxia Min, Yi Zeng, Huijue Jia, Yong Hou

Summary: Although the association between human microbiome, especially gut microbiota, and longevity has been revealed in recent studies, the causality between them is still unclear. This study used bidirectional two-sample Mendelian randomization (MR) analyses to assess the relationship between the human microbiome (gut and oral microbiota) and longevity. The findings suggest that certain disease-protected gut microbiota and probiotics are associated with increased odds of longevity, while other gut microbiota are negatively associated with longevity.

SCIENTIFIC REPORTS (2023)

Article Biotechnology & Applied Microbiology

Intermittent fasting modulates the intestinal microbiota and improves obesity and host energy metabolism

Xiangwei Hu, Kai Xia, Minhui Dai, Xiaofeng Han, Peng Yuan, Jia Liu, Shiwei Liu, Fuhuai Jia, Jiayu Chen, Fangfang Jiang, Jieyao Yu, Huanming Yang, Jian Wang, Xun Xu, Xin Jin, Karsten Kristiansen, Liang Xiao, Wei Chen, Mo Han, Shenglin Duan

Summary: Intermittent fasting is a promising weight loss method that modulates the gut microbiota. A three-week IF program resulted in an average weight loss of 3.67 kg and improved clinical parameters, regardless of initial BMI and gut microbiota status.

NPJ BIOFILMS AND MICROBIOMES (2023)

Article Ecology

Eighty million years of rapid evolution of the primate Y chromosome

Yang Zhou, Xiaoyu Zhan, Jiazheng Jin, Long Zhou, Juraj Bergman, Xuemei Li, Marjolaine Marie C. Rousselle, Meritxell Riera Belles, Lan Zhao, Miaoquan Fang, Jiawei Chen, Qi Fang, Lukas Kuderna, Tomas Marques-Bonet, Haruka Kitayama, Takashi Hayakawa, Yong-Gang Yao, Huanming Yang, David N. Cooper, Xiaoguang Qi, Dong-Dong Wu, Mikkel Heide Schierup, Guojie Zhang

Summary: A comparative analysis of Y chromosomes in 29 primate species reveals rapid evolution and different patterns of evolution in different regions. The Y chromosome plays a critical role in determining male sex and has unique sequence classes that have experienced distinct evolutionary trajectories. By generating and analyzing 19 new primate sex chromosome assemblies, along with 10 existing ones, this study reports the rapid evolution of the primate Y chromosome. Different primate lineages exhibit varying rates of gene loss, structural changes, and chromatin modifications on their Y chromosomes. The study also highlights the contribution of selection on Y-linked genes to the evolution of male traits across primates.

NATURE ECOLOGY & EVOLUTION (2023)

Article Biochemistry & Molecular Biology

Comparison of In-Frame Deletion, Homology-Directed Repair, and Prime Editing-Based Correction of Duchenne Muscular Dystrophy Mutations

Xiaoying Zhao, Kunli Qu, Benedetta Curci, Huanming Yang, Lars Bolund, Lin Lin, Yonglun Luo

Summary: Recent progress in CRISPR gene editing tools has expanded the possibilities for treating devastating genetic diseases. In this study, three methods of gene editing (NHBEJ, HDR, and PE) were compared for correcting loss-of-function mutations in Duchenne Muscular Dystrophy. The highest efficiency was achieved with NHBEJ, followed by HDR and PE2. The correction efficiency was increased with the use of PE3. This study demonstrates the potential for highly efficient correction of DMD mutations using CRISPR gene editing.

BIOMOLECULES (2023)

Article Multidisciplinary Sciences

A consortium of three-bacteria isolated from human feces inhibits formation of atherosclerotic deposits and lowers lipid levels in a mouse model

Zhuye Jie, Qian Zhu, Yuanqiang Zou, Qili Wu, Min Qin, Dongdong He, Xiaoqian Lin, Xin Tong, Jiahao Zhang, Zhu Jie, Wenwei Luo, Xiao Xiao, Shiyu Chen, Yonglin Wu, Gongjie Guo, Shufen Zheng, Yong Li, Weihua Lai, Huanming Yang, Jian Wang, Liang Xiao, Jiyan Chen, Tao Zhang, Karsten Kristiansen, Huijue Jia, Shilong Zhong

Summary: Through MWAS survey, a study found that individuals with ACVD have lower levels of Bacteroides cellulosilyticus, Faecalibacterium prausnitzii, and Roseburia intestinalis. Selected bacteria from healthy Chinese individuals were tested in Apoe(-/-) mice, showing that administration of these bacteria significantly improves cardiac function, reduces plasma lipid levels, and attenuates atherosclerotic plaque formation. Analysis of gut microbiota, plasma metabolome, and liver transcriptome revealed a beneficial modulation of the gut microbiota associated with a 7 alpha-dehydroxylation-LCA-FXR pathway. This study provides insights into the potential of specific bacteria for ACVD prevention and treatment.

ISCIENCE (2023)

Article Chemistry, Multidisciplinary

Integrated Human Skin Bacteria Genome Catalog Reveals Extensive Unexplored Habitat-Specific Microbiome Diversity and Function

Zhiming Li, Yanmei Ju, Jingjing Xia, Zhe Zhang, Hefu Zhen, Xin Tong, Yuzhe Sun, Haorong Lu, Yang Zong, Peishan Chen, Kaiye Cai, Zhen Wang, Huanming Yang, Jiucun Wang, Jian Wang, Yong Hou, Xin Jin, Tao Zhang, Wenwei Zhang, Xun Xu, Liang Xiao, Ruijin Guo, Chao Nie

Summary: This study used deep-shotgun sequencing to analyze 450 facial samples and 2069 publicly available skin metagenomic datasets, and constructed a Unified Human Skin Genome (UHSG) catalog containing 813 prokaryotic species. The core functions of the skin microbiome were described based on the UHSG, and differences in amino acid metabolism, carbohydrate metabolism, and drug resistance functions among different phyla were identified. Additionally, analysis of near-complete genomes revealed 1220 putative novel secondary metabolites. The UHSG provides a convenient reference database for studying the role of skin microorganisms in the skin.

ADVANCED SCIENCE (2023)

Article Biotechnology & Applied Microbiology

A catalog of bacterial reference genomes from cultivated human oral bacteria

Wenxi Li, Hewei Liang, Xiaoqian Lin, Tongyuan Hu, Zhinan Wu, Wenxin He, Mengmeng Wang, Jiahao Zhang, Zhuye Jie, Xin Jin, Xun Xu, Jian Wang, Huanming Yang, Wenwei Zhang, Karsten Kristiansen, Liang Xiao, Yuanqiang Zou

Summary: The study presents a Cultivated Oral Bacteria Genome Reference (COGR) consisting of 1089 high-quality genomes. COGR covers five phyla and contains 195 species-level clusters, with 315 genomes representing species with no taxonomic annotation. The oral microbiota differs between individuals, with person-specific clusters. The Streptococcus genus dominates COGR and many of these strains harbor quorum sensing pathways important for biofilm formation. Clusters containing unknown bacteria are enriched in individuals with rheumatoid arthritis, highlighting the importance of culture-based isolation for characterizing and exploiting oral bacteria.

NPJ BIOFILMS AND MICROBIOMES (2023)

Article Biology

Developmental dynamics of chromatin accessibility during post-implantation development of monkey embryos

Xi Dai, Honglian Shao, Nianqin Sun, Baiquan Ci, Jun Wu, Chuanyu Liu, Liang Wu, Yue Yuan, Xiaoyu Wei, Huanming Yang, Longqi Liu, Weizhi Ji, Bing Bai, Zhouchun Shang, Tao Tan

Summary: This study applied scATAC-seq technology to investigate the chromatin status of in vitro cultured cynomolgus monkey embryos. The findings provide insights into the chromatin reorganization and transcriptional regulatory mechanisms during early post-implantation development in primates, including the identification of regulatory factors and lineage specification.

GIGASCIENCE (2023)

Article Cell Biology

The complete and fully-phased diploid genome of a male Han Chinese

Chentao Yang, Yang Zhou, Yanni Song, Dongya Wu, Yan Zeng, Lei Nie, Panhong Liu, Shilong Zhang, Guangji Chen, Jinjin Xu, Hongling Zhou, Long Zhou, Xiaobo Qian, Chenlu Liu, Shangjin Tan, Chengran Zhou, Wei Dai, Mengyang Xu, Yanwei Qi, Xiaobo Wang, Lidong Guo, Guangyi Fan, Aijun Wang, Yuan Deng, Yong Zhang, Jiazheng Jin, Yunqiu He, Chunxue Guo, Guoji Guo, Qing Zhou, Xun Xu, Huanming Yang, Jian Wang, Shuhua Xu, Yafei Mao, Xin Jin, Jue Ruan, Guojie Zhang

Summary: Since the release of the complete human genome, efforts in human genomic study have shifted towards closing gaps in ethnic diversity. In this study, a fully phased diploid human genome from a Han Chinese male individual (CN1) was presented, achieving the telomere-to-telomere (T2T) level. Comparisons with the CHM13 haploid T2T genome revealed significant variations in the centromere and numerous novel structural variations outside the centromere. CN1 outperformed CHM13 as a reference genome for the East Asian population, impacting rare SNP calling and uncovering East Asian specific introgression sequences.

CELL RESEARCH (2023)

Article Ophthalmology

De Novo Mutations Contributes Approximately 7% of Pathogenicity in Inherited Eye Diseases

Wei Li, Xiang-Dong He, Zheng-Tao Yang, Dong-Ming Han, Yan Sun, Yan-Xian Chen, Xiao-Tong Han, Si-Cheng Guo, Yu-Ting Ma, Xin Jin, Huan-Ming Yang, Ya Gao, Zhuo-Shi Wang, Jian-Kang Li, Wei He

Summary: The aim of this study was to investigate the genetic characteristics and genotype-phenotype associations in a trio-based cohort of inherited eye diseases (IEDs). Through retrospective analysis of a large cohort of Chinese proband-parent trios, the researchers identified 108 IED-causative genes and found that the top 24 genes explained two-thirds of the genetically solved trios. The study also revealed the significant role of de novo mutations (DNMs) in IEDs and its association with paternal age at reproduction.

INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE (2023)

No Data Available