4.8 Article

Unknown Metabolite Identification Using Machine Learning Collision Cross-Section Prediction and Tandem Mass Spectrometry

期刊

ANALYTICAL CHEMISTRY
卷 95, 期 2, 页码 1047-1056

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.analchem.2c03749

关键词

-

向作者/读者索取更多资源

Ion mobility spectrometry (IM) provides valuable data for identifying unknown metabolites in non-targeted metabolomics. This study presents a workflow using de novo molecular formula annotation, MS/MS structure elucidation, and machine learning predictions to identify differential unknown metabolites in Caenorhabditis elegans mutant strains. However, the performance of this approach is limited by instrumentation and data analysis challenges, resulting in a relatively low success rate in filtering candidate structures.
Ion mobility (IM) spectrometry provides semi orthogonal data to mass spectrometry (MS), showing promise for identifying unknown metabolites in complex non-targeted metabolomics data sets. While current literature has showcased IM-MS for identifying unknowns under near ideal circumstances, less work has been conducted to evaluate the performance of this approach in metabolomics studies involving highly complex samples with difficult matrices. Here, we present a workflow incorporating de novo molecular formula annotation and MS/MS structure elucidation using SIRIUS 4 with experimental IM collision cross-section (CCS) measurements and machine learning CCS predictions to identify differential unknown metabolites in mutant strains of Caenorhabditis elegans. For many of those ion features, this workflow enabled the successful filtering of candidate structures generated by in silico MS/MS predictions, though in some cases, annotations were challenged by significant hurdles in instrumentation performance and data analysis. While for 37% of differential features we were able to successfully collect both MS/MS and CCS data, fewer than half of these features benefited from a reduction in the number of possible candidate structures using CCS filtering due to poor matching of the machine learning training sets, limited accuracy of experimental and predicted CCS values, and lack of candidate structures resulting from the MS/MS data. When using a CCS error cutoff of +/- 3%, on average, 28% of candidate structures could be successfully filtered. Herein, we identify and describe the bottlenecks and limitations associated with the identification of unknowns in non-targeted metabolomics using IM-MS to focus and provide insights into areas requiring further improvement.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据