4.5 Article

Bacterial species identification from MALDI-TOF mass spectra through data analysis and machine learning

期刊

SYSTEMATIC AND APPLIED MICROBIOLOGY
卷 34, 期 1, 页码 20-29

出版社

ELSEVIER GMBH
DOI: 10.1016/j.syapm.2010.11.003

关键词

MALDI-TOF MS; Bacteria; Species; Identification; Data processing; Machine learning techniques; Leuconostoc; Fructobacillus; Lactococcus

资金

  1. Federal Research Policy [C3/00/17]
  2. Belgian Science Policy [C3/00/12, IAP VI-PAI VI/06]
  3. Research Foundation - Flanders

向作者/读者索取更多资源

At present, there is much variability between MALDI-TOF MS methodology for the characterization of bacteria through differences in e.g., sample preparation methods, matrix solutions, organic solvents, acquisition methods and data analysis methods. After evaluation of the existing methods, a standard protocol was developed to generate MALDI-TOF mass spectra obtained from a collection of reference strains belonging to the genera Leuconostoc. Fructobacillus and Lactococcus. Bacterial cells were harvested after 24 h of growth at 28 degrees C on the media MRS or TSA. Mass spectra were generated, using the CHCA matrix combined with a 50:48:2 acetonitrile:water:trifluoroacetic acid matrix solution, and analyzed by the cell smear method and the cell extract method. After a data preprocessing step, the resulting high quality data set was used for PCA, distance calculation and multi-dimensional scaling. Using these analyses, species-specific information in the MALDI-TOF mass spectra could be demonstrated. As a next step, the spectra, as well as the binary character set derived from these spectra, were successfully used for species identification within the genera Leuconostoc, Fructobacillus, and Lactococcus. Using MALDI-TOF MS identification libraries for Leuconostoc and Fructobacillus strains, 84% of the MALDI-TOF mass spectra were correctly identified at the species level. Similarly, the same analysis strategy within the genus Lactococcus resulted in 94% correct identifications, taking species and subspecies levels into consideration. Finally, two machine learning techniques were evaluated as alternative species identification tools. The two techniques, support vector machines and random forests, resulted in accuracies between 94% and 98% for the identification of Leuconostoc and Fructobacillus species, respectively. (C) 2011 Elsevier GmbH. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据