4.6 Article

Determining Informative Microbial Single Nucleotide Polymorphisms for Human Identification

期刊

出版社

AMER SOC MICROBIOLOGY
DOI: 10.1128/aem.00052-22

关键词

hidSkinPlex; skin microbiome; microbial forensics; human identification; massively parallel sequencing; machine learning; multinomial logistic regression; Wright's fixation index

资金

  1. National Institute of Justice [2015-NE-BX-K006, 2020-R2-CX-0046]

向作者/读者索取更多资源

This study utilized SNPs from the skin microbiome for human identification, developing a machine learning framework that achieved 96% accuracy in classifying unknown samples in the test dataset, and predicted the correct host with 95% accuracy using the hidSkinPlex+ panel.
The skin microbiome is a highly abundant and relatively stable source of DNA that may be utilized for human identification (HID). In this study, a set of single nucleotide polymorphisms (SNPs) with a high mean estimated Wright's fixation index (F-ST) (>0.1) and widespread abundance (found in >= 75% of samples compared) were selected from a diverse set of markers in the hidSkinPlex panel. The least absolute shrinkage and selection operator (LASSO) was used in a novel machine learning framework to generate a SNP panel and predict the human host from skin microbiome samples collected from the hand, manubrium, and foot. The framework was devised to emulate a new unknown person introduced to the algorithm and to match samples from that person against a population database. Unknown samples were classified with 96% accuracy (Matthews correlation coefficient [MCC], 0.954) in the test (n = 225 samples) data set. A final panel of informative SNPs was determined for HID (hidSkinPlex+) using all 51 individuals sampled at three body sites in triplicate. The hidSkinPlex+ panel comprises 365 SNPs and yielded prediction accuracy for the correct host of 95% (MCC = 0.949). The accuracy of the hidSkinPlex+ panel may be somewhat overestimated due to using 26 individuals from the training data set for the selection of the final panel. However, this accuracy still provides an indication of performance when tested on new samples. IMPORTANCE One of the fundamental goals in forensic genetics is to identify the source of biological evidence. Methods for detecting human DNA have advanced and can be quite sensitive, but not all DNA samples are amenable to current methods. However, the human skin microbiome is a source of DNA with high copy numbers, and it has the potential for high discriminatory power. The hidSkinPlex panel has been used for HID; however, some aspects of it could be improved. Missing information is ambiguous, as it is unclear if marker drop-out is a by-product of a low-template sample or if the reasons for not observing a marker are biological. Such ambiguity may confound methods for HID, and as such, an improved marker set (hidSkinPlex+) was designed that is considerably smaller and more robust to drop-out (365 SNPs contained in 135 markers) yet still can be used to accurately predict the human host. One of the fundamental goals in forensic genetics is to identify the source of biological evidence. Methods for detecting human DNA have advanced and can be quite sensitive, but not all DNA samples are amenable to current methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Medicine, Legal

Copan microFLOQ® Direct Swab collection of bloodstains, saliva, and semen on cotton cloth

Allison J. Sherier, Rachel E. Kieser, Nicole M. M. Novroski, Frank R. Wendt, Jonathan L. King, August E. Woerner, Angie Ambers, Paolo Garofano, Bruce Budowle

INTERNATIONAL JOURNAL OF LEGAL MEDICINE (2020)

Article Biotechnology & Applied Microbiology

Population Informative Markers Selected Using Wright's Fixation Index and Machine Learning Improves Human Identification Using the Skin Microbiome

Allison J. Sherier, August E. Woerner, Bruce Budowle

Summary: Microbial DNA shed from human skin can be distinctive to its host, aiding in individualizing donors of forensic biological evidence. Genetic differentiation may be more suitable for individual identification.

APPLIED AND ENVIRONMENTAL MICROBIOLOGY (2021)

暂无数据