4.7 Article

The effect of sampling from subdivided populations on species identification with DNA barcodes using a Bayesian statistical approach

Journal

MOLECULAR PHYLOGENETICS AND EVOLUTION
Volume 65, Issue 2, Pages 765-773

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.ympev.2012.07.033

Keywords

Geographical sampling; Population subdivision; Incomplete reference database; Isolation by distance; Segregating sites; DNA barcoding

Funding

  1. NSERC
  2. Government of Canada through Genome Canada
  3. Ontario Genomics Institute [OGI-050]

Ask authors/readers for more resources

Barcoding is an initiative to define a standard fragment of DNA to be used to assign sequences of unknown origin to existing known species whose sequences are recorded in databases. This is a difficult task when species are closely related and individuals of these species might have more than one origin. Using a previously introduced Bayesian statistical tree-less assignment algorithm based on segregating sites, we examine how it functions in the presence of hidden population subdivision with closely related species using simulations. Not surprisingly, adding samples to the database from a greater proportion of the species range leads to a consistently higher number of accurate results. Without such samples, query sequences that originate from outside of the sampled range are easily misinterpreted as coming from other species. However, we show that even the addition of a single sample from a different subpopulation is sufficient to greatly increase the probability of placement of unknown queries into the correct species group. This study highlights the importance of broad sampling, even with five reference samples per species, in the creation of a reference database. (C) 2012 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available