4.7 Article

Accounting for imperfect detection in data from museums and herbaria when modeling species distributions: combining and contrasting data-level versus model-level bias correction

Journal

ECOGRAPHY
Volume 44, Issue 9, Pages 1341-1352

Publisher

WILEY
DOI: 10.1111/ecog.05679

Keywords

Bayesian hierarchical model; citizen science; collection bias; occupancy model; phenology; species distribution model; specimen data

Funding

  1. Institute of Museum and Library Services [FAIN MG-30-15-0094-15]
  2. Alan Graham Fund in Global Change

Ask authors/readers for more resources

The study explores adapting occupancy models for biased opportunistic occurrence data using Anacardiaceae species in Florida as a case study. The best models incorporated background data filtration and collector covariates, with month, collection method, and previous collection of the focal species being important predictors of collection probability. Standardizing metadata associated with data collection will improve efforts for modeling the spatial distribution of various species.
The digitization of museum collections as well as an explosion in citizen science initiatives has resulted in a wealth of data that can be useful for understanding the global distribution of biodiversity, provided that the well-documented biases inherent in unstructured opportunistic data are accounted for. While traditionally used to model imperfect detection using structured data from systematic surveys of wildlife, occupancy models provide a framework for modelling the imperfect collection process that results in digital specimen data. In this study, we explore methods for adapting occupancy models for use with biased opportunistic occurrence data from museum specimens and citizen science platforms using seven species of Anacardiaceae in Florida as a case study. We explored two methods of incorporating information about collection effort to inform our uncertainty around species presence: 1) filtering the data to exclude collectors unlikely to collect the focal species and 2) incorporating collection covariates (collection type, time of collection and history of previous detections) into a model of collection probability. We found that the best models incorporated both the background data filtration step as well as collector covariates. Month, method of collection and whether a collector had previously collected the focal species were important predictors of collection probability. Efforts to standardize meta-data associated with data collection will improve efforts for modeling the spatial distribution of a variety of species.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available