4.6 Article

A method for measuring the relative information content of data from different monitoring protocols

Journal

METHODS IN ECOLOGY AND EVOLUTION
Volume 1, Issue 3, Pages 263-273

Publisher

WILEY
DOI: 10.1111/j.2041-210X.2010.00035.x

Keywords

cross-data validation; data efficiency ratio; data quality; eBird; North American Breeding Bird Survey; species distribution model

Categories

Funding

  1. Leon Levy Foundation
  2. Wolf Creek Foundation
  3. National Science Foundation [ITR-0427914, DBI-0542868, DUE-0734857, IIS-0612031, IIS-074826, IIS-0832782]
  4. Direct For Computer & Info Scie & Enginr
  5. Division Of Computer and Network Systems [0832782] Funding Source: National Science Foundation

Ask authors/readers for more resources

1. Species monitoring is an essential component of assessing conservation status, predicting effects of habitat change and establishing management and conservation priorities. The pervasive access to the Internet has led to the development of several extensive monitoring projects that engage massive networks of volunteers who provide observations following relatively unstructured protocols. However, the value of these data is largely unknown. 2. We develop a novel cross-data validation method for measuring the value of survey data from one source (e. g. an Internet checklist program) relative to a second, benchmark data source. The method fits a model to the data of interest and validates the model using benchmark data, allowing us to isolate the training data's information content from its biases. We also define a data efficiency ratio to quantify the relative efficiency of the data sources. 3. We apply our cross-data validation method to quantify the value of data collected in eBird - a western hemisphere, year-round citizen science bird checklist project - relative to data from the highly standardized North American Breeding Bird Survey (BBS). The results show that eBird data contain information similar in quality to that in BBS data, while the information per BBS datum is higher. 4. We suggest that these methods have more general use in evaluating the suitability of sources of data for addressing specific questions for taxa of interest.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available