4.7 Article

Sample size for the evaluation of presence-absence models

Journal

ECOLOGICAL INDICATORS
Volume 114, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.ecolind.2020.106289

Keywords

AUC; Accuracy; ROC curve; Sample size; Sensitivity; Specificity

Funding

  1. Spanish Ramon y Cajal Program [RYC-2013-14441]
  2. Spanish Ministerio de Ciencia, Innovacion y Universidades
  3. [CGL2017-89000-P]

Ask authors/readers for more resources

The effect of the training dataset sample size has been shown to have profound outcomes on the performance of species distribution models. However, the effects that the testing dataset sample size can have on the assessment of a models predictive capacity has received little attention. In this study, I used simulations to study how accurate two discrimination statics, the AUC (the area under the receiver operating characteristic - ROC - curve) and Se* (the probability of correctly classifying any case and calculated from the threshold that makes minimum the difference between sensitivity and specificity), are estimated based on sample size. ROC curves with known discrimination ability were simulated, samples were randomly taken, the two discrimination statistics were estimated, and the differences between the two estimators and their respective true values were computed to understand how bias and precision were affected by sample size. In general, as sample size increases, the difference between reported and true discrimination capacity decreased. There were no important differences between the estimated AUC and Se* statistics in terms of bias and precision. Under realistic scenarios where the ROC points are not necessarily part of the true underlying ROC curve, the two discrimination statistics are both unbiased and equally precise, and the higher the true discrimination capacity is, the more accurate they are estimated. Between 20 and 30 is a lowest sample size limit since below this interval accuracy estimates considerably decreases. All together, these results are very important since many interesting SDM applications involve rare and poorly known species for which sample sizes are unavoidably small.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Food Science & Technology

Amplification of 16S rDNA reveals important fish mislabeling in Madrid restaurants

Jose L. Horreo, Patrick S. Fitze, Alberto Jimenez-Valverde, Jorge Ari Noriega, Maria L. Pelaez

FOOD CONTROL (2019)

Article Zoology

Arthropod biodiversity patterns point to the Mesovoid Shallow Substratum (MSS) as a climate refugium

Enrique Ledesma, Alberto Jimenez-Valverde, Enrique Baquero, Rafael Jordana, Alberto de Castro, Vicente M. Ortuno

ZOOLOGY (2020)

Article Zoology

Niche differentiation between deeply divergent phylogenetic lineages of an endemic newt: implications for Species Distribution Models

Miguel Penalver-Alcazar, Alberto Jimenez-Valverde, Pedro Aragon

Summary: Intraspecific variation impacts the performance of species distribution models, with geographically structured phylogenetic lineages showing differences in predicted potential distribution and environmental factors. Model predictive capacity varies depending on the algorithm used, and lineages exhibit low niche overlap, occurring in different environmental niches. Improved predictivity is observed in lineage-level distribution models, with partial spatial agreement between niche overlap and reported secondary contact zones.

ZOOLOGY (2021)

Review Zoology

Diplura in caves: diversity, ecology, evolution and biogeography

Alberto Sendra, Ferran Palero, Alberto Jimenez-Valverde, Ana Sofia P. S. Reboleira

Summary: Diplurans are considered an exceptional group of animals adapted to cave environments, playing a crucial role in organic matter decomposition underground and being sensitive to human impacts. Comprehensive research is needed to fill knowledge gaps and aid in the conservation of cave ecosystems.

ZOOLOGICAL JOURNAL OF THE LINNEAN SOCIETY (2021)

Article Plant Sciences

Asian cave-adapted diplurans, with the description of two new genera and four new species (Arthropoda, Hexapoda, Entognatha)

Alberto Sendra, Ana Komericki, Josiane Lips, Yunxia Luan, Jesus Selfa, Alberto Jimenez-Valverde

Summary: This study identified two new genera and four new species of Diplura from caves in China and Myanmar, showcasing impressive morphological adaptation to cave ecosystems. The discovery highlights the biogeographical importance of biodiversity in East Asia, with vast karst regions still waiting to be explored and new species of diplurans waiting to be discovered.

EUROPEAN JOURNAL OF TAXONOMY (2021)

Article Biodiversity Conservation

Prevalence affects the evaluation of discrimination capacity in presence-absence species distribution models

Alberto Jimenez-Valverde

Summary: This study found that AUC and Se* are unbiased estimators and achieve the highest precision when prevalence is balanced. Increasing sample size leads to greater stability in the prevalence interval around 0.5. For sample sizes n <= 100, at least ten observations of the rare state should be considered, while a prevalence interval of [0.01, 0.99] is recommended for higher sample sizes.

BIODIVERSITY AND CONSERVATION (2021)

Review Biodiversity Conservation

Diversity, ecology, distribution and biogeography of Diplura

Alberto Sendra, Alberto Jimenez-Valverde, Jesus Selfa, Ana Sofia P. S. Reboleira

Summary: Diplura is a basal hexapod group related to insects, with unique entognathan mouthparts, widespread distribution in terrestrial habitats, and dependence on high humidity and moderate temperatures. They are potentially sensitive to anthropogenic pressures and climate change, making them important targets for ecophysiological studies and conservation efforts.

INSECT CONSERVATION AND DIVERSITY (2021)

Article Zoology

Cave-adapted campodeids (Hexapoda, Diplura, Campodeidae) from the Dinarides and adjacent karst regions

Alberto Sendra, Spela Borko, Alberto Jimenez-Valverde, Jesus Selfa, Marko Lukic, Kazimir Miculinic, Tonci Rada, Dragan Antic

Summary: This study describes five new cave-adapted campodeids species, highlighting the importance of the Dinarides karst region as a center of diversification for campodeids and cave animals in general. Four out of five subgenera were present in the region studied, with a monophyletic subgroup colonizing the Dinaric plate during the middle of the Cenozoic.

REVUE SUISSE DE ZOOLOGIE (2021)

Article Zoology

Climatic niche differences among Zootoca vivipara clades with different parity modes: implications for the evolution and maintenance of viviparity

J. L. Horreo, A. Jimenez-Valverde, P. S. Fitze

Summary: The research indicates that viviparous clades prefer habitats with less variable temperature and precipitation, while oviparous clades prefer habitats with more variable temperatures. Analysis using reproductive period climates confirms that viviparous lizards exhibit selfish behavior of mothers in relatively safer environments, giving them an adaptive advantage in survival.

FRONTIERS IN ZOOLOGY (2021)

Article Ecology

The uniform AUC: Dealing with the representativeness effect in presence-absence models

Alberto Jimenez-Valverde

Summary: This paper proposes a methodology to harmonize the distribution of suitability values in species distribution models, and harmonizes the area under the receiver operating characteristic curve (AUC) for comparison between different datasets. The author validates the method through simulations and empirical studies, and expects it to be useful in other research areas dealing with discrimination/classification problems.

METHODS IN ECOLOGY AND EVOLUTION (2022)

Article Plant Sciences

A new Diplura species from Georgia caves, Plusiocampa (Plusiocampa) imereti (Diplura, Campodeidae), with morphological and molecular data

Alberto Sendra, Ferran Palero, Alba Sanchez-Garcia, Alberto Jimenez-Valverde, Jesus Selfa, Eter Maghradze, Shalva Barjadze

Summary: The article describes a new dipluran species, Plusiocampa (Plusiocampa) imereti, discovered in caves in the Imereti region of Georgia, which is a new addition to four other known Diplura species in the Black Sea region. This study provides the first CO1 sequences for Plusiocampinae taxa and the first molecular data for cave-dwelling Plusiocampa species in the region.

EUROPEAN JOURNAL OF TAXONOMY (2021)

Article Biodiversity Conservation

rangemap: AN R PACKAGE TO EXPLORE SPECIES GEOGRAPHIC RANGES

Marlon E. Cobos, Vijay Barve, Narayani Barve, Alberto Jimenez-Valverde, Claudia Nunez-Penichet

Summary: This article introduces rangemap, an R package that provides tools for exploring species' geographic distributions. The package includes analysis tools and visualization tools that can generate simple summaries and figures of species' ranges. It also allows for the generation of extents of occurrence and areas of occupancy based on IUCN criteria.

BIODIVERSITY INFORMATICS (2022)

Article Ecology

Deconstructing the abundance-suitability relationship in species distribution modelling

Alberto Jimenez-Valverde, Pedro Aragon, Jorge M. Lobo

Summary: Estimating local suitability with species distribution models (SDMs) can indicate the maximum abundance attainable by species, but the abundance-suitability relationship is typically wedge-shaped. The shape of this relationship is directly related to maximum abundance and is influenced by SDM quality and species prevalence.

GLOBAL ECOLOGY AND BIOGEOGRAPHY (2021)

Article Biodiversity Conservation

Climate data source matters in species distribution modelling: the case of the Iberian Peninsula

Alberto Jimenez-Valverde, Marta Rodriguez-Rey, Pablo Pena-Aguilera

Summary: This study compared global WorldClim v.2 database (WC) with regional Iberian Climate Atlas (ICA) in the geographical context of the Iberian Peninsula, focusing on differences in climatic variables and their impact on woody plant distribution models. Significant discrepancies were found in precipitation values between the two databases, while temperature values also showed noticeable differences, especially in high elevation areas. The source of climate data influenced estimated suitability values, discrimination capacity, and the importance of variables in the distribution models. Additionally, the rarity of species was associated with increased uncertainty related to the climate data source.

BIODIVERSITY AND CONSERVATION (2021)

Article Biodiversity Conservation

Identification of critical ecological restoration and early warning regions in the five-lakes basin of central Yunnan

Yongcui Lan, Jinliang Wang, Qianwei Liu, Fang Liu, Lanfang Liu, Jie Li, Mengjia Luo

Summary: This study focuses on the five major plateau lake basins in central Yunnan, China, and constructs an ecological security pattern using the source-resistance surface-corridor-pinch point framework. The study simulates land use/cover change in the region and identifies early warning regions where future urban expansion poses a threat to current ecological source areas and corridors.

ECOLOGICAL INDICATORS (2024)

Article Biodiversity Conservation

Active microeukaryotes hold clues of effects of global warming on benthic diversity and connectivity in the coastal sediments

Pingping Huang, Feng Zhao, Bailing Zhou, Kuidong Xu

Summary: This study investigates the distribution of benthic microeukaryotes in the China Seas and finds that they can stride over the ecological barrier of 32 degrees N. The study also highlights the significant influence of depth, temperature, and latitude on communities in the China Seas.

ECOLOGICAL INDICATORS (2024)

Article Biodiversity Conservation

Which bird traits most affect the goodness-of-fit of species distribution models?

Federico Morelli, Yanina Benedetti, Jesse Stanford, Leszek Jerzak, Piotr Tryjanowski, Paolo Perna, Riccardo Santolini

Summary: Species distribution models (SDMs) are numerical tools used for predicting species' spatial distribution. This study found that ecological characteristics, such as habitat specialization, play a role in improving the accuracy of SDMs.

ECOLOGICAL INDICATORS (2024)

Article Biodiversity Conservation

Exploring the spatiotemporal evolution dynamic and influencing factor of green ecology transition for megacities: A case study of Chengdu, China

Xiaoxuan Wu, Hang Liu, Wei Liu

Summary: Global climate change, urbanization, and economic development have increased the need for sustainable human development, urban ecological governance, and low-carbon energy transformation. This study analyzes the green ecological transition in Chengdu based on panel data from 2010 to 2020, exploring its spatiotemporal evolution and key factors. The results show an overall upward trend in Chengdu's green ecological development and positive spatial autocorrelation in certain districts.

ECOLOGICAL INDICATORS (2024)

Article Biodiversity Conservation

A multi-indicator approach to compare the sustainability of organic vs. integrated management of grape production

Castaldi Simona, Formicola Nicola, Mastrocicco Micol, Morales Rodriguez Carmen, Morelli Raffaella, Prodorutti Daniele, Vannini Andrea, Zanzotti Roberto

Summary: Sustainable agricultural practices are increasingly important for global and national environmental policies and economy. This study compared the sustainability of grape production under integrated and organic management using multiple indicators. The results showed that organic management was more beneficial for most environmental aspects of the agroecosystem compared to integrated management, without affecting grape yield.

ECOLOGICAL INDICATORS (2024)

Article Biodiversity Conservation

Comparing ground below-canopy and satellite spectral data for an improved and integrated forest phenology monitoring system

Gaia Vaglio Laurin, Alexander Cotrina-Sanchez, Luca Belelli-Marchesini, Enrico Tomelleri, Giovanna Battipaglia, Claudia Cocozza, Francesco Niccoli, Jerzy Piotr Kabala, Damiano Gianelle, Loris Vescovo, Luca Da Ros, Riccardo Valentini

Summary: Phenology monitoring is important for understanding forest functioning and climate impacts. This research compares the phenological behavior of European beech forests using Tree-Talker (TT+) and Sentinel 2 satellite data. The study finds differences in the information derived by the two sensor types, particularly in terms of season length, phenology changepoints, and leaf period variability. TT+ with its higher temporal resolution demonstrates precision in capturing the phenological changepoints, especially when satellite image availability is limited.

ECOLOGICAL INDICATORS (2024)

Article Biodiversity Conservation

Assessing the coupling coordination dynamics between land use intensity and ecosystem services in Shanxi's coalfields, China

Huanhuan Pan, Ziqiang Du, Zhitao Wu, Hong Zhang, Keming Ma

Summary: The land use and cover changes resulting from coal mining activities and ecological restoration have had a significant impact on ecosystem services in mining areas. This study investigates the relationship between ecosystem services and land use intensity in coal mining areas, emphasizing the importance of understanding this interdependence for balanced human-land system development. The research examines the evolving relationship across different reclamation stages in Shanxi, China, using a coupling coordination degree model. The findings suggest the need for timely and judicious reclamation of coalfields, considering the land's bearing capacity.

ECOLOGICAL INDICATORS (2024)

Article Biodiversity Conservation

An investigation on the impact of blue and green spatial pattern alterations on the urban thermal environment: A case study of Shanghai

Jingjuan He, Yijun Shi, Lihua Xu, Zhangwei Lu, Mao Feng

Summary: This study examines the spatial interplay between changes in the blue-green spatial distribution and modifications in land surface temperature grades in Shanghai. The findings reveal that the transformation of the blue-green spatial pattern differs between different sectors of the city, and the impact on the thermal environment varies spatially.

ECOLOGICAL INDICATORS (2024)

Article Biodiversity Conservation

Prediction of phytoplankton biomass and identification of key influencing factors using interpretable machine learning models

Yi Xu, Di Zhang, Junqiang Lin, Qidong Peng, Xiaohui Lei, Tiantian Jin, Jia Wang, Ruifang Yuan

Summary: This study analyzed the response relationship between phytoplankton growth and water environmental parameters in the Middle Route of the South-to-North Water Diversion Project in China using long-term monitoring data and machine learning models. The results revealed the differences between monitoring sites and identified the key parameters that affect phytoplankton growth.

ECOLOGICAL INDICATORS (2024)