The continuing challenges of testing species distribution models

I. P. Vaughan; S. J. Ormerod

ArticleOPEN ACCESS

The continuing challenges of testing species distribution models

Journal of Applied Ecology

DOI: 10.1111/j.1365-2664.2005.01052.x

259Citations

741Readers

Abstract

1. Species distribution models could bring manifold benefits across ecology, but require careful testing to prove their reliability and guide users. Shortcomings in testing are often evident, failing to reflect recent methodological developments and changes in the way models are applied. We considered some of the fundamental issues. 2. Generalizability is a basic requirement for predictive models, describing their capacity to produce accurate predictions with new data, i.e. in real applications beyond model training. Tests of generalizability should be as rigorous as possible: ideally using a large number of independent test sites (≥ 200-300) that represent anticipated applications. Bootstrapping identifies the role of overfitting of the training data in limiting a model's generalizability. 3. Predictions from most distribution models are continuous variables. Their accuracy may be described by discrimination and calibration components. Discriminatory ability describes how well a model separates occupied from unoccupied sites. It is independent of species prevalence and is readily comparable between models. Rank correlation coefficients, such as the concordance index, are effective measures. 4. Calibration describes the numerical accuracy of predictions (e.g. whether 40% of sites with predicted probabilities of 0·40 are occupied) but is frequently overlooked in model testing. Poor calibration could mislead any conservation efforts utilizing models to estimate the 'value' of different sites for a given species. Effective assessments can be made using smoothed calibration plots. 5. The effects of species prevalence on nominal presence-absence predictions are well known. The currently preferred accuracy measure, Cohen's κ, has weaknesses. We argue that mutual information measures, based in information theory, may be more appropriate. 6. Synthesis and applications. Model evaluation must be informative and should ideally: (i) define generalizability in detail; (ii) separate the discrimination and calibration components of accuracy and test both; (iii) adopt assessment techniques that permit more valid intermodel comparisons; (iv) avoid nominal presence-absence evaluation where possible and consider information-theoretic measures; and (v) utilize the full range of techniques to help diagnose the causes of prediction problems. Few modellers in applied ecology and conservation biology satisfy these needs, making it difficult for others to evaluate models and identify potential misuses. The problems are real, and if uncorrected will damage conservation efforts through the inaccurate assessment of distribution and habitat preferences of important organisms. © 2005 British Ecological Society.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Vaughan, I. P., & Ormerod, S. J. (2005, August). The continuing challenges of testing species distribution models. Journal of Applied Ecology. https://doi.org/10.1111/j.1365-2664.2005.01052.x

Readers' Seniority

PhD / Post grad / Masters / Doc 286

51%

Researcher 205

36%

Professor / Associate Prof. 57

10%

Lecturer / Post doc 16

Readers' Discipline

Agricultural and Biological Sciences 357

61%

Environmental Science 184

32%

Earth and Planetary Sciences 31

Computer Science 9

Article Metrics

Mentions

References: 2

View details >

The continuing challenges of testing species distribution models

Abstract

Author supplied keywords

References Powered by Scopus

The measurement of observer agreement for categorical data

The meaning and use of the area under a receiver operating characteristic (ROC) curve

Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors

Cited by Powered by Scopus

Assessing the accuracy of species distribution models: Prevalence, kappa and the true skill statistic (TSS)

AUC: A misleading measure of the performance of predictive distribution models

Mapping species distributions: Spatial inference and prediction

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline

Article Metrics