Published in

Elsevier, Ecological Modelling, 2-3(157), p. 249-259, 2002

DOI: 10.1016/s0304-3800(02)00198-9

Links

Tools

Export citation

Search in Google Scholar

Should data be partitioned spatially before building large-scale distribution models?

Journal article published in 2002 by Patrick E. Osborne ORCID, Susana Suarez-Seoane
This paper was not found in any repository, but could be made available legally by the author.
This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

There is growing interest in building predictive models of species distributions over large geographic areas. As larger areas are modelled, however, it is highly likely that heterogeneity in the predictors variable increases and that areas are included where animals respond to habitats in different ways, for example, due to social status. These effects (spatial non-stationary) may weaken model performance. This paper explores whether data partitioning prior to analysis can improve the tit of models and provide ecological insight into distribution patterns. Data on three bird species were modelled for the whole of Spain at 1 km(2) resolution using logistic regression analysis. Data were partitioned into geographic quarters, concentric rings around the centroid of the distribution, and into random samples for comparison. In all cases, data partitioning produced better models as assessed by Receiver Operating Characteristic curve (AUC) statistics than analysis of the global data set. Inclusion of latitude and longitude improved the global models only when added as smoothed splines but produced different probabilities to the partitioned data. Geographic partitioning is a very crude local modelling approach and we suggest that some form of geographically-weighted regression could offer the best solution to large-scale modelling but is computationally intensive on Geographical Information Systems (GIs) data. It is concluded that simple partitioning by geographic quarters may detect spatial non-stationary and alert the modeller to possible problems; that partitioning into more novel arrangements may be used to test ecological hypotheses; and that data should not be partitioned spatially to build and test models if non-stationary is suspected.