Published in

Cell Press, Trends in Ecology and Evolution, 6(29), p. 302-303

DOI: 10.1016/j.tree.2014.03.004

Links

Tools

Export citation

Search in Google Scholar

Data availability and model complexity, generality, and utility: a reply to Lonergan

This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Red circle
Preprint: archiving forbidden
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Lonergan [1] suggests that the claim made in our recent paper [2], that 'viewing simple models as the main way to achieve generality may be an obstacle to the progress of ecological research', will depend on the availability of data. We have discussed this issue in a recent article [3] in which we acknowledge that to obtain more robust predictions and better understanding of ecosystem processes there needs to be 'greater emphasis on constraining models with data and on hypothesis testing within models'. In that paper we also point out how the ability of ecologists to collect data has been revolutionised by technological advances. For exam-ple, in the past few days it has been shown how southern right whales can be surveyed by satellite [4]. Ecologists are taught that the path to generality lies via simple models. Our article [2] was aimed at providing a philosophically informed challenge to this view and sug-gested that the perceived generality of simple models can be superficial because as models become simpler they lose the ability to provide actual instances of explanation. They become instead demonstrations of possible explanations. In our paper we emphasised that this function of simple models is extremely useful. Our main argument was that the maxim 'simple = general = good' should not be accepted unquestioningly by ecological modellers. It will be the case that additional complexity will mean that more data are required for calibration, but one point of agreement among ecologists is that the real world is com-plex. This complexity does not simply vanish if one reduces multiple processes down to a summary variable: it has only been hidden [5]. For example, a term such as the intrinsic population growth rate in a logistic equation is a single parameter that can be estimated from data and uses up few degrees of freedom (potentially only one). However, such a portmanteau parameter gives little understanding of the processes that underlie population growth. For this it is necessary to decompose this one parameter into several. Both approaches are useful, but let us not pretend that we have fully understood the system if we have reduced everything down to the simplest model we can devise. If we take complexity to be the number of parameters estimated, then as complexity increases model fit to data cannot get worse. However, we dispute the suggestion that this makes our argument a tautology. Our claim was that the reason why simple models are perceived as having more generality than complex models is because the stan-dard of fidelity used to assess model fit is lower for simple models than for complex models. This results in them appearing to have the ability to fit a greater range of systems while not actually fitting any of them well. Note that our use of 'fidelity' refers to 'standards theorists use to evaluate a model's ability to represent phenomena' ([6], p. 39) rather than to how close the predictions of each model actually are to the data fitted to it, as claimed by Lonergan [1]. The final test of a model is its ability to make predictions about observations unused in model design and parameterisation. Developing complex models will often be demanding of both logical thought and data. One of the most useful products of a modelling exercise is to demonstrate poten-tially important mechanisms where data are lacking. A good example of this can be seen in the development of climate models, which have become progressively more complex with time and actually have wider confidence intervals now than in the past [7]. The increasing com-plexity has resulted from greater understanding of the climate system and a reduction in the number of simplifying assumptions. The desire to make climate models reflect the complexity of the climate system has stimulated research into the terrestrial carbon cycle, the role of aerosols, and the roles of clouds and ground-to-atmosphere feedback. Would these phenomena have been subjected to such active research without models that required their quantification? Lonergan offers a counsel of despair: complex models cannot be tested against data unless many data are avail-able. He infers that the search for general patterns across ecology will often be consigned to simple approaches. The result would be stagnation in the progress of ecology. We think that we need to embrace the complexity of the real world and reflect this in our models, and if necessary collect the appropriate data. Thirty years ago Medawar wrote that what 'sets the genuine sciences apart from those that arrogate to themselves the title without really earning