Maximizing citizen scientists’ contribution to automated species recognition

Koch, Wouter; Hogeweg, Laurens; Nilsen, Erlend B.; Finstad, Anders G.

Published in

Nature Research, Scientific Reports, 1(12), 2022

DOI: 10.1038/s41598-022-11257-x

Tools

Export citation

Search in Google Scholar

Maximizing citizen scientists’ contribution to automated species recognition

Journal article published in 2022 by Wouter Koch

, Laurens Hogeweg

, Erlend B. Nilsen

, Anders G. Finstad

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractTechnological advances and data availability have enabled artificial intelligence-driven tools that can increasingly successfully assist in identifying species from images. Especially within citizen science, an emerging source of information filling the knowledge gaps needed to solve the biodiversity crisis, such tools can allow participants to recognize and report more poorly known species. This can be an important tool in addressing the substantial taxonomic bias in biodiversity data, where broadly recognized, charismatic species are highly over-represented. Meanwhile, the recognition models are trained using the same biased data, so it is important to consider what additional images are needed to improve recognition models. In this study, we investigated how the amount of training data influenced the performance of species recognition models for various taxa. We utilized a large citizen science dataset collected in Norway, where images are added independently from identification. We demonstrate that while adding images of currently under-represented taxa will generally improve recognition models more, there are important deviations from this general pattern. Thus, a more focused prioritization of data collection beyond the basic paradigm that “more is better” is likely to significantly improve species recognition models and advance the representativeness of biodiversity data.

Published in

Links

Tools

Maximizing citizen scientists’ contribution to automated species recognition

Abstract