Published in

Wiley, Journal of Agricultural Economics, 1(74), p. 316-323, 2022

DOI: 10.1111/1477-9552.12505

Links

Tools

Export citation

Search in Google Scholar

A note on synthetic data for replication purposes in agricultural economics

Journal article published in 2022 by Stefan Wimmer ORCID, Robert Finger ORCID
This paper was not found in any repository, but could be made available legally by the author.
This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

AbstractEmpirical studies in agricultural economics usually involve policy implications. In many cases, such studies rely on proprietary or confidential data that cannot be published along with the article, challenging the replicability and credibility of the results. To overcome this problem, the use of synthetic data—that is, data that do not contain a single unit of the original data—has been proposed. In this note, we illustrate the utility of synthetic data generation methods for replication purposes using a range of methods from agricultural production analysis. More specifically, we compare input elasticities and technical efficiency scores based on different farm‐level production data between original data and synthetic data. We generate synthetic data using a non‐parametric method of classification and regression trees (CART) and parametric linear regressions. We find synthetic data result in elasticities and technical efficiency distributions that are very similar to the original data, especially when generated with CART, and conclude with implications for the research community.