Full text: Download
Improved agricultural production systems, together with increased grain yield, are essential to feed the growing global population in the 21st century. Global gridded crop models (GGCMs) have been extensively used to assess crop production and yield simulation on a large geographical scale. However, GGCMs are less effective when they are used on a finer scale, significantly limiting the precision in capturing the yearly maize yield. To address this issue, we propose a relatively more advanced approach that downsizes GGCMs by combining machine learning and crop modeling to enhance the accuracy of maize yield simulations on a regional scale. In this study, we combined the random forest algorithm with multiple data sources, trained the algorithm on low-resolution maize yield simulations from GGCMs, and applied it to a finer spatial resolution on a regional scale in China. We evaluated the performance of the eight GGCMs by utilizing a total of 1046 county-level maize yield data available over a 30-year period (1980–2010). Our findings reveal that the downscaled models created for maize yield simulations exhibited a remarkable level of accuracy (R2 ≥ 0.9, MAE < 0.5 t/ha, RMSE < 0.75 t/ha). The original GGCMs performed poorly in simulating county-level maize yields in China, and the improved GGCMs in our study captured an additional 17% variability in the county-level maize yields in China. Additionally, by optimizing nitrogen management strategies, we identified an average maize yield gap at the county level in China ranging from 0.47 to 1.82 t/ha, with the south maize region exhibiting the highest yield gap. Our study demonstrates the high effectiveness of machine learning methods for the spatial downscaling of crop models, significantly improving GGCMs’ performance in county-level maize yield simulations.