nextGEMS: entering the era of kilometer-scale Earth system modeling

Geoscientific Model Development Copernicus Publications 18:20 (2025) 7735-7761

Authors:

Hans Segura, Xabier Pedruzo-Bagazgoitia, Philipp Weiss, Sebastian K M眉ller, Thomas Rackow, Junhong Lee, Edgar Dolores-Tesillos, Imme Benedict, Matthias Aengenheyster, Razvan Aguridan, Gabriele Arduini, Alexander J Baker, Jiawei Bao, Swantje Bastin, Eul脿lia Baulenas, Tobias Becker, Sebastian Beyer, Hendryk Bockelmann, Nils Br眉ggemann, Lukas Brunner, Suvarchal K Cheedela, Sushant Das, Jasper Denissen, Ian Dragaud, Piotr Dziekan, Madeleine Ekblom, Jan Frederik Engels, Monika Esch, Richard Forbes, Claudia Frauen, Lilli Freischem, Diego Garc铆a-Maroto, Philipp Geier, Paul Gierz, 脕lvaro Gonz谩lez-Cervera, Katherine Grayson, Matthew Griffith, Oliver Gutjahr, Helmuth Haak, Ioan Hadade, Kerstin Haslehner, Shabeh ul Hasson, Jan Hegewald, Lukas Kluft, Aleksei Koldunov, Nikolay Koldunov, Tobias K枚lling, Shunya Koseki, Sergey Kosukhin, Josh Kousal

Abstract:

Abstract. The Next Generation of Earth Modeling Systems (nextGEMS) project aimed to produce multidecadal climate simulations, for the first time, with resolved kilometer-scale (km-scale) processes in the ocean, land, and atmosphere. In only 3 years, nextGEMS achieved this milestone with the two km-scale Earth system models, ICOsahedral Non-hydrostatic model (ICON) and Integrated Forecasting System coupled to the Finite-volumE Sea ice-Ocean Model (IFS-FESOM). nextGEMS was based on three cornerstones: (1)聽developing km-scale Earth system models with small errors in the energy and water balance, (2)聽performing km-scale climate simulations with a throughput greater than 1 simulated year per day, and (3)聽facilitating new workflows for an efficient analysis of the large simulations with common data structures and output variables. These cornerstones shaped the timeline of nextGEMS, divided into four cycles. Each cycle marked the release of a new configuration of ICON and IFS-FESOM, which were evaluated at hackathons. The hackathon participants included experts from climate science, software engineering, and high-performance computing as well as users from the energy and agricultural sectors. The continuous efforts over the four cycles allowed us to produce 30-year simulations with ICON and IFS-FESOM, spanning the period 2020鈥2049 under the SSP3-7.0 scenario. The throughput was about 500 simulated days per day on the Levante supercomputer of the German Climate Computing Center (DKRZ). The simulations employed a horizontal grid of about 5 km resolution in the ocean and 10 km resolution in the atmosphere and land. Aside from this technical achievement, the simulations allowed us to gain new insights into the realism of ICON and IFS-FESOM. Beyond its time frame, nextGEMS builds the foundation of the Climate Change Adaptation Digital Twin developed in the Destination Earth initiative and paves the way for future European research on climate change.

Southern Annular Mode persistence and westerly jet: a reassessment using high-resolution global models

Weather and Climate Dynamics Copernicus Publications 6:4 (2025) 1179-1193

Authors:

TC Chen, H Goosse, C Davrinche, S Libera, C Roberts, M Aengenheyster, K Strommen, M Roberts, R Ghosh, JS Von Storch

Abstract:

This study evaluates the performance of high-resolution (grid sizes of 9-28 km for the atmosphere; 5-13 km for the ocean) global simulations from the EERIE project in representing the persistence of the Southern Annular Mode (SAM), a leading mode of Southern Hemisphere climate variability. Using the decorrelation timescale of the SAM index (蟿), we compare EERIE simulations with CMIP6 models and ERA5 reanalysis. EERIE simulations reduce long-standing biases in SAM persistence, especially in early summer, with 蟿 values of 9-20 d compared to CMIP6's 9-32 d and ERA5's 11 d. This improvement correlates with a more accurate climatological jet latitude (位0). EERIE atmosphere-only AMIP runs outperform the coupled simulations in both 蟿 and 位0, showing smaller biases and ranges of variability, underscoring the critical role of sea surface temperature (SST) representation in shaping atmospheric circulation. In these AMIP experiments, the atmospheric eddy feedback strength, combined with the damping timescale estimated via friction, correlates more strongly with 蟿 than 位0. We speculate that the well-captured jet position (biases < 1掳 relative to ERA5), due to prescribed SSTs, limits 位0's explanatory power for 蟿 differences, allowing other processes to dominate. Using a finer model grid (9 km vs. 28 km) of the same AMIP model reduces 蟿, though the mechanism remains unclear. Finally, motivated by the importance of oceanic eddies in the Southern Ocean, we conducted sensitivity experiments that filter transient mesoscale features from the SST boundary conditions. The results suggest that oceanic eddies may enhance summertime SAM persistence (by g1/4 2 d), though this signal is not statistically significant and is absent in the single 9 km run, pointing to a subtle role of mesoscale ocean-Atmosphere interaction that remains to be explored.

Balancing Informativity and Predictability in Circulation Type Forecasts: A Case Study of Energy Demand in Great Britain

Meteorological Applications Wiley 32:4 (2025) e70078

Authors:

Kristian Strommen, Hannah M Christensen, Hannah C Bloomfield

Abstract:

Weather regimes and weather patterns, here jointly referred to as circulation types, are used to generate forecasts for a variety of applications, such as energy demand and flood risk. However, there are usually many different choices available for precisely which circulation types to use. Ideally, one would like to use circulation types that are both highly informative for the application and also highly predictable, but in practice, there is often a tradeoff between informativity and predictability. We present a simple, general framework for how to construct a circulation type forecast that optimally balances these factors by segueing between different choices of circulation types at different lead times based on information鈥恡heoretic considerations. As an example, we apply this framework to the case of forecasting energy demand in Great British winters. We compare a set of 30 weather patterns produced by the UK Met Office with the much simpler two鈥恠tate framework consisting of a positive and negative North Atlantic Oscillation (NAO) regime and show how to optimally combine the two across a winter season.

Data-Driven Stochastic Parameterization of MCS Latent Heating in the Grey Zone

Copernicus Publications (2025)

Authors:

Zhixiao Zhang, Hannah Christensen, Robert Plant, Warren Tennant, Mark Muetzelfeldt, Michael Whitall, Tim Woollings, Alison Stirling

Abstract:

Mesoscale Convective Systems (MCSs), with length scales of 100 to 1000 km or more, fall into the "grey zone" of global models with grid spacings of 10s of km. Their under-resolved nature leads to model deficiencies in representing MCS latent heating, whose vertical structure critically shapes large-scale circulations. To address this challenge, we use analysis increments鈥攖he corrections applied by Data Assimilation (DA) to the model's prior state鈥攆rom a 10 km Met Office operational forecast model to inform the development of a stochastic parameterization for MCS latent heating. To focus on errors in MCS feedback rather than errors due to a missing MCS, we select analysis increments from 1037 MCS tracks that the model successfully captures at the start of the DA cycle.A Machine Learning鈥揵ased Gaussian Mixture Model reveals that the vertical structure of temperature analysis increments is probabilistically linked to the atmospheric environment. Bottom-heavy heating increments tend to occur in low Total Column Water Vapor (TCWV) conditions, suggesting that the model underestimates low-level convective heating in relatively dry environments. In contrast, top-heavy heating increments are linked to a moist layer overturning structure鈥攃haracterized by high TCWV and strong vertical wind shear鈥攊ndicating model underestimation of upper-level condensate detrainment in such environments. This probabilistic relationship is implemented in the Met Office operational forecast model as part of the MCS: PRIME stochastic scheme, which corrects MCS-related uncertainties during model integration. By enhancing top-heavy heating, the scheme backscatters kinetic energy from the mesoscale to larger scales, improving predictions of Indian seasonal rainfall and the Madden鈥揓ulian Oscillation (MJO). Future work will assess its impact on forecast busts and its potential to extend predictability.

Precipitation rate, convective diagnostics and spin-up compared across physics suites in the model uncertainty model intercomparison project (MUMIP)

Copernicus Publications (2025)

Authors:

Edward Groot, Hannah Christensen, Xia Sun, Kathryn Newman, Wahiba Lfarh, Romain Roehrig, Kasturi Singh, Hugo Lambert, Keith Williams, Jeff Beck, Ligia Bernadet, Judith Berner

Abstract:

A parameterisation suite is the combination of all parameterisation schemes that is used by a numerical model of the atmosphere. These parameterisation (or 鈥減hysics鈥) suites are widely seen as the most uncertain components of atmospheric models.聽聽In MUMIP we compare deterministic parameterisation suites from across different modelling centres under common prescribed large-scale dynamics. In the first MUMIP experiment, these dynamical tendencies have been derived by coarse-graining the convection-permitting ICON DYAMOND simulation to 0.2 degree resolution. We use these realistic spatiotemporal dynamical patterns to drive millions 聽of single column model simulations over the tropical Indian Ocean with prescribed SSTs. We use this data to estimate the uncertainty from their physics across four models, each using their default convection-parametrised physics suites. The models are: IFS, GFS, RAP and ARPEGE. The distributions of precipitation rate, convective available potential energy (CAPE), convective inhibition (CIN) and level of neutral buoyancy are analysed, as well as individual model tendencies and rate of change of CAPE and CIN as a function of lead time and, for instance, the diurnal cycle . We find notable differences across the physics suites and even more strongly between convection-parameterised physics suites and the convection-permitting ICON DYAMOND benchmark. Furthermore, we relate these diagnostics to biases in temperature and specific humidity. We also develop a framework for the detection of statistical relations among diagnostics and/or their change. The framework may for instance be used to quantify the impact of spin-up compared to persistence ("memory") and randomness within a dataset and to identify similarity in the physics across modelling centres. In this contribution some of the early results of the international MUMIP project will be presented and we hope to encourage other researchers to use and/or complement the data of MUMIP. Please refer to https://mumip.web.ox.ac.uk for details of how to get involved.