https://arxiv.org/api/icxKVxLeBvSYWsXvXE8OfUqRHbY2026-06-13T11:51:21Z836510515http://arxiv.org/abs/2605.21673v1From Licensing to Open Access: Designing a Sustainable Transition in Operational Weather Data2026-05-20T19:30:02ZThis translational article documents the European Centre for Medium-Range Weather Forecasts (ECMWF) transition from a restricted data licensing model to open access under CC BY 4.0, completed in October 2025. The policy context included EU open data requirements and alignment with international data exchange frameworks. The transition was implemented through a tiered service model that kept core forecast data open while offering operationally supported delivery as a cost-recovered service. Between 2020 and 2025, ECMWF executed an iterative planning cycle: setting an annual target for revenue reduction, specifying additions to the open tier under that target, provisioning infrastructure, and assessing outcomes to update assumptions. Drawing on internal administrative records (2014 - 2025), we describe design choices, operational constraints, and early outcomes. In the six months following the end of the transition, more than 93% of previously paying organisations retained a Service Agreement, while open endpoint download volumes increased substantially. We discuss trade-offs in defining the open tier (resolution, parameters, schedule), the reduction of compliance overheads formerly associated with redistribution restrictions, and the scalability implications of global distribution. We note an emerging sustainability question as AI-based forecast products become freely available. The early evidence is consistent with the view that a tiered service model can be designed to reconcile open-access obligations with operational sustainability, subject to monitoring over longer contract renewal cycles (typically annual).2026-05-20T19:30:02ZEmma PidduckUmberto ModiglianiVictoria L. BennettFabio VenutiFlorian PappenbergerFlorence Rabierhttp://arxiv.org/abs/2605.21196v1Effect of grid anisotropy, resolution, and subgrid-scale models in pseudo-spectral Large Eddy Simulations of low-level clouds2026-05-20T13:57:19ZWe investigate the effect due to grid resolution and subgrid-scale model on large-eddy simulations of low-level clouds using a novel framework that combines pseudo-spectral advection with the anisotropic minimum dissipation (AMD) subgrid-scale model. We use two field campaigns as reference, DYCOMS-II RF01 and ASTEX, which cover both non-precipitating and precipitating stratocumulus cloud regimes across different time scales. Our results demonstrate that the AMD model combined with pseudo-spectral advection produces robust and accurate predictions across varying grid resolutions without parameter tuning. We identify a recommended grid anisotropy where vertical spacing is approximately three times finer than horizontal spacing, balancing accuracy and computational efficiency. Finally, an error analysis based on cloud liquid water content and vertical velocity variance reveals good agreement with theoretical predictions for isotropic grids, while grid anisotropy effectively improves convergence rates.2026-05-20T13:57:19ZDavide SelvaticiRichard J. A. M. Stevenshttp://arxiv.org/abs/2605.21191v1Beyond Vorticity: An Angular Momentum Perspective on Fluid Flow2026-05-20T13:54:38ZWhile vorticity is the classical tool for analyzing rotational fluid kinematics, it inherently focuses on local, differential spin. This paper introduces a complementary framework based on the angular momentum density field, $\mathbf{L} = \mathbf{r} \times \mathbf{u}$, deriving generalized transport equations that explicitly balance macroscopic torque and rotational momentum. This $\mathbf{L}$ perspective offers several distinct theoretical advantages over traditional velocity/vorticity formulations. Specifically, this approach: (i) provides a novel decomposition of the viscous torque into a diffusive component and a local spin dissipative term; (ii) shows the mechanism by which lift is generated in viscous boundary layers by vorticity acting as a source of angular momentum; it also explains stall (iii) reformulates the hydrodynamic impulse to yield a remarkably clean separation of terms into dilatational, volumetric, and rotational flux components; The $\mathbf{L}$ formalism provides the kinematic closure necessary to unify non-circulatory added mass and circulatory lift within a single, dimensionally consistent budget. (iv) enables the direct calculation of the viscous added mass force, accounting for the inertial resistance of boundary layers and separated wakes; (v) simplifies geophysical fluid dynamics by absorbing the planet's rotation, traditionally treated as an artificial virtual vorticity term which directly gets absorbed into the conserved axial angular momentum $m$, revealing the fundamental physics of global circulation through explicit torque balances; (vi) identifies the rotlet as a fundamental Green's function for the $\mathbf{L}$ transport equation in the Stokes regime; and (vii) demonstrates that both oblique shocks and vortex sheets act as singular sources of $\mathbf{L}$ that turn the macroscopic flow.2026-05-20T13:54:38ZAhmed Farooqhttp://arxiv.org/abs/2503.11803v6Harnessing natural and mechanical airflows for surface-based atmospheric pollutant removal2026-05-20T10:56:33ZRemoval strategies for atmospheric pollutants are increasingly being considered to mitigate global warming and improve public health. However, the global potential of surface-based removal techniques has not yet been quantified based on limits of pollutant transport and removal rates. We evaluate the atmospheric pollutant transport to surfaces and assess the potential of surface-based removal technologies for global-scale deployment across a variety of configurations, including air interaction with the built environment, mechanical ventilation and convection systems, and over the global transportation fleet. Cities provide the highest transport-limited removal potential, with median annual atmospheric flow rates of 30 GtCO$_2$, 0.06 GtCH$_4$, 0.007 GtNO$_\text{x}$ and 0.0001 GtPM$_{2.5}$ to their total surface area. Cities, solar farms and HVAC systems have flow rates large enough to potentially remove more than 1 GtCO$_2$/y (1 GtCO$_2$e/y for CH$_4$, 20-year GWP), if laboratory-scale removal efficiencies from the literature are applied to their total surface area, however, achieving this would require technological advances. Based on their transport-limited upper bounds, HVAC filters have the potential to achieve costs as low as \$600 per tCO$_2$ removed (\$2000 per tCO$_2$e) if CO$_2$-sorption (CH$_4$-catalyst) technologies are incorporated into their surfaces and performance is maintained through routine replacement, compared with \$3000 per tCO$_2$ (\$10000 per tCO$_2$e) for city surfaces, using literature values for these technologies' material and application costs. These findings demonstrate that integrating surface-based pollutant removal technologies into infrastructure may offer a pathway to advance climate objectives, though further studies are needed to assess their feasibility in application, and application-implementation rates and cost.2025-03-14T18:43:40ZSamuel D. TomlinsonAliki M. TsopelakouTzia M. OnnSteven R. H. BarrettAdam M. BoiesShaun D. Fitzgeraldhttp://arxiv.org/abs/2605.20925v1Blending machine learning and physics-based approaches for weather and climate: a typology2026-05-20T09:10:45ZThe integration of machine learning (ML) with traditional physics-based models is reshaping the landscape of weather and climate prediction. On their own, ML-based and physics-based approaches each have significant benefits - but also challenges. Deploying both these approaches side by side has the potential to accelerate the pull through of emerging science in a trusted and practical way. But there are many choices that can be made to how we "blend" ML and established physics-based modelling systems to get the optimal benefits. This paper aims to provide a typology of blended modelling approaches and discusses some of the strategic benefits that come with them. It can be used not just to classify modelling systems, but also identify routes to gradual, incremental or wholesale development and implementation of new and emerging capabilities. These approaches provide a practical path to innovation by combining the speed and adaptability of machine learning with the robustness, trust, and interpretability of physics-based systems. By adopting a structured vocabulary and outlining the benefits and limitations of each approach, this framework supports informed decision-making and strategic planning, and can be used by the wider community to navigate the transition to next-generation prediction systems.2026-05-20T09:10:45ZSubmitted to Bull. Amer. Meteor. Soc. (BAMS)Benjamin J ShipwayCaroline BainDavid WaltersBen B. B. BoothIan BoutleRobin T. ClarkKatherine L. HillElizabeth KendonSimon B. Vosperhttp://arxiv.org/abs/2605.26130v1AirCast-SR: A Foundation Model for Kilometer-Scale Atmospheric Super-Resolution via Latent Consistency Diffusion2026-05-20T07:29:00ZOperational weather prediction at kilometer scales remains computationally prohibitive for traditional numerical weather prediction (NWP) models, limiting forecast access for applications in energy, agriculture, and disaster management that require fine-grained spatiotemporal detail. Here we introduce AirCast-SR, a foundation model for atmospheric super-resolution that downscales global AI weather forecasts from 0.25 degree (~28 km) to 1 km horizontal resolution at hourly temporal resolution, producing 67-hour forecasts of eight coupled surface variables simultaneously. EarthMind-SR employs a three-dimensional U-Net conditioned within a Latent Consistency Model (LCM) diffusion framework, trained on patch-based samples over the contiguous United States (CONUS) using GraphCast forecasts as input and NOAA's Analysis of Record for Calibration (AORC) as the target. The model achieves near-zero bias across all variables and lead times, and its radial power spectral density analysis demonstrates preservation of fine-scale atmospheric structure at wavelengths of 10 km to 100 km where coarser models lose spectral power. We validate EarthMind-SR across three CONUS case studies spanning winter, summer, and spring seasons, and demonstrate zero-shot global transferability over India and Germany using independent surface station observations without any retraining or fine-tuning. As an open-weights foundation model, EarthMind-SR establishes a new paradigm for kilometer-scale AI weather prediction and provides a platform for regional fine-tuning, distillation, and downstream applications in climate services and hazard forecasting.2026-05-20T07:29:00ZSomnath Luitel and Manmeet Singh are equal-contribution co-first authors, with Manmeet Singh (manmeet.singh@wku.edu) as corresponding authorSomnath LuitelManmeet SinghJoshua DurkeeAbdullah Al FahadNaveen SudharsanPrabhjot SinghCenlin HeHarsh KamathZong-Liang YangKrishnagopal HalderSandeep JunejaParthasarathi MukhopadhyaySaptarishi DhanukaAmit Kumar Srivastavahttp://arxiv.org/abs/2605.20494v1A 10,000-Year Global Stochastic Tropical Cyclone Catalog with Wind-Dependent Track Transitions (WHITS)2026-05-19T20:58:36ZReliable assessment of tropical cyclone (TC) risk is limited by the brevity and spatial sparsity of the historical record, particularly for the rare, high-intensity landfalls that dominate insured loss. We present WHITS (Wind-focused Hurricane Interactive Track Simulator), a non-parametric semi-Markov track generator that extends the HITS framework of Nakamura et al. (2015) in three ways: transitions between historical track segments are conditioned on local wind speed in addition to position, age, and forward vector; the kernel selection on the comparative-vector term is sharpened to suppress dynamically inconsistent jumps; and a short smoothing window is applied across each transition to remove the position and wind discontinuities reported by downstream surge users. WHITS is fit to the full available best-track record in each of six basins in IBTrACS, extending in the North Atlantic to 1851 and in other basins to the earliest year of reliable best-track data. The resulting 10,000-yr global synthetic catalog reproduces observed track density and the annual hurricane/typhoon-force wind-hit probability across all basins. The catalog is intended for catastrophe-risk applications where a large, low-bias sample of physically plausible tracks is more useful than a small, statistically corrected one.2026-05-19T20:58:36ZJennifer NakamuraUpmanu Lallhttp://arxiv.org/abs/2605.20455v1Benchmarking Cylindrical Blast Wave Theory Against the OSIRIS-REx Sample Return Capsule Reentry2026-05-19T20:07:09ZWeak shock theory based on cylindrical blast waves has been used to interpret meteor infrasound, but it has not been systematically benchmarked against a non-ablating hypersonic source with independently known parameters. The objective of this study is not to propose a new theoretical framework, but to evaluate the operational validity of the existing suite of blast radius formulations against a high-fidelity ground truth dataset. The OSIRIS-REx Sample Return Capsule reentry on 24 September 2023 provides such a benchmark because the capsule geometry, trajectory, and infrasound emission points are constrained from mission data and ray tracing, reducing source-side uncertainty associated with ablation. Using observations from 39 infrasound stations, this benchmarking study evaluates six published blast radius (R_0) formulations and three weak-shock transition coefficients (C) within a stratified atmospheric propagation model to predict signal period and peak overpressure. The benchmarking identifies the Sakurai formulation as the best-performing formulation for non-ablating bodies, with the Jones/Plooster formulation performing comparably when a physically appropriate C is adopted. Sakurai and Jones/Plooster yield linear-period median absolute percentage residuals of 9% and 11%, respectively. The period predictions show only weak sensitivity to C at these propagation distances. The Mach-diameter approximation commonly used in meteor studies overestimates R_0 by more than a factor of 3 in the absence of ablation. These results establish a performance baseline for applying cylindrical blast wave theory to non-ablating hypersonic bodies and demonstrate that the signal period is a robust observable for constraining R_0.2026-05-19T20:07:09Z70 pages,Elizabeth A. Silberhttp://arxiv.org/abs/2605.24009v1Improving Ensemble CAPE Forecasts with a Diffusion Model Incorporating Aerosol Information2026-05-19T18:32:55ZConvective available potential energy (CAPE) is an important variable for forecasting severe weather and understanding deep convection and precipitation. The latest versions of the Global Forecast System (GFS) and related Global Ensemble Forecast System (GEFS) have exhibited a bias towards underestimating CAPE values during the summertime. We train an artificial intelligence (AI) diffusion model to improve the skill and uncertainty quantification of afternoon 6-hour lead time ensemble forecasts over the United States. Our model takes a GFS CAPE forecast as input and outputs an ensemble that significantly outperforms both GFS and GEFS 6-hour forecasts on root mean square error, continuous ranked probability score, and Brier score. We propose a two-stage training pipeline to leverage both a larger historical GFS forecast dataset and a smaller historical GEFS dataset, despite the two using initialization and parameterization schemes that vary over time. We also show that classifier-free guidance can be used to control the skill and spread of the forecasts. We then demonstrate the versatility of our framework by adding aerosol optical depths (AODs) of black carbon, organic carbon, dust, sea salt, and sulfates as additional input features. Aerosols can invigorate or suppress convection depending on atmospheric conditions. Our AI models effectively incorporate aerosols to produce improved CAPE forecasts. We interpret the model components by using permutation feature importance to rank the influence of the different AODs and find that black carbon, organic carbon, and sulfate aerosols have a greater impact on the model's CAPE predictions than sea salt and dust aerosols.2026-05-19T18:32:55ZZachary JamesJoseph GuinnessArthur DeGaetanohttp://arxiv.org/abs/2605.06944v2AIMIP Phase 1: systematic evaluations of AI weather and climate models2026-05-19T16:41:41ZWe present the AI weather and climate model intercomparison project (AIMIP), phase 1. Drawing from the rich tradition of intercomparisons in climate model development, we specify a common experiment, output data format, and training constraints (namely, training against historical reanalysis data) for AIMIP Phase 1 models. We aim to identify differences in modeling frameworks and AI architectural choices that influence model behavior, and build trust in AI weather and climate models through open data and evaluation. AIMIP Phase 1 models must simulate the atmosphere given specified historical sea surface temperatures over 1979-2024. We evaluate the models' performance using five major evaluation criteria: biases, trends, response to El Niño-related sea surface temperature anomalies, temporal variability, and out-of-sample generalization tests. We find that the AI models are able to simulate the historical climate and response to forcing as well as a conventional physically-based model, but some AI models underestimate historical warming trends, and their predictions diverge in the out-of-sample generalization tests. We describe the AIMIP Phase 1 dataset that is publicly available for additional evaluations.2026-05-07T21:04:05Z48 pages, 25 figuresBrian HennChristopher S. BrethertonNikolay KoldunovChristian LessigMaria J. MolinaTroy ArcomanoOliver Watt-MeyerGuillaume CouaironRenu SinghRobert BrunsteinYana HassonAntonia JostNoah BrenowitzPeter ManshausenNathaniel Cresswell-ClayDale DurranKyle Joseph Chen HallJanni YuvalDmitrii KochkovStephan HoyerIgnacio Lopez-Gomezhttp://arxiv.org/abs/2605.20028v1Training-Free Bayesian Filtering with Generative Emulators2026-05-19T15:52:09ZBayesian filtering is a well-known problem that aims to estimate plausible states of a dynamical system from observations. Among existing approaches to solve this problem, particle filters are theoretically exact for non-linear dynamics and observations, but suffer from poor scalability in high dimensions. In this work, we show that diffusion-based emulators of dynamical systems can be used to implement, without additional training, an optimal variant of particle filters that has remained largely unexplored due to implementation challenges with classical numerical solvers. Experiments on nonlinear chaotic systems, including atmospheric dynamics, demonstrate that the proposed approach successfully scales particle filtering to high-dimensional settings.2026-05-19T15:52:09ZAccepted as a spotlight paper at the International Conference on Machine Learning 2026Thomas SavaryFrançois RozetGilles Louppehttp://arxiv.org/abs/2602.03924v2WIND: Weather Inverse Diffusion for Zero-Shot Atmospheric Modeling2026-05-19T13:45:59ZDeep learning has revolutionized weather forecasting, but many challenges remain, including climate modeling. Moreover, the current landscape remains fragmented: highly specialized models are typically trained individually for distinct tasks. To unify this landscape, we introduce WIND, a single pre-trained foundation model capable of replacing specialized baselines across a vast array of tasks. Crucially, in contrast to previous atmospheric foundation models, we achieve this without any task-specific fine-tuning. To learn a robust, task-agnostic prior of the atmosphere, we pre-train WIND with a self-supervised video reconstruction objective, utilizing an unconditional video diffusion model to iteratively reconstruct atmospheric dynamics from a noisy state. At inference, we frame diverse domain-specific problems strictly as inverse problems and solve them via posterior sampling. This unified approach allows us to tackle highly relevant weather and climate problems, including probabilistic forecasting, spatial and temporal downscaling, reconstruction of spatial fields from sparse observations and enforcing global dry air mass conservation. We further demonstrate how WIND can be applied to explore extreme weather events under prescribed out-of-distribution thermodynamic perturbations. By combining generative video modeling with inverse problem solving, WIND offers a computationally efficient alternative for AI-based atmospheric modeling.2026-02-03T18:58:10ZPublished at the 43rd International Conference on Machine Learning (ICML 2026)Michael AichAndreas FürstFlorian SestakCarlos Ruiz-GonzalezNiklas BoersJohannes Brandstetterhttp://arxiv.org/abs/2606.00055v1Viability of Tensor Train Methods for Geophysical Fluid Dynamics2026-05-18T20:27:18ZTensor train (TT) methods have recently gained popularity for accelerating the solving of systems of PDEs. Here, we evaluate the performance of TT methods in the context of geophysical fluid dynamics (GFD) using the shallow water equations and a discretization scheme employed by the ocean component of the Energy Exascale Earth System Model (E3SM). Through a suite of four test cases of increasing complexity, we evaluate TT methods in terms of how much TT is able to compress the model state, the error incurred by the TT approximation, and the speedup obtained by TT versus an optimal standard non-TT implementation in a representative subproblem. We show that though TT is able to effectively compress and speed up simple flows, it struggles to efficiently represent more complex states that are common in realistic GFD applications.2026-05-18T20:27:18ZJeremy LillyDerek DeSantisMark R. Petersenhttp://arxiv.org/abs/2512.04452v2NORi: An ML-Augmented Ocean Boundary Layer Parameterization2026-05-18T18:53:43ZNORi is a machine learning (ML) parameterization of ocean boundary layer turbulence that is physics-based and augmented with neural networks. NORi stands for neural ordinary differential equations (NODEs) Richardson number (Ri) closure. The physical parameterization is controlled by a Richardson number-dependent diffusivity and viscosity. The neural ODEs are trained to capture the entrainment through the base of the boundary layer, which cannot be represented with a local diffusive closure. The parameterization is trained using large-eddy simulations in an "a posteriori" fashion, where parameters are calibrated with a loss function that explicitly depends on the actual time-integrated variables of interest rather than the instantaneous subgrid fluxes, which are inherently noisy. NORi conserves tracers by design, uses realistic nonlinear thermodynamics, and demonstrates excellent prediction and generalization capabilities in capturing entrainment dynamics under different convective strengths, background stratifications, rotation, and wind forcings. NORi is shown to simulate the seasonal evolution of the boundary layer at Ocean Weather Station Papa with similar performance to the state-of-the-art two-equation $k$-$ε$ closure. When implemented in a double-gyre simulation, it is numerically stable for at least 100 years, despite only being trained on two-day horizons, and can be run with time steps as long as one hour. The highly expressive neural networks, combined with a physically rigorous base closure, prove to be a robust paradigm for designing parameterizations for climate models: data required and training cost are drastically reduced, inference performance can be directly optimized as a primary objective, and numerical stability is implicitly promoted through training.2025-12-04T04:49:52Z58 pages, 20 figures, submitted to Journal of Advances in Modeling Earth Systems (JAMES). This is version 2, updated based on reviews from 3 anonymous reviewers after initial submission to JAMES. The largest change from the previous version is the addition of comparisons with realistic observations from a long-term monitoring site in the Northeast PacificXin Kai LeeAli RamadhanAndre SouzaGregory LeClaire WagnerSimone SilvestriJohn MarshallRaffaele Ferrarihttp://arxiv.org/abs/2605.18523v1Spontaneous Zonal Symmetry Breaking of Tropical Rain Belt2026-05-18T15:10:36ZThe intertropical convergence zone (ITCZ) is a central component of tropical climate, but the conditions under which a tropical rain belt remains zonally extended or becomes unstable to zonal organization are not well understood. We investigate this problem using idealized nonrotating kilometer-scale simulations forced by a prescribed sea surface temperature (SST) distribution that varies only in the meridional direction. This setup produces an ITCZ-like rain belt while allowing spontaneous zonal convective self-aggregation (ZCSA) to emerge. A parameter sweep shows that ZCSA occurs preferentially when both the peak SST and the meridional SST amplitude are large. ZCSA cases exhibit a temporary weakening of the meridional near-surface convergence. Boundary-layer momentum and thermodynamic analyses link this weakening to enhanced lower-tropospheric stability over the cool subsiding region, a shallower boundary layer, and stronger effective frictional damping of the meridional inflow. However, weak convergence alone is not sufficient for ZCSA. Aggregating cases also have a large meridional contrast in moist static energy forcing, implying a strong demand for meridional energy transport. Consistently, ZCSA reorganizes meridional moist static energy transport, including enhanced stationary eddy export from the warm region, and is accompanied by growing zonal moisture variability and weakening meridional moisture contrast. These results suggest that zonal symmetry breaking of an ITCZ-like rain belt is favored when weakened meridional inflow coincides with a large imposed meridional MSE-forcing contrast.2026-05-18T15:10:36ZTomoro YanaseCathy Hohenegger