https://arxiv.org/api/8tlc+2vu9EugrxeA+YYhwJIi39w 2026-06-13T16:08:56Z 8365 165 15 http://arxiv.org/abs/2605.05108v1 Turbulent damping of fast tidal oscillations by three-dimensional Rayleigh-Bénard convection with a radiating free surface 2026-05-06T16:41:14Z We present three-dimensional Dedalus simulations of Rayleigh-Bénard convection with a blackbody-radiating free upper surface, subject to a low-amplitude oscillatory forcing that mimics tidal perturbations in convective envelopes of stars and planets. The forcing period is 10-100 times shorter than the convective timescale, $t_{\rm conv}$. Using a Reynolds decomposition of the velocity field averaged over one oscillation period, in which the tidal oscillations naturally constitute the fluctuating field and convection the mean flow, we elucidate the kinetic energy exchange between the two. Provided the oscillatory Reynolds number exceeds a modest threshold, we find that the oscillations systematically transfer kinetic energy to the mean flow at a volume-averaged rate $D_R \sim u'^2 t_{\rm conv}^{-1}$, where $u'$ is the rms fluctuation velocity. This reflects strong, order-unity correlations between the fluctuation velocities and the mean flow. These arise because the oscillatory forcing displaces fluid elements that are then redirected by buoyancy and incompressibility in the same manner as the mean flow. The transfer is dominated by correlations involving vertical velocity fluctuations and vertical gradients of the mean flow. The resulting energy transfer rate is consistent, within the equilibrium-tide framework, with the observed tidal circularisation of solar-type binaries and with the orbital evolution of moons of Jupiter and Saturn. This validates the formalism proposed by Terquem (2021) for the dissipation of fast tides, a longstanding problem. Replacing the free surface with a rigid upper boundary significantly and artificially modifies the correlations. 2026-05-06T16:41:14Z 23 pages, 8 figures, accepted for publication in MMRAS Caroline Terquem Alexander Boone Enrico Martinez http://arxiv.org/abs/2605.05052v1 Interpretable Neural Networks to Predict Momentum Fluxes of Orographic Gravity Waves 2026-05-06T15:49:01Z State-of-the-art Earth system models (ESMs) cannot explicitly resolve many small-scale atmospheric processes such as atmospheric gravity waves, and thus must represent, or parameterise, their effects on the resolved state. Machine learning (ML) has the potential to improve these parameterisations. In our study, we train neural networks (NNs) on ERA5 reanalysis data to predict momentum fluxes of orographic gravity waves as a function of the state variables at the resolution of a coarse ESM. Employing a full year of data, we extract inertia-gravity waves using the software MODES, which applies linear theory for wave filtering, and train ML models on data coarse-grained to the ESM's target resolution. We consider four different cases: the full spectrum of inertia-gravity waves resolved in ERA5, or just the part of the spectrum that is subgrid-scale in the target ESM, both over all land or just over mountainous terrain. Our NNs successfully predict momentum fluxes, with a global coefficient of determination ($R^2$) ranging from 0.72 to 0.56, depending on the case, when evaluated offline with data from another year. An analysis of our models using SHAP values, an explainable AI technique, suggests that the networks learned physically meaningful relationships. In addition, we give a comparison with the physics-based parameterisation scheme by Lott and Miller. This work forms the basis for the development of operational ML-based parameterisations to improve the representation of gravity waves and their effects in climate models. 2026-05-06T15:49:01Z Elias Haslauer Mierk Schwabe Andreas Dörnbrack Edwin P. Gerber Markus Rapp Nedjeljka Žagar Veronika Eyring http://arxiv.org/abs/2605.10960v1 Two Hebrew folk meteorological proverbs tested: rainfall on Rosh Chodesh and Shabbat Mevarechim as predictors of monthly precipitation (Israel, 1950-2024) 2026-05-06T14:28:33Z Folk meteorological proverbs encode centuries of empirical observation by agricultural communities. Two Hebrew proverbs link lunar calendar anchor days to monthly winter rainfall: (i) "If Rosh Chodesh is rainy, the whole month is rainy" and (ii) "If it rains on Shabbat Mevarechim, the whole month is rainy." Shabbat Mevarechim is the last Saturday before each new Hebrew month, preceding Rosh Chodesh by one to seven days. The first proverb is widely known; the second circulates in Hasidic oral tradition with no identified written source. Both have never been formally tested. We analyse 75 years (1950-2024) of daily precipitation data from seven Israeli cities across three climatic regions, comprising 191,758 station-days and 2,422 Hebrew-month observations during the winter rainy season (Marcheshvan-Adar). A rainy Rosh Chodesh increases the probability of a rainy month from 22.2% to 38.6% (lift +16.4 percentage points; chi-square = 57.8, p = 2.9e-14; Bayes factor 1.81). A rainy Shabbat Mevarechim produces a similar effect (lift +16.5 percentage points, p = 8.0e-13), despite preceding Rosh Chodesh by up to seven days. The effect decays with lag and mirrors daily rainfall autocorrelation (r = 0.35-0.44 at lag 1; ~0 at lag 7), consistent with Mediterranean cyclone persistence. A bootstrap permutation test (p < 1e-4) and a 15-year rolling analysis show declining predictive power (-0.20 percentage points per year, p < 0.001), consistent with shortening precipitation events under warming climate conditions. Both proverbs encode real but probabilistic meteorological signals whose reliability is decreasing over time. 2026-05-06T14:28:33Z Abraham Itzhak Weinberg http://arxiv.org/abs/2605.04881v1 From Classical to Quantum-Mechanical Data Assimilation: A Comparison between DATO and QMDA 2026-05-06T13:16:58Z Data assimilation provides a systematic framework for combining dynamical models with partial and noisy observations to infer the evolving state of a system. In this work, we undertake a comparative study of Data Assimilation with Transfer Operators (DATO) and Quantum Mechanical Data Assimilation (QMDA), focusing on their mathematical formulation, algorithmic structure, and empirical performance. Both methods are first cast within a common operator-theoretic framework, which makes it possible to compare, on a unified basis, their representations of uncertainty, forecast propagation, and assimilation updates. We then analyse their principal similarities and differences with respect to state-space structure, update mechanisms, structural preservation properties, and computational cost. To complement the theoretical analysis, we assess both approaches on benchmark dynamical systems across a range of observational settings, including noisy, sparse, and partially observed regimes. Our results show that, despite their shared operator-theoretic motivation, DATO and QMDA embody substantially different assimilation paradigms, leading to distinct advantages and limitations in terms of interpretability, robustness, and scalability. The present study helps delineate the regimes in which each framework is most effective and offers broader insight into the design of operator-based methodologies for data assimilation. 2026-05-06T13:16:58Z Emanuele Donno Giovanni Conti Paolo Oddo Silvio Gualdi Luca Mainetti Giovanni Aloisio http://arxiv.org/abs/2602.10136v2 Collective and nonlinear structure of wind power correlations 2026-05-06T08:51:55Z We describe the correlation structure of wind power fluctuations in a farm of 80 turbines, sampled over 5 years. We report the presence of universal, collective, and nonlinear correlations, responsible for the excess persistency and intermittency of farm-aggregated power output. A first cross-correlation analysis of turbine production reveals a dynamical scaling transition (à la Family-Vicszek) from local decoherence to large-scale turbulence-driven scaling, and responsible for the geographical smoothing effect, previously reported beyond farm scale [M. M. Bandi, Phys. Rev. Lett. 118, 028301 (2017)]. A second bivariate analysis shows the long-range correlation of non-Gaussian features, responsible for their amplification in total farm output. These findings provide a new perspective on wind power variability, highlighting the importance of nonlinear correlations in power production dynamics. By better characterising these fluctuations, our results can inform strategies for grid management, storage optimization, and wind farm design, ultimately improving the integration of wind energy into modern power systems. 2026-02-08T03:15:24Z 11 pages, 6 figures, supplemental in pdf file PRX Energy 5, 023007 (2026) Samy E. Lakhal J. E. Sardonia M. M. Bandi 10.1103/vms3-ng8z http://arxiv.org/abs/2605.01599v2 Cast3: Translating numerical weather prediction principles into data-driven forecasting 2026-05-06T07:51:56Z Data-driven weather models have made rapid advances in recent years, reaching and in some metrics surpassing the large-scale forecast skill of operational numerical weather prediction. This progress, however, has been built almost entirely on the reanalysis data that NWP produced, while the methodological knowledge that the NWP community distilled over decades of multi-scale atmospheric modelling remains largely unused. Here we present Cast3, a generative forecasting framework that systematically absorbs NWP meta-knowledge to close this gap. Cast3 operates on variable-resolution cubed-sphere grids for scale-aware representation and constructs structurally diverse super-ensembles that sample the complementary biases of different grid discretizations, delivering state-of-the-art ensemble prediction. It further introduces generative nudging, a posterior-sampling strategy that distils the collective information of the full ensemble into a single forecast possessing both the large-scale accuracy of the ensemble mean and the mesoscale realism of a high-resolution member. Evaluated across synoptic-scale skill, spectral fidelity, station-level surface verification, and tropical cyclone prediction, Cast3 outperforms established deterministic and generative baselines across various dimensions. More broadly, these results demonstrate that the design principles embedded in computational atmospheric science offer a rich and largely untapped foundation for the next generation of data-driven Earth system modelling. 2026-05-02T20:17:16Z 28 pages, 5 figures; corrected typos Congyi Nai Baoxiang Pan Yuan Liang Xi Chen http://arxiv.org/abs/2605.05255v1 Prediction of Drought and Flash Drought in Africa at the Seasonal-to-Subseasonal Scale using the Community Research Earth Digital Intelligence Twin Framework 2026-05-05T23:52:32Z Droughts and flash droughts (rapidly developing droughts; FDs) remain impactful events that are known to desiccate landscape and destroy crops. In particular, droughts in Africa are often more impactful than in other locations, such as the United States or Europe, due to many regions in Africa heavily depending on local agriculture for sustenance. In recent years, large machine learning (ML) models, such as GraphCast and AIFS, have emerged as effective tools for global weather prediction. However, sparse data observations and few ML studies in Africa have left it unclear if these ML models retain their skill when focused on Africa. As such, this project seeks to examine the predictability of drought and FD in Africa using a CrossFormer model based on the Community Research Earth Digital Intelligence Twin (CREDIT) framework developed by NSF NCAR. Our CrossFormer model, termed DroughtFormer, incorporates variables from the ERA5 and GLDAS2 reanalyses and the IMERG and MODIS satellite observations, and employs dry air mass and moisture conservation, to predict soil moisture, vegetation health, and other drought-related surface variables. While DroughtFormer displayed lower accuracy in predicting precipitation and FD indices, it showed significant skill in predicting the remaining variables, delivering stable and skillful forecasts out to 90-day lead times (either beating out or having comparable skill to climatology). In particular, DroughtFormer skillfully represented climate anomalies for key variables, such as soil moisture (though it struggled with the magnitude of the anomalies). Thus, DroughtFormer showed significant promise in representing and predicting agricultural level drought in a region that is heavily impacted by drought events. 2026-05-05T23:52:32Z Stuart Edris Amy McGovern Jason Hickey http://arxiv.org/abs/2605.04164v1 Enabling Real-Time Training of a Wildfire-to-Smoke Map with Multilinear Operators 2026-05-05T18:02:14Z Wildfires are a major producer of fine particulate matter, impacting human health and the electrical grid. Accurately forecasting smoke impacts over long time scales incorporates fuel treatment strategies, natural fuel succession, and stochastic events like lightning strikes. However, predicting smoke for each fuel distribution with a forward simulation of a coupled fire-atmosphere model is computationally infeasible. Moreover, relatively simple fire models are tractable to run in many long-time scenarios but do not capture smoke transport. We use data-driven multilinear operators to predict a smoke concentration field from knowledge of the time since ignition for two quantities of interest: aerosol optical depth and smoke detection. Our method first computes the principal components of time-since-ignition and smoke concentration fields and then learns a map from powers of the input coefficients to the output coefficients. We apply our learned operator to smoke prediction in the Upper Rio Grande Watershed. After collecting training data, learning the approximation weights on a CPU takes less than 30 seconds, and each forward call takes less than 1 ms. On a proxy for aerosol optical depth, we obtain equal accuracy to Monte Carlo sampling with fewer than half as many coupled model calls. For smoke detection, we obtain an intersection-over-union (IoU) of 65% and an area under the receiver operating characteristic curve (AUC) of 0.95 on holdout data. Our method is significantly more accurate than the most similar published smoke classifier, which obtains an IoU and AUC of 0.15 and 0.61, respectively, on a 2015 bushfire in Australia. 2026-05-05T18:02:14Z 27 pages Zachary Morrow Joseph Crockett John D. Jakeman Dan J. Krofcheck http://arxiv.org/abs/2601.08116v3 Learning a Stochastic Differential Equation Model of Tropical Cyclone Intensification from Reanalysis and Observational Data 2026-05-05T17:48:08Z Tropical cyclones are among the most consequential weather hazards, yet estimates of their risk are limited by the relatively short historical record. To extend these records, researchers often generate large ensembles of synthetic storms using simplified models of cyclone intensification. Developing such models, however, has traditionally required substantial theoretical effort. Here we explore whether equation-discovery methods, a class of data-driven techniques designed to infer governing equations, can accelerate the process of developing simplified intensification models. Using observational storm data (IBTrACS) together with environmental conditions from reanalysis (ERA5), we learn a compact stochastic differential equation describing tropical cyclone intensity evolution. We focus on TCs because their dynamics are well studied and a hierarchy of reduced-order models exist, enabling direct comparison of the learned model to physically-derived counterparts. We find that the learned model simulates synthetic TCs whose intensification statistics and hazard estimates are consistent with observations and competitive with a leading physics-based TC intensification model. Our model also reproduces known nonlinear dynamical behavior of tropical cyclones, including as a saddle node bifurcation as inner core ventilation is increased. This result shows that equation-discovery approaches, when applied directly to storm intensity, can recover not only realistic statistics but also physically meaningful dynamical structure. These findings highlight the potential for data-driven methods to complement existing theory and reduced-order models in the study of extreme weather. 2026-01-13T01:11:17Z Kenneth Gee Sai Ravela http://arxiv.org/abs/2605.04002v1 Aerosol memory in stratocumulus clouds leads to noise-induced patterns and non-ergodic sampling 2026-05-05T17:21:59Z Stratocumulus cloud decks exhibit bistability between patterns of high (closed cells) and low (open cells) cloud fraction. Localized transitions between these two states (pockets of open cells) have been observed but their underlying mechanism remains unclear. We model stratocumulus and their interaction with atmospheric aerosol as a data-driven and physics-informed stochastic dynamical system with time-dependent parameters. This allows us to show that pockets of open cells result from noise-induced transitions between the stratocumulus patterns. We find comparable timescales for these transitions, mesoscale self-organization into patterns and the evolution of large-scale parameters. This lack of timescale separation corresponds to an aerosol memory in cloud evolution and means that the sampling of stratocumulus states by polar-orbiting satellites lacks the encoding of process information that would be present for an asymptotic and ergodic sampling. 2026-05-05T17:21:59Z Benjamin Hernandez Franziska Glassmeier http://arxiv.org/abs/2605.03997v1 Uncertainty Quantification in Forecast Comparisons 2026-05-05T17:20:48Z Skill scores, which measure the relative improvement of a forecasting method over a benchmark via consistent scoring functions and proper scoring rules, are a standard tool in forecast evaluation, yet their sampling uncertainty is rarely rigorously quantified. With modern forecasting applications being increasingly multivariate and involving evaluations across multiple horizons, variables, spatial locations, and forecasting methods, standard tools like the pairwise Diebold-Mariano forecast accuracy test or pointwise confidence intervals fail to account for the multiple comparison problem, leading to inflated Type I error rates and invalid joint inference. To address the lack of a coherent, statistically rigorous framework for quantifying uncertainty across these multi-dimensional evaluation problems, we introduce simultaneous confidence bands for expected scores and skill scores. Our framework provides a versatile tool for joint inference that is applicable to any forecast type from mean and quantile to full distributional forecasts. We develop a bootstrap implementation and show that our bands are valid under multivariate extensions of the classical Diebold-Mariano assumptions. We demonstrate the practical utility of the approach in two case studies by quantifying the benefits of time-varying parameter models for macroeconomic forecasting, and by comparing data-driven and physics-based models in probabilistic weather forecasting. 2026-05-05T17:20:48Z Marc-Oliver Pohle Tanja Zahn Sebastian Lerch http://arxiv.org/abs/2603.12515v2 Recent Weakening of the Global Radiative Feedback 2026-05-05T14:41:51Z Earth's climate stability, characterized by the global radiative feedback parameter ($λ$), varies decadally due to changing surface temperature patterns. Recent variations in $λ$ are poorly understood as coordinated model simulations typically end in 2014. We apply a convolutional neural network trained on climate model simulations to observation-based surface temperature reconstructions to estimate variations in $λ$ up to 2025. We find that $λ$ reached a minimum (maximum stability) around the mid 1990s ($λ\simeq -3 {\rm Wm^{-2}/K}$), but has since weakened significantly ($λ\simeq -2\, {\rm Wm^{-2}/K}$). We confirm these results with climate model simulations extended to 2022. The recent $λ$ weakening is not significantly affected by El Niño Southern Oscillation or Pacific Decadal Oscillation. Attribution reveals that warming in the subtropical Northeast Pacific is an important driver of the recently weakened feedback, confirmed by targeted experiments in E3SMv2. Our approach enables near real-time monitoring of Earth's climate stability. 2026-03-12T23:24:04Z 7 pages, 3 figures; supplemental information (11 figures, 3 tables) Senne Van Loon Maria Rugenstein Mark D. Zelinka Timothy Andrews http://arxiv.org/abs/2605.03802v1 Towards accurate extreme event likelihoods from diffusion model climate emulators 2026-05-05T14:28:20Z ML climate model emulators are useful for scenario planning and adaptation, allowing for cost-efficient experimentation. Recently, the diffusion model Climate in a Bottle (cBottle) has been proposed for generation of atmospheric states compatible with boundary conditions of solar position and sea surface temperatures. Crucially, cBottle can be guided to generate extreme events such as Tropical Cyclones (TCs) over locations of interest. Diffusion models such as cBottle work by approximating the probability density of the training data. Here, we show use cases of the probability density estimates of atmospheric states obtained from this climate emulator. Most importantly, these estimates allow us to calculate likelihoods of extreme events under guidance. When guiding the model towards states including TCs, comparing the probability density under the guided and unguided model enables us to quantify how much more likely the guidance has made the TC. We show how these odds ratios allow us to importance-sample from the TC distribution, reducing the standard error of the probability estimate compared to simple Monte Carlo sampling. Furthermore, we discuss results and limitations of the application of model probability densities to extreme event attribution-like experiments. We present these early but encouraging results hoping they will spur more research into probabilistic information that can be gained from diffusion models of the atmosphere. 2026-05-05T14:28:20Z 19 pages, 6 figures Peter Manshausen Noah Brenowitz Julius Berner Karthik Kashinath Mike Pritchard http://arxiv.org/abs/2605.03399v1 PODiff: Latent Diffusion in Proper Orthogonal Decomposition Space for Scientific Super-Resolution 2026-05-05T06:21:04Z Probabilistic super-resolution of high-dimensional spatial fields using diffusion models is often computationally prohibitive due to the cost of operating directly in pixel space. We propose PODiff, a structured conditional generative framework that performs diffusion in a fixed, variance-ordered Proper Orthogonal Decomposition (POD) coefficient space, exploiting the orthogonality of POD modes to impose an interpretable, variance-ordered latent geometry. This design enables efficient ensemble generation, preserves dominant spatial structure, and yields spatially interpretable, well-calibrated uncertainty at substantially lower computational cost. We evaluate PODiff on sea surface temperature downscaling over the West Australian coast and on a controlled advection-diffusion benchmark. PODiff achieves reconstruction accuracy comparable to pixel-space diffusion while requiring significantly less memory and producing more reliable uncertainty estimates than deterministic and Monte Carlo Dropout baselines. 2026-05-05T06:21:04Z Accepted at ICML 2026 Onkar Jadhav Tim French Matthew Rayson Nicole L. Jones http://arxiv.org/abs/2602.00378v2 Parametrization of subgrid scales in long-term simulations of the shallow-water equations using machine learning and convex limiting 2026-05-05T00:15:03Z We present a method for parametrizing sub-grid processes in the Shallow Water equations. We define coarse variables and local spatial averages and use a feed-forward neural network to learn sub-grid fluxes. Our method results in a local parametrization that uses a four-point computational stencil, which has several advantages over globally coupled parametrizations. We demonstrate numerically that our method improves energy balance in long-term turbulent simulations and also accurately reproduces individual solutions. The long-term simulations refer to numerical studies where a fluid flow is simulated over a duration long enough to reach a statistical steady state. The neural network parametrization can be easily combined with flux limiting to reduce oscillations near shocks. More importantly, our method provides reliable parametrizations, even in dynamical regimes that are not included in the training data. 2026-01-30T22:57:32Z Md Amran Hossan Mojamder Zhihang Xu Min Wang Ilya Timofeyev