https://arxiv.org/api/HGxtbsWdCL7JHcJkDXxT4U5fjDA 2026-05-15T23:44:07Z 8268 0 15 http://arxiv.org/abs/2512.14898v2 Predicting Forecast Error for the HRRR Using LSTM Neural Networks: A Comparative Study Using New York and Oklahoma State Mesonets 2026-05-14T15:58:34Z

Long Short-Term Memory (LSTM) models are trained to predict forecast errors for the High-Resolution Rapid Refresh (HRRR) model using the New York State Mesonet and Oklahoma State Mesonet near-surface weather observations as ground truth. When evaluated using mean-absolute-error and percent improvement relative to the HRRR, LSTMs predict precipitation error most accurately, providing, on average, a 48% improvement relative to the HRRR forecast, followed by wind error, providing, on average, a 15% improvement, and then temperature error, providing, on average, a 25% improvement. Precipitation errors exhibit an asymmetry, with overforecast precipitation detected more accurately than underforecast, while wind error predictions are consistent across over- and underforecast predictions. Temperature error predictions are relatively accurate but smoother, with respect to variance, than true observations. This paper describes an overview of LSTM performance with the expressed intent of providing forecasters with real-time predictions of forecast error at the point of use within the New York State and Oklahoma State Mesonets. In practice, the predicted errors can be used to adjust deterministic HRRR forecasts at the point of use, identify locations and variables with elevated uncertainty, and provide supplemental guidance for high-impact decision-making. This research demonstrates the potential of LSTM-based machine learning models to provide actionable, location-specific predictions of forecast error for high-resolution operational numerical weather prediction (NWP) systems. However, model performance is variable-dependent, and the approach relies on the availability of dense mesonet observations, which may limit applicability in data-sparse regions.

2025-12-16T20:22:41Z This manuscript is a preprint and has been submitted for peer review to the Weather and Forecasting journal. The content is subject to change based on the outcome of the peer-review process and should not be considered final or definitive. Copyright in this Work may be transferred without further notice David Aaron Evans Kara J. Sulia Nick P. Bassill Chris D. Thorncroft Jay C. Rothenberger Lauriana C. Gaudet http://arxiv.org/abs/2605.14947v1 From Particles to Policy: Technical Building Blocks for Multi-State SAI Coordination 2026-05-14T15:19:27Z

Stratospheric aerosol injection (SAI) is a solar radiation modification technique, proposed as an interim measure to offset warming while greenhouse gas (GHG) emissions are reduced. This paper discusses a possible SAI implementation route - an alternative to sulfate aerosols formed in situ - based on engineered solid particles having dedicated properties such as size, composition, surface chemistry, and traceable origin, supporting safety, controllability, and functionality needed for SAI systems. These engineered properties also open up options for any future multi-state coordination of SAI through two technical building blocks: (1) the SAI-induced radiative forcing (SRF) - the magnitude of the cooling effect attributable specifically to the SAI layer - as an operator-independent quantity, derivable from direct aerosol-layer measurements; and (2) particle traceability through identifying signatures embedded at production. Both could feed into a shared, publicly accessible monitoring database open to independent interrogation, addressing several governance challenges by anchoring compliance assessments in measurable parameters. Drawing on precedents from the Montreal Protocol, IAEA safeguards, and other regimes, we show that shared technical metrics have historically enabled multi-state cooperation, and we argue the same could apply to SAI. We describe a phased pathway in which the technical capabilities and coordination practices that would use them are developed and tested together, at scales orders of magnitude below operational deployment. To be clear - we regard SAI deployment as premature; the conditions under which it might be considered have not been met. The paper does not propose a governance framework; rather, it identifies technical infrastructure that could support a wide range of such frameworks.

2026-05-14T15:19:27Z R. Yahav A. Spector D. Kushnir M. C. Waxman http://arxiv.org/abs/2601.21151v2 Learning to Advect: A Neural Semi-Lagrangian Architecture for Weather Forecasting 2026-05-14T14:21:49Z

Recent machine-learning approaches to weather forecasting often employ a monolithic architecture in which distinct physical mechanisms-advection (long-range transport), diffusion-like mixing, thermodynamic processes, and forcing-are represented implicitly within a single large network. This is particularly problematic for advection, where long-range transport typically requires expensive global interaction mechanisms or deep stacks of local convolutional layers. To mitigate this, we present PARADIS, a physics-inspired global weather prediction model that enforces inductive biases on network behavior through a functional decomposition into advection, diffusion, and reaction blocks acting on latent variables. We implement advection through a Neural Semi-Lagrangian operator that performs trajectory-based transport via differentiable interpolation on the sphere, enabling end-to-end learning of both the latent modes to be transported and their characteristic trajectories. Diffusion-like processes are modeled by depthwise-separable spatial mixing, whereas local source terms and vertical interactions are handled via pointwise channel interactions, yielding a physically structured operator decomposition. Evaluated on ERA5 benchmarks, PARADIS achieves competitive deterministic forecast skill, with particularly strong short-lead performance, while preserving substantially better spectral fidelity and forecast activity during medium-range rollouts.

2026-01-29T01:20:21Z Carlos A. Pereira Stéphane Gaudreault Valentin Dallerit Christopher Subich Shoyon Panday Siqi Wei Sasa Zhang Siddharth Rout Eldad Haber Raymond J. Spiteri David Millard Emilia Diaconescu http://arxiv.org/abs/2605.03646v3 Turbophoresis of inertial particles in inhomogeneous turbulence produced by oscillating grids 2026-05-14T07:28:43Z

Turbophoresis in inhomogeneous turbulent flows leads to the formation of large-scale nonuniform particle number density distributions of inertial particles. This effect is associated with an effective drift velocity directed toward regions of lower turbulence intensity and proportional to the particle Stokes time and the spatial gradient of the turbulence intensity. In the present study, turbophoretic transport is experimentally investigated in air flows generated by one-grid and two-grid oscillating turbulence systems. The flow velocity field and particle spatial distribution are measured using Particle Image Velocimetry. To isolate the effect of particle accumulation due to turbophoresis from that associated with mean fluid flow, the measured particle number density of inertial particles is normalized by the corresponding distribution obtained for noninertial tracer particles under identical flow conditions. The measurements show preferential accumulation of inertial particles in regions of lower turbulence intensity, consistent with the expected behavior of turbophoretic transport.

2026-05-05T11:20:11Z 12 pages, 14 figures, revtex4-2, revised paper E. Elmakies O. Shildkrot N. Kleeorin A. Levy I. Rogachevskii http://arxiv.org/abs/2605.14426v1 A plug-and-play generative framework for multi-satellite precipitation estimation 2026-05-14T06:18:53Z

Reliable precipitation monitoring is essential for disaster risk reduction, water resources management, and agricultural decision-making. Multi-source satellite observations, particularly the combination of geostationary infrared and passive microwave measurements, have become a primary means of precipitation detection. Traditional multi-source satellite precipitation estimation methods remain computationally inefficient, and many deep learning methods lack the flexibility to incorporate new sensors without retraining the full model. Here we introduce PRISMA (Precipitation Inference from Satellite Modalities via generAtive modeling), a plug-and-play latent generative framework for multi-sensor precipitation estimation. PRISMA learns an unconditional precipitation prior from IMERG Final fields and constrains it through independently trained, sensor-specific conditional branches, allowing new observation sources to be incorporated without retraining the generative backbone. Applied to FY-4B AGRI infrared and GPM GMI microwave observations, PRISMA improves Critical Success Index by up to 40.3% and reduces root-mean-square error by 22.6% relative to infrared-only estimation within microwave swaths, while also improving probabilistic skill and maintaining an average inference time of about 37 s. Independent rain-gauge validation across China confirms consistent gains, and typhoon case studies show that microwave conditioning restores eyewall and spiral rainband structures, reducing storm-core mean absolute error by up to 42.3%. PRISMA thus provides an extensible and efficient framework for multi-sensor precipitation estimation.

2026-05-14T06:18:53Z Yunfan Yang Haofei Sun Xiuyu Sun Wei Han Xiaoze Xu Xingtao Song Jun Li Zhiqiu Gao Wei Huang Hao Li http://arxiv.org/abs/2605.14317v1 Guided Diffusion Sampling for Precipitation Forecast Interventions 2026-05-14T03:27:54Z

Extreme precipitation causes severe societal and economic damage, and weather control has long been discussed as a potential mitigation strategy. However, to the best of our knowledge, perturbation-based interventions for weather control using data-driven weather forecasting models have not yet been explored. While adversarial attacks also generate perturbations that alter forecasts, they aim to exploit model artifacts and do not account for physical plausibility. In this paper, we propose a gradient-based guidance framework for precipitation-reduction interventions through diffusion sampling in diffusion-based weather forecasting models. Instead of directly perturbing atmospheric states, our method steers the diffusion sampling trajectory, enabling precipitation reduction while maintaining consistency with the atmospheric distribution. To assess physical plausibility, we evaluate from three perspectives: (i) vertical and variable-wise perturbation profiles, (ii) latent-space trajectory deviation, and (iii) cross-model transferability. Experiments on extreme precipitation events from WeatherBench2 demonstrate that our method achieves effective precipitation reduction while yielding more physically plausible interventions than adversarial perturbations.

2026-05-14T03:27:54Z 12+7 pages, 7+2 figures Ayumu Ueyama Kazuhiko Kawamoto Hiroshi Kera http://arxiv.org/abs/2605.11968v2 Assessment of cloud and associated radiation fields from a GAN stochastic cloud subcolumn generator 2026-05-13T17:46:08Z

Modern Earth System Models (ESMs) operate on horizontal scales far larger than typical cloud features, requiring stochastic subcolumn generators to represent subgrid horizontal and vertical cloud variability. Traditional physically-based generators often rely on analytical cloud overlap paradigms, such as exponential-random decorrelation, which can struggle to capture the complex, anti-correlated behavior of non-contiguous cloud layers. In this study, we introduce a novel two-stage machine learning subcolumn generator for the GEOS atmospheric model, utilizing a Conditional Variational Autoencoder combined with a Generative Adversarial Network (CVAE-GAN) and a U-Net architecture. Trained on a merged CloudSat-CALIPSO height-resolved cloud optical depth dataset, the ML generator creates 56 stochastic subcolumns representing cloud occurrence and optical depth profiles. Evaluated against the established Räisänen, the ML approach accurately reproduces bimodal cloud overlap distributions, significantly reduces biases in grid-mean statistics, and halves the root-mean-square error in ISCCP-style cloud-top pressure and optical thickness joint histograms. The improvements brought by our deep generative models translate into more accurate offline radiative transfer calculations, reducing the global-mean shortwave top-of-atmosphere cloud radiative effect bias by a factor of three. Provided that the generator can be accelerated on CPUs, this offers a practical pathway to reduce structural errors at the cloud-radiation interface.

2026-05-12T11:19:57Z Dongmin Lee Lazaros Oreopoulos Nayeong Cho Daeho Jin http://arxiv.org/abs/2605.13654v1 Free-surface deformations induced by three-dimensional turbulence 2026-05-13T15:13:09Z

We report the experimental characterization of free-surface deformations generated by three-dimensional homogeneous and isotropic turbulence. Using Fourier transform profilometry in a jet-forced turbulent tank, we perform spatiotemporal measurements of the surface elevation field over a wide range of turbulence intensities. The standard deviation of surface deformations scales linearly with subsurface velocity fluctuations. The spectra of surface deformations highlight the coexistence of two mechanisms: transient coherent structures (e.g., upwelling) contributing to the low-frequency, large-scale spectral components, and a passive response to subsurface turbulent pressure fluctuations responsible for the power-law spectral scaling. The wavenumber and frequency spectra of surface deformations exhibit similar power-law exponents (-2.5), suggesting the advection of turbulent structures at the free surface. We develop a linear response model based on the transfer function from the free surface to turbulent pressure fluctuations, incorporating wave-turbulent damping. The model successfully predicts the main features of the turbulent surface: spatiotemporal spectrum shape, similar spectrum power-law exponents (-7/3), and dominance of passive response over wave generation. These findings provide new insights into free-surface turbulence in regimes where turbulent velocities remain below the surface-breaking threshold.

2026-05-13T15:13:09Z Michaël Berhanu Eric Falcon http://arxiv.org/abs/2605.13417v1 State-resolved multimodal contributions to stratospheric polar vortex predictability 2026-05-13T12:13:13Z

The dynamical basis of stratospheric polar vortex predictability remains unclear, particularly the relative roles of persistence, structural variability, and cross-level coupling. Here we provide a state-resolved and quantitative framework using eigen microstate theory applied to ERA5 geopotential height fields, enabling attribution of predictability to dynamically coherent circulation states via a mesoscopic Granger-causality approach. We show that short-term predictability is dominated by persistence of the leading stratospheric state, whereas extended predictability arises from higher-order stratospheric structures and tropospheric variability. These contributions exhibit strong lead-time dependence and become more distributed during sudden stratospheric warming events. Our results unify SPV predictability within a multimodal, state-resolved framework and provide a physically interpretable pathway for improving subseasonal-to-seasonal forecasts.

2026-05-13T12:13:13Z 16 pages, 4 figures Shuo Yang Dan Zhao Tingting Xue Chunhua Zeng Yongwen Zhang Xiaosong Chen http://arxiv.org/abs/2604.16429v2 (Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models 2026-05-13T08:17:13Z

We introduce Mosaic, a probabilistic weather forecasting model that addresses two distinct failure modes of spectral degradation in ML-based weather prediction: (1) spectral damping caused by deterministic training against ensemble means; and (2) aliasing artifacts caused by compressive encoding onto a coarse latent grid. Mosaic generates ensemble members through learned functional perturbations and operates on native-resolution grids via mesh-aligned block-sparse attention, a hardware-aligned mechanism that captures long-range dependencies at linear cost by sharing keys and values across spatially adjacent queries. At 1.5° resolution with 214M parameters, Mosaic matches or outperforms models trained on 6$\times$ finer resolution on key variables and achieves state-of-the-art results among 1.5° models, producing well-calibrated ensembles whose individual members exhibit near-perfect spectral alignment across all resolved frequencies. A 24-member, 10-day forecast takes under 12\,s on a single H100~GPU. Code is available at https://github.com/maxxxzdn/mosaic.

2026-04-06T08:50:42Z Accepted to ICML 2026 Maksim Zhdanov Ana Lucic Max Welling Jan-Willem van de Meent http://arxiv.org/abs/2603.28785v2 A Threshold Model for Micrometeoroid Atmospheric Entry: Filippov Dynamics, Survival Estimates, and Survivor-Only Inverse Limits 2026-05-13T06:30:00Z

Micrometeoroids enter Earth's atmosphere at hypervelocity speeds and experience rapid coupling between drag, heating, radiation, melting, ablation, and deceleration. This paper develops a reduced threshold model for the thermal survival boundary of spherical micrometeoroids. The model uses free molecular drag, an exponential atmosphere, projected-area heating, full-sphere radiative cooling, and a surplus-heat ablation rule at the melting temperature. The switching surface $T=T_m$ is treated as a Filippov/complementarity surface. Sustained melting occurs when the local heating-to-radiation ratio exceeds unity. Under the additional Allen--Eggers assumptions of constant radius, constant entry angle, negligible gravity during the main heating interval, and constant transport coefficients, this threshold yields the classical approximate survival scaling $r_0^{\rm crit}\sim v_0^{-3}$. An exact radius-loss identity is obtained along the prescribed Allen--Eggers trajectory, and a perturbative stability estimate explains when this expression approximates the full reduced model. The inverse problem is formulated through a transfer matrix from pre-atmospheric entry bins to observed survivor bins. Entry bins with zero survival probability lie in the survivor-only null space and require external information for reconstruction. The framework gives a compact analytical description of threshold entry survival and identifies the information lost when only surviving particles are observed.

2026-03-19T22:21:42Z 9 pages, 7 figures, 2 tables Md Shahrier Islam Arham Prasun Panthi Min Heo http://arxiv.org/abs/2603.10305v3 Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning 2026-05-12T18:57:37Z

Machine learning models can represent climate processes that are nonlocal in horizontal space, height, and time, often by combining information across these dimensions in highly nonlinear ways. While this can improve predictive skill, it makes learned relationships difficult to interpret and prone to overfitting as the extent of nonlocal information grows. We address this challenge by introducing data-driven integration kernels, a framework that adds structure to nonlocal operator learning by explicitly separating nonlocal information aggregation from local nonlinear prediction. Each spatiotemporal predictor field is first integrated using learnable kernels (defined as continuous weighting functions over horizontal space, height, and/or time), after which a local nonlinear mapping is applied only to the resulting kernel-integrated features and optional local inputs. This design confines nonlinear interactions to a small set of integrated features and makes each kernel directly interpretable as a weighting pattern that reveals which horizontal locations, vertical levels, and past timesteps contribute most to the prediction. We demonstrate the framework for South Asian monsoon precipitation using a hierarchy of neural network models with increasing structure, including baseline, nonparametric kernel, and parametric kernel models. Across this hierarchy, kernel models achieve near-baseline performance with far fewer trainable parameters, indicating that much of the relevant nonlocal information can be captured through a small set of interpretable integrations when appropriate structural constraints are imposed.

2026-03-11T01:05:16Z Presented at Climate Informatics 2026 (14 pages, 5 figures, 1 table) Savannah L. Ferretti Jerry Lin Sara Shamekh Jane W. Baldwin Michael S. Pritchard Tom Beucler http://arxiv.org/abs/2605.12311v1 Acidification of Water by CO2 2026-05-12T15:57:26Z

Fundamental inorganic chemistry shows that increasing concentrations of atmospheric CO2 will have no harmful effect on organisms that live in the natural waters of the Earths, and may well benefit them. Alkalinity and dissolved CO2 give high buffering capacity to most natural waters and minimize the change of pH from external influences. For example, doubling the atmospheric concentration of CO2 from 430 ppm to 860 ppm would reduce the pH of representative sea water at a temperature of 25 C from pH = 8.18 to pH = 7.93. This change is comparable to diurnal pH changes in biologically productive surface waters, due to photosynthetic fixation of dissolved inorganic carbon during the day and respiration at night. The change is also less than the variations of pH with latitude, longitude and depth in the oceans. This paper includes a quantitative review of the carbonate chemistry of seawater and freshwater, the buffering capacity, the Revelle factor, the transport of calcium carbonate in ground water, the formation of flowstone, and the classic use of limewater to detect gaseous CO2. The paper concludes with a brief review of those parts of chemical thermodynamics that are involved in ocean acidification.

2026-05-12T15:57:26Z W. A. van Wijngaarden P. Ridd M. Cornell W. Happer http://arxiv.org/abs/2605.13895v1 Drag-Controlled Regime Transitions in the Eddy Saturation Mechanism of the Antarctic Circumpolar Current 2026-05-12T12:09:16Z

Eddy saturation -- the weak sensitivity of Antarctic Circumpolar Current (ACC) transport to wind stress -- is a fundamental feature of Southern Ocean dynamics, yet the processes that maintain this state remain debated. Previous studies have proposed different mechanisms, including adjustments of eddy diffusivity and standing meanders, but the conditions under which each mechanism dominates are unclear. Here we use an idealized reentrant channel model to examine how drag strength controls the eddy saturation. When the wind strength relative to friction is below a certain threshold, eddy saturation is governed by a combination of standing meander and eddy diffusivity adjustments; once the threshold is exceeded, it is governed solely by standing meander adjustment. These results suggest that changes in drag strength may account for the divergent eddy saturation mechanisms reported across studies.

2026-05-12T12:09:16Z Takuro Matsuta Yuki Tanaka Atsushi Kubokawa http://arxiv.org/abs/2605.11639v1 Machine Learning-Based Covariance Correction for Ensemble Kalman Filter with Limited Ensemble Size 2026-05-12T07:00:50Z

Data assimilation (DA) integrates numerical model forecasts with observations to achieve the optimal state estimation. Ensemble-based methods, such as the ensemble Kalman filter (EnKF), are widely used for state estimation for high-dimensional and nonlinear dynamic systems. However, their performance strongly depends on the ensemble size, therefore causing a tradeoff problem between analysis accuracy and computational cost. To address this problem, this study presents a machine learning-based EnKF framework that maintains high accuracy with a relatively small ensemble size. Specifically, a multilayer perceptron (MLP) function is built to predict the difference between the forecast error covariances estimated from a limited ensemble and a sufficiently large ensemble, with the latter being assumed to be an accurate approximation of the underlying truth. This predicted covariance difference term is then incorporated into the EnKF algorithm via an element-wise scaling strategy, resulting in an amended forecast covariance matrix that better approximates the true uncertainty level and sequentially produces more accurate analysis results. To demonstrate the feasibility and robustness of the proposed algorithm, we perform a set of numerical experiments with the Lorenz-63 and Lorenz-96 systems under various configurations, and the results consistently indicate that the proposed algorithm can significantly outperform the standard EnKF with the same limited ensemble size, by achieving notably higher analysis accuracy while remaining computationally efficient. This approach provides a practical and feasible pathway to accurate and computationally efficient data assimilation for high-dimensional and nonlinear dynamic systems.

2026-05-12T07:00:50Z Zhou Yao Zhilin Li Li Zhao Zeng Liu Zhaokuan Lu Seungnam Kim Guangyao Wang