https://arxiv.org/api/gHT+nRScZgBHz+VD7dCYNeq/Qug2026-06-13T12:41:39Z836512015http://arxiv.org/abs/2605.18477v1Global kilometre-scale tropical cyclone inner-core vector winds from sparse scalar CYGNSS observations2026-05-18T14:31:40ZTropical cyclone (TC) inner-core surface wind vectors underpin intensity forecasting and storm-surge prediction, yet direct observations remain scarce: routine aircraft reconnaissance is confined to the North Atlantic and Eastern Pacific and, even there, samples each storm only episodically. CYGNSS is the only satellite that penetrates heavy precipitation to measure inner-core surface winds, but delivers directionless scalar wind speeds and is assimilated by no operational analysis system. Here we show that the full 10 m vector wind field inside the TC inner core can be reconstructed globally at 1.5 km resolution from sparse CYGNSS scalar observations alone, by generalising score-based diffusion assimilation to a nonlinear observation operator and injecting three TC boundary-layer constraints; we further propose a CYGNSS-intrinsic Observation Coverage Sufficiency (OCS) criterion that flags reliable reconstructions without external references. Applied to 4,955 snapshots of 249 TCs across all six active basins (2020-2022), the reconstructions reduce systematic Vmax bias against IBTrACS best-track by ~79% and ~75% relative to ERA5 and CCMP. Independent Tail Doppler Radar validation (47 storms) yields a wind speed RMSE of 6.9 m/s on the 23 coverage-sufficient cases (7.5 m/s overall); ablation across the full sample shows that the physical constraints cut wind-direction RMSE by 60% without degrading speed accuracy. The framework further supports joint assimilation of heterogeneous observations: adding only 11 dropsonde vectors to CYGNSS for TC FIONA (2022) reduces the cross-eye profile RMSE by 42%, outlining a practical pathway for fusing CYGNSS with SFMR, SAR and scatterometer data. The result is a globally consistent, observation-anchored kilometre-scale description of TC inner-core vector winds across all six active basins, including those without routine aircraft reconnaissance.2026-05-18T14:31:40Z33 pages, 10 figures, 5 tables. Supplementary information available as an ancillary fileXinhai HanXiaohui LiJingsong YangZeyi NiuGuoqi HanJiuke WangWei HuangYunxia ZhengHanyue NiYiqi WangWei TaoLotfi AoufShaoliang PengDake Chenhttp://arxiv.org/abs/2604.06398v2Calibration of a neural network ocean closure for improved mean state and variability2026-05-17T22:34:10ZGlobal ocean models exhibit biases in the mean state and variability, particularly at coarse resolution, where mesoscale eddies are unresolved. To address these biases, parameterization coefficients are typically tuned ad hoc. Here, we formulate parameter tuning as a calibration problem using Ensemble Kalman Inversion (EKI). We optimize parameters of a neural network parameterization of mesoscale eddies in two idealized ocean models at coarse resolution. The calibrated parameterization reduces errors by factors of 1.7-3.3 in the time-averaged fluid interfaces and their variability compared to the unparameterized model, depending on the metric and configuration. The EKI method is robust to noise in time-averaged statistics arising from chaotic ocean dynamics. Furthermore, we propose an efficient calibration protocol that bypasses integration to statistical equilibrium by carefully choosing an initial condition. These results demonstrate that systematic calibration can substantially improve coarse-resolution ocean simulations and provide a practical pathway for reducing biases in global ocean models.2026-04-07T19:32:25ZPavel PerezhoginAlistair AdcroftLaure Zannahttp://arxiv.org/abs/2604.16429v3(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models2026-05-17T21:54:11ZWe introduce Mosaic, a probabilistic weather forecasting model that addresses three failure modes of spectral degradation in ML-based weather prediction: spectral damping (statistical), high-frequency aliasing (architectural), and residual high-frequency leakage (parametric). Mosaic generates ensemble members through learned functional perturbations and operates on native-resolution grids via mesh-aligned block-sparse attention, a hardware-aligned mechanism that captures long-range dependencies at linear cost by sharing keys and values across spatially adjacent queries. At 1.5° resolution with 214M parameters, Mosaic matches or outperforms models trained on 6$\times$ finer resolution on key variables and achieves state-of-the-art results among 1.5° models, producing well-calibrated ensembles whose individual members exhibit near-perfect spectral alignment across all resolved frequencies. A 24-member, 10-day forecast takes under 12s on a single H100~GPU. Code is available at https://github.com/maxxxzdn/mosaic.2026-04-06T08:50:42ZAccepted to ICML 2026Maksim ZhdanovAna LucicMax WellingJan-Willem van de Meenthttp://arxiv.org/abs/2605.17603v1Longwang: Zero-Shot Global Spatiotemporal Precipitation Downscaling with a Latent Generative Prior2026-05-17T19:01:47ZHigh-resolution precipitation information is essential for climate impact assessment, yet global climate models remain too coarse to resolve key small-scale processes. Existing machine learning downscaling methods often require paired low- and high-resolution data for supervised learning, are tied to fixed regions or scale factors during inference, and can be computationally expensive to train and run in physical space. Here we introduce Longwang, a zero-shot latent generative framework for global spatiotemporal precipitation downscaling. Longwang learns a context-conditioned latent generative prior and combines it with a physically informed observation operator through posterior sampling, enabling daily O(10 km) precipitation fields to be generated from monthly O(100 km) inputs. On ERA5 reanalysis, Longwang outperforms standard posterior sampling with an unconditional generative prior in reconstructing fine-scale spatial patterns, preserving temporal coherence, and recovering extreme precipitation intensities. The framework further generalizes to historical climate simulations and future climate projections under substantial distribution shift.2026-05-17T19:01:47ZYue WangDaniele Visionihttp://arxiv.org/abs/2605.14947v2From Particles to Policy: Technical Building Blocks for Multi-State SAI Coordination2026-05-17T16:52:14ZStratospheric aerosol injection (SAI) is a solar radiation modification technique, proposed as an interim measure to offset warming while greenhouse gas (GHG) emissions are reduced. This paper discusses a possible SAI implementation route - an alternative to sulfate aerosols formed in situ - based on engineered solid particles having dedicated properties such as size, composition, surface chemistry, and traceable origin, supporting safety, controllability, and functionality needed for SAI systems. These engineered properties also open up options for any future multi-state coordination of SAI through two technical building blocks: (1) the SAI-induced radiative forcing (SRF) - the magnitude of the cooling effect attributable specifically to the SAI layer - as an operator-independent quantity, derivable from direct aerosol-layer measurements; and (2) particle traceability through identifying signatures embedded at production. Both could feed into a shared, publicly accessible monitoring database open to independent interrogation, addressing several governance challenges by anchoring compliance assessments in measurable parameters. Drawing on precedents from the Montreal Protocol, IAEA safeguards, and other regimes, we show that shared technical metrics have historically enabled multi-state cooperation, and we argue the same could apply to SAI. We describe a phased pathway in which the technical capabilities and coordination practices that would use them are developed and tested together, at scales orders of magnitude below operational deployment. To be clear - we regard SAI deployment as premature; the conditions under which it might be considered have not been met. The paper does not propose a governance framework; rather, it identifies technical infrastructure that could support a wide range of such frameworks.2026-05-14T15:19:27ZR. YahavA. SpectorD. KushnirM. C. Waxmanhttp://arxiv.org/abs/2605.17493v1Beyond Linear Superposition: Discovering Climate Features in AI Weather Models with KAN-SAE2026-05-17T15:04:15ZDeep learning weather prediction models achieve remarkable predictive skill yet remain largely opaque: we know little about how they represent physical climate phenomena internally. Mechanistic interpretability through Sparse Autoencoders (SAEs) offers a principled route to decomposing these representations, but existing SAEs assume strictly linear feature superposition - a constraint ill-suited for the highly nonlinear atmospheric dynamics encoded in modern transformers. We introduce KAN-SAE, a sparse autoencoder whose encoder replaces the standard ReLU with learnable per-feature B-spline activations drawn from Kolmogorov-Arnold Networks (KANs), allowing each latent dimension to develop its own nonlinear gating profile. Applied to Sonny, KAN-SAE discovers 975 alive features (vs. 566 for a linear baseline, a 72% improvement) with 20% lower inter-feature redundancy and comparable reconstruction fidelity. Without any climate supervision, KAN-SAE identifies an interpretable European heatwave feature spatially concentrated over western Europe, and a western Pacific typhoon tracker confirmed by causal steering experiments. Our results demonstrate that nonlinear activations are essential for mechanistic interpretability of deep learning weather prediction models, recovering climate features that remain invisible to linear baselines.2026-05-17T15:04:15ZMinjong Cheonhttp://arxiv.org/abs/2510.04006v2Learning more physically realistic dynamics in machine-learning based weather forecasting with latent-space constraints2026-05-16T20:42:44ZData-driven machine learning (ML) models are reshaping weather forecasting and have shown the potential to accelerate and surpass traditional physics-based approaches, leading to a second revolution in the field after data assimilation. However, most ML forecast models are trained with weighted variable-wise losses on rollout forecasts that neglect cross-variable and spatial error covariance induced by physical coupling, often yielding overly smooth and physically unrealistic long-range forecasts. To address this, we reformulate model training as a four-dimensional variational data assimilation (4DVar) problem that treats reanalysis data as imperfect observations. This enables the loss function to incorporate cross-variable error covariance structures that capture multivariate dependencies and their associated errors. In practice, we approximate this objective by computing the loss in an autoencoder-learned latent space of global atmospheric states. By encoding complex nonlinear couplings among atmospheric variables, this representation allows the high-dimensional, complex error covariance matrix in model space to be approximated as nearly diagonal in latent space, substantially simplifying implementation. We show that rollout training with latent-space constraints improves long-term forecast skill, while better preserving fine-scale structures and physical realism than the widely used model-space loss. Finally, we extend this framework to accommodate heterogeneous data sources, enabling the forecast model to be trained jointly on reanalysis and multi-source observations within a unified theoretical formulation.2025-10-05T02:55:04ZHang FanYi XiaoYongquan QuJuan NathanielFenghua LingBen FeiLei BaiPierre Gentinehttp://arxiv.org/abs/2605.28851v1Towards a Foundation Model for the Martian Atmosphere2026-05-16T20:37:05ZThe martian atmosphere hosts dynamical phenomena ranging from planet-encircling dust storms to mesoscale orographic clouds and nocturnal low-level jets. General circulation model show capability to simulate these phenomena, but is computationally expensive at resolution needed to resolve mesoscale features. While assimilation of satellite remote sensing observation enable forecasting capabilities using such models, observation record is often sparse, short and fragmented across instrument generators. These constraints motivate the development of a data-driven foundation model for the Martian atmosphere.
Foundation models live in a complex design landscape. There is an interplay between the available data, the physics of the underlying processes and corresponding developments in AI. Even though the idea of a foundation model is to address multiple use cases in a data- and compute-efficient manner, it is important to have a clear picture what applications can sensibly addressed by a single model.
The purpose of this paper is to elucidate this design landscape. We discuss available data ranging from atmospheric retrievals to reanalysis datasets as well as existing physical models. Moreover, we identify a wide range of candidate downstream applications. Finally, we consider relevant recent developments in artificial intelligence (AI) that can be leveraged in this context. Here, we put a particular emphasis on AI models for atmospheric physics, data-driven approaches to data assimilation as well as methods to work in a limited data setting.2026-05-16T20:37:05ZSujit RoyUdayshankar NairYuling WuGeorgios PriftisLiping WangAnastasia GeorgiouAnne JonesBjörn LütjensJohannes SchmudeCampbell WatsonRachel A. SlankAnkur KumarAnirbit MukherjeeProcheta SenRamin LolachiHaonan ChenManil MaskeyJuan Bernabé-MorenoRahul Ramachandranhttp://arxiv.org/abs/2506.00473v3Cold pools, Breezes, and Monsoons: Propagating Convection over New Guinea2026-05-16T12:16:14ZThe diurnal cycle of precipitation near New Guinea involves intricate land-ocean-atmosphere interactions, posing substantial challenges for tropical weather and climate simulations. Using over two decades of GPM satellite observations and convection-permitting WRF simulations, this study examines the physical mechanisms governing the pronounced offshore propagation of diurnal convection over New Guinea. We identify two distinct convective propagation modes: (1) a "ridge-to-coast" mode originated over elevated terrain and migrating toward the coastline, and (2) an "over-ocean" mode initiated near the coast, separated by a spatial gap of approximately 100 km. Our findings highlight the critical role of multi-scale thermally driven flow in shaping boundary-layer dynamics over warm ocean waters. Specifically, the afternoon sea-breeze front advects cooler air onshore, stabilizing the lower atmosphere and interrupting the continuous propagation of the first mode. At night, the hybrid land breeze, strengthened by cold pools, generates offshore moist patches that facilitate the convective regeneration and propagation of the second mode. These offshore convective systems interact with monsoonal background winds, sustaining precipitation well beyond 200~600 km from the coast. Sensitivity experiments indicate that even a modest increase in sea surface temperature can enhance convective intensity and extend offshore propagation. These results shed light on the mechanisms that enable diurnal offshore convection to persist overnight and propagate far from the coastline, highlighting the importance of moist-boundary-layer density currents and offering insights for improving precipitation forecasts and global model performance over the Maritime Continent.2025-05-31T08:44:27ZSubmitted to JGR: Atmospheres; under reviewMingyue TangJimy DudhiaChanghai LiuGiuseppe Torrihttp://arxiv.org/abs/2605.28847v1New class of quantum transitions exhibiting large-scale intercorrelations: Color of the sky2026-05-16T05:45:30ZThe absolute value of the transition probability of the Rayleigh scattering is computed for the first time and applied to the scattering of solar light with molecules in the atmosphere and to the laser scattering with nanopartilces. The probability has a new contribution of unique properties from long-range correlations specific to the quantum mechanics. The magnitude is sufficient to resolve longstanding puzzle on diffusion lights in the sky and anomalous photon spectrum in laser experiments. The earth's albedo from the new calculations on Rayleigh scattering agrees with observations with satelites.2026-05-16T05:45:30Z22 pages,2 figuresKenzo IshikawaMasaki Takesadahttp://arxiv.org/abs/2605.16574v1Data-driven analysis of metastability in a stochastic bistable system2026-05-15T19:25:01ZWe study the metastability properties of a simple prototypical bistable system using the formalism of the Koopman operator. Instead of studying noise-induced transitions by following the trajectories of the system, we track them by studying the time evolution and the decay rate of the subdominant mode of the Koopman operator, thus in a geometry-agnostic framework. We find agreement with the predictions - both the exponential and subexponential ones - of large deviation theory in the weak-noise limit for the statistics of escape time, both in equilibrium and nonequilibrium conditions. The subdominant Koopman mode also allows for an accurate reconstruction of the competing basins of attraction. Going deeper in the Koopman spectrum, we are able to recognise modes that are associated with intrawell variability as well as with the escape of trajectories from the saddle towards the attractor, both in the equilibrium and nonequilibrium case. Our methodology, being grounded in purely data-driven techniques, could be helpful for studying high-dimensional metastable systems.2026-05-15T19:25:01Z15 pages, 8 figures, 2 tablesAnkan BanerjeeManuel Santos GutierrezJohn MoroneyValerio Lucarinihttp://arxiv.org/abs/2605.16178v1Probabilistic Seasonal Streamflow Forecasting Across California's Sierra Nevada Watersheds with Agentic AI2026-05-15T16:59:29ZAccurate seasonal runoff forecasts are critical for managing California's reservoirs and water supply for millions of its residents. Winter snow accumulation provides a strong source of predictability of snowmelt-based runoff in the spring and summer months, but progressive hydroclimatic changes in the Sierra Nevada are altering its timing and volume. These changes reduce the skill of statistical forecasts trained on historical data, highlighting the need for improved forecasting systems that can capture the changing dynamics of snowmelt. Here we demonstrate that a collaborative workflow between an agentic AI assistant and an automated code-mutation system, both powered by large language models, can accelerate the development of competitive seasonal runoff forecasting systems. In our framework, the AI agent discovers relevant datasets, synthesizes domain knowledge from prior forecasting competitions and the scientific literature, and explores the space of model architectures, while the code-mutation system refines each of the solutions explored by the agent through Monte Carlo Tree Search over the code space. The resulting system forecasts monthly Full Natural Flow (FNF) at 1- to 6-month lead times across 23 Sierra Nevada watersheds using an adaptive ensemble of three XGBoost quantile regression sub-models with physics-informed feature engineering. Evaluated against California's operational Bulletin 120 forecasts over 2021-2025, the agent-evolved model achieves superior skill for early-season cumulative April-July runoff predictions, reducing watershed-averaged quantile forecast error by up to 29%, and offering a new paradigm for AI-driven scientific model development in the geosciences.2026-05-15T16:59:29ZIgnacio Lopez-GomezMichael P. BrennerTapio Schneiderhttp://arxiv.org/abs/2605.16082v1An efficient multi-GPU implementation for the Discontinuous Galerkin ocean model SLIM2026-05-15T15:44:48ZUnstructured-mesh ocean models are increasingly used for coastal applications due to their ability to represent complex geometries and apply local grid refinement where needed. However, their broader use has been hindered by their high computational cost, particularly for models based on the Discontinuous Galerkin finite element (DG-FE) method, which involves significantly more degrees of freedom than traditional finite volume or continuous finite element approaches. The rapid emergence of GPU-based high-performance computing architectures now offers a pathway to address this limitation, as DG-FE formulations are inherently well suited to massively parallel, element-wise computations. Here, we present a full 3D DG-FE ocean model implementation optimized for both single- and multi-GPU systems, with support for both NVIDIA and AMD architectures. We detail the computational strategies employed to achieve high performance, including memory layout optimization, kernel-level parallelization, and matrix-free solvers for key vertical processes. Benchmark results demonstrate that a single HPC-grade GPU (e.g. NVIDIA A100) delivers performance equivalent to approximately 1500 CPU cores, while replacing a 128-core CPU node with a 4xA100 GPU node yields a speedup of around 50x. Weak-scaling efficiency is maintained up to 1024 GPUs. We further demonstrate the model's capabilities on a real-world application in the Great Barrier Reef, achieving a spatial resolution five times finer than the most accurate existing model while maintaining a physical-to-numerical time ratio of 100. These results highlight how GPU-accelerated DG-FE methods can dramatically advance the capabilities of unstructured-mesh ocean modeling, enabling ultra-high-resolution coastal simulations that were previously infeasible.2026-05-15T15:44:48ZMiguel De Le CourtVincent LegatAnge P. IshimweColin ScherpereelEmmanuel HanertJonathan Lambrechtshttp://arxiv.org/abs/2605.15958v1Bridging the climate to energy data gap: simulated annealing for representative climate year selection2026-05-15T13:52:08ZEnergy system models are increasingly dependent on representative climate input. Yet, a fundamental mismatch persists between the hundreds of simulated years often used in climate science and the handful of years that computationally demanding power system models can process. Current practice, including ENTSO-E's European Resource Adequacy Assessment, relies on climate year selections that have not been validated against explicit representativeness criteria. This risks biased investment decisions and blind spots for plausible weather conditions. This study proposes simulated annealing as an optimisation method for selecting representative subsets of complete climate years from large climate ensembles. Representativeness is quantified using the seasonal sliced Wasserstein distance, a metric from optimal transport theory that captures representativeness on marginal distributions, inter-variable correlations, and seasonal structure simultaneously. We evaluate simulated annealing against the alternative methods random search, filtered random search, and K-Medoids clustering across three test cases spanning the Netherlands and Europe, using 180 climate years from the Pan-European Climate Database as a reference. Simulated annealing consistently produces the most representative subsets and outperforms all compared methods. Simulated annealing achieves an effective sample size four to five times the actual subset size. The resulting subsets are roughly 2.5--3.5 times more representative than current ENTSO-E practice. The method is application-agnostic and its output can serve as a validated climate data input to any subsequent (energy) impact study.2026-05-15T13:52:08Z33 pages, 13 figures, submitted to Applied EnergyBram van DuinenKarin van der WielJean ThoreyLaurens Stoophttp://arxiv.org/abs/2605.15879v1Coarse-grained local available potential energy2026-05-15T11:57:18ZThe available potential energy (APE) of a fluid can be defined locally in space, providing useful insights into both the energetics and dynamics of stratified flows ranging from three-dimensional turbulence to planetary scale circulations. Here we develop a framework for considering the multi-scale evolution of the local APE using a spatial filtering, or coarse-graining, approach. Evolution equations for the APE at scales larger, and smaller, than the filtering scale are derived -- including the cross-scale APE flux term. These results can be paired with existing frameworks for coarse-grained kinetic energy, offering the potential for examining a complete energy cycle that accounts for conversions between both spatial scales and energy reservoirs. An illustrative example of the application of this approach to a simulation of two-dimensional Kelvin-Helmholtz instability is provided.2026-05-15T11:57:18ZJacob O. WenegratTomas ChorRoy Barkan