https://arxiv.org/api/fS8dND52+0X6MSQERuCGTFB2Yb42026-06-18T08:34:36Z838133015http://arxiv.org/abs/2509.17526v2Key role of the Madden-Julian Oscillation on tropical and subtropical humid heat and heatwaves2026-04-03T08:40:39ZHumid heat stress and heatwaves pose significant risks for living organisms, from humans and wildlife to insects. These threats have wide-ranging health, ecological, and socio-economic impacts that are expected to worsen with climate change. How large-scale climate modes drive the week-to-month variability of humid heat remains poorly understood at the global scale. This limitation hinders the development of accurate forecasts necessary for risk-management measures, notably in the heavily populated and ecologically fragile regions of the tropics and subtropics. With forecast lead times up to several weeks, the Madden-Julian Oscillation (MJO), a global-scale intraseasonal tropical atmospheric disturbance circumnavigating earth in around 30-60 days, provides considerable predictability for weather conditions, and meteorological and oceanic extremes. Here we show that the MJO, and the associated boreal summer intraseasonal oscillation (BSISO), have a significant influence on humid heat and heatwaves over much of the tropics and subtropics across all seasons, both over terrestrial and marine regions. Humid heatwave likelihood can double or halve, depending on the MJO phase, in large areas of the Earth. The MJO/BSISO's influence on wet-bulb temperature is primarily via specific humidity rather than dry-bulb temperature anomalies. In the subtropics and other regions where we typically do not find a strong signal of the convection, we find that intraseasonal anomalies of specific humidity and dry-bulb temperature are influenced by horizontal advection in the planetary boundary layer. Particularly in the subtropics where advection of the climatological moisture and temperature gradient by MJO-related anomalous winds is an important term.2025-09-22T08:49:08ZVersion 1 of manuscript submitted to Nature Communications the 25/07/2024Version 2 of manuscript submitted to Journal of Climate the 19/08/2025 Version 3 of manuscript submitted to Journal of Climate the 02/03/2026 (under 2nd review)Claire RocuetUPF, SECOPOLTakeshi IzumoIRD [Polynésie], SECOPOLBastien PagliUPF, IRD [Polynésie], SECOPOLNeil J HolbrookIMASSophie CravatteIRD [Nouvelle-Calédonie], LEGOSMarania HopuareUPF, GePaSUDMaxime ColinZMThttp://arxiv.org/abs/2604.02850v1High-resolution probabilistic estimation of three-dimensional regional ocean dynamics from sparse surface observations2026-04-03T08:10:55ZThe ocean interior regulates Earth's climate but remains sparsely observed due to limited in situ measurements, while satellite observations are restricted to the surface. We present a depth-aware generative framework for reconstructing high-resolution three-dimensional ocean states from extremely sparse surface data. Our approach employs a conditional denoising diffusion probabilistic model (DDPM) trained on sea surface height and temperature observations with up to 99.9 percent sparsity, without reliance on a background dynamical model. By incorporating continuous depth embeddings, the model learns a unified vertical representation of the ocean states and generalizes to previously unseen depths. Applied to the Gulf of Mexico, the framework accurately reconstructs subsurface temperature, salinity, and velocity fields across multiple depths. Evaluations using statistical metrics, spectral analysis, and heat transport diagnostics demonstrate recovery of both large-scale circulation and multiscale variability. These results establish generative diffusion models as a scalable approach for probabilistic ocean reconstruction in data-limited regimes, with implications for climate monitoring and forecasting.2026-04-03T08:10:55ZSupplementary information: https://drive.google.com/file/d/12FPQujokmSOUktTftfYjPFVNnSYHfszv/view?usp=sharingNiloofar AsefiTianning WuRuoying HeAshesh Chattopadhyayhttp://arxiv.org/abs/2604.02818v1MAG-Net: Physics-Aware Multi-Modal Fusion of Geostationary Satellite and Radar for Severe Convective Precipitation Nowcasting2026-04-03T07:34:08ZRadar-based convective precipitation nowcasting suffers from rapid performance degradation beyond 30 minutes due to missing thermodynamic variables. Existing deep learning models also face blurring effects, training instability, and limited interpretability. To address this, we propose MAG-Net, a Physics-Aware Multi-modal Attention-guided Generator Network. It integrates radar dynamics with selected geostationary satellite channels (IR 10.8, WV 7.1, BTD) to incorporate thermodynamic and microphysical precursors. MAG-Net features a Dual-Stream Encoder for heterogeneous modalities and a Symmetric Dual-Head Decoder optimizing reflectivity regression and event probability via an uncertainty-weighted multi-task strategy. Furthermore, an inference-time Gradient-Preserving Fusion (GPF) strategy combines probabilistic constraints with regression details for better high-frequency texture retention. Experiments on a large-scale dataset (2018-2023) over southeastern China show MAG-Net outperforms deterministic (e.g., CPrecNet) and generative (e.g., DGMR) baselines. Specifically, it improves CSI40 by 0.083 (0.172 to 0.255) over CPrecNet, enhancing intense convective echo detection. Finally, Integrated Gradients (IG) analysis reveals the model's reliance on satellite inputs increases with forecast lead time and convective intensity, confirming that satellite data captures critical precursors for severe weather prediction.2026-04-03T07:34:08ZDandan ChenYaqiang WangAnyuan XiongEnda Zhuhttp://arxiv.org/abs/2604.02519v1On the White-Noise Limit of the Colored Linear Inverse Model2026-04-02T21:21:26ZA recent paper by Lien et al. (2025) introduces the "colored linear inverse model" (colored LIM), in which stochastic forcing is modeled using Ornstein-Uhlenbeck colored noise rather than idealized white noise. In that work, it is shown that the derivative-based identification formulas used to estimate model parameters do not admit a regular white-noise limit due to the loss of differentiability of the lag-correlation function at zero lag. Here we revisit the white-noise limit from the perspective of the underlying stochastic differential equations. Treating the colored LIM as an augmented Ornstein-Uhlenbeck system, we show that as the correlation time tau -> 0 the colored-noise-driven system reduces to the classical LIM, and the corresponding stationary covariance satisfies the standard fluctuation-dissipation relation. Re-examining the same linear system used by Lien et al. (2025), we illustrate this convergence numerically. These results highlight a distinction between the singular behavior of derivative-based identification formulas and the regular limiting behavior of the underlying stochastic model. Taken together with recent results showing convergence of estimated parameters in the white-noise limit, they provide a consistent interpretation in which the colored LIM recovers the classical LIM at the level of stochastic dynamics even though certain estimation procedures become ill-defined in that limit.2026-04-02T21:21:26Z8 pages, 1 figureCristian Martinez-Villaloboshttp://arxiv.org/abs/2604.02283v1A proposal for the safety and controllability requirements that SRM systems should meet2026-04-02T17:21:30ZSolar Radiation Modification (SRM) may be the only way to limit global warming in the coming decades, leading to increased interest in the subject and to the expansion of related research & development (R&D) activity. Defining the safety and controllability requirements that any SRM system should meet is crucial for directing R&D activities and enabling governments to make informed decisions on the development and possible implementation of such systems. We present an initial proposal for this set of requirements, which also guides Stardust's R&D, as a basis for further discussion and consideration. While we focus on SRM systems based on Stratospheric Aerosol Injection (SAI), the proposed principles may be applicable more broadly.2026-04-02T17:21:30ZE. WaxmanA. SpectorY. LedererY. SegevT. KislevY. YedvabD. KushnirR. Yahavhttp://arxiv.org/abs/2604.02187v1Possible, Yes; Ignorant, Perhaps: A Scorecard for Possibilistic Forecasts2026-04-02T15:46:47ZProbabilistic forecasts must sum to unity and cannot express ``I don't know.'' Possibility theory relaxes this constraint: a subnormal distribution explicitly measures how much of the plausibility budget remains unassigned, ignorance signal that probability cannot represent. This paper develops a verification framework for such forecasts, centred on a five-number scorecard that separately diagnoses whether the forecast pointed at the right outcome (depth-of-truth), how sharply (diffuseness, support margin), how confidently (ignorance), and how dominantly (conditional necessity). A possibility-to-probability conversion preserves ignorance for familiar frequency-based scoring; categorical threshold scores (POD, FAR, CSI, etc.) connect to operational practice. Together, these three complementary facets -- possibilistic, probabilistic, and categorical -- expose failure modes invisible to any single metric. Storm Prediction Center convective outlook categories serve as the running example throughout; a synthetic reforecast demonstrates diagnostic visualisations and scorecard interpretation. Ignorance is better expressed than repressed.2026-04-02T15:46:47Z11 figures; 7 sections;19 pages on PDF as-isJohn R. Lawsonhttp://arxiv.org/abs/2602.19233v2On Using Medium-Range Ensemble Forecasts for Storm Transposition of Synoptic-Scale Systems in Probable Maximum Precipitation Estimation2026-04-02T00:21:40ZMost methods for estimating probable maximum precipitation (PMP) -- the greatest depth of precipitation that is physically possible over a given area and duration -- rely on storm transposition (ST), the process of transporting a storm, either historically observed or simulated, from its original location to a target basin. Existing ST approaches, whether classical or physically based, involve assumptions and manipulations that can introduce inconsistencies, leaving the physical validity of the transposed storm uncertain. In this study, the internal variability leveraging (IVL) approach is used to transpose an atmospheric river cluster that affected the U.S. West Coast during 20-29 October 2021. Steering the storm toward the target basin and determining its transposition region are achieved by considering an ensemble of plausible storm evolutions and trajectories obtained from archived ECMWF medium-range forecasts. The Willamette River and Nass River watersheds, located approximately 6 deg N, 2 deg W and 16 deg N, 8 deg W, respectively, from the area most affected by the observed precipitation, were selected as target basins. For each basin, the IVL realization yielding the largest 24-h basin-average precipitation depth was identified, and the initial and boundary condition shifting method was subsequently applied to further enhance its impact, producing 24-h precipitation depths of 119 mm for the Willamette and 98 mm for the Nass.2026-02-22T15:24:00ZMathieu Mure-Ravaudhttp://arxiv.org/abs/2604.01454v1Assessing the ability of a stretched-grid deep-learning weather prediction model to capture physical balances2026-04-01T23:01:25ZWeather forecasting has traditionally relied on Numerical Weather Prediction (NWP) models, which simulate weather by solving the governing fluid equations. Recently, the emergence of Deep Learning Weather Prediction (DLWP) models has opened a new era in weather forecasting, offering a data-driven alternative to classical NWP approaches. Regional DLWP models such as the stretched-grid model Bris developed by Met Norway, have demonstrated performance on par with, or even slightly better than regional NWP models across a range of standard forecast metrics. By overcoming the coarse horizontal resolution that constrained earlier global data-driven models, the operational use of regional DLWP systems now appears increasingly promising. Nevertheless, the performance of such models during extreme events is generally inferior to that of regional NWP models, and comprehensive evaluations of their ability to generate physically realistic forecasts are still lacking. Here, we present a study comparing the physical consistency of the deterministic version of Bris with the control run of the operational MetCoOp Ensemble Prediction System (MEPS) in forecasting the severe extratropical cyclone Poly, which hit the Netherlands on 5 July 2023. We examine whether Bris accurately represents deviations from key atmospheric balances and whether it reproduces expected dynamics of the storm. We show that, despite its relatively good performance in terms of RMSE, Bris struggles to capture important mesoscale features of the event and that it significantly disrupts several atmospheric balances. This unrealistic disruption is mainly linked to the fine-scale noise evidenced in its output fields, which leads to incorrect and unrealistic spatial gradients. These results raise critical questions for improving AI-based models to better represent extreme events and how to ensure physical consistency in their predictions.2026-04-01T23:01:25Z21 pages, 13 figuresFrancesco PasquiniMichiel BaatsenBastien FrançoisNatalie TheeuwesMaurice Schmeitshttp://arxiv.org/abs/2604.01215v1The Recipe Matters More Than the Kitchen:Mathematical Foundations of the AI Weather Prediction Pipeline2026-04-01T17:53:51ZAI weather prediction has advanced rapidly, yet no unified mathematical framework explains what determines forecast skill. Existing theory addresses specific architectural choices rather than the learning pipeline as a whole, while operational evidence from 2023-2026 demonstrates that training methodology, loss function design, and data diversity matter at least as much as architecture selection. This paper makes two interleaved contributions. Theoretically, we construct a framework rooted in approximation theory on the sphere, dynamical systems theory, information theory, and statistical learning theory that treats the complete learning pipeline (architecture, loss function, training strategy, data distribution) rather than architecture alone. We establish a Learning Pipeline Error Decomposition showing that estimation error (loss- and data-dependent) dominates approximation error (architecture-dependent) at current scales. We develop a Loss Function Spectral Theory formalizing MSE-induced spectral blurring in spherical harmonic coordinates, and derive Out-of-Distribution Extrapolation Bounds proving that data-driven models systematically underestimate record-breaking extremes with bias growing linearly in record exceedance. Empirically, we validate these predictions via inference across ten architecturally diverse AI weather models using NVIDIA Earth2Studio with ERA5 initial conditions, evaluating six metrics across 30 initialization dates spanning all seasons. Results confirm universal spectral energy loss at high wavenumbers for MSE-trained models, rising Error Consensus Ratios showing that the majority of forecast error is shared across architectures, and linear negative bias during extreme events. A Holistic Model Assessment Score provides unified multi-dimensional evaluation, and a prescriptive framework enables mathematical evaluation of proposed pipelines before training.2026-04-01T17:53:51ZPiyush GargDiana R. GergelAndrew E. ShaoGalen J. Yacalishttp://arxiv.org/abs/2604.00348v1Gray Swan Factory: Making Extreme Events from Ordinary Cyclones2026-04-01T00:53:04ZGray swans, plausible but unobserved extreme events, broaden our understanding of the range of hazards beyond those observed during the short observational record. They are useful for dynamical studies, synthetic training data, emergency planning, infrastructure design, and insurance hazard assessment. We propose a method to produce gray swans from the observational record using gradient descent on a loss function with a differentiable weather prediction model. Minimizing the loss corresponds to perturbed initial conditions that produce a measurable outcome at a future time, subject to constraints, such as the size of the initial perturbations. We illustrate the method by altering hurricane Fiona (2022), which tracked northward over the Atlantic Ocean, to produce a gray-swan outcome similar to hurricane Sandy (2012), which made landfall on the East Coast of the United States after a unique westward turn. The Fiona gray-swan solution, involving small perturbations to reanalysis initial conditions, produces an extratropical cyclone with a Sandy-like track, a warm core, and a minimum sea-level pressure more than 20 hPa lower than Sandy. Perturbations to the extratropical state are more important than to the hurricane, leading to interactive strengthening, and merger, of an upper-level trough and the hurricane. Similar gray swans are found for four other Atlantic hurricanes. A major weakness of this work is that the hurricane core is not resolved by the model used for optimization, and the impact of this is unknown. Furthermore, although these solutions present plausible outcomes, they do not inform on their probability of occurrence.2026-04-01T00:53:04Z14 pages, 10 figuresGregory J. HakimAishwarya Agrawalhttp://arxiv.org/abs/2602.01416v2Convolution Based Self Attraction and Loading2026-03-31T19:46:04ZSelf Attraction and Loading (SAL), which includes the deformation of the solid Earth under the load of the ocean tide and the self-gravitation of the so-deformed Earth as well as of the ocean tides themselves, is an important term to include in numerical models of the ocean tides. Computing SAL is a challenging problem that is usually tackled using spherical harmonics. The spherical harmonic approach has several drawbacks which limit its accuracy. In this work, we propose an alternative technique based on a spherical convolution. We implement the convolution technique in the Modular Ocean Model, version 6, and demonstrate that it allows for more accurate tides when measured against tidal datasets based upon satellite altimetry. The convolution based SAL reduces the error by reducing spurious oscillations associated with the Gibbs phenomenon. These oscillations are large in coastal regions under the traditional spherical harmonic approach.2026-02-01T19:43:03ZAnthony ChenHe WangBrian ArbicRobert Krasnyhttp://arxiv.org/abs/2604.00082v1Deep-Learned Observation Operators for Artificial Intelligence Weather Forecasting Models2026-03-31T17:29:01ZSatellite observation operators play an essential role in atmospheric data assimilation by translating model state variables into observation space. Previous work has shown that deep-learned emulators can effectively predict the outputs of classic observation operators, like the Community Radiative Transfer Model (CRTM), with reduced inference time. This study expands previous work to show the potential for integrating observation operators into artificial intelligence (AI) weather forecasting models. Specifically, this study shows that (1) deep-learned models can effectively predict the innovations (or differences between the simulated and observed radiances) used by data assimilation models and (2) deep-learned observation models suffer only minor degradations in performance when the model state is represented with fewer vertical levels, as is commonly used by AI forecasting models. Experiments were performed using the Unified Forecast System (UFS) replay dataset, including Gridpoint Statistical Interpolation (GSI) observational data for the Advanced Technology Microwave Sounder (ATMS) sensor from 2022 and 2023. Code is available at https://github.com/mitre/deep-obs.2026-03-31T17:29:01ZCode is available at https://github.com/mitre/deep-obsKelsey LiebermanLaura SlivinskiMatt BenderChris MillerJosh DaRosaNick KrallMohammad Ridhwaan AlamNick SilvermanSergey Frolovhttp://arxiv.org/abs/2604.03311v1PollutionNet: A Vision Transformer Framework for Climatological Assessment of NO$_2$ and SO$_2$ Using Satellite-Ground Data Fusion2026-03-31T15:39:26ZAccurate assessment of atmospheric nitrogen dioxide (NO$_2$) and sulfur dioxide (SO$_2$) is essential for understanding climate-air quality interactions, supporting environmental policy, and protecting public health. Traditional monitoring approaches face limitations: satellite observations provide broad spatial coverage but suffer from data gaps, while ground-based sensors offer high temporal resolution but limited spatial extent. To address these challenges, we propose PollutionNet, a Vision Transformer-based framework that integrates Sentinel-5P TROPOMI vertical column density (VCD) data with ground-level observations. By leveraging self-attention mechanisms, PollutionNet captures complex spatiotemporal dependencies that are often missed by conventional CNN and RNN models. Applied to Ireland (2020-2021), our case study demonstrates that PollutionNet achieves state-of-the-art performance (RMSE: 6.89 $μ$g/m$^3$ for NO$_2$, 4.49 $μ$g/m$^3$ for SO$_2$), reducing prediction errors by up to 14% compared to baseline models. Beyond accuracy gains, PollutionNet provides a scalable and data-efficient tool for applied climatology, enabling robust pollution assessments in regions with sparse monitoring networks. These results highlight the potential of advanced machine learning approaches to enhance climate-related air quality research, inform environmental management, and support sustainable policy decisions.2026-03-31T15:39:26ZThis manuscript is currently under review at Theoretical and Applied Climatology (Springer)Prasanjit DeySoumyabrata DevBianca Schoen-Phelanhttp://arxiv.org/abs/2603.29478v130-meter Land Surface Temperature from Landsat via Progressive Self-Training Downscaling2026-03-31T09:24:58ZLand surface temperature (LST) is a critical parameter for characterizing surface energy balance and hydrothermal processes. While Landsat provides invaluable LST observations at medium spatial resolution for over 40 years, its native spatial resolution of thermal bands (e.g., 100 m) remains insufficient compared to its 30 m optical bands, failing to meet the demands of fine-scale studies. To address this issues, this study proposes a progressive self-training framework for downscaling Landsat LST to 30 m without relying on fine-scale ground truth, while maintaining minimal data dependence. The framework progressively optimizes a cross-modal fusion network to refine thermal details in a coarse-to-fine manner, characterized by one pre-training and two fine-tuning stages. Spatial validation against SDGSAT-1 30 m LST and temporal validation using in situ measurements confirm its reliability and accuracy, with both station-averaged MAE and RMSE outperforming the official cubic product by approximately 0.4 K. Further performance comparison experiments demonstrate that the proposed framework consistently reconstructs coherent fine-scale thermal patterns while preserving spatial heterogeneity. Multi spatial resolution evaluations and ablation studies verify the effectiveness of the proposed strategy and network design. Overall, the framework provides a stable pathway for enhancing the spatial resolution of Landsat LST, providing fine-resolution data support for fine-scale surface process studies and localized environmental monitoring.2026-03-31T09:24:58ZHuanfeng ShenChan LiMenghui JiangPenghai WuGuanhao ZhangTian Xiehttp://arxiv.org/abs/2604.09661v1Multistability and intermingledness in complex high-dimensional data2026-03-30T18:02:03ZMultistability is a phenomenon prevalent in many natural systems. In climate, for example, it allows the possibility of irreversible consequences on planetary scale as a result of climate change. Indeed, a climate ``tipping element'' is a multistable component that can undergo a transition to an alternative steady state due to an external perturbation. Despite the potential impact, multistability in realistic, complex simulations (e.g. climate models) remains poorly understood. Arguably a reason for this the lack of applicable methodology that explicitly targets finite yet high-dimensional datasets. In this work we utilize recent progress in computational nonlinear dynamics to formulate a workflow that analyses potentially multistable simulation data and decides algorithmically what are the alternative steady states contained within, if any. The framework undergoes an optimization routine that showcases which observables in the data best differentiate the alternative states, and which ones do not differentiate at all, which could be used to guide monitoring and early-warning for multistable components in climate or ecosystems. Finally, once the alternate states have been found, we define an indicator called ``intermingledness''. It quantifies differences and similarities between alternate states, as well as for their basins of attraction, across various diagnostic variables. We analyse and present results using three diverse climate datasets: Atlantic ocean circulation, atmospheric midlatitude flow, and habitability of exoplanets. We also provide easy-to-use open source code for applying the workflow to new data.2026-03-30T18:02:03ZGeorge DatserisJohannes LohmannOisín HamiltonJacob Haqq-Misra