https://arxiv.org/api/rKDjfGquc/wYSWa0705WV+oWYvE 2026-06-18T23:03:34Z 8381 525 15 http://arxiv.org/abs/2602.01236v1 Radar-Based Raindrop Size Distribution Prediction: Comparing Analytical, Neural Network, and Decision Tree Approaches 2026-02-01T13:49:43Z Reliable estimation of the raindrop size distribution (RSD) is important for applications including quantitative precipitation estimation, soil erosion modelling, and wind turbine blade erosion. While in situ instruments such as disdrometers provide detailed RSD measurements, they are spatially limited, motivating the use of polarimetric radar for remote retrieval of rain microphysical properties. This study presents a comparative evaluation of analytical and machine-learning approaches for retrieving RSD parameters from polarimetric radar observables. One-minute OTT Parsivel2 disdrometer measurements collected between September 2020 and May 2022 at Sheepdrove Farm, UK, were quality-controlled using collocated weighing and tipping-bucket rain gauges. Measured RSDs were fitted to a normalised three-parameter gamma distribution, from which a range of polarimetric radar variables were analytically simulated. Analytical retrievals, neural networks, and decision tree models were then trained to estimate the gamma distribution parameters across multiple radar feature sets and model architectures. To assess robustness and equifinality, each model configuration was trained 100 times using random 70/30 train-test splits, yielding approximately 17,000 trained models in total. Machine-learning approaches generally outperform analytical methods; however, no single model class or architecture is uniformly optimal. Model performance depends strongly on both the target RSD parameter and the available radar observables, with decision trees showing particular robustness in reduced-feature regimes. These results highlight the importance of aligning retrieval model structure with operational data constraints rather than adopting a single universal approach. 2026-02-01T13:49:43Z 14 pages R. J. Humphreys http://arxiv.org/abs/2602.00622v1 "What is a realistic forecast?" Assessing data-driven weather forecasts, a journey from verification to falsification 2026-01-31T09:22:19Z The artificial intelligence revolution is fueling a paradigm shift in weather forecasting: forecasts are generated with machine learning models trained on large datasets rather than with physics-based numerical models that solve partial differential equations. This new approach proved successful in improving forecast performance as measured with standard verification metrics such as the root mean squared error. At the same time, the realism of data-driven weather forecasts is often questioned and considered as an Achilles' heel of machine learning models. How 'forecast realism' can be defined and how this forecast attribute can be assessed are the two questions simultaneously addressed here. Inspired by the seminal work of Murphy (1993) on the definition of 'forecast goodness', we identify 3 types of realism and discuss methodological paths for their assessment. In this framework, falsification arises as a complementary process to verification and diagnostics when assessing data-driven weather models. 2026-01-31T09:22:19Z Zied Ben Bouallègue http://arxiv.org/abs/2602.00421v1 Observational Evidence for Wind-Driven Low-Pass Filtering of Infrasound at Short Range 2026-01-31T00:18:38Z Infrasound from controlled explosions provide a unique opportunity to isolate atmospheric effects on propagation. We report observations from two campaigns in May and October 2024, each featuring 10-ton TNT-equivalent controlled surface chemical explosions recorded by a dense network of 31 single-sensor stations within 23 km. Despite identical sources, the observed wavefields were very different. October signals followed a near-unimodal period-distance trend, whereas May signals exhibited a pronounced azimuthal bifurcation in both period and celerity. Downwind paths largely preserved the short-period baseline observed in October, while upwind paths showed systematically longer periods caused by wind-driven low-pass filtering. This study provides the first direct observational evidence that tropospheric winds can impose azimuth-dependent low-pass filtering at local ranges, without the influence of measured temperature inversions. Thus, the structure of the atmosphere can modify the spectral characteristics of low-frequency acoustic waves even at a distance of only a few kilometers. 2026-01-31T00:18:38Z 25 pages, 4 figures, supplemental materials. Geophysical Research Letters (2026) Elizabeth A. Silber Daniel C. Bowman Sasha Egan Lawrence Burkett Michael Fleigle Keehoon Kim Tesla Newton Loring P. Schaible Richard Sonnenfeld Nora Wynn Jonathan Snively 10.1029/2025GL120042 http://arxiv.org/abs/2501.12402v3 Rain from Solar Scattering 2026-01-30T23:31:05Z Herein we propose a method to mimic natural processes for the creation of precipitation, in a safe, economically feasible manner anywhere in the world. We propose this is accomplishable via changing the target of the well established field of aerosol dispersal for large scale climate cooling from long term cooling to short term, locally targeted dispersal. We show that such methods could induce precipitation anywhere with sufficient humidity and other conditions, and could be accomplishable at low cost with low or no safety concerns. 2025-01-12T22:26:52Z 25 pages, 14 figures Aya Thompson 10.1016/j.jastp.2026.106784 http://arxiv.org/abs/2602.00331v1 Prototype-based Explainable Neural Networks with Channel-specific Reasoning for Geospatial Learning Tasks 2026-01-30T21:34:45Z Explainable AI (XAI) is essential for understanding machine learning (ML) decision-making and ensuring model trustworthiness in scientific applications. Prototype-based XAI methods offer an intrinsically interpretable alternative to post-hoc approaches which often yield inconsistent explanations. Prototype-based XAI methods make predictions based on the similarity between inputs and learned prototypes that represent typical characteristics of target classes. However, existing prototype-based models are primarily designed for standard RGB image data and are not optimized for the distinct, variable-specific channels commonly found in geoscientific image and raster datasets. In this study, we develop a prototype-based XAI approach tailored for multi-channel geospatial data, where each channel represents a distinct physical environmental variable or spectral channel. Our approach enables the model to identify separate, channel-specific prototypical characteristics sourced from multiple distinct training examples that inform how these features individually and in combination influence model prediction while achieving comparable performance to standard neural networks. We demonstrate this method through two geoscientific case studies: (1) classification of Madden Julian Oscillation phases using multi-variable climate data and (2) land-use classification from multispectral satellite imagery. This approach produces both local (instance-level) and global (model-level) explanations for providing insights into feature-relevance across channels. By explicitly incorporating channel-prototypes into the prediction process, we discuss how this approach enhances the transparency and trustworthiness of ML models for geoscientific learning tasks. 2026-01-30T21:34:45Z submitted to Environmental Data Science (preprint) Anushka Narayanan Karianne J. Bergen http://arxiv.org/abs/2601.22824v1 Rotational Spectroscopy as a Tool to Study Vibration-Rotation Interaction: Investigations of $^{13}$CH$_3$CN and CH$_3$$^{13}$CN up to $v_8 = 2$ and a Search for $v_8 = 2$ Transitions toward Sagittarius B2(N) 2026-01-30T10:51:10Z Methyl cyanide, CH$_3$CN, is present in diverse regions in space, in particular in the warm parts of star-forming regions where it is a common molecule. Rotational transitions of $^{13}$CH$_3$CN and CH$_3$$^{13}$CN in their $v_8 = 1$ lowest excited vibrational states ($E_{\rm vib} \approx 520$ K) are quite prominent in Sagittarius B2(N). In order to be able to search for transitions of the next higher vibrational state $v_8 = 2$, we recorded spectra of samples enriched in $^{13}$CH$_3$CN and CH$_3$$^{13}$CN up to $v_8 = 2$ in the 35 to 1091~GHz region and reinvestigated existing spectra of CH$_3$CN in its natural isotopic composition between 1085 and 1200 GHz. Perturbations caused by near-degeneracies in $K = 4$ of $v_8 = 2^0$ and $K = 2$ of $v_8 = 2^{-2}$ yielded accurate information on the energy spacing of 22.93 and 21.79 cm$^{-1}$ between the $l$-components of $^{13}$CH$_3$CN and CH$_3$$^{13}$CN, respectively. Fermi-type interaction between $K = 13$ and 14 of $v_8 = 1^{-1}$ and $v_8 = 2^{+2}$ probe the energy differences between the two states of both isotopomers. In addition, a $ΔK \pm2$, $Δl \mp1$ interaction between the ground vibrational state of $^{13}$CH$_3$CN and $v_8 = 1^{+1}$ provides information on their energy spacing. Furthermore, we obtained improved or extended ground state rotational transition frequencies of $^{13}$CH$_3$$^{13}$CN and extensive data for $^{13}$CH$_3$C$^{15}$N and CH$_3$$^{13}$C$^{15}$N. Finally, we report the results of our search for transitions of $^{13}$CH$_3$CN and CH$_3$$^{13}$CN in their $v_8 = 2$ states toward Sagittarius B2(N). 2026-01-30T10:51:10Z in press at ACS Earth and Space Chemistry; 18 pages, 5 Tables, 10 Figures ACS Earth Space Chem. 2026, 10, 2, 578-591 Holger S. P. Müller Arnaud Belloche Frank Lewen Stephan Schlemmer 10.1021/acsearthspacechem.5c00353 http://arxiv.org/abs/2601.22458v1 AI Decodes Historical Chinese Archives to Reveal Lost Climate History 2026-01-30T02:06:13Z Historical archives contain qualitative descriptions of climate events, yet converting these into quantitative records has remained a fundamental challenge. Here we introduce a paradigm shift: a generative AI framework that inverts the logic of historical chroniclers by inferring the quantitative climate patterns associated with documented events. Applied to historical Chinese archives, it produces the sub-annual precipitation reconstruction for southeastern China over the period 1368-1911 AD. Our reconstruction not only quantifies iconic extremes like the Ming Dynasty's Great Drought but also, crucially, maps the full spatial and seasonal structure of El Ni$ñ$o influence on precipitation in this region over five centuries, revealing dynamics inaccessible in shorter modern records. Our methodology and high-resolution climate dataset are directly applicable to climate science and have broader implications for the historical and social sciences. 2026-01-30T02:06:13Z 60 pages, 4 figures in the main text, 25 figures and 10 tables in the appendix Sida He Lingxi Xie Xiaopeng Zhang Qi Tian http://arxiv.org/abs/2601.22298v1 Conformal Prediction for Generative Models via Adaptive Cluster-Based Density Estimation 2026-01-29T20:23:33Z Conditional generative models map input variables to complex, high-dimensional distributions, enabling realistic sample generation in a diverse set of domains. A critical challenge with these models is the absence of calibrated uncertainty, which undermines trust in individual outputs for high-stakes applications. To address this issue, we propose a systematic conformal prediction approach tailored to conditional generative models, leveraging density estimation on model-generated samples. We introduce a novel method called CP4Gen, which utilizes clustering-based density estimation to construct prediction sets that are less sensitive to outliers, more interpretable, and of lower structural complexity than existing methods. Extensive experiments on synthetic datasets and real-world applications, including climate emulation tasks, demonstrate that CP4Gen consistently achieves superior performance in terms of prediction set volume and structural simplicity. Our approach offers practitioners a powerful tool for uncertainty estimation associated with conditional generative models, particularly in scenarios demanding rigorous and interpretable prediction sets. 2026-01-29T20:23:33Z Qidong Yang Qianyu Julie Zhu Jonathan Giezendanner Youssef Marzouk Stephen Bates Sherrie Wang http://arxiv.org/abs/2601.22111v1 Physics Informed Reconstruction of Four-Dimensional Atmospheric Wind Fields Using Multi-UAS Swarm Observations in a Synthetic Turbulent Environment 2026-01-29T18:40:32Z Accurate reconstruction of atmospheric wind fields is essential for applications such as weather forecasting, hazard prediction, and wind energy assessment, yet conventional instruments leave spatio-temporal gaps within the lower atmospheric boundary layer. Unmanned aircraft systems (UAS) provide flexible in situ measurements, but individual platforms sample wind only along their flight trajectories, limiting full wind-field recovery. This study presents a framework for reconstructing four-dimensional atmospheric wind fields using measurements obtained from a coordinated UAS swarm. A synthetic turbulence environment and high-fidelity multirotor simulation are used to generate training and evaluation data. Local wind components are estimated from UAS dynamics using a bidirectional long short-term memory network (Bi-LSTM) and assimilated into a physics-informed neural network (PINN) to reconstruct a continuous wind field in space and time. For local wind estimation, the bidirectional LSTM achieves root-mean-square errors (RMSE) of 0.064 and 0.062 m/s for the north and east components in low-wind conditions, increasing to 0.122 to 0.129 m/s under moderate winds and 0.271 to 0.273 m/s in high-wind conditions, while the vertical component exhibits higher error, with RMSE values of 0.029 to 0.091 m/s. The physics-informed reconstruction recovers the dominant spatial and temporal structure of the wind field up to 1000 m altitude while preserving mean flow direction and vertical shear. Under moderate wind conditions, the reconstructed mean wind field achieves an overall RMSE between 0.118 and 0.154 m/s across evaluated UAS configurations, with the lowest error obtained using a five-UAS swarm. These results demonstrate that coordinated UAS measurements enable accurate and scalable four-dimensional wind-field reconstruction without dedicated wind sensors or fixed infrastructure. 2026-01-29T18:40:32Z Abdullah Tasim Wei Sun http://arxiv.org/abs/2511.01515v2 Ocean neutral transport: sub-Riemannian geometry and hypoelliptic diffusion 2026-01-29T16:44:26Z Transport and mixing of tracers in the ocean is thought to be preferentially along neutral planes defined by the potential temperature and salinity fields. This gives rise to a conceptual model of ocean transport in which water parcel trajectories are everywhere neutral, that is, tangent to the neutral planes. Because the distribution of neutral planes is not integrable, neutral transport, while locally two dimensional, is globally three dimensional. We describe this form of transport, building on its connection with contact and sub-Riemannian geometry. We discuss a Lie-bracket interpretation of local dianeutral transport, the quantitative meaning of helicity and the implications of the accessibility theorem. We compute sub-Riemnanian geodesics for climatological neutral planes and put forward the use of the associated Carnot--Carathéodory distance as a diagnostic of the strong anisotropy of neutral transport. We propose a stochastic toy model of neutral transport which represents motion along neutral planes by a Brownian motion. The corresponding diffusion process is degenerate and not (strongly) elliptic. The non-integrability of the neutral planes however ensures that the diffusion is hypoelliptic. As a result, trajectories are not confined to surfaces but visit the entire three-dimensional ocean. The short-time behaviour is qualitatively different from that obtained with a non-degenerate highly anisotropic diffusion. We examine both short- and long-time behaviours using Monte Carlo simulations. The simulations provide an estimate for the time scale of ocean vertical transport implied by the constraint of neutrality. 2025-11-03T12:27:17Z Matthieu Chatelain Isambard Goodbody Nived Rajeev Saritha Jacques Vanneste http://arxiv.org/abs/2601.21913v1 Rapid estimation of global sea surface temperatures from sparse streaming in situ observations 2026-01-29T16:05:30Z Reconstructing high-resolution sea surface temperatures (SST) from staggered SST measurements is essential for weather forecasting and climate projections. However, when SST measurements are sparse, the resulting inferred SST fields are rather inaccurate. Here, we demonstrate the ability of Sparse Discrete Empirical Interpolation Method (S-DEIM) to reconstruct the high-resolution SST field from sparse in situ observations, without using a model. The S-DEIM estimate consists of two terms, one computed from instantaneous in situ observations using empirical interpolation, and the other learned from the historical time series of observations using recurrent neural networks (RNNs). We train the RNNs using the National Oceanic and Atmospheric Administration's weekly high-resolution SST dataset spanning the years 1989-2021 which constitutes the training data. Subsequently, we examine the performance of S-DEIM on the test data, comprising January 2022 to January 2023. For this test data, S-DEIM infers the high-resolution SST from 100 in situ observations, constituting only 0.2% of the high-resolution spatial grid. We show that the resulting S-DEIM reconstructions are about 40% more accurate than earlier empirical interpolation methods, such as DEIM and Q-DEIM. Furthermore, 91% of S-DEIM estimates fall within $\pm 1^\circ$C of the true SST. We also demonstrate that S-DEIM is robust with respect to sensor placement: even when the sensors are distributed randomly, S-DEIM reconstruction error deteriorates only by 1-2%. S-DEIM is also computationally efficient. Training the RNN, which is performed only once offline, takes approximately one minute. Once trained, the S-DEIM reconstructions are computed in less than a second. As such, S-DEIM can be used for rapid SST reconstruction from sparse streaming observational data in real time. 2026-01-29T16:05:30Z Cassidy All Kevin Ho Maya Magnuski Christopher Nicolaides Louisa B. Ebby Mohammad Farazmand http://arxiv.org/abs/2601.21890v1 Reddy: An open-source toolbox for analyzing eddy-covariance measurements in heterogeneous environments 2026-01-29T15:48:37Z Land-atmosphere exchange processes are determined by turbulent fluxes, which can be derived from eddy-covariance measurements. This method was established to quantify ecosystem-scale vertical atmosphere-vegetation exchange processes, but is also used to validate atmospheric turbulence theories with the ultimate aim to improve the representation of turbulence in numerical models. While the focus has long been on turbulence over idealized, homogeneous and flat surfaces, recent scientific developments are shifting towards investigating turbulent exchange processes in complex heterogeneous environments under non-idealized conditions, which pose particular challenges, e.g. advective fluxes between different surface types or non-stationarity of nighttime turbulence. This requires to rethink standard post-processing routines for determining turbulent fluxes from the high-frequency sonic and gas analyzer measurements. Here, we introduce the open-source R-package 'Reddy', which provides modular-built functions for post-processing, analysis and visualization of eddy-covariance measurements, including investigating spectra, coherent structures, anisotropy, flux footprints and surface energy balance closure. The 'Reddy' package is accompanied by a detailed documentation and a set of jupyter notebooks introducing new users hands-on to eddy-covariance data analysis. We showcase 'Reddy' based on measurements from three different sites in Norway: A case study during strong stratification over alpine tundra, for determining suitable averaging times during ice-cover transition at a boreal lake, and for fitting flux-variance relations for a permafrost peatland. 'Reddy' serves as extension of previously developed software packages, paving the way towards holistic turbulence data analysis in heterogeneous real-world environments. 2026-01-29T15:48:37Z Laura Mack Norbert Pirk http://arxiv.org/abs/2602.09030v1 UniPhy: Unifying Riemannian-Clifford Geometry and Biorthogonal Dynamics for Planetary-Scale Continuous Weather Modeling 2026-01-29T12:56:35Z While data-driven weather models have achieved remarkable deterministic accuracy, they fundamentally rely on discrete-time mappings and closed-system assumptions, failing to capture the multi-scale continuous dynamics and thermodynamic openness of the atmosphere. To address these limitations, we propose UniPhy, a continuous-time non-Hermitian neural stochastic partial differential equation (SPDE) solver. Geometrically, we employ Riemannian-Clifford gauge transformations to flatten planetary heterogeneity, enabling globally consistent operations. Dynamically, we construct non-Hermitian biorthogonal spectral operators integrated with a global flux tracker to capture transient energy growth and open-system exchange. Computationally, by identifying the algebraic associativity of the analytic solution, we reformulate adaptive physical integration as a parallel prefix-sum problem, achieving log-linear sequence parallelism. UniPhy establishes a physically complete foundation model architecture that unifies geometric adaptivity, thermodynamic consistency, and computational efficiency. Our code is available at <https://github.com/yrqUni/UniPhy>. 2026-01-29T12:56:35Z Ruiqing Yan Haoyu Deng Yuhang Shao Xingbo Du Jingyuan Wang Zhengyi Yang http://arxiv.org/abs/2410.00244v4 Multi-threshold time series analysis enables characterization of variable renewable energy droughts in Europe 2026-01-29T08:14:57Z Variable renewable energy droughts, so called Dunkelflaute events, emerge as a challenge for climate-neutral energy systems based on variable renewables. Here we characterize European drought events for on- and offshore wind power, solar photovoltaics, and renewable technology portfolios, using 38 historic weather years and an advanced identification method. Their characteristics heavily depend on the chosen drought threshold, questioning the usefulness of single-threshold analyses. Applying a multi-threshold framework, we quantify how the complementarity of wind and solar power temporally and spatially alleviates drought frequency, return periods, duration, and severity within (portfolio effect) and across countries (balancing effect). We identify the most extreme droughts, which drive major discharging periods of long-duration storage in a fully renewable European energy system, based on a policy-relevant decarbonization scenario. Such events comprise sequences of shorter droughts of varying severity. The most extreme event occurred in winter 1996/97 and lasted 55 days in an idealized, perfectly interconnected setting. The average renewable availability during this period was still 47% of its long-run mean. System planners must consider such events when planning for storage and other flexibility technologies. Methodologically, we conclude that using arbitrary single calendar years is not suitable for modeling weather-resilient energy scenarios. 2024-09-30T21:29:25Z Martin Kittel Wolf-Peter Schill 10.1038/s43247-026-03251-2 http://arxiv.org/abs/2503.03990v3 Data-Driven Probabilistic Air-Sea Flux Parameterization 2026-01-28T20:16:43Z Accurately quantifying air-sea fluxes is important for understanding air-sea interactions and improving coupled weather and climate systems. This study introduces a probabilistic framework to represent the highly variable nature of air-sea fluxes, which is missing in deterministic bulk algorithms. Assuming Gaussian distributions conditioned on the input variables, we use artificial neural networks and eddy-covariance measurement data to estimate the mean and variance by minimizing negative log-likelihood loss. The trained neural networks provide alternative mean flux estimates to existing bulk algorithms, and quantify the uncertainty around the mean estimates. Stochastic parameterization of air-sea turbulent fluxes can be constructed by sampling from the predicted distributions. Tests in a single-column forced upper-ocean model suggest that changes in flux algorithms influence sea surface temperature and mixed layer depth seasonally. The ensemble spread in stochastic runs is most pronounced during spring restratification. 2025-03-06T00:40:49Z add zenodo link Jiarong Wu Pavel Perezhogin David John Gagne Brandon Reichl Aneesh C. Subramanian Elizabeth Thompson Laure Zanna