https://arxiv.org/api/+u3wUiIzEVBr14bfDeciG3G1BbM 2026-06-21T13:46:26Z 23582 660 15 http://arxiv.org/abs/2604.27696v1 FoReco and FoRecoML: A Unified Toolbox for Forecast Reconciliation in R 2026-04-30T10:38:04Z Forecast reconciliation has become key to improving the accuracy and coherence of forecasts for linearly constrained multiple time series, such as hierarchical and grouped series. Yet, comprehensive software that jointly covers cross-sectional, temporal, and cross-temporal reconciliation has so far been lacking. The R packages FoReco and FoRecoML address this gap by offering a comprehensive and unified framework. The packages respectively implement classical and regression-based linear reconciliation approaches, and non-linear approaches based on machine learning for cross-sectional, temporal and cross-temporal frameworks. Designed for accessibility and flexibility, these packages provide sensible default options that allow new users to apply reconciliation methods with minimal effort, while still giving expert users full control to explore state-of-the-art extensions through customized settings. With this dual focus, FoReco and FoRecoML are versatile tools for practitioners and researchers working on forecast reconciliation. 2026-04-30T10:38:04Z Daniele Girolimetto Jeroen Rombouts Ines Wilms Yangzhuoran Fin Yang http://arxiv.org/abs/2502.14698v2 General Uncertainty Estimation with Delta Variances 2026-04-30T10:23:41Z Decision makers may suffer from uncertainty induced by limited data. This may be mitigated by accounting for epistemic uncertainty, which is however challenging to estimate efficiently for large neural networks. To this extent we investigate Delta Variances, a family of algorithms for epistemic uncertainty quantification, that is computationally efficient and convenient to implement. It can be applied to neural networks and more general functions composed of neural networks. As an example we consider a weather simulator with a neural-network-based step function inside -- here Delta Variances empirically obtain competitive results at the cost of a single gradient computation. The approach is convenient as it requires no changes to the neural network architecture or training procedure. We discuss multiple ways to derive Delta Variances theoretically noting that special cases recover popular techniques and present a unified perspective on multiple related methods. Finally we observe that this general perspective gives rise to a natural extension and empirically show its benefit. 2025-02-20T16:22:40Z Simon Schmitt John Shawe-Taylor Hado van Hasselt http://arxiv.org/abs/2510.19110v2 Signature Kernel Scoring Rule: A Spatio-Temporal Diagnostic for Probabilistic Weather Forecasting 2026-04-30T09:48:10Z Modern weather forecasting has increasingly transitioned from numerical weather prediction (NWP) to data-driven machine learning forecasting techniques. While these new models produce probabilistic forecasts to quantify uncertainty, their training and evaluation may remain hindered by conventional scoring rules, primarily MSE, which are designed for single time point predictions and ignore the highly correlated data structures present in weather behaviour. This work introduces the signature kernel scoring rule to the domain of weather forecasting, which reframes weather variables as continuous paths to encode temporal and spatial dependencies through iterated integrals. Validated as strictly proper through the use of path augmentations to guarantee uniqueness, the signature kernel provides a theoretically robust metric for forecast verification and model training. Empirical evaluations through weather scorecards on WeatherBench 2 models demonstrate the signature kernel scoring rule's high discriminative power and unique capacity to capture path-dependent interactions. Following previous demonstration of successful adversarial-free probabilistic training, we train sliding window generative neural networks using a predictive-sequential scoring rule on ERA5 reanalysis weather data. Using a lightweight model, we demonstrate that signature kernel based training outperforms climatology for forecast paths of up to fifteen timesteps. 2025-10-21T22:15:20Z Archer Dodson Ritabrata Dutta http://arxiv.org/abs/2605.02934v1 Statistical analysis of virion-cell interactions mediated by peptide nanofibrils and peptide amphiphiles using STEM tomography 2026-04-30T09:26:32Z Peptide nanofibrils (PNFs) and peptide amphiphiles (PAs) are promising tools for enhancing viral transduction and gene transfer. However, quantitative insight into how their supramolecular architecture governs virion-cell interactions is limited. Here, we introduce a framework for the acquisition, processing, and statistical analysis of scanning transmission electron microscopy (STEM) tomograms to objectively quantify peptide-virion-cell interactions. Using four transduction-enhancing peptides (D4, Vectofusin-1, palmitic acid-PA (pal-PA), and eicosapentaenoic-PA (eic-PA)), peptide aggregate morphology, interfacial contact areas, and the spatial organization of virions with respect to peptides and cells were analyzed using advanced geometric descriptors. All peptides efficiently captured virions, resulting in few free virions, but they differ in how strictly virions were spatially confined near the cell surface. These differences reflect alternative spatial organization strategies, which are likely crucial factors influencing transduction-enhancing efficacy. Our approach provides a novel, generalizable method to evaluate infection-enhancing nanomaterials and guides the rational design of next-generation peptide assemblies for therapeutic viral delivery. 2026-04-30T09:26:32Z 26 pages, 10 figures Philipp Rieder Julia La Roche Orkun Furat Annalena Kuhn Lena Rauch-Wirth Kübra Kaygisiz Fabian Zech Jan Münch Clarissa Read Rüdiger Groß Volker Schmidt http://arxiv.org/abs/2604.08632v2 Why Network Segmentation Projects Fail 2026-04-30T05:21:21Z Network segmentation is a foundational enterprise security control. Despite its recognized benefits, segmentation initiatives frequently fail in practice, and the field lacks a systematic empirical explanation for why these projects do not achieve their intended outcomes. This paper presents an empirical study of failed segmentation projects based on a survey of 400 U.S.-based\ network security practitioners. The survey was grounded in a two-part failure framework that separately measures general IT project failure factors and segmentation-specific technical and operational barriers. Clustering analysis of the responses reveals four distinct failure archetypes. Surprisingly, practitioners across all four archetypes propose general IT project management fixes over segmentation-specific fixes in the same ratio. 2026-04-09T17:00:23Z Rohit Dube http://arxiv.org/abs/2604.27409v1 Robust inference methods of diagnostic test accuracy meta-analysis for influential outlying studies via density power divergence 2026-04-30T04:20:28Z In diagnostic test accuracy meta-analysis (DTA-MA), standard inference methods using bivariate random-effects models for jointly synthesizing sensitivity and specificity can be sensitive to outlying studies and may yield misleading conclusions. In this article, we propose frequentist outlier-robust statistical inference methods for DTA-MA based on density power divergence. The proposed methods automatically downweight influential outlying studies by modifying the estimating function using the robust divergence with a tuning parameter. To achieve robust yet statistically efficient inference in the presence of outlying studies, the proposed methods incorporate practical strategies for selecting the tuning parameter, including a data-adaptive criterion based on the Hyvärinen score. We also quantify the contributions of individual studies to the robust pooled estimates, facilitating interpretation of how outlying studies affect the results. We illustrate the effectiveness of the proposed methods through an application to a DTA-MA of the Mini-Mental State Examination. Simulation studies showed that the proposed methods reduced bias and root mean squared error relative to existing methods and improved coverage probability in the presence of outliers. The proposed methods enable a sensitivity analysis to assess whether the main results obtained using standard methods are driven by outlying studies. 2026-04-30T04:20:28Z 20 pages with 4 figures Kotaro Sasaki Hisashi Noma Theodoros Evrenoglou http://arxiv.org/abs/2604.22200v3 Formalizing Galaxy Population Evolution: Drift and Mergers as Transport Processes on Manifolds 2026-04-30T04:09:44Z Galaxy evolution is commonly described through the time evolution of observational statistics such as luminosity functions and stellar mass functions. However, these quantities are projections of an underlying multivariate galaxy state space rather than fundamental dynamical variables. We develop a unified framework in which galaxy evolution is formulated as the time evolution of a probability measure on the galaxy manifold. Representing galaxy states by latent variables $θ\in\mathcal{M}$ and the population by a density $ρ(θ,t)$, the evolution is governed by a general equation containing continuous transport and nonlocal jump processes. By reinterpreting manifold learning as the pushforward of measures, we distinguish observational, representation, and physical measures, and emphasize that manifold coordinates themselves need not carry direct physical meaning. In this picture, luminosity functions and stellar mass functions arise as projected observables of a single underlying dynamics, and generally do not form closed equations in observational space. The framework contains existing models as limiting cases: reduction to a single mass variable yields continuity-equation models, while additive post-merger states recover the Smoluchowski coagulation equation. We further show that luminosity-function evolution is naturally described within the Schechter family, whose apparent stability is interpreted as an effective consequence of projection. Since observables are projections of measures, inference of galaxy evolution becomes a statistical inverse problem of recovering manifold dynamics from data. This framework shifts the focus from fitting observed statistics directly to inferring the underlying state-space dynamics, thereby bridging manifold learning and physical theory. 2026-04-24T04:10:06Z 31 pages, 3 figure, to be submitted Tsutomu T. Takeuchi Nagoya University, Institute of Statistical Mathematics http://arxiv.org/abs/2604.27338v1 Estimating Population Viral Load Contextual Exposure Using GPS-Derived Activity Spaces in Rural South Africa 2026-04-30T02:30:55Z This article introduces novel methodologies for estimating contextual exposure to HIV population viral load using GPS data. We propose a comprehensive analytical framework comprising (i) local (grid-cell level) estimation of HIV population viral load, (ii) derivation of individual activity spaces from GPS trajectories, and (iii) quantification of contextual exposure to HIV within these activity spaces. We integrate HIV surveillance and sociodemographic survey data with GPS-based mobility data collected in rural KwaZulu-Natal, South Africa, to characterize mobility patterns among young adults aged 20-30 years. Using derived measures of mobility and contextual exposure, we assess whether participants' sex and age systematically influence the magnitude, configuration, and heterogeneity of their mobility patterns. Furthermore, we describe analytical approaches to examine how contextual exposure to HIV evolves as activity spaces extend beyond static residential locations, outlining procedures to identify GPS-tracked participants at elevated risk of HIV acquisition. KEYWORDS: Population viral load exposure; GPS-based mobility analysis; Activity space 2026-04-30T02:30:55Z 22 pages, 5 figures Zhaoxing Wu Haoyang Wu Thulile Mathenjwa Elphas Okango Khai Hoan Tram Margot Otto Maxime Inghels Paul Mee Diego Cuadros Hae-Young Kim Till Bärnighausen Frank Tanser Adrian Dobra http://arxiv.org/abs/2604.24587v2 Bayesian inference for hidden Markov models under genuine multimodality with application to ecological time series 2026-04-30T02:18:06Z Bayesian inference in hidden Markov models (HMMs) can be challenging due to the presence of multimodality in the likelihood function, and consequently in the joint posterior distribution, even after correcting for label switching. The parallel tempering (PT) algorithm, a state-space augmentation method, is a widely used approach for dealing with multimodal distributions. Nevertheless, standard implementation of the PT algorithm may not always be sufficient to effectively explore the high-dimensional, complex multimodal posterior distributions that arise in HMMs. In this work, we demonstrate common pitfalls when implementing the PT algorithm for HMMs, approaches to remedy them, and introduce new non-informative prior distributions that facilitate effective posterior distribution exploration. We analyse time series of blue whale dive data with two 3-state HMMs in a Bayesian framework, one of which includes a categorical covariate in the transition probability matrix to account for the effect of sound stimuli on the whale's behavior. We demonstrate how effective implementation of the modified PT algorithm for Bayesian inference leads to effective exploration of the resultant multimodal posterior distribution and how that affects inference for the underlying movement patterns of the blue whales. 2026-04-27T15:11:09Z 37 pages, 11 figures, to be submitted to Bayesian Analysis, corrected author affiliations Marco A. Gallegos-Herrada Vianey Leos-Barajas Jeffrey S. Rosenthal http://arxiv.org/abs/2604.27282v1 The Likelihood Ratio Wall: Structural Limits on Accurate Risk Assessment for Rare Violence 2026-04-30T00:32:52Z Pretrial risk assessment tools are used on over one million U.S. defendants each year, yet their use for predicting rare violent re-offense faces a basic statistical barrier. We derive a universal precision bound -- the Likelihood Ratio Wall -- showing that when violent re-arrest rates are low (2-5%), achieving even a 50% hit rate among people labeled "high risk" (positive predictive value, or PPV) would require tools far more discriminative than current instruments appear to be. For rare outcomes, a tool can have respectable-looking performance metrics and still be wrong most of the time it flags someone as "high risk for violence." We show that post-hoc score recalibration cannot solve this problem because it does not improve the tool's underlying ability to separate true positives from false positives. We further prove a Surveillance Ceiling: when over-policing inflates recorded "risk factors" among those who would not re-offend, the maximum achievable precision is structurally lower for over-policed groups, even at equal offense rates. We translate these results into the Number Needed to Detain (how many people must be detained to prevent one violent offense), and propose that risk reports should communicate this uncertainty explicitly. Our findings suggest that for rare violent outcomes, debates about fairness metrics alone are incomplete: under current data regimes, the available features may not support high-confidence individualized detention decisions. 2026-04-30T00:32:52Z 16 pages, 2 figures, 8 tables. Accepted to the 2026 ACM Conference on Fairness, Accountability, and Transparency (FAccT '26) Marco Pollanen 10.1145/3805689.3812215 http://arxiv.org/abs/2509.16115v3 A Korean Macroeconomic Database for Data-Rich Policy Analysis and U.S.--Korea Dependence 2026-04-30T00:24:38Z We introduce KRED (Korea Research Economic Database), a FRED-MD-compatible monthly macroeconomic database for Korea designed for data-rich policy analysis and cross-country comparison. KRED contains 125 monthly series from ECOS, KOSIS, and administrative labor-market sources, with coverage back to 1960. Using a balanced panel of 104 series over 2009:06--2025:12, principal-components analysis extracts four factors that explain about 30% of total variation. These factors correspond to financial conditions, real activity, housing and real-estate credit, and labor-market and price pressures, and their diffusion indices summarize major Korean macroeconomic episodes. We then use KRED in two empirical applications. First, factor-augmented VARs show that U.S. monetary tightening transmits strongly to Korea and that factor augmentation yields a more coherent inflation response than a low-dimensional VAR. Second, a grouped U.S.--Korea tensor autoregression shows that cross-country dependence is concentrated in financially oriented blocks, with stronger transmission from the U.S. financial block to Korea than in the reverse direction, while spillovers in real activity and housing are much weaker. KRED thus provides a transparent public database for Korean macroeconomic research and a useful building block for comparative work on macro-financial dependence in Asia. 2025-09-19T16:03:29Z Changryong Baek Seunghyun Moon Seunghyeon Lee http://arxiv.org/abs/2604.27243v1 Estimating Decision Uncertainty from Preference Uncertainty: Application to Ground Vehicle Design 2026-04-29T22:30:10Z Engineering design problems are often modeled as multi-objective optimization tasks in which a scalarized utility function selects an optimal design from the Pareto set. In practice, preferences are imperfectly known, so uncertainty in the preference model leads to uncertainty in the resulting optimal design. This paper proposes a probabilistic framework that treats preference parameters as random variables and examines how preference uncertainty propagates to decision uncertainty. A random preference vector induces a probability distribution over optimal designs, allowing us to identify which regions of the Pareto front are most likely to be selected and to assess recommendation stability under preference variability. To explain the sources of this variability, we apply variance-based global sensitivity analysis to the induced optimal solutions, using Sobol' indices and Shapley values to quantify the contributions of individual design variables and their dependencies. We further summarize the overall dispersion of the optimal-design distribution using the Fréchet variance, which provides a scalar measure of decision stability under a given preference model. Two vehicle design case studies demonstrate how problem structure can lead to discrete versus continuous decision distributions and show how the proposed quantities support preference-aware design analysis. 2026-04-29T22:30:10Z Chia-Ruei Liu Yongjia Song Qiong Zhang Cameron Turner http://arxiv.org/abs/2605.00056v1 Smart Ensemble Learning Framework for Predicting Groundwater Heavy Metal Pollution 2026-04-29T21:40:18Z Groundwater in the Densu Basin is increasingly threatened by heavy metal contamination, but conventional methods fail to capture the statistical complexity and spatial heterogeneity of pollution indicators. A key challenge is modelling the Heavy Metal Pollution Index (HPI), which is typically skewed and affected by correlated contaminants, leading to biased predictions without transformation. This study develops a predictive framework integrating response transformations with nested cross-validated ensemble machine learning. Three transformations (raw, log, and Gaussian copula) were applied to HPI and evaluated across six learners: support vector regression (SVM), $k$-nearest neighbours (k-NN), CART, Elastic Net, kernel ridge regression, and a stacked Lasso ensemble. Raw-scale models produced deceptively high fits (Elastic Net and stacked ensemble $R^2 \approx 1.0$), suggesting over-optimism. The log transformation stabilised variance (SVM: $R^2 = 0.93$, RMSE $= 0.18$; k-NN: $R^2 = 0.92$, RMSE $= 0.20$). The Gaussian copula gave the most reliable results: stacked ensemble $R^2 = 0.96$ (RMSE $= 0.19$), with other learners maintaining high accuracy. Copula-based models improved residuals and produced spatially plausible maps. DBSCAN clustering revealed Fe and Mn as primary HPI contributors, consistent with regional hydrogeochemistry. Limitations include reliance on random (not spatial) cross-validation and basin-specific scope. Future work should explore spatial validation and other geological settings. Overall, distribution-aware ensembles with clustering diagnostics offer robust, interpretable assessments of groundwater contamination. 2026-04-29T21:40:18Z 53 pages, 16 figures, accepted for publication in Earth Systems and Environment (2026) T. Ansah-Narh G. Y. Afrifa J. B. Tandoh K. Asare M. Addi K. E. Yorke D. M. A. Akpoley K. Aidoo S. K. Fosuhene http://arxiv.org/abs/2605.10949v1 AlphaEarth Satellite Embeddings for Modelling Climate Sensitive Diseases Towards Global Health Resilience 2026-04-29T21:14:31Z Malaria, childhood acute respiratory infection, and child undernutrition together account for over two million deaths annually in children under five, with the burden concentrated in low and middle-income countries where climate variability modulates transmission, exposure, and nutritional outcomes. Routine health surveillance in these settings remains sparse and reactive. Satellite-derived representations of the Earth's surface offer a scalable, low-cost complement to traditional covariates, yet their utility as predictors of population health outcomes is poorly characterised. We summarise findings from three studies evaluating AlphaEarth Foundations 64-dimensional satellite embeddings as predictors of population health outcomes, focusing on vulnerable populations. The studies span infectious disease (malaria, respiratory infection) and stunting. In each study, embeddings provide predictive value at sufficient spatial granularity: (i) malaria prediction across Nigeria shows consistent per-region R^2 gains; (ii) childhood acute respiratory infection prediction across 11 DHS countries increases pooled R^2 from 0.157 to 0.206 across three tree-based estimators; (iii) stunting prediction across 35 countries is neutral at country level due to collinearity with fixed effects. The stunting case is currently limited by lack of DHS cluster-level coordinates, which is the next key experiment. 2026-04-29T21:14:31Z Visualising Climate 2026 Usman Nazir I-Han Cheng Sara Khalid http://arxiv.org/abs/2604.27198v1 Bayesian Nonparametric Causal Inference for Quantile Residual Life: An Application to Alzheimer's Disease 2026-04-29T21:01:31Z In Alzheimer's disease research, for individuals who remain dementia-free through a given follow-up time, an important clinical question is how much longer they are likely to remain dementia-free. Quantiles of this remaining time provide clinically interpretable prognostic milestones and can help characterize prognostic heterogeneity across baseline groups. We address this question in the Alzheimer's Disease Neuroimaging Initiative (ADNI), focusing on baseline amyloid status as the exposure. Estimation is challenging because amyloid status is observed rather than randomized, requiring adjustment for confounding, and because time to dementia onset is heterogeneous and heavily right-censored. We estimate causal contrasts in quantile residual life using a Bayesian nonparametric enriched Dirichlet process mixture model for the joint distribution of event times, exposure, and baseline covariates, with inference via Bayesian g-computation. The approach accommodates ignorable missing baseline covariates through data augmentation, supports inference across clinically relevant landmark times, and allows sensitivity analysis for residual unmeasured confounding. Simulation studies show good performance under complex heterogeneity and heavy censoring. In ADNI, elevated baseline amyloid was associated with shorter quantiles of remaining dementia-free time than non-elevated baseline amyloid among individuals who remained dementia-free through relevant landmark times, overall and within baseline diagnostic subgroups. 2026-04-29T21:01:31Z Woojung Bae Taekwon Hong Sang Kyu Lee Dongrak Choi Jong-Hyeon Jeong