https://arxiv.org/api/fBwcu5DKUSFIFtXLnu0Zvh89jK0 2026-03-24T09:44:17Z 22812 30 15 http://arxiv.org/abs/2603.20546v1 On the Limits of Prediction: Forecastability Profiles and Information Decay in Time Series 2026-03-20T22:28:16Z Forecasting accuracy is bounded by the information available about the future. This paper makes that statement precise using information-theoretic tools. Under logarithmic loss, the expected performance of any probabilistic forecast decomposes into two parts: an irreducible component and an approximation component. The irreducible term is the conditional entropy of the future given the available information, while the approximation term is the divergence between the true conditional distribution and the forecasting method. The gap between this conditional-entropy limit and an unconditional baseline is exactly the mutual information between the future observation and the declared information set. This leads to a definition of forecastability as the maximum achievable reduction in expected log loss. Evaluated across horizons, forecastability forms a profile that describes how predictive information varies with lead time. This profile reflects the dependence structure of the process and need not be monotone: predictive information may be concentrated at particular lags, including seasonal horizons, even when intermediate horizons contain little useful signal. From this profile, the paper defines the informative horizon set: the horizons at which forecastability exceeds a practical threshold. At horizons not in this set, the achievable gain over the unconditional baseline is necessarily small, regardless of the forecasting method used. The framework therefore separates what is learnable from what is not, and distinguishes limits imposed by the data from errors introduced by modelling. The result is a pre-modelling diagnostic that identifies where meaningful prediction is feasible before any model is chosen, providing a principled basis for allocating modelling effort across forecast horizons. 2026-03-20T22:28:16Z Peter Maurice Catt http://arxiv.org/abs/2603.20518v1 Multi-dimensional Mortality: Sex-Age-Specific Model Life Tables, Fitting, Prediction from Summary Mortality Indicators, and Forecasting 2026-03-20T21:35:35Z Demographers rely on a variety of tools and methods to work with mortality schedules - model life tables, fitting methods, summary-indicator prediction, and forecasting - largely developed independently and not providing structurally coherent sex-specific outputs. The multi-dimensional mortality model (MDMx) unifies all four within one Tucker tensor decomposition demonstrated using the Human Mortality Database. Period life tables from the Human Mortality Database are organized as a four-way tensor of logit(1qx) indexed by sex, age, country, and year. Shared factor matrices for sex and age make every output schedule structurally coherent by construction. From this decomposition four capabilities emerge: model life tables via clustering and smooth within-regime trajectories; life table fitting via a three-stage algorithm with Bayes-factor disruption detection; summary-indicator prediction mapping child or adult mortality to complete schedules, reformulating SVD-Comp in tensor coordinates; and forecasting via a damped local linear trend Kalman filter on PCA-reduced core matrices with hierarchical drift. 2026-03-20T21:35:35Z Samuel J. Clark http://arxiv.org/abs/2603.03004v2 eTFCE: Exact Threshold-Free Cluster Enhancement via Fast Cluster Retrieval 2026-03-20T19:03:41Z Threshold-free cluster enhancement (TFCE) is a popular method for cluster extent inference but is computationally intensive. Existing TFCE implementations often rely on discretized approximation that introduces numerical errors. Also, we identified a long-standing scaling error in the FSL implementation of TFCE (version 6.0.7.19 and earlier). As an alternative implementation, we present eTFCE, an efficient framework that computes exact TFCE scores using an optimized cluster retrieval algorithm, which, though exact, reduces computation time by approximately 50% compared to standard approximated implementations. In addition, the proposed framework enables simultaneous computation of TFCE and generalized cluster statistics, formulated similarly to TFCE, within a single nonparametric run, with negligible additional computational cost. This, in turn, facilitates systematic method comparisons, and enables a more complete characterization of spatial activation patterns. As a result, eTFCE establishes a mathematically exact and computationally efficient framework for comprehensive and informative nonparametric inference in neuroimaging. 2026-03-03T13:56:57Z Withdrawn by the authors after identifying aspects of the analysis and interpretation that require further validation. To avoid potentially misleading readers, we chose to withdraw the manuscript while conducting additional analyses Xu Chen Wouter Weeda Thomas E. Nichols Jelle J. Goeman http://arxiv.org/abs/2512.04366v7 Sequential Randomization Tests Using e-values: Applications for trial monitoring 2026-03-20T18:09:11Z Sequential monitoring of randomized trials traditionally relies on parametric assumptions or asymptotic approximations. We discuss a family of nonparametric sequential tests - collectively called e-RT - for binary, deaths-only, continuous, time-to-event, and multi-state endpoints. All variants derive validity solely from the randomization mechanism. Using a betting framework, each test constructs a test martingale by sequentially wagering on treatment assignments given observed outcomes. Under the null hypothesis of no treatment effect, the expected wealth cannot grow, guaranteeing anytime-valid Type I error control regardless of stopping rule. We prove validity for each variant, present simulation studies demonstrating calibration and power, and discuss the principled asymmetry in betting strategies across outcome types. These methods provide a conservative, assumption-free complement to model-based sequential analyses. 2025-12-04T01:24:17Z Fernando G Zampieri http://arxiv.org/abs/2504.04143v4 The Rhythm of Aging: Stability and Drift in the Individual Rate of Senescence 2026-03-20T17:24:46Z Human aging is marked by a steady rise in the risk of dying with age-a process demographers call senescence. Over the past century, life expectancy has risen dramatically, but is this because we are aging slower, or simply starting it later? Vaupel hypothesizes that the pace at which individuals age may be constant, with gains in longevity coming from the delayed onset of senescence rather than its slowing down. We test this idea using a new framework that decomposes the pace of senescence into three components: a biological baseline, a long-term trend, and the cumulative impact of period shocks. Applying this to cohort mortality data above age 80 from 12 countries, we find that once period shocks are accounted for, there is no statistical evidence of a long-term trend, consistent with Vaupel's hypothesis. Analyses using lower starting ages yield the same qualitative conclusion. Rather than indicating a change in the process that drives senescence, these variations are consistent with echoes of shared historical events. These results suggest that while longevity has shifted, the rhythm of human aging may be conserved. 2025-04-05T11:31:02Z Silvio Cabral Patricio http://arxiv.org/abs/2510.01803v3 The Perceived Impact of Environment on Health in Italy: a Penalized Ordinal Regression Approach 2026-03-20T16:25:30Z Understanding how individuals perceive their living environment is a complex task, as it reflects both personal and contextual determinants. In this paper, we address this task by analyzing the environmental module of the Italian nationwide health surveillance system PASSI (Progressi delle Aziende Sanitarie per la Salute in Italia), integrating it with contextual information at the municipal level, including socio-economic indicators, pollution exposure, and other geographical characteristics. Methodologically, we adopt a penalized semi-parallel cumulative ordinal regression model to analyze how subjective perceptions are shaped by both personal and territorial determinants. The approach balances flexibility and interpretability by allowing both parallel and non-parallel effects while regularizing estimates to address multicollinearity and separation issues. We use the model as an analytical tool to uncover the determinants of positivity and neutrality in environmental perceptions, defined as factors that contribute the most to improving perception or increasing the sense of neutrality. The results are diverse. First, results reveal significant heterogeneity across Italian territories, indicating that local characteristics strongly shape environmental perception. Second, various individual factors interact with contextual influences to shape perceptions. Third, hazardous environmental factors, such as higher PM2.5 levels, appear to be associated with poorer environmental perception, suggesting a tendency among respondents to recognize specific environmental issues. Overall, the approach demonstrates strong potential for application and provides useful insights for environmental policy planning. 2025-10-02T08:44:39Z Mattia Stival Angela Andreella Gaia Bertarelli Catarina Midões Stefano Federico Tonellato Stefano Campostrini http://arxiv.org/abs/2603.20052v1 Uncertainty in wind and solar projections depends on global and regional climate models 2026-03-20T15:36:34Z Ensembles of regional-global climate model combinations show substantial spread in projected wind and solar resources. Using 31 RCM-GCM pairs, we quantify the sources of this spread with a spatially and seasonally resolved variance decomposition, separating contributions from RCMs and GCMs. For both wind speed and solar radiation, RCMs dominate the variability in the absolute historical fields. In contrast, projected changes in wind speed are largely controlled by the driving GCMs, except in mountainous regions where RCM-induced variance becomes larger than that induced by GCMs. For solar radiation, contributions are strongly season-dependent, with RCMs dominating in summer and GCMs in winter. Our findings support that GCM and RCM variability together define the uncertainty of wind and solar climate projections. This provides guidance for designing climate model ensembles that better support uncertainty-aware energy system decisions under climate change. 2026-03-20T15:36:34Z Nina Effenberger Reto Knutti http://arxiv.org/abs/2603.16982v3 Trajectory Stability and Signature Diagnostics for Comet-Based Interstellar Navigation 2026-03-20T15:18:46Z Interstellar objects (ISOs) motivate a coupled mission-design and inference question relevant to spacecraft dynamics and control in extreme environments: if volatile-rich, rotating comet-like bodies were used for sustained deep-space navigation by exploiting pre-existing hyperbolic motion and in-situ propellant, what stability requirements arise under non-gravitational forcing, and what astrometric signatures might distinguish active stabilization from uncontrolled natural dynamics? We develop a stability-theoretic framework for trajectory tracking with jet-actuated correction, and show that high-speed transit geometry -- including debris-belt avoidance and encounter phasing -- tightly constrains feasible trajectories, making long-horizon tracking stability mission-critical. We model tracking residuals as the balance of disturbances and corrective action, and derive stability conditions across four levels: disturbance-energy stability, outer-loop contraction, actuator-memory stability, and rotation-mediated (Floquet) stability. The analysis implies residual diagnostics that can motivate empirical tests: under comparable forcing, effective stabilization is expected to strengthen short-horizon error correction, reduce event-conditioned persistence and variance clustering, regularize standardized innovations, and yield bounded post-shock recovery. More broadly, the framework provides a reference for deep-space guidance and control under nonlinear, multi-field disturbances and for planetary-defense concepts involving attitude shaping or impulsive kinetic impact. 2026-03-17T15:30:15Z 31 pages, 2 figures, 4 added references Bo Pieter Johannes Andrée http://arxiv.org/abs/2603.20015v1 On the Calibration of Bayesian Success Criteria and Operating Characteristics for Clinical Trials 2026-03-20T14:57:37Z Recently, the U.S. Food and Drug Administration (FDA) released draft guidance \citep{FDA2026} signaling a paradigm shift that facilitates the use of Bayesian methodology as the primary analysis and decision framework for drug approval. The cornerstone and fundamental challenge of this framework is the specification and calibration of Bayesian success criteria to control decision errors, ensuring reliable clinical and regulatory outcomes. In this work, we systematically investigate various Bayesian decision-error metrics, their theoretical interrelationships, and their alignment with conventional Frequentist counterparts. This investigation provides critical theoretical insights and practical guidance on calibrating Bayesian success criteria and operating characteristics to ensure robust decision-making and the integrity of public health decisions. We illustrate this framework using a clinical trial evaluating revascularization strategies for cardiogenic shock. A Shiny application will be available at www.trialdesign.org to assist sponsors and regulators in evaluating calibration strategies consistent with recent regulatory perspectives. 2026-03-20T14:57:37Z Peng Yang Li Wang Ying Yuan http://arxiv.org/abs/2603.19986v1 Probabilistic Estimation of Hidden Migrant Fatalities Along the Central Mediterranean Route 2026-03-20T14:33:46Z Estimating the number of migrants who die or go missing along dangerous routes such as the Central Mediterranean remains challenging as available records are incomplete. Some incidents are never documented, and fatalities associated with such unobserved incidents are absent from observed totals. We propose a Bayesian approach for probabilistic estimation of total migrant fatalities in such settings. Building on recent developments in multiple-systems estimation, we develop a time-stratified latent-class framework that accommodates missing fatality counts for unobserved incidents. We apply the method to recoded incident-level data from the Missing Migrants Project for the Central Mediterranean route from 2014 to 2025, encompassing 25,712 fatalities across 1,562 incidents. Our model yields 95% credible intervals of 30,426-39,172 fatalities and 2,200-2,591 deadly incidents, indicating that approximately 66%-85% of fatalities and 60%-71% of incidents are reflected in the available data. We estimate that unreported fatalities were concentrated between 2014 and 2016. Furthermore, we document that reporting likelihood increases with incident severity, implying that smaller incidents are most likely to remain undetected. While contingent on modeling assumptions and incomplete data, our method provides a broadly applicable and principled alternative to naive data adjustment methods. 2026-03-20T14:33:46Z Gregor Zens Zoe Sigman http://arxiv.org/abs/2509.01597v2 Statistics-Friendly Confidentiality Protection for Establishment Data, with Applications to the QCEW 2026-03-20T14:30:40Z Confidentiality for business data is an understudied area of disclosure avoidance, where legacy methods struggle to provide acceptable results. Standard formal privacy techniques for person-level data, like differential privacy, are designed to protect against membership inference and hence do not provide suitable confidentiality/utility trade-offs due to the highly skewed nature of business data and because extreme outlier records are often important contributors to query answers. Prior proposals, therefore, took a personalized differential privacy approach that allowed privacy parameters to degrade for the outlying records -- larger establishments get weaker membership inference guarantees. However, providing guarantees to some entities that are strictly weaker than guarantees for others is problematic from a policy standpoint. In this paper, we propose a novel confidentiality framework for business data with a focus on interpretability for policy makers. Instead of protecting against membership inference, which is often not a concern in business data, we protect against attribute inferences that are too precise. In our framework, data curators specify a neighbor function that is used to define uncertainty interval bands around an establishment's attribute values and the privacy parameters govern the strength of indistinguishability between values within the same uncertainty interval.We propose two query-answering mechanisms under this framework and evaluate them on: (1) a confidential Quarterly Census of Employment and Wages (QCEW) dataset produced by the U.S. Bureau of Labor Statistics (this was done through a cooperative agreement), and (2) a substitute dataset that we created from public sources (and will publicly release). 2025-09-01T16:29:54Z 42 pages (13 main text, 2 references, and 27 appendix pages), 13 figures (4 in main text) Kaitlyn Webb Prottay Protivash John Durrell Daniell Toth Aleksandra Slavković Daniel Kifer http://arxiv.org/abs/2508.15954v2 A Heuristic Framework of Variable Neighborhood Descent Methods for the Large-Scale Multi-Level Facility Location Problem in Supply Chain Networks 2026-03-20T14:22:55Z This paper addresses the single-assignment, uncapacitated, multi-level facility location (MFL) problem, a strategic decision-making process critical to the design of long-term supply chain networks. Specifically, we examine four- and five-level facility location structures (k-LFL), modeled as a location-allocation problem where demand nodes must be assigned to open facilities across hierarchical levels. Although the MFL has been addressed in the literature, solutions to large-scale, realistic problems involving thousands of nodes are lacking. This paper proposes a heuristic framework based on the Variable Neighborhood Descent (VND) metaheuristic with a multi-start strategy. We develop and compare four variants: Basic Variable Neighborhood Descent (BVND), Pipe Variable Neighborhood Descent (PVND), Cyclic Variable Neighborhood Descent (CVND), and Union Variable Neighborhood Descent (UVND). In each case, a multi-start strategy with strong diversification components is employed. Extensive computational experiments compare the methods on large-scale instances involving up to 10,000 customers, 150 distribution centers, 50 warehouses, and 30 plants. Each algorithm settled into a unique, statistically significant computational time when solving these problems. Sensitivity analyses, supported by non-parametric statistical methods, validate the effectiveness of the proposed heuristic framework. 2025-08-21T20:46:34Z 48 pages 3 figures Haibo Wang Bahram Alidaee http://arxiv.org/abs/2603.19899v1 Deep Autocorrelation Modeling for Time-Series Forecasting: Progress and Prospects 2026-03-20T12:31:08Z Autocorrelation is a defining characteristic of time-series data, where each observation is statistically dependent on its predecessors. In the context of deep time-series forecasting, autocorrelation arises in both the input history and the label sequences, presenting two central research challenges: (1) designing neural architectures that model autocorrelation in history sequences, and (2) devising learning objectives that model autocorrelation in label sequences. Recent studies have made strides in tackling these challenges, but a systematic survey examining both aspects remains lacking. To bridge this gap, this paper provides a comprehensive review of deep time-series forecasting from the perspective of autocorrelation modeling. In contrast to existing surveys, this work makes two distinctive contributions. First, it proposes a novel taxonomy that encompasses recent literature on both model architectures and learning objectives -- whereas prior surveys neglect or inadequately discuss the latter aspect. Second, it offers a thorough analysis of the motivations, insights, and progression of the surveyed literature from a unified, autocorrelation-centric perspective, providing a holistic overview of the evolution of deep time-series forecasting. The full list of papers and resources is available at https://github.com/Master-PLC/Awesome-TSF-Papers. 2026-03-20T12:31:08Z Hao Wang Licheng Pan Qingsong Wen Jialin Yu Zhichao Chen Chunyuan Zheng Xiaoxi Li Zhixuan Chu Chao Xu Mingming Gong Haoxuan Li Yuan Lu Zhouchen Lin Philip Torr Yan Liu http://arxiv.org/abs/2603.20349v1 Prediction intervals for overdispersed multinomial data with application to historical controls 2026-03-20T12:14:41Z In pharmaceutical and toxicological research, historical control data are increasingly used to validate concurrent control groups, typically via the construction of historical control limits. While methods have been described for continuous and dichotomous endpoints, approaches for overdispersed multinomial data, common in developmental and reproductive toxicology or histopathology, are currently lacking. This article introduces and compares methods for constructing simultaneous prediction intervals for future multinomial observations subject to overdispersion. We investigate a range of frequentist approaches, including asymptotic approximations and bootstrap techniques (incorporating symmetric, asymmetric, and marginal calibration, as well as rank-based methods), alongside Bayesian hierarchical models. Extensive simulation studies assessing simultaneous coverage probability and the balance of lower and upper tail error probabilities show that standard asymptotic methods and simple Bonferroni adjustments yield liberal intervals, especially for small sample sizes or rare event categories. In contrast, bootstrap methods, specifically the Marginal Calibration and Rank-Based Simultaneous Confidence Sets, provide reliable error control and equal tail probabilities across diverse scenarios involving varying cluster sizes and degrees of overdispersion. These methods fill an important gap for multinomial endpoints and support the validation of concurrent controls using historical control data, in line with the recent European Food Safety Authority scientific opinion on the use and reporting of historical control data. 2026-03-20T12:14:41Z Sören Budig Frank Schaarschmidt Max Menssen http://arxiv.org/abs/2508.18716v3 Dynamic Count Models with Flexible Innovation Processes for Irregular Maritime Migration 2026-03-20T10:41:01Z Motivated by the challenge of analyzing the dynamics of weekly sea border crossings in the Mediterranean (2015-2025) and the English Channel (2018-2025), we develop a Bayesian dynamic framework for modeling heteroskedastic count time series. Building on theoretical considerations and empirical stylized facts, our approach utilizes a Poisson random walk model that allows for heavy-tailed innovations or stochastic volatility dynamics, while incorporating an explicit mechanism to separate structural from sampling zeros. Posterior inference is carried out via a straightforward Markov chain Monte Carlo algorithm. Applying this methodology to Mediterranean and English Channel data, we compare alternative model specifications through a comprehensive out-of-sample forecasting exercise. Using log predictive scores and empirical coverage at predictive quantiles to evaluate each model, we find strong evidence for stochastic volatility in migration innovations. These models deliver the strongest out-of-sample forecasts with empirical coverage close to nominal levels up to the 99th percentile. Our framework can be used to develop risk indicators with direct policy implications for improving governance and preparedness for migration surges. More broadly, the methodology extends to other zero-inflated non-stationary count time series applications, including epidemiological surveillance and public safety incident monitoring. 2025-08-26T06:27:41Z Gregor Zens Jakub Bijak