https://arxiv.org/api//kf+O7v3seK1e6Iv486fLEkvQbY 2026-03-20T10:52:21Z 22783 0 15 http://arxiv.org/abs/2603.19143v1 The Uncertain Policy Price of Scaling Direct Air Capture 2026-03-19T16:59:11Z Direct air carbon capture and storage (DACCS) is a promising CO2 removal technology, but its deployment at scale remains speculative. Yet, its technological, economic, and policy-related uncertainties have often been overlooked in mitigation pathways. This paper conducts the first uncertainty quantification and global sensitivity analysis of DACCS on technological, market, financial and public support drivers, using a detailed-process Integrated Assessment Model and newly developed sensitivity algorithms. We find that DACCS deployment exhibits a fat-tailed distribution: most scenarios show modest technology uptake, but there is a small but non-zero probability (4-6%) of achieving gigaton-scale removals by mid-century. Scaling DACCS to gigaton levels requires subsidies that always exceed 200-330 USD/tCO2 and are sustained for decades, resulting in a public support programme of 900-3000 USD Billions. Such an effort pays back by mid-century, but only if accompanied by strong emission reduction policies. These findings highlight the critical role of climate policies in enabling a robust and economically sustainable CO2 removal strategy. 2026-03-19T16:59:11Z Leonardo Chiani Pietro Andreoni Laurent Drouet Tobias Schmidt Katrin Sievert Bjerne Steffen Massimo Tavoni http://arxiv.org/abs/2603.19055v1 Probabilistic multivariate statistical process control via kernel parameter uncertainty propagation 2026-03-19T15:48:48Z Kernel-based multivariate statistical process control (K-MSPC) extends classical monitoring to nonlinear industrial processes. Its performance depends critically on kernel parameters such as lengthscales and variance terms. In current practice these parameters are typically selected by heuristics or deterministic optimisation, and then treated as fixed, despite being inferred from finite and noisy data. This can lead to overconfident control limits and unstable alarm behaviour when the kernel choice is uncertain. This work proposes a probabilistic K-MSPC framework that quantifies and propagates kernel parameter uncertainty to the monitoring statistics. The approach follows a two-stage workflow: (i) deterministic kernel calibration using supervised or unsupervised models, and (ii) Bayesian inference of kernel parameters via Markov chain Monte Carlo. Posterior samples are propagated through kernel Principal Component Analysis to produce probabilistic $T^2$ and squarred prediction error control charts, together with uncertainty-aware contribution plots. The framework is evaluated on the Tennessee Eastman Process benchmark. Results show that posterior-mean monitoring often improves fault detection compared to deterministic prior-mean charts for the squared exponential kernel, while credible bands remain narrow in-control and widen under faults, reflecting amplified epistemic uncertainty in abnormal regimes. The automatic relevance determination kernel reduces posterior uncertainty and yields performance close to the deterministic baseline, whereas unsupervised calibration produces wider posterior bands but still robust fault detection. 2026-03-19T15:48:48Z Zina-Sabrina Duma Victoria Jorry Ayesha Safraz Maria Paola di Crosta Tuomas Sihvonen Lassi Roininen Satu-Pia Reinikainen http://arxiv.org/abs/2603.18781v1 SRRM: Improving Recursive Transport Surrogates in the Small-Discrepancy Regime 2026-03-19T11:32:28Z Recursive partitioning methods provide computationally efficient surrogates for the Wasserstein distance, yet their statistical behavior and their resolution in the small-discrepancy regime remain insufficiently understood. We study Recursive Rank Matching (RRM) as a representative instance of this class under a population-anchored reference. In this setting, we establish consistency and an explicit convergence rate for the anchored empirical RRM under the quadratic cost. We then identify a dominant mismatch mechanism responsible for the loss of resolution in the small-discrepancy regime. Based on this analysis, we introduce Selective Recursive Rank Matching (SRRM), which suppresses the resulting dominant mismatches and yields a higher-fidelity practical surrogate for the Wasserstein distance at moderate additional computational cost. 2026-03-19T11:32:28Z 29 pages,20 figures Yufei Zhang Tao Wang Jingyi Zhang http://arxiv.org/abs/2603.16982v2 Trajectory Stability and Signature Diagnostics for Comet-Based Interstellar Navigation 2026-03-19T10:42:10Z Interstellar objects (ISOs) motivate a coupled mission-design and inference question relevant to spacecraft dynamics and control in extreme environments: if volatile-rich, rotating comet-like bodies were used for sustained deep-space navigation by exploiting pre-existing hyperbolic motion and in-situ propellant, what stability requirements arise under non-gravitational forcing, and what astrometric signatures might distinguish active stabilization from uncontrolled natural dynamics? We develop a stability-theoretic framework for trajectory tracking with jet-actuated correction, and show that high-speed transit geometry -- including debris-belt avoidance and encounter phasing -- tightly constrains feasible trajectories, making long-horizon tracking stability mission-critical. We model tracking residuals as the balance of disturbances and corrective action, and derive stability conditions across four levels: disturbance-energy stability, outer-loop contraction, actuator-memory stability, and rotation-mediated (Floquet) stability. The analysis implies residual diagnostics that can motivate empirical tests: under comparable forcing, effective stabilization is expected to strengthen short-horizon error correction, reduce event-conditioned persistence and variance clustering, regularize standardized innovations, and yield bounded post-shock recovery. More broadly, the framework provides a reference for deep-space guidance and control under nonlinear, multi-field disturbances and for planetary-defense concepts involving attitude shaping or impulsive kinetic impact. 2026-03-17T15:30:15Z 31 pages, 2 figures Bo Pieter Johannes Andrée http://arxiv.org/abs/2507.04303v3 Forecasting age distribution of deaths across countries: Life expectancy and annuity valuation 2026-03-18T22:47:54Z In this paper, we provide a comprehensive cross-country validation study of compositional mortality modeling and forecasting methods. Thus, we consider two one-to-one transformations: the cumulative distribution function and the centered log-ratio transformation in compositional data analysis. Between the two transformations, the cumulative distribution function provides a scale-free way to visualize the gender gap and cross-country heterogeneity in the probability of dying by sex and country. Drawing on age-specific period life-table death counts from 24 countries in the Human Mortality Database (2025), we assess and compare the point and interval forecast accuracy of the two transformations, using the same forecasting method. Enhancing the forecast accuracy of period life-table death counts is of significant value to demographers, who rely on such forecasts to estimate survival probabilities and life expectancy, and to actuaries, who use them to price annuities across various entry ages and maturities. 2025-07-06T09:03:48Z 34 pages, 15 figures, 5 tables Han Lin Shang Steven Haberman http://arxiv.org/abs/2603.18279v1 Covariate-Dependent Functional Principal Component Analysis for SHM 2026-03-18T20:54:13Z In Structural Health Monitoring (SHM), sensor measurements and derived features such as eigenfrequencies often exhibit systematic daily patterns and can therefore be naturally represented as functional data. Furthermore, these patterns are typically influenced by environmental factors, particularly temperature, which can substantially affect the observed system response. While most existing methods for removing environmental effects assume that confounding influences affect only the mean response, it has been shown that environmental and operational factors may also alter the covariance structure of the residual process. To address this limitation in a functional data monitoring framework, we incorporate so-called covariate-dependent functional principal component analysis (CD-FPCA), which allows eigenfunctions and eigenvalues of the residual process to vary smoothly with covariates such as temperature. The proposed methodology is illustrated using an extended version of the KW51 railway bridge eigenfrequency dataset. This case study suggests that accounting for covariate effects beyond the functional mean can improve the robustness of the monitoring procedure, in particular by reducing environmentally induced (false) alarms under challenging low-temperature conditions. 2026-03-18T20:54:13Z 10 pages, 3 figures, conference Philipp Wittenberg Lizzie Neumann Kristof Maes Jan Gertheiss http://arxiv.org/abs/2510.20012v2 AI Pose Analysis and Kinematic Profiling of Range-of-Motion Variations in Resistance Training 2026-03-18T20:21:17Z This study develops an AI-based pose estimation pipeline for quantifying movement kinematics in resistance training. Using videos from Wolf et al. (2025), comprising 303 recordings of 26 participants performing eight upper-body exercises under full (fROM) and lengthened partial (pROM) conditions, we extract joint-angle trajectories using five distinct deep-learning pose estimation models and a unified signal-processing framework. From these trajectories, we derive repetition-level metrics including range of motion (ROM) and repetition duration. We use these outputs as dependent variables in a crossed random-effects model that accounts for participant-, exercise-, and model-level variability to assess systematic differences between ROM conditions. Results indicate that pROM reduces range of motion without significantly affecting repetition duration. Variance decomposition shows that pROM increases both between-participant and between-exercise variability, suggesting reduced consistency in execution. To enable cross-exercise comparison, we model ROM on a logarithmic scale and define %ROM as the proportion of fROM achieved under pROM. While the estimated mean is approximately 56\%, significant heterogeneity across exercises indicates that lengthened partials are not characterized by a fixed proportion of full ROM. The results demonstrate that AI-based motion analysis can provide reliable kinematic insights to inform evidence-based training recommendations. 2025-10-22T20:27:45Z Adam Diamant http://arxiv.org/abs/2602.07684v2 Quantifying resilience for distribution system customers with SALEDI 2026-03-18T19:27:09Z The impact of routine smaller outages on distribution system customers in terms of customer minutes interrupted can be tracked using conventional reliability indices. However, the customer minutes interrupted in large blackout events are extremely variable, and this makes it difficult to quantify the customer impact of these extreme events with resilience metrics. We solve this problem with the System Average Large Event Duration Index SALEDI that logarithmically transforms the customer minutes interrupted. We explain how this new resilience metric works, compare it with alternatives, quantify its statistical accuracy, and illustrate its practical use with standard outage data from five utilities. 2026-02-07T20:10:43Z Arslan Ahmad Ian Dobson http://arxiv.org/abs/2412.02484v2 Vector Optimization with Gaussian Process Bandits 2026-03-18T19:03:13Z We study black-box vector optimization with Gaussian process bandits, where there is an incomplete order relation on objective vectors described by a polyhedral convex cone. Existing black-box vector optimization approaches either suffer from high sample complexity or lack theoretical guarantees. We propose Vector Optimization with Gaussian Process (VOGP), an adaptive elimination algorithm that identifies Pareto optimal solutions sample efficiently by exploiting the smoothness of the objective function. We establish theoretical guarantees, deriving information gain-based and kernel-specific sample complexity bounds. Finally, we conduct a thorough empirical evaluation of VOGP and compare it with the state-of-the-art multi-objective and vector optimization algorithms on several real-world and synthetic datasets, emphasizing VOGP's efficiency (e.g., $\sim18\times$ lower sample complexity on average). We also provide heuristic adaptations of VOGP for cases where the design space is continuous and where the Gaussian process model lacks access to the true kernel hyperparameters. This work opens a new frontier in sample-efficient multi-objective black-box optimization by incorporating preference structures while maintaining theoretical guarantees and practical efficiency. 2024-12-03T14:47:46Z İlter Onat Korkmaz Yaşar Cahit Yıldırım Çağın Ararat Cem Tekin http://arxiv.org/abs/2603.18195v1 The Role of Data and Metrics in Measuring Inequality Worldwide. A Tribute to Giovanni Andrea Cornia's Lifelong Work on the World Ginis 2026-03-18T18:45:31Z This paper pays tribute to Professor Giovanni Andrea Cornia's lifelong contributions to the measurement of global inequality. We review twelve world and regional databases of the Gini coefficient, illustrate their coverage, overlapping, and data gaps, and analyse the major sources of discrepancy among published Ginis. Merging all databases into a unified collection of over 122,000 observations spanning 222 countries from 1867 to 2024, we document how differences in welfare metrics, reference units, sub-metric definitions, post-survey adjustments, and survey design produce Gini estimates that diverge considerably -- sometimes by as much as 50 percentage points -- for the same country and year. We quantify pairwise cross-database discordance, document the income-consumption Gini gap by region and income group, and discuss the contributions of welfare metric and equivalence scale choices to cross-database dispersion. We extend the analysis with a dedicated discussion of comparability across time and across measurement dimensions, showing how multiple layers of methodological choice interact to make any single Gini figure a product of a complex chain of decisions that are rarely fully disclosed. Our analysis confirms that the choice of welfare metric remains the single most important source of cross-country non-comparability, while sub-metric definitions and equivalence scales introduce further systematic differences that are routinely overlooked in comparative work. 2026-03-18T18:45:31Z 26 Pages, 5 Figures, 7 Tables Lidia Ceriani Paolo Verme http://arxiv.org/abs/2603.18190v1 Starting Off on the Wrong Foot: Pitfalls in Data Preparation 2026-03-18T18:37:33Z When working with real-world insurance data, practitioners often encounter challenges during the data preparation stage that can undermine the statistical validity and reliability of downstream modeling. This study illustrates that conventional data preparation procedures such as random train-test partitioning, often yield unreliable and unstable results when confronted with highly imbalanced insurance loss data. To mitigate these limitations, we propose a novel data preparation framework leveraging two recent statistical advancements: support points for representative data splitting to ensure distributional consistency across partitions, and the Chatterjee correlation coefficient for initial, non-parametric feature screening to capture feature relevance and dependence structure. We further integrate these theoretical advances into a unified, efficient framework that also incorporates missing-data handling, and embed this framework within our custom InsurAutoML pipeline. The performance of the proposed approach is evaluated using both simulated datasets and datasets often cited in the academic literature. Our findings definitively demonstrate that incorporating statistically rigorous data preparation methods not only significantly enhances model robustness and interpretability but also substantially reduces computational resource requirements across diverse insurance loss modeling tasks. This work provides a crucial methodological upgrade for achieving reliable results in high stakes insurance applications. 2026-03-18T18:37:33Z 42 pages, 37 references Jiayi Guo Panyi Dong Zhiyu Quan http://arxiv.org/abs/2111.06390v4 Theoretical Foundations of δ-margin Majority Voting 2026-03-18T17:52:02Z In high-stakes ML applications such as fraud detection, medical diagnostics, and content moderation, practitioners rely on consensus-based approaches to control prediction quality. A particularly valuable technique -- δδδ-margin majority voting -- collects votes sequentially until one label exceeds alternatives by a threshold δδδ, offering stronger confidence than simple majority voting. Despite widespread adoption, this approach has lacked rigorous theoretical foundations, leaving practitioners reliant on heuristics for key metrics like expected accuracy and cost. This paper establishes a comprehensive theoretical framework for δδδ-margin majority voting by formulating it as an absorbing Markov chain and leveraging Gambler's Ruin theory. Our contributions form a practical \emph{design calculus} for δδδ-margin voting: (1)~Closed-form expressions for consensus accuracy, expected voting duration, variance, and the stopping-time PMF, enabling model-based design rather than trial-and-error. (2)~A Bayesian extension handling uncertainty in worker accuracy, supporting real-time monitoring of expected quality and cost as votes arrive, with single-Beta and mixture-of-Betas priors. (3)~Cost-calibration methods for achieving equivalent quality across worker pools with different accuracies and for setting payment rates accordingly. We validate our predictions on two real-world datasets, demonstrating close agreement between theory and observed outcomes. The framework gives practitioners a rigorous toolkit for designing δδδ-margin voting processes, replacing ad-hoc experimentation with model-based design where quality control and cost transparency are essential. 2021-11-11T18:58:09Z Margarita Boyarskaya Panos Ipeirotis http://arxiv.org/abs/2603.17866v1 Bayesian multilevel step-and-turn models for evaluating player movement in American football 2026-03-18T15:54:28Z In sports analytics, player tracking data have driven significant advancements in the task of player evaluation. We present a novel generative framework for evaluating the observed frame-by-frame player positioning against a distribution of hypothetical alternatives. We illustrate our approach by modeling the within-play movement of an individual ball carrier in the National Football League (NFL). Specifically, we develop Bayesian multilevel models for frame-level player movement based on two components: step length (distance between successive locations) and turn angle (change in direction between successive steps). Using the step-and-turn models, we perform posterior predictive simulation to generate hypothetical ball carrier steps at each frame during a play. This enables comparison of the observed player movement with a distribution of simulated alternatives using common valuation measures in American football. We apply our framework to tracking data from the first nine weeks of the 2022 NFL season and derive novel player performance metrics based on hypothetical evaluation. 2026-03-18T15:54:28Z Quang Nguyen Ronald Yurko http://arxiv.org/abs/2603.17717v1 Machine Learning for Network Attacks Classification and Statistical Evaluation of Machine Learning for Network Attacks Classification and Adversarial Learning Methodologies for Synthetic Data Generation 2026-03-18T13:35:02Z Supervised detection of network attacks has always been a critical part of network intrusion detection systems (NIDS). Nowadays, in a pivotal time for artificial intelligence (AI), with even more sophisticated attacks that utilize advanced techniques, such as generative artificial intelligence (GenAI) and reinforcement learning, it has become a vital component if we wish to protect our personal data, which are scattered across the web. In this paper, we address two tasks, in the first unified multi-modal NIDS dataset, which incorporates flow-level data, packet payload information and temporal contextual features, from the reprocessed CIC-IDS-2017, CIC-IoT-2023, UNSW-NB15 and CIC-DDoS-2019, with the same feature space. In the first task we use machine learning (ML) algorithms, with stratified cross validation, in order to prevent network attacks, with stability and reliability. In the second task we use adversarial learning algorithms to generate synthetic data, compare them with the real ones and evaluate their fidelity, utility and privacy using the SDV framework, f-divergences, distinguishability and non-parametric statistical tests. The findings provide stable ML models for intrusion detection and generative models with high fidelity and utility, by combining the Synthetic Data Vault framework, the TRTS and TSTR tests, with non-parametric statistical tests and f-divergence measures. 2026-03-18T13:35:02Z Iakovos-Christos Zarkadis Christos Douligeris http://arxiv.org/abs/2603.17599v1 Prediction with Missing Data: Target Probabilities and Missingness Mechanisms 2026-03-18T11:13:41Z Conditions ensuring optimal parameter estimation in the presence of missing data are well established in inference, typically relying on the Missing-at-Random (MAR) assumption. In prediction, similar principles are often assumed to apply. However, methods considered biased in inference, such as pattern sub-modelling or unconditional imputation, have been shown to achieve optimal predictive performance under any missingness mechanism, including non-MAR (MNAR). To explain this apparent contradiction, we introduce a new formal framework for describing missingness in prediction. Central to this framework is a distinction between two prediction targets, defined according to whether or not the indicator of observation of the predictors is exploited to predict the outcome. This distinction leads to a classification of the missingness mechanisms describing the conditions under which these targets are equal, and when consistent prediction of each is achievable. A key result is that both targets may be consistently predicted under conditions weaker than MAR. We discuss the implications of this paradigm for handling missing data in prediction, distinguishing between missingness at development, validation and deployment of a forecaster. The findings are illustrated using simulated data and a real-world application with the prediction of significant injury after trauma upon arrival at the emergency department. 2026-03-18T11:13:41Z 55 pages (including 40 pages for the main article and 15 pages for the supplementary material) Pierre Catoire Robin Genuer Cecile Proust-Lima