https://arxiv.org/api/Z7dcekDr2t465G/oD71oCsVxc7M 2026-06-14T03:03:52Z 23522 225 15 http://arxiv.org/abs/2502.04867v5 Invariant Image Reparameterisation: Bridging Symbolic and Numerical Methods for Identifiability Analysis, Model Reduction, and Prediction 2026-05-28T05:03:50Z

Structural and practical parameter non-identifiability issues are common when mathematical models are used to interpret data. Such issues motivate model reparameterisation and reduction methods. Here, we consider Invariant Image Reparameterisation (IIR), which asks when symbolic reparameterisation conditions can be replaced by numerical derivative calculations at a single reference point. The central object is the invariant image: a reduced, basis-independent representation of the parameter combinations controlling observable model behaviour. We show that when a one-to-one componentwise transformation makes observable behaviour depend only on fixed linear combinations of the transformed parameters, a single numerical Jacobian determines the associated lower-dimensional reparameterisation space. This includes models depending on monomial combinations of the original parameters. We also give a first-order invariance condition that distinguishes minimal from non-minimal but exact reductions via the invariant part of the local null space. In structurally identifiable but practically weakly informed settings, the same calculations separate strongly and weakly informed parameter combinations. The invariant image admits multiple coordinate representations: the SVD gives a default orthonormal basis ordered by local identifiability, while sparse monomial bases are often more interpretable. Treating these coordinates as interest parameters in Profile-Wise Analysis gives likelihood-based uncertainty quantification and prediction. We demonstrate the method on parameterised normal models with Poisson-limit, extended Poisson-limit, and non-limit cases, and on the repressilator, a nonlinear differential equation model of gene regulation. A Julia implementation of IIR, with these and further examples, is available at https://github.com/omaclaren/reparam.

2025-02-07T12:13:42Z 41 pages incl. supplementary material (main text approx. 28 pages) Oliver J. Maclaren Ruanui Nicholson Joel A. Trent Joshua Rottenberry Matthew Simpson http://arxiv.org/abs/2605.29296v1 Conformal prediction for functional time series: Application to age-specific mortality rates 2026-05-28T03:26:11Z

In demographic literature, forecast uncertainty is often quantified with a statistical model. This model-based approach may potentially suffer from drawbacks, namely model misspecification, selection effect, and lack of finite-sample validity. We introduce a model-agnostic and distribution-free procedure, conformal prediction, for constructing prediction intervals for a functional time series. In the family of conformal prediction, split conformal prediction divides the data into training, validation, and test sets. Within the validation set, we can select optimal tuning parameters by calibrating the empirical coverage probabilities to match their nominal values. With the selected optimal tuning parameters, we then construct the prediction intervals using the same forecasting model for the holdout data in the testing set. Without sample splitting, sequential conformal prediction sequentially updates the predicted quantiles via an autoregressive process. Using Australian age- and sex-specific log mortality rates, we evaluate and compare the interval forecast accuracy, as measured by empirical coverage probability, coverage probability difference and mean interval score, between the two variants of conformal prediction.

2026-05-28T03:26:11Z 27 pages, 4 figures, 7 tables Han Lin Shang http://arxiv.org/abs/2605.29284v1 Rapid Approximation Prediction for Kriging 2026-05-28T03:11:05Z

Exact Kriging and conditional simulation (CS) for uncertainty quantification are computationally infeasible for modern spatial analyses with large numbers of observations and dense prediction grids. We present a rapid approximation to the Kriging prediction step for stationary Gaussian processes for a regular prediction grid by approximating each off-grid covariance vector by a sparse linear combination of on-grid covariances within a local $L$-order neighborhood of $M = (2L)^2$ neighboring grid points. This reformulation reduces complexity from $O(N n^3)$ to $O(N \log N + nM + M^3)$ while preserving accuracy. A factorial study shows that approximation error decreases systematically with increased Matérn smoothness, neighbor order $L$, and grid resolution, aligning with bounds from kernel approximation theory. In a North American summer-rainfall application ($n=1368$), our method produces predictions visually indistinguishable from exact Kriging with point-wise errors on the order of $10^{-5}$ inches and achieves more than $150$ times speedups at a $350\times350$ grid, also outperforming Vecchia and LatticeKrig predictions. Embedded in a fast CS scheme, the approach reproduces Kriging standard errors and scales favorably with both $n$ and $N$. We recommend a practical workflow that uses a fast method for parameter estimation followed by our rapid predictor for fine-grid mapping and uncertainty quantification.

2026-05-28T03:11:05Z 11 figures, 38 pages Ziyu Li Gregory Fasshauer Douglas Nychka http://arxiv.org/abs/2605.29196v1 Coating Breakdown Prediction for Ships and Inspection Planning 2026-05-28T00:15:00Z

Marine corrosion significantly reduces a ship's availability, increases costs of operation and could impact safety. Protective coatings mitigate these risks, but their effectiveness deteriorates over time. Early detection of coating breakdown is crucial to prevent costly repairs and safety concerns. While corrosion itself is well-understood, coating degradation remains under-investigated due to insufficient long-term data. This work addresses this knowledge gap by enhancing coating defect prediction and optimizing inspection planning for ships. The Power Law Non-Homogeneous Poisson Process (PL-NHPP) is utilized for modeling coating defect arrivals. Unlike prior studies, we employ a hierarchical Bayesian approach for parameter fitting, effectively addressing limitations associated with scarce real-world data. Furthermore, we optimize inspection planning by incorporating out-of-service costs and potential costs increases due to delayed repairs. The efficacy of these methods is evaluated through a comprehensive case study involving a recently commissioned fleet with limited historical data. This research contributes to the advancement of condition-based maintenance (CBM) strategies for ships by enabling more accurate prediction of coating breakdowns and optimizing inspection schedules early in the life of the fleet. This approach ultimately improves operational efficiency and reduces life-cycle costs.

2026-05-28T00:15:00Z Huy Truong-Ba Michael E. Cholette Geoffrey Will Marc Hartmann http://arxiv.org/abs/2605.29193v1 Bayesian reversal of the liquid level trajectory in a draining tank for pollution forensics 2026-05-28T00:08:50Z

Storage tanks for hazardous liquids are common in industry and agriculture. During a pollution incident, liquid may drain from a storage tank through a small hole, crack, or pipe. After containing the leak, estimating the discharged volume of liquid is essential for public safety, regulatory assessment, and remediation. When the original inventory of liquid is unknown, this constitutes an inverse problem. In this work, we present a framework for inferring the initial liquid level in a partially drained tank from the observed final liquid level after a pollution incident and an estimate of the drainage duration. Because the drainage dynamics, model parameters, and observations are uncertain, we employ Bayesian statistical inversion to combine prior physical knowledge with experimental liquid level time series data to predict the initial liquid level with quantified uncertainty. We use a physics-based model based on Torricelli's law to describe the tank-draining dynamics and augment it with an empirical discrepancy function to account for missing or imperfectly modeled physics. In our experiments with a tank draining of water, we found that our inferred initial liquid level was accurate, although uncertainty increased with drainage duration. Beyond its application to pollution forensics, this work may also serve as a hands-on classroom project illustrating dynamic modeling, model discrepancy, and Bayesian inference.

2026-05-28T00:08:50Z Kyla D. Jones Gbenga Fabusola Alexander W. Dowling Cory M. Simon http://arxiv.org/abs/2506.08028v2 Sensor Fusion for Track Geometry Monitoring: Integrating On-Board Condition Monitoring and Degradation Models via Kalman Filtering 2026-05-28T00:00:55Z

Track geometry monitoring is essential for maintaining the safety and efficiency of railway operations. While Track Recording Cars (TRCs) provide accurate measurements of track geometry indicators, their limited availability and high operational costs restrict frequent monitoring across large rail networks. Recent advancements in on-board sensor systems installed on in-service trains offer a cost-effective alternative by enabling high-frequency, albeit less accurate, data collection. This study proposes a method to enhance the reliability of track geometry predictions by integrating low-accuracy sensor vibration signals with degradation models through a Kalman filter framework. An experimental campaign using a low-cost sensor system mounted on a TRC evaluates the proposed approach. The results demonstrate that incorporating frequent sensor data significantly reduces prediction uncertainty, even when the data is noisy. The study also investigates how the frequency of data recording influences the size of the credible prediction interval, providing guidance on the optimal deployment of on-board sensors for effective track monitoring and maintenance planning.

2025-06-02T00:31:53Z Huy Truong-Ba Jacky Chin Michael E. Cholette Pietro Borghesani http://arxiv.org/abs/2605.28974v1 Algorithm to check Maximum Likelihood Estimate Existence for integrated PCA 2026-05-27T18:23:10Z

Being encouraged by [AKRS] that provides an amazing bridge between Statistics and Invariant Theory, and especially by [FM], where quiver semi-invariant techniques apply to verify the existence of MLE for a recent iPCA model, we provide an enhancement to [FM]. Our Theorem 5.2 yields necessary and sufficient conditions for MLE to exist generically for any dimension vector. The conditions can be easily checked with our software [T] based on Derksen-Weyman algorithm and simplifying the application for statistics practitioners and non-specialists in quivers. For those deep in quiver Representation Theory, Theorem 5.2 relates the MLE existence to the local semi-simplicity of representations as introduced in [Sh07]. We also hope that our elementary and short text can serve for the experts in both domains as a warm start in a new category.

2026-05-27T18:23:10Z 6 pages Dmitri Shmelkin http://arxiv.org/abs/2605.28762v1 Deep Neural Networks for Doubly Robust Estimation with Nonprobability Survey Samples 2026-05-27T17:21:50Z

Integrating probability and nonprobability survey samples is an important problem in modern survey sampling. Nonprobability samples often contain rich outcome information but may lack population representativeness, whereas probability samples provide design-based auxiliary information but may not contain the study variable. We propose a deep neural network (DNN)-assisted doubly robust framework for estimating the finite population mean from these two data sources. The proposed method models the logit sampling score for the nonprobability sample as an unknown nonparametric function and estimates it by maximizing a pseudo-likelihood that combines information from the nonprobability sample and a reference probability sample. The DNN parameters are optimized using the ADAM algorithm. The resulting DNN-estimated sampling scores are incorporated into a DNN-assisted inverse-probability weighted estimator and a deep doubly robust estimator. We establish consistency and convergence rates under regularity conditions and evaluate the finite-sample performance of the proposed estimators through simulation studies and an empirical application using Pew Research Center and Behavioral Risk Factor Surveillance System data. The results suggest that the proposed estimators can improve robustness to parametric propensity-score misspecification, especially when the true selection mechanism is nonlinear.

2026-05-27T17:21:50Z 29 pages, 1 figure Yufang Dai Shihua Luo Wendy Lou Zilin Wang Xuewen Lu http://arxiv.org/abs/2411.13479v4 Conformal Prediction for Hierarchical Data 2026-05-27T16:36:39Z

We consider conformal prediction for multivariate data and focus on hierarchical data, where some components are linear combinations of others. Intuitively, the hierarchical structure can be leveraged to reduce the size of prediction regions for the same coverage level. We implement this intuition by including a projection step (also called a reconciliation step) in the split conformal prediction [SCP] procedure, and prove that the resulting prediction regions are indeed globally smaller. We do so both under the classic objective of joint coverage and under a new and challenging task: component-wise coverage, for which efficiency results are more difficult to obtain. The associated strategies and their analyses are based both on the literature of SCP and of forecast reconciliation, which we connect. We also illustrate the theoretical findings, for different scales of hierarchies on simulated data.

2024-11-20T17:26:26Z 39 pages, 4 figures Guillaume Principato Gilles Stoltz Yvenn Amara-Ouali Yannig Goude Bachir Hamrouche Jean-Michel Poggi http://arxiv.org/abs/2605.28344v1 Capturing the Curve: Functional Data Analysis for Validated Digital Outcome Measures 2026-05-27T11:48:45Z

Digital health technologies enable high-frequency collection of data in near-continuous time and capture rich information about the health of individuals. The raw data collected by these devices often have a hierarchical functional structure: repeated physiological functions are observed over time and on multiple time scales (seconds, days, weeks). While many summaries can be derived from digital data, typically, only a small subset of pre-defined scalars is validated as outcome measures in clinical trials. We explore data-driven summaries based on between-subject scores from Multilevel Functional Principal Component Analysis (MFPCA), which are low-dimensional representations of functional data with robust statistical properties. Specifically, we compute MFPCA projection scores with respect to a reference population, summarising how individuals differ from the dominant directions of variation at each hierarchical level. Through a simulation study based on smartwatch electrocardiogram (ECG) signals, we compare MFPCA scores with pre-specified summaries in terms of validation criteria, including test-retest reliability and known-groups discrimination. We demonstrate that MFPCA scores generally have high reliability and can discriminate between groups across simulated scenarios of change. This offers an advantage when digital tools enable the measurement of novel physiological signals and the characteristics of the change are not yet defined. Finally, using knee flexion-extension data from individuals living with Parkinson's disease, we demonstrate that one of the MFPCA scores more strongly correlates with established gold-standard metrics and can detect clinical change, compared to a pre-specified scalar. We conclude that MFPCA-derived scores retain more information than typical outcome measures and open the door to using learning representation strategies in clinical trial settings.

2026-05-27T11:48:45Z Mia S. Tackney Marcos Matabuena Marco Palma Michael Wester Claire Maassen Thomas Krammer Julian Mustroph Peter H. Charlton James Carpenter Sofia S. Villar http://arxiv.org/abs/2605.28212v1 How to measure intra-physician variability in clinical decision-making? 2026-05-27T09:30:45Z

Intra-physician prescribing variability, the probability that one physician issues discordant decisions for two patients deemed comparable on observed covariates, holds great impact in quality of care, safety and cost. However, there are no known validated measurement methods. Here, we benchmark eight methods (Euclidean, Mahalanobis, Learned-Weights, Genetic Mahalanobis, Random Forest proximity, Mutual-Information-weighted, Latent Profile Analysis and Bayesian binomial generalized linear mixed model) against a synthetic ground truth across 94 experimental conditions. Learned-Weights matching achieves the lowest mean absolute error (0.027), followed by Mutual-Information-weighted matching (0.028) and RF Proximity (0.034). All eight discordance-analysis methods preserve the physician rank ordering with high fidelity (Spearman > 0.89 versus the ground truth on the SCORE2 experiment), as long as the physician variability groups are well separated. Under a continuous-heterogeneity physician model, rank preservation degrades substantially for unsupervised methods (Spearman = [0.28, 0.35]) but is retained by supervised feature-weighted methods and the GLMM (Spearman = [0.62, 0.68]). This controlled methodological evaluation is a foundation for validation on observational prescribing data. Once validated on observational prescribing data, these evaluated open-source estimators could turn prescribing inconsistency into a routinely measurable clinician-level quality metric, systematically complementing the existing literature on between-physician variation.

2026-05-27T09:30:45Z 24 pages, 7 tables, 3 figures Alaedine Benani Pierre Meneton Emmanuel Messas Liza Hettal Sai Sagireddy Damien Grosgeorge Jérôme Salomon Sylvain Bodard Xavier Tannier http://arxiv.org/abs/2603.08276v2 A Unified Framework for Density Estimation under Right-Censored Point-Centred Quarter Sampling 2026-05-27T06:24:13Z

While the point-centred quarter method (PCQM) is widely used for density estimation, existing methods for handling right-censored data from truncated search radii rely primarily on a Poisson model assuming complete spatial randomness (CSR), leaving a critical gap for spatially aggregated populations. To address this limitation, we develop a unified likelihood- and moment-based framework for right-censored point-centred quarter sampling under both Poisson and negative binomial distribution (NBD) models. In particular, the proposed NBD-based estimators explicitly account for spatial aggregation and censoring simultaneously, extending distance-based inference beyond the CSR setting. Extensive simulations and applications to fully mapped forest plots reveal that the NBD-based MLE delivers the most robust overall performance across diverse ecological scenarios. Across more than 100 species from fully mapped forest plots, the proposed NBD-based MLE approximately reduced absolute relative bias by a median of 0.10 compared with existing censored estimators, representing a relative improvement of over 30%. Ultimately, our framework provides a rigorously validated and practically useful toolkit for analysing censored point-to-tree distance data.

2026-03-09T11:47:55Z 42 pages, 28 figures, 4 table Wenzhe Huang Guochun Shen Dingliang Xing Jiangyan Zhao http://arxiv.org/abs/2601.07299v2 Cauchy-Gaussian Overbound for Heavy-tailed GNSS Measurement Errors 2026-05-27T01:37:24Z

Overbounds of heavy-tailed measurement errors are essential to meet stringent navigation requirements in integrity monitoring applications. This paper proposes to leverage the bounding sharpness of the Cauchy distribution in the core and the Overbounds of heavy-tailed measurement errors are essential for meeting stringent navigation requirements in integrity-monitoring applications. This paper proposes to leverage the bounding sharpness of the Cauchy distribution in the core and the Gaussian distribution in the tails to tightly bound heavy-tailedglobal navigation satellite system measurement errors. We develop a procedure to determine the overbounding parameters for both symmetric unimodal (SU)and non-symmetric unimodal (NSU) heavy-tailed errors and prove that the over-bounding property is preserved through convolution. Experiment results on both simulated and real-world data sets reveal that our method can sharply boundheavy-tailed errors in both the core and tail regions. In the position domain, the proposed method reduces the average vertical protection level by 15% for SU heavy-tailed errors compared with the single-cumulative-density-function Gaussian overbound and by 21%-47% for NSU heavy-tailed errors compared with the navigation discrete envelope and two-step Gaussian overbounds.

2026-01-12T08:21:14Z Published in NAVIGATION: Journal of the Institute of Navigation Zhengdao Li Penggao Yan Weisong Wen Li-Ta Hsu 10.33012/navi.749 http://arxiv.org/abs/2605.27796v1 Benchmarking Ultrasound Foundation Models for Fetal Plane Classification 2026-05-27T00:32:40Z

Ultrasound is widely used in obstetric care due to its safety, accessibility, and real-time imaging. However, interpretation remains operator-dependent and susceptible to noise and artifacts. Deep learning models have shown strong performance to solve these problem, but they typically require large annotated datasets that are difficult to obtain in clinical ultrasound. Foundation models (FMs) offer an alternative, using a large number of ultrasound images to learn transferable representations that can generalize with limited labeled data. This work presents a comprehensive benchmark of ultrasound-specific FMs for fetal plane classification. We evaluated four ultrasound FMs (USFM, MOFO, UltraSAM, FetalCLIP) against two CNN baselines (ResNet50, EfficientNet-V2) and a ViT (DINOv3) pretrained on natural images. We trained all models under two complementary settings: full fine-tuning and linear probing with a frozen encoder. All models were trained using 5-fold patient-level cross-validation on a Spanish fetal ultrasound dataset and tested on both in-domain data and an external African cohort to assess cross-population generalization. We found that FetalCLIP achieved the best results in the linear probing setting (F1 = 0.9261 for in-domain, F1 = 0.9731 for out-of-domain), while USFM performed best in the full fine-tuning setting (F1 = 0.9476 for in-domain, F1 = 0.9515 for out-of-domain). MOFO and UltraSAM degraded most in both settings, underperforming natural image pretrained models in some cases. These findings highlight how the choice of pretrained model strongly affects fetal plane classification performance, since different pretraining objectives lead to different levels of transferability.

2026-05-27T00:32:40Z Leya Barrientos Yuexi Du Nicha C. Dvornek http://arxiv.org/abs/2605.27781v1 Day-Ahead Electricity Price Forecasting Using a Multivariate Group Lasso Method 2026-05-27T00:08:44Z

Electricity price signals in modern power systems exhibit complex dependence structures that render forecasting inherently challenging. Our analysis of real-world pricing signals from the California Independent System Operator (CAISO) reveals complex temporal group effects, whereby the influence of explanatory variables on electricity prices persists across consecutive blocks of time due to underlying economic and operational drivers. In response, we propose a multivariate statistical method based on a Group Lasso formulation to forecast the vector of day-ahead electricity prices, by leveraging multi-feature temporal group effects. Our approach is evaluated on two full years of electricity prices from CAISO, demonstrating considerable improvements in point and probabilistic forecast metrics compared to a wide array of statistical and deep learning methods. Theoretical and empirical analyses confirm the effectiveness of the proposed approach in modeling realistic group effects, maintaining both interpretability and low computational complexity. When retrospectively evaluated on test data from a recent international electricity price forecasting challenge, the proposed method ranked in second place, despite having access to significantly less information than competing approaches. Finally, the proposed method is independently validated against two operational electricity price forecasting systems in CAISO, demonstrating competitive predictive performance and practical relevance.

2026-05-27T00:08:44Z Keyi Wang Jiaxiang Ji Mahan Mansouri Ahmed Aziz Ezzat