https://arxiv.org/api/Z7dcekDr2t465G/oD71oCsVxc7M2026-06-14T03:03:52Z2352222515http://arxiv.org/abs/2502.04867v5Invariant Image Reparameterisation: Bridging Symbolic and Numerical Methods for Identifiability Analysis, Model Reduction, and Prediction2026-05-28T05:03:50ZStructural and practical parameter non-identifiability issues are common when mathematical models are used to interpret data. Such issues motivate model reparameterisation and reduction methods. Here, we consider Invariant Image Reparameterisation (IIR), which asks when symbolic reparameterisation conditions can be replaced by numerical derivative calculations at a single reference point. The central object is the invariant image: a reduced, basis-independent representation of the parameter combinations controlling observable model behaviour. We show that when a one-to-one componentwise transformation makes observable behaviour depend only on fixed linear combinations of the transformed parameters, a single numerical Jacobian determines the associated lower-dimensional reparameterisation space. This includes models depending on monomial combinations of the original parameters. We also give a first-order invariance condition that distinguishes minimal from non-minimal but exact reductions via the invariant part of the local null space. In structurally identifiable but practically weakly informed settings, the same calculations separate strongly and weakly informed parameter combinations. The invariant image admits multiple coordinate representations: the SVD gives a default orthonormal basis ordered by local identifiability, while sparse monomial bases are often more interpretable. Treating these coordinates as interest parameters in Profile-Wise Analysis gives likelihood-based uncertainty quantification and prediction. We demonstrate the method on parameterised normal models with Poisson-limit, extended Poisson-limit, and non-limit cases, and on the repressilator, a nonlinear differential equation model of gene regulation. A Julia implementation of IIR, with these and further examples, is available at https://github.com/omaclaren/reparam.2025-02-07T12:13:42Z41 pages incl. supplementary material (main text approx. 28 pages)Oliver J. MaclarenRuanui NicholsonJoel A. TrentJoshua RottenberryMatthew Simpsonhttp://arxiv.org/abs/2605.29296v1Conformal prediction for functional time series: Application to age-specific mortality rates2026-05-28T03:26:11ZIn demographic literature, forecast uncertainty is often quantified with a statistical model. This model-based approach may potentially suffer from drawbacks, namely model misspecification, selection effect, and lack of finite-sample validity. We introduce a model-agnostic and distribution-free procedure, conformal prediction, for constructing prediction intervals for a functional time series. In the family of conformal prediction, split conformal prediction divides the data into training, validation, and test sets. Within the validation set, we can select optimal tuning parameters by calibrating the empirical coverage probabilities to match their nominal values. With the selected optimal tuning parameters, we then construct the prediction intervals using the same forecasting model for the holdout data in the testing set. Without sample splitting, sequential conformal prediction sequentially updates the predicted quantiles via an autoregressive process. Using Australian age- and sex-specific log mortality rates, we evaluate and compare the interval forecast accuracy, as measured by empirical coverage probability, coverage probability difference and mean interval score, between the two variants of conformal prediction.2026-05-28T03:26:11Z27 pages, 4 figures, 7 tablesHan Lin Shanghttp://arxiv.org/abs/2605.29284v1Rapid Approximation Prediction for Kriging2026-05-28T03:11:05ZExact Kriging and conditional simulation (CS) for uncertainty quantification are computationally infeasible for modern spatial analyses with large numbers of observations and dense prediction grids. We present a rapid approximation to the Kriging prediction step for stationary Gaussian processes for a regular prediction grid by approximating each off-grid covariance vector by a sparse linear combination of on-grid covariances within a local $L$-order neighborhood of $M = (2L)^2$ neighboring grid points. This reformulation reduces complexity from $O(N n^3)$ to $O(N \log N + nM + M^3)$ while preserving accuracy. A factorial study shows that approximation error decreases systematically with increased Matérn smoothness, neighbor order $L$, and grid resolution, aligning with bounds from kernel approximation theory. In a North American summer-rainfall application ($n=1368$), our method produces predictions visually indistinguishable from exact Kriging with point-wise errors on the order of $10^{-5}$ inches and achieves more than $150$ times speedups at a $350\times350$ grid, also outperforming Vecchia and LatticeKrig predictions. Embedded in a fast CS scheme, the approach reproduces Kriging standard errors and scales favorably with both $n$ and $N$. We recommend a practical workflow that uses a fast method for parameter estimation followed by our rapid predictor for fine-grid mapping and uncertainty quantification.2026-05-28T03:11:05Z11 figures, 38 pagesZiyu LiGregory FasshauerDouglas Nychkahttp://arxiv.org/abs/2605.29196v1Coating Breakdown Prediction for Ships and Inspection Planning2026-05-28T00:15:00ZMarine corrosion significantly reduces a ship's availability, increases costs of operation and could impact safety. Protective coatings mitigate these risks, but their effectiveness deteriorates over time. Early detection of coating breakdown is crucial to prevent costly repairs and safety concerns. While corrosion itself is well-understood, coating degradation remains under-investigated due to insufficient long-term data. This work addresses this knowledge gap by enhancing coating defect prediction and optimizing inspection planning for ships. The Power Law Non-Homogeneous Poisson Process (PL-NHPP) is utilized for modeling coating defect arrivals. Unlike prior studies, we employ a hierarchical Bayesian approach for parameter fitting, effectively addressing limitations associated with scarce real-world data. Furthermore, we optimize inspection planning by incorporating out-of-service costs and potential costs increases due to delayed repairs. The efficacy of these methods is evaluated through a comprehensive case study involving a recently commissioned fleet with limited historical data. This research contributes to the advancement of condition-based maintenance (CBM) strategies for ships by enabling more accurate prediction of coating breakdowns and optimizing inspection schedules early in the life of the fleet. This approach ultimately improves operational efficiency and reduces life-cycle costs.2026-05-28T00:15:00ZHuy Truong-BaMichael E. CholetteGeoffrey WillMarc Hartmannhttp://arxiv.org/abs/2605.29193v1Bayesian reversal of the liquid level trajectory in a draining tank for pollution forensics2026-05-28T00:08:50ZStorage tanks for hazardous liquids are common in industry and agriculture. During a pollution incident, liquid may drain from a storage tank through a small hole, crack, or pipe. After containing the leak, estimating the discharged volume of liquid is essential for public safety, regulatory assessment, and remediation. When the original inventory of liquid is unknown, this constitutes an inverse problem. In this work, we present a framework for inferring the initial liquid level in a partially drained tank from the observed final liquid level after a pollution incident and an estimate of the drainage duration. Because the drainage dynamics, model parameters, and observations are uncertain, we employ Bayesian statistical inversion to combine prior physical knowledge with experimental liquid level time series data to predict the initial liquid level with quantified uncertainty. We use a physics-based model based on Torricelli's law to describe the tank-draining dynamics and augment it with an empirical discrepancy function to account for missing or imperfectly modeled physics. In our experiments with a tank draining of water, we found that our inferred initial liquid level was accurate, although uncertainty increased with drainage duration. Beyond its application to pollution forensics, this work may also serve as a hands-on classroom project illustrating dynamic modeling, model discrepancy, and Bayesian inference.2026-05-28T00:08:50ZKyla D. JonesGbenga FabusolaAlexander W. DowlingCory M. Simonhttp://arxiv.org/abs/2506.08028v2Sensor Fusion for Track Geometry Monitoring: Integrating On-Board Condition Monitoring and Degradation Models via Kalman Filtering2026-05-28T00:00:55ZTrack geometry monitoring is essential for maintaining the safety and efficiency of railway operations. While Track Recording Cars (TRCs) provide accurate measurements of track geometry indicators, their limited availability and high operational costs restrict frequent monitoring across large rail networks. Recent advancements in on-board sensor systems installed on in-service trains offer a cost-effective alternative by enabling high-frequency, albeit less accurate, data collection. This study proposes a method to enhance the reliability of track geometry predictions by integrating low-accuracy sensor vibration signals with degradation models through a Kalman filter framework. An experimental campaign using a low-cost sensor system mounted on a TRC evaluates the proposed approach. The results demonstrate that incorporating frequent sensor data significantly reduces prediction uncertainty, even when the data is noisy. The study also investigates how the frequency of data recording influences the size of the credible prediction interval, providing guidance on the optimal deployment of on-board sensors for effective track monitoring and maintenance planning.2025-06-02T00:31:53ZHuy Truong-BaJacky ChinMichael E. CholettePietro Borghesanihttp://arxiv.org/abs/2605.28974v1Algorithm to check Maximum Likelihood Estimate Existence for integrated PCA2026-05-27T18:23:10ZBeing encouraged by [AKRS] that provides an amazing bridge between Statistics and Invariant Theory, and especially by [FM], where quiver semi-invariant techniques apply to verify the existence of MLE for a recent iPCA model, we provide an enhancement to [FM]. Our Theorem 5.2 yields necessary and sufficient conditions for MLE to exist generically for any dimension vector. The conditions can be easily checked with our software [T] based on Derksen-Weyman algorithm and simplifying the application for statistics practitioners and non-specialists in quivers. For those deep in quiver Representation Theory, Theorem 5.2 relates the MLE existence to the local semi-simplicity of representations as introduced in [Sh07]. We also hope that our elementary and short text can serve for the experts in both domains as a warm start in a new category.2026-05-27T18:23:10Z6 pagesDmitri Shmelkinhttp://arxiv.org/abs/2605.28762v1Deep Neural Networks for Doubly Robust Estimation with Nonprobability Survey Samples2026-05-27T17:21:50ZIntegrating probability and nonprobability survey samples is an important problem in modern survey sampling. Nonprobability samples often contain rich outcome information but may lack population representativeness, whereas probability samples provide design-based auxiliary information but may not contain the study variable. We propose a deep neural network (DNN)-assisted doubly robust framework for estimating the finite population mean from these two data sources. The proposed method models the logit sampling score for the nonprobability sample as an unknown nonparametric function and estimates it by maximizing a pseudo-likelihood that combines information from the nonprobability sample and a reference probability sample. The DNN parameters are optimized using the ADAM algorithm. The resulting DNN-estimated sampling scores are incorporated into a DNN-assisted inverse-probability weighted estimator and a deep doubly robust estimator. We establish consistency and convergence rates under regularity conditions and evaluate the finite-sample performance of the proposed estimators through simulation studies and an empirical application using Pew Research Center and Behavioral Risk Factor Surveillance System data. The results suggest that the proposed estimators can improve robustness to parametric propensity-score misspecification, especially when the true selection mechanism is nonlinear.2026-05-27T17:21:50Z29 pages, 1 figureYufang DaiShihua LuoWendy LouZilin WangXuewen Luhttp://arxiv.org/abs/2411.13479v4Conformal Prediction for Hierarchical Data2026-05-27T16:36:39ZWe consider conformal prediction for multivariate data and focus on hierarchical data, where some components are linear combinations of others. Intuitively, the hierarchical structure can be leveraged to reduce the size of prediction regions for the same coverage level. We implement this intuition by including a projection step (also called a reconciliation step) in the split conformal prediction [SCP] procedure, and prove that the resulting prediction regions are indeed globally smaller. We do so both under the classic objective of joint coverage and under a new and challenging task: component-wise coverage, for which efficiency results are more difficult to obtain. The associated strategies and their analyses are based both on the literature of SCP and of forecast reconciliation, which we connect. We also illustrate the theoretical findings, for different scales of hierarchies on simulated data.2024-11-20T17:26:26Z39 pages, 4 figuresGuillaume PrincipatoGilles StoltzYvenn Amara-OualiYannig GoudeBachir HamroucheJean-Michel Poggihttp://arxiv.org/abs/2605.28344v1Capturing the Curve: Functional Data Analysis for Validated Digital Outcome Measures2026-05-27T11:48:45ZDigital health technologies enable high-frequency collection of data in near-continuous time and capture rich information about the health of individuals. The raw data collected by these devices often have a hierarchical functional structure: repeated physiological functions are observed over time and on multiple time scales (seconds, days, weeks). While many summaries can be derived from digital data, typically, only a small subset of pre-defined scalars is validated as outcome measures in clinical trials. We explore data-driven summaries based on between-subject scores from Multilevel Functional Principal Component Analysis (MFPCA), which are low-dimensional representations of functional data with robust statistical properties. Specifically, we compute MFPCA projection scores with respect to a reference population, summarising how individuals differ from the dominant directions of variation at each hierarchical level. Through a simulation study based on smartwatch electrocardiogram (ECG) signals, we compare MFPCA scores with pre-specified summaries in terms of validation criteria, including test-retest reliability and known-groups discrimination. We demonstrate that MFPCA scores generally have high reliability and can discriminate between groups across simulated scenarios of change. This offers an advantage when digital tools enable the measurement of novel physiological signals and the characteristics of the change are not yet defined. Finally, using knee flexion-extension data from individuals living with Parkinson's disease, we demonstrate that one of the MFPCA scores more strongly correlates with established gold-standard metrics and can detect clinical change, compared to a pre-specified scalar. We conclude that MFPCA-derived scores retain more information than typical outcome measures and open the door to using learning representation strategies in clinical trial settings.2026-05-27T11:48:45ZMia S. TackneyMarcos MatabuenaMarco PalmaMichael WesterClaire MaassenThomas KrammerJulian MustrophPeter H. CharltonJames CarpenterSofia S. Villarhttp://arxiv.org/abs/2605.28212v1How to measure intra-physician variability in clinical decision-making?2026-05-27T09:30:45ZIntra-physician prescribing variability, the probability that one physician issues discordant decisions for two patients deemed comparable on observed covariates, holds great impact in quality of care, safety and cost. However, there are no known validated measurement methods. Here, we benchmark eight methods (Euclidean, Mahalanobis, Learned-Weights, Genetic Mahalanobis, Random Forest proximity, Mutual-Information-weighted, Latent Profile Analysis and Bayesian binomial generalized linear mixed model) against a synthetic ground truth across 94 experimental conditions. Learned-Weights matching achieves the lowest mean absolute error (0.027), followed by Mutual-Information-weighted matching (0.028) and RF Proximity (0.034). All eight discordance-analysis methods preserve the physician rank ordering with high fidelity (Spearman > 0.89 versus the ground truth on the SCORE2 experiment), as long as the physician variability groups are well separated. Under a continuous-heterogeneity physician model, rank preservation degrades substantially for unsupervised methods (Spearman = [0.28, 0.35]) but is retained by supervised feature-weighted methods and the GLMM (Spearman = [0.62, 0.68]). This controlled methodological evaluation is a foundation for validation on observational prescribing data. Once validated on observational prescribing data, these evaluated open-source estimators could turn prescribing inconsistency into a routinely measurable clinician-level quality metric, systematically complementing the existing literature on between-physician variation.2026-05-27T09:30:45Z24 pages, 7 tables, 3 figuresAlaedine BenaniPierre MenetonEmmanuel MessasLiza HettalSai SagireddyDamien GrosgeorgeJérôme SalomonSylvain BodardXavier Tannierhttp://arxiv.org/abs/2603.08276v2A Unified Framework for Density Estimation under Right-Censored Point-Centred Quarter Sampling2026-05-27T06:24:13ZWhile the point-centred quarter method (PCQM) is widely used for density estimation, existing methods for handling right-censored data from truncated search radii rely primarily on a Poisson model assuming complete spatial randomness (CSR), leaving a critical gap for spatially aggregated populations. To address this limitation, we develop a unified likelihood- and moment-based framework for right-censored point-centred quarter sampling under both Poisson and negative binomial distribution (NBD) models. In particular, the proposed NBD-based estimators explicitly account for spatial aggregation and censoring simultaneously, extending distance-based inference beyond the CSR setting. Extensive simulations and applications to fully mapped forest plots reveal that the NBD-based MLE delivers the most robust overall performance across diverse ecological scenarios. Across more than 100 species from fully mapped forest plots, the proposed NBD-based MLE approximately reduced absolute relative bias by a median of 0.10 compared with existing censored estimators, representing a relative improvement of over 30%. Ultimately, our framework provides a rigorously validated and practically useful toolkit for analysing censored point-to-tree distance data.2026-03-09T11:47:55Z42 pages, 28 figures, 4 tableWenzhe HuangGuochun ShenDingliang XingJiangyan Zhaohttp://arxiv.org/abs/2601.07299v2Cauchy-Gaussian Overbound for Heavy-tailed GNSS Measurement Errors2026-05-27T01:37:24ZOverbounds of heavy-tailed measurement errors are essential to meet stringent navigation requirements in integrity monitoring applications. This paper proposes to leverage the bounding sharpness of the Cauchy distribution in the core and the Overbounds of heavy-tailed measurement errors are essential for meeting stringent navigation requirements in integrity-monitoring applications. This paper proposes to leverage the bounding sharpness of the Cauchy distribution in the core and the Gaussian distribution in the tails to tightly bound heavy-tailedglobal navigation satellite system measurement errors. We develop a procedure to determine the overbounding parameters for both symmetric unimodal (SU)and non-symmetric unimodal (NSU) heavy-tailed errors and prove that the over-bounding property is preserved through convolution. Experiment results on both simulated and real-world data sets reveal that our method can sharply boundheavy-tailed errors in both the core and tail regions. In the position domain, the proposed method reduces the average vertical protection level by 15% for SU heavy-tailed errors compared with the single-cumulative-density-function Gaussian overbound and by 21%-47% for NSU heavy-tailed errors compared with the navigation discrete envelope and two-step Gaussian overbounds.2026-01-12T08:21:14ZPublished in NAVIGATION: Journal of the Institute of NavigationZhengdao LiPenggao YanWeisong WenLi-Ta Hsu10.33012/navi.749http://arxiv.org/abs/2605.27796v1Benchmarking Ultrasound Foundation Models for Fetal Plane Classification2026-05-27T00:32:40ZUltrasound is widely used in obstetric care due to its safety, accessibility, and real-time imaging. However, interpretation remains operator-dependent and susceptible to noise and artifacts. Deep learning models have shown strong performance to solve these problem, but they typically require large annotated datasets that are difficult to obtain in clinical ultrasound. Foundation models (FMs) offer an alternative, using a large number of ultrasound images to learn transferable representations that can generalize with limited labeled data. This work presents a comprehensive benchmark of ultrasound-specific FMs for fetal plane classification. We evaluated four ultrasound FMs (USFM, MOFO, UltraSAM, FetalCLIP) against two CNN baselines (ResNet50, EfficientNet-V2) and a ViT (DINOv3) pretrained on natural images. We trained all models under two complementary settings: full fine-tuning and linear probing with a frozen encoder. All models were trained using 5-fold patient-level cross-validation on a Spanish fetal ultrasound dataset and tested on both in-domain data and an external African cohort to assess cross-population generalization. We found that FetalCLIP achieved the best results in the linear probing setting (F1 = 0.9261 for in-domain, F1 = 0.9731 for out-of-domain), while USFM performed best in the full fine-tuning setting (F1 = 0.9476 for in-domain, F1 = 0.9515 for out-of-domain). MOFO and UltraSAM degraded most in both settings, underperforming natural image pretrained models in some cases. These findings highlight how the choice of pretrained model strongly affects fetal plane classification performance, since different pretraining objectives lead to different levels of transferability.2026-05-27T00:32:40ZLeya BarrientosYuexi DuNicha C. Dvornekhttp://arxiv.org/abs/2605.27781v1Day-Ahead Electricity Price Forecasting Using a Multivariate Group Lasso Method2026-05-27T00:08:44ZElectricity price signals in modern power systems exhibit complex dependence structures that render forecasting inherently challenging. Our analysis of real-world pricing signals from the California Independent System Operator (CAISO) reveals complex temporal group effects, whereby the influence of explanatory variables on electricity prices persists across consecutive blocks of time due to underlying economic and operational drivers. In response, we propose a multivariate statistical method based on a Group Lasso formulation to forecast the vector of day-ahead electricity prices, by leveraging multi-feature temporal group effects. Our approach is evaluated on two full years of electricity prices from CAISO, demonstrating considerable improvements in point and probabilistic forecast metrics compared to a wide array of statistical and deep learning methods. Theoretical and empirical analyses confirm the effectiveness of the proposed approach in modeling realistic group effects, maintaining both interpretability and low computational complexity. When retrospectively evaluated on test data from a recent international electricity price forecasting challenge, the proposed method ranked in second place, despite having access to significantly less information than competing approaches. Finally, the proposed method is independently validated against two operational electricity price forecasting systems in CAISO, demonstrating competitive predictive performance and practical relevance.2026-05-27T00:08:44ZKeyi WangJiaxiang JiMahan MansouriAhmed Aziz Ezzat