https://arxiv.org/api/NqyrkpiGgoQDcpGlyBnRVAXeHx02026-06-10T11:29:37Z3612419515http://arxiv.org/abs/2504.13291v2Estimating equations for causal survival analysis with pooled logistic regression2026-06-03T15:16:16ZBackground: Pooled logistic regression models are commonly applied in survival analysis. However, the standard implementation can be computationally demanding, which is further exacerbated when using the nonparametric bootstrap for inference. To ease these computational burdens, investigators often coarsen time intervals or assume a parametric models for time. These approaches impose restrictive assumptions, which may not always have a well-motivated substantive justification. Methods: Here, the pooled logistic regression model is re-framed using estimating equations to simplify computations and allow for inference via the empirical sandwich variance estimator, thus avoiding the more computationally demanding bootstrap. The proposed implementation is demonstrated using two examples with publicly available data. The performance of the empirical sandwich variance estimator is illustrated using a Monte Carlo simulation study. Results: As shown in the applied examples, the proposed implementation substantially reduced run-times and could be applied without needing to coarsen the data. In the simulation study, the empirical sandwich variance estimator results in nominal confidence interval coverage. Conclusions: The implementation proposed here offers an improved alternative to the standard implementation of pooled logistic regression without needing to impose restrictive constraints on time.2025-04-17T19:04:07ZPaul N ZivichStephen R ColeBonnie E Shook-SaJustin B DeMonteJessie K Edwardshttp://arxiv.org/abs/2510.07559v3A coupling-based approach to f-divergences diagnostics for Markov chain Monte Carlo2026-06-03T15:10:09ZA long-standing gap exists between the theoretical analysis of Markov chain Monte Carlo convergence, which is often based on statistical divergences, and the diagnostics used in practice. We introduce the first general convergence diagnostics for Markov chain Monte Carlo based on any $f$-divergence, allowing users to directly monitor, among others, the Kullback-Leibler and the $χ^2$ divergences as well as the Hellinger and the total variation distances. Our approach rests on a coupling-based "weight harmonization" scheme that produces direct, computable, and consistent importance weights for interacting Markov chains with respect to their target distribution. Beyond their use as convergence diagnostics, these weights are consistent estimates of the Radon-Nikodym derivative $\mathrm{d}π/\mathrm{d} μ_t$, a richer object than the convergence bounds alone, with natural applications to importance-weighted inference. We show how such weightings can provide upper bounds to any $f$-divergence, prove that these bounds tighten over time and converge to zero as the chains approach stationarity, and demonstrate that, while more conservative than existing coupling-based total variation estimators, our method remains a practical and broadly applicable diagnostic tool.2025-10-08T21:14:15Z15 pages + 23 pages of appendix comprising mostly proofs. 8 figures. Main differences w.r.t. v1 are: - the addition of a theorem on the almost sure convergence our the weights system to 1/N under minimal assumptions. - fixed numerical simulationsAdrien CorenflosHai-Dang Dauhttp://arxiv.org/abs/2605.25934v2Weighted NPMLE for the Marginal Mean of Recurrent Events with a Competing Terminal Event2026-06-03T14:37:34ZRegression modeling of recurrent and terminal events continues to present methodological challenges in survival analysis. Existing approaches either make unverifiable assumptions about the dependency structure between the two event types or rely on the proportional intensity assumption for the marginal mean. A semiparametric regression model is proposed that is based on a novel weighted likelihood function, thereby targeting directly the marginal mean of the recurrent event. Our general model captures a large class of semiparametric regression models and accommodates external time-dependent covariate effects on the marginal mean intensity. We establish the consistency and asymptotic normality of the estimators and propose a sandwich estimator of the variance. We propose a novel simulation procedure that directly targets the marginal mean intensity of the recurrent events. In simulation studies, we demonstrate a strong performance of the weighted NPMLE under independent right-censoring. The practical utility of the proposed methodology is demonstrated through application to data from the STATCOPE trial, a large randomized clinical trial that investigated the efficacy of simvastatin for COPD exacerbations. We provide personalized predictions for the number of exacerbations and reassess the effect of simvastatin treatment, accounting for death as a competing terminal event for patients with GOLD stage 4.2026-05-25T15:13:47ZAnna BellachMichael R. Kosorokhttp://arxiv.org/abs/2503.21358v3Inference in stochastic differential equations using the Laplace approximation: Demonstration and examples2026-06-03T14:09:52ZStochastic differential equations are a natural framework for dynamic systems and time series in ecology, because they allow for non-linear first-principle knowledge and uncertainty in the dynamics, and can be combined with measurement errors. However, estimation methods are often technically and computationally challenging. Here, we demonstrate that the Laplace approximation is useful for estimating states and parameters in these models, when done correctly. We give special attention to non-linear dynamics, state-dependent noise intensities, and non-Gaussian measurement errors. Our technique adds states between times of observations, approximates transition densities using discretization methods - in the simplest case, the Euler-Maruyama method - and eliminates unobserved states using the Laplace approximation. We demonstrate that consistency requires a particular form of the approximation, and provide different approaches to implementation. Using simulated case studies, we demonstrate that transition probabilities are well approximated, that inference is computationally feasible, and that the framework leads to simple and flexible implementations.2025-03-27T10:50:05Z40 pages, 7 figures, 1 tableUffe Høgsbro ThygesenKasper Kristensenhttp://arxiv.org/abs/2606.04879v1Bootstrap-based Hypothesis Test of 2D Contours using Elastic Shape Analysis2026-06-03T13:43:38ZShapes of objects in images are often complex, high-dimensional, and vary in ways not captured by standard Euclidean geometry and statistics. Statistical shape analysis encompasses methods for flexible and interpretable measurement of intrinsic shape and shape variability in geometric objects. Elastic Shape Analysis (ESA) is one such method that measures shape differences between objects, represented by contours, in a way that is invariant to rotation, scale, translation, and parameterization. Although ESA is useful for quantifying shape of objects in many image applications, formal methods for statistical inference in image-based ESA remain limited. This work introduces a hypothesis test procedure based on empirical confidence intervals for the elastic shape distance (ESD) between a proposed underlying true shape and an estimated shape. The confidence intervals are created using a bootstrap procedure for non-smooth functionals, which accounts for the non-differentiability of the ESD. The effectiveness of the method is illustrated through both numerical studies and real world image examples from inertial confinement fusion (ICF).2026-06-03T13:43:38Z35 pages, 11 figuresSusan GlennJustin StraitKelly MoranChris DanlyMatthew P Selwoodhttp://arxiv.org/abs/2606.01760v2HS3: A Descriptive, Interoperable Serialization Standard for Statistical Models in High-Energy Physics2026-06-03T13:41:05ZStatistical models in high-energy physics formally encode the relationship between observed data, physics parameters of interest, and experimental and theoretical uncertainties. Likelihood-based inference is the central tool for precision measurements, effective field theory fits, and cross-analysis combinations. Consequently, there is an increasing need for machine-readable, descriptive, and portable model representations. Existing formats such as ROOT workspaces, pyhf JSON, and CMS DataCards provide valuable capabilities but remain tied to specific software stacks and offer no universal standard for exchange, validation, or long-term preservation. We introduce HS3, the High-Energy Physics Statistics Serialization Standard, an implementation-agnostic, human-readable, and extensible serialization format for statistical models. HS3 is designed such that new statistical constructs can be incorporated through backward-compatible extensions, while inference procedures and implementation-specific execution details remain the responsibility of downstream frameworks. HS3 represents likelihoods as computational graphs composed of named distributions, functions, datasets, domains, and analysis prescriptions. It supports binned and unbinned likelihoods as well as hierarchical composite models. HS3 is convertible from and to ROOT/RooFit and is a superset of pyhf. We describe the design principles, structure, and semantics of HS3 and summarize existing implementations in C++, Python, and Julia. We also present early applications to public likelihoods on HEPData, cross-framework validation, and reproducibility efforts. HS3 provides a foundation for FAIR (Findable, Accessible, Interoperable, Reusable), long-lived statistical models at the LHC and beyond. The standard is intended to serve the broader scientific community and to evolve over time for application across a wide range of domains.2026-06-01T06:31:41Z18 pages, 3 figures, 3 code listingsCarsten BurgardOliver SchulzGiordon StarkJonas RembserSimon CelloCornelius Grunwaldhttp://arxiv.org/abs/2606.04859v1Stein's method for the Wishart distribution2026-06-03T13:26:03ZIn this work, we develop Stein's method for the Wishart distribution on the cone of positive definite matrices. We establish the basic ingredients of a Wishart Stein framework: we derive an extended-generator-based Stein characterization from the Wishart diffusion process, identify the corresponding transition semigroup through the noncentral Wishart law, provide an explicit semigroup representation for the solution of the Stein equation, and obtain regularity estimates for the solution. The new methodology is demonstrated in four applications: (i) an order $n^{-1}$ bound, for smooth test functions, for the Wishart approximation of uncentered group-mean scatter matrices in MANOVA; (ii) a quantitative multivariate Satterthwaite approximation; (iii) local/integrated De Bruijn identities and logarithmic Sobolev inequalities for the Wishart measure; and (iv) Stein's method of moments for the shape and scale parameters, including structured scale estimation.2026-06-03T13:26:03Z93 pages, 6 tables, 2 figuresGabriel BaillyRobert E. GauntFrédéric OuimetDonald RichardsRainer von Sachshttp://arxiv.org/abs/2509.23935v3RAPSEM: Identifying Latent Mediators Without Sequential Ignorability via a Rank-Preserving Structural Equation Model2026-06-03T13:15:38ZStandard structural equation models (SEMs) are often used to identify latent mediators. However, valid inference typically relies on the strong, frequently violated Sequential Ignorability assumption. We introduce the Rank-Preserving Structural Equation Model (RAPSEM), which increases robustness through G-estimation while maintaining the measurement model's integrity through a two-stage method of moments (2SMM) for factor score corrections. RAPSEM replaces the no unmeasured mediator-outcome confounding with the weaker no unobserved effect modification assumption. By leveraging treatment randomization, RAPSEM achieves identification in a manner equivalent to instrumental variable estimation through structurally emerging instruments. Specifically, identification relies on treatment-covariate interactions that influence the mediator but have no direct effect on the outcome, allowing researchers to utilize natural heterogeneity in treatment response as a testable source of identification. We provide a robustness assessment for the core identifying assumption and establish the consistency and asymptotic normality of the resulting estimator. Simulation studies demonstrate that RAPSEM remains unbiased under unobserved confounding, whereas standard SEM yields biased results. RAPSEM achieves reasonable power for sample sizes above 500, depending on the strength of the structural instruments. The method is implemented in the accompanying rapsem R package, and its practical utility is illustrated through an empirical example from educational research. The code is available at https://github.com/PsychometricsMZ/RAPSEM.2025-09-28T15:15:49Z31 pages, 8 figures, 8 tables, submitted to Psychometrika, Cambridge University PressSofia MorelliRoberto FalehHolger Brandthttp://arxiv.org/abs/2606.03415v2A Better Comparison under right-censoring: ABC Statistic for Equivalence Testing and Quantification2026-06-03T12:20:14ZThe ABC (area between curves) statistic is an $L^1$-distance which targets an easy-to-interpret estimand. Defined as the (normalized) integrated absolute distance between two survival curves it is a meaningful quantity even when survival functions are crossing. Based on right-censored time-to-event data, estimation is based on Kaplan-Meier curves obtained from two independent sample groups. In the present paper, we develop the large sample properties of the ABC statistic and investigate various resampling options for approximating the statistic's distribution which is possibly non-normal in the limit. These breakthroughs enable the construction of equivalence tests which can be used to establish that differences between two survival functions are practically irrelevant. Alternatively, the point estimator can be accompanied with confidence intervals that comprehensibly quantify the difference between the curves. An extensive simulation study explores these inferential methods under various scenarios: proportional, crossing, and partially equal survival functions. An application to data on overall and progression-free survival in a lung cancer trial illustrates the methods' benefits and some points of consideration.2026-06-02T09:58:38Z27 pages, 6 tables, 4 figures; version2: Added previously incomplete summary of the contents of the appendixSimon MackKathrin MöllenhoffDennis Doblerhttp://arxiv.org/abs/2408.08630v3Spatial Principal Component Analysis and Moran Statistics for Multivariate Functional Areal Data2026-06-03T09:58:19ZThe paper introduces a multivariate functional areal spatial principal component analysis (mfasPCA) framework, together with multivariate functional Moran's I statistics, to enable the assessment of spatial autocorrelation and dimension reduction for multivariate functional data observed over areal units. The proposed framework is spatial-functional in scope: the functional argument may represent time, age, wavelength, or another ordered continuum, while spatial dependence is introduced across areal units through a spatial weight matrix. The principal component method is defined through a Moran-type spatially weighted criterion. We propose eigenvalue-based permutation tests to assess the significance of spatially structured components. The testing framework includes omnibus tests, componentwise tests with Holm adjustment, and sequential rank-wise tests based on tail sums of eigenvalues. Simulation studies show that mfasPCA captures positive and negative spatial-functional structures and concentrates them in the leading components under the respective autocorrelation regimes. A real-data application illustrates how mfasPCA identifies spatially structured modes of multivariate functional variation.2024-08-16T09:49:34ZDharini PathmanathanIssa-Mbenard DaboTzung Hsuen KhooAlaa Ali-HassanSophie Dabo-Nianghttp://arxiv.org/abs/2606.04673v1Improving Longitudinal Targeted Maximum Likelihood Estimation in Target Trial Emulation using Joint Calibrated Weights2026-06-03T09:55:45ZIn target trial emulation (TTE), marginal structural models (MSMs) can be used to characterise per-protocol treatment effects over time. The MSM parameters are often estimated by inverse probability weighting (IPW), with weights estimated by maximum likelihood. However, IPW-based estimators can be unstable in small samples and are sensitive to misspecification of the weight models. An alternative method for estimating the MSM parameters is longitudinal targeted maximum likelihood estimation (LTMLE). LTMLE is double robust and potentially more efficient than IPW. Nevertheless, LTMLE also relies on inverse probability weights and may therefore share the instability of IPW-based estimators. We propose joint calibrated LTMLE, which integrates LTMLE with joint calibrated weights tailored for per-protocol effect estimation in TTE. This calibration of weights improves finite-sample performance by enforcing covariate balance in both the treatment and censoring processes simultaneously. Simulations show that the proposed method has improved efficiency and robustness to weight model misspecification, compared to standard LTMLE. We illustrate the method using a case study to evaluate the effect of highly active antiretroviral therapy on CD4 cell count among HIV-positive women.2026-06-03T09:55:45ZMain text: 34 pages, 3 figures, 8 tables. Supplementary Materials includedJuliette M. LimozinShaun R. SeamanLi Suhttp://arxiv.org/abs/2503.18721v3Differentially Private Joint Independence Test2026-06-03T09:01:24ZIdentification of joint dependence among several random vectors plays an important role in many statistical applications, where the data may contain sensitive or confidential information. In this paper, we consider the $d$-variable Hilbert-Schmidt independence criterion (dHSIC) in the context of differential privacy. Given that the limiting distribution of the empirical estimate of dHSIC is a complicated Gaussian chaos, constructing tests in the non-private regime is typically based on permutation and bootstrap methods. To detect joint dependence under privacy constraints, we propose a dHSIC-based testing procedure employing a differentially private permutation methodology. We show that our method enjoys privacy guarantees, a valid level, and pointwise consistency, whereas the bootstrap counterpart suffers from inconsistent power. We further investigate the uniform power of the proposed test under the dHSIC and $L_2$ metrics, showing that the proposed test attains the minimax optimal power across different privacy regimes. As a byproduct, we show that the non-private permutation dHSIC test proposed in Pfister et al. (2018) is a special case of our differentially private permutation test, and our results also establish its pointwise and uniform power--thus resolving an open problem from that work. Both numerical simulations and real data analysis in causal inference suggest that our proposed test performs well empirically.2025-03-24T14:32:05Z57 pages, 7 figuresXingwei LiuYuexin ChenJin-Ting ZhangWangli Xuhttp://arxiv.org/abs/2606.03863v2Assessing the Impact of Intercurrent Events on Power and Sample Size for Estimands with Time-to-Event Endpoints2026-06-03T08:21:13ZThe precise definition of a primary estimand, accounting for intercurrent events (IEs) as per the ICH E9(R1) addendum, is fundamental to the design and interpretation of clinical trials. Conventional power and sample size calculations, however, often do not adequately incorporate the impact of IEs and their corresponding handling strategies, creating a risk of over- or under-powered studies. While simulation-based approaches can address this complexity, they are often computationally intensive and may only explore a limited set of scenarios. In this paper, we introduce a set of formulae for calculating power for estimands with time-to-event endpoints, applied to trials with fixed follow-up durations. We focus on estimands that use treatment policy, hypothetical, composite, or a combination of strategies for handling IEs, under the assumption that IEs occur independently of each other and the primary endpoint. Validation against simulation-based estimates shows strong agreement, and we explore deviations in power estimates in scenarios where outcomes and IEs are dependent. We illustrate the practical application of our approach through a case study in nasal polyposis, examining the sensitivity of sample size requirements to varying IE rates and their impacts on post-IE outcomes. The proposed formulae facilitate rapid and accurate power and assurance calculations, enabling clinical trial designs to be more closely aligned with the estimand of interest.2026-06-02T16:37:45ZDaniel J BrattonFiona GuillardSunita RehalThomas Druryhttp://arxiv.org/abs/2606.04546v1Bivariate inverse Gaussian degradation processes with shared random effects and an application to fatigue cracks2026-06-03T07:28:08ZThe inverse Gaussian (IG) process is a widely used model for univariate degradation data. For bivariate degradation data involving two performance characteristics (PCs), dependence is often introduced through an unobserved shared frailty factor combined with IG processes. Previous studies typically assume a specific frailty distribution, such as normal or gamma, although such choices are difficult to justify because the frailty is unobserved. This paper proposes a general IG GG framework for modeling bivariate degradation data with dependent PCs. Each degradation process is modeled using an IG process, while the shared frailty follows the generalized gamma (GG) family, which includes exponential, gamma, Weibull, and lognormal distributions as special cases. The proposed framework allows flexible selection of an appropriate frailty distribution within the GG family, leading to improved model fitting. Convenient parameter estimation procedures are developed and evaluated through simulation studies, demonstrating satisfactory performance. The proposed model is applied to fatigue crack data and compared with several existing frailty based and copula based models. Results show that the IG GG model provides a superior fit. System reliability estimation under the IG GG framework is also discussed.2026-06-03T07:28:08ZYuvraj DuttaSandip BaruiDebanjan MitraNarayanaswamy Balakrishnanhttp://arxiv.org/abs/2606.04523v1Bias Correction for Scalar-on-Density Regression Models2026-06-03T07:02:18ZIn one extension of scalar-on-function regression modeling, the covariate is taken to be a density that is estimated from a finite number of measurements gathered for each observational unit. When this number of measurements is relatively small, the estimated coefficient function suffers from attenuation bias. This paper studies how the bias depends on the number of measurements per unit and proposes a bias-correction method based on simulation extrapolation (SIMEX). We establish that the bias decreases monotonically as the number of measurements per unit increases. The proposed SIMEX procedure applies bootstrap resampling to simulate smaller measurement counts and then extrapolates to infinitely many measurements, thereby correcting finite-measurement bias. A comprehensive simulation study, conducted over a range of sample sizes and noise levels, shows that the mean integrated squared error of the coefficient function decreases with more measurements per unit and that the SIMEX-extrapolated estimates achieve lower bias than the naive estimates based on the full set of measurements. The practical utility of the method is further illustrated through an application to the National Health and Nutrition Examination Survey, for which we relate 24-hour physical activity profiles to all-cause mortality. This example supports the validity of the method and demonstrates its ability to detect and correct for finite-measurement bias.2026-06-03T07:02:18Z26 pages, 7 figures, 1 tableFenglin XieTodd Ogden