https://arxiv.org/api/8/+zNRnoRSfstY1wMtLyMwJKE38 2026-06-18T21:05:06Z 36296 930 15 http://arxiv.org/abs/2605.20604v1 Conditional regularized halfspace depth for sparse functional data and its applications 2026-05-20T01:48:32Z

Many functional datasets are observed sparsely and irregularly. Ordering such data is challenging because only limited information is available from each observation, while the underlying trajectories remain infinite-dimensional. This paper develops a novel depth notion for sparse functional data, called the conditional regularized halfspace depth (CRHD). CRHD is defined as the infimum of conditional halfspace probabilities of the underlying trajectory given the observed sparse measurements, thereby enabling depth evaluation directly at sparse observations without requiring trajectory reconstruction. We study several basic theoretical properties of CRHD that clarify its behavior as a depth measure. The proposed depth is applicable even to extremely sparsely observed functional data, overcoming key limitations of existing sparse functional depths that often rely on reconstructed curves. In addition, CRHD induces meaningful rankings for complex functional data. Its numerical performance is demonstrated through rank-based tests, and its practical utility is illustrated using an infant growth dataset.

2026-05-20T01:48:32Z Hyemin Yeon Xiongtao Dai Sara Lopez-Pintado http://arxiv.org/abs/2602.04092v2 Time-to-Event Estimation with Unreliably Reported Events in Medicare Health Plan Payment 2026-05-20T01:38:36Z

OBJECTIVE: To propose time-to-event estimators that help evaluate incident diagnostic coding and possible upcoding in Medicare as well as introduce an open-source software package that enables more reproducible methods development relevant to Medicare billing behavior. STUDY SETTING AND DESIGN: Observational analysis of simulated upcoding based on coding by insurers or providers that may be incentivized by Medicare Advantage risk adjustment. DATA SOURCES AND ANALYTIC SAMPLE: Two years of separately simulated incident health condition coding data for a Medicare Advantage population and a Traditional Medicare population where coding patterns are aligned with known practices in each program. PRINCIPAL FINDINGS: We propose several novel time-to-event estimators of incident coding intensity and possible upcoding in Medicare Advantage, including accounting for unreliable reporting. We demonstrate estimator performance in simulated data leveraging the National Institutes of Health's All of Us study and also develop an open source R package to simulate longitudinal realistic labeled upcoding data, which were not previously available for researchers. In simulations, our novel estimators recovered differences in upcoding within and across monitoring periods. Undercoding had a limited effect on our novel estimators while an existing estimator was more sensitive to undercoding. CONCLUSIONS: Our proposed estimators can help researchers and policymakers track new coding behaviors (e.g., as may be incentivized by risk adjustment formula updates) earlier and at scale while accounting for several real-world data considerations. Further, the R package we provide can be used to improve the development, accessibility, and reproducible evaluation of coding intensity and upcoding methodology.

2026-02-04T00:04:44Z 44 pages, 10 figures Oana M. Enache Sherri Rose http://arxiv.org/abs/2605.21535v1 An Old Look at Empirical Bayes 2026-05-20T01:32:03Z

Dennis Lindley once said that there is only one thing worse than a frequentist, and that is an empirical Bayesian. The quip has the air of caricature, but its technical content is serious: empirical Bayes uses the same data twice, conflates levels of a hierarchy, and produces posterior-shaped summaries whose uncertainty quantification differs from what a fully hierarchical model delivers. David Blei's 2026 IMS Medallion Lecture, "A Fresh Look at Empirical Bayes," revives the program under three new banners: empirical Bayes via probabilistic symmetries (rebranded "Bayesian empirical Bayes"), empirical Bayes with implicit likelihoods through simulation-based inference, and empirical Bayes for combining experimental and observational data through calibration studies. This is a continuation of Blei and Kucukelbir's earlier "population empirical Bayes" (PopEB, 2015). We argue, in the spirit of Lindley, I. J. Good, William DuMouchel, Thomas Louis, and our own recent work with Datta, that Blei's machinery targets inferential objects distinct from the posterior conditional on the realized data, and that the cost of maintaining the full hierarchical discipline has fallen low enough that the computational trade-off no longer favors the shortcut. The case study is the Tweedie formula. Efron's f-modeling empirical Bayes plugs an estimated score function into a posterior-mean identity, but a smoothed score need not arise from any prior. The horseshoe Tweedie formula does. We conclude by recommending that the impressive computational machinery of modern empirical Bayes (variational inference, neural amortization, simulation-based inference) be redeployed in service of properly hierarchical Bayes.

2026-05-20T01:32:03Z 23 pages Nicholas G. Polson Vadim O. Sokolov Daniel Zantedeschi http://arxiv.org/abs/2605.20572v1 Minimax unbiased estimation for finite populations with bounded outcomes 2026-05-20T00:08:21Z

We study design-unbiased estimation of the finite-population total $\sum_{i=1}^N y_i$ when each outcome satisfies known bounds $y_i\in[a_i,b_i]$. For any sampling design with inclusion probabilities $π_i>0$, we prove a sharp lower bound on the worst-case squared error over the rectangular parameter space. This bound is attained if and only if the unit inclusion indicators are pairwise independent, in which case the minimax estimator is the midpoint-differenced Horvitz-Thompson estimator $\sum_{i=1}^N m_i+\sum_{i\in S}(y_i-m_i)/π_i$, with $m_i=(a_i+b_i)/{2}$. We then solve the joint design-and-estimation problem under the constraint $\sum_i π_i\le n$. We find that a minimax strategy samples units independently with probabilities $π_i^\ast=\min(1,c (b_i-a_i))$ where $c>0$ is chosen so that $\sum_i π_i^\ast=n$, and uses the midpoint-differenced estimator. This extends Gabler (1990)'s linear minimax result to the full class of design-unbiased estimators. We also show that the estimator is admissible among unbiased estimators and affine equivariant.

2026-05-20T00:08:21Z 14 pages P. M. Aronow Patrick Lopatto http://arxiv.org/abs/2605.20567v1 Meta-analysis and network meta-analysis of time-to-event outcomes with non-proportional hazards: a Bayesian time-varying hazard ratio approach 2026-05-19T23:56:34Z

Background: Often when undertaking meta-analyses of time-to-event (TTE) outcomes, especially in a Health Technology Assessment context, a hazard ratio (HR) scale is used. However, issues arise when there is evidence of non-proportional hazards in some of the studies included. A number of methods have been advocated, but their use has been limited by either their complexity and/or the ease with which their results can be used in HTA. An alternative approach is to assume a treatment-log(time) interaction within a Cox proportional hazards model for each study, and to then undertake a bivariate meta-analysis of the resulting treatment and interaction coefficients, so that an overall time-varying HR (TVHR) can be obtained. Methods: A TVHR approach was applied to a meta-analysis of chemotherapy compared to Standard of Care for advanced recurrent gastric cancer, and in which Progression-Free Survival (PFS) was an outcome. The approach was also applied to a network meta-analysis (NMA) evaluating overall survival (OS) in advanced BRAF-mutated melanoma. Results: Five trials in the advanced gastric cancer meta-analysis displayed evidence of non-proportional hazards for PFS. Using a TVHR model produced HRs ranging from 0.83 (CrI:0.75-0.91) at 0.5 years to 0.99 (CrI:0.79-1.23) at 3.5 years. Three studies showed evidence of non-proportional hazards in the advanced BRAF-mutated melanoma NMA for OS. Using a TVHR model, nivolumab plus ipilimumab demonstrated consistent superiority from month 7 onwards, with a HR improving from 0.37 (CrI:0.26-0.51) at one year to 0.24 (CrI:0.12-0.45) at five years. Conclusions: A TVHR approach to the meta-analysis or NMA of TTE outcomes when the proportional hazards assumption appears not to hold, produces an intuitive solution which can be readily used in HTA.

2026-05-19T23:56:34Z 23 pages, 13 figures, 3 tables & Presented as an Oral Contribution at International Society for Clinical Biostatistics (ISCB) Conference (ISCB-46), Basel, August 27, 2025 Rhiannon K Owen Keith R Abrams http://arxiv.org/abs/2605.20559v1 Group-Aware Matrix Estimation and Latent Subspace Recovery 2026-05-19T23:22:32Z

Modern matrix completion problems often involve heterogeneous data whose rows simultaneously belong to many meta-categories, such as demographic and age groups in recommendation systems, or region and recording session labels in neural electrophysiological experiments. Standard low-rank estimators impose a single global latent geometry, which can recover average structure but may smooth away subgroup-specific variation, especially when observations are unevenly distributed across groups. We introduce Group-Aware Matrix Estimation (GAME), a convex estimator for overlapping subgroup-wise low-rank matrix estimation. GAME regularizes category-specific submatrices through overlapping nuclear-norm penalties, allowing related groups to borrow information while preserving local latent structure in a shared coordinate system. We provide finite-sample guarantees for both reconstruction error and subgroup-specific subspace recovery, showing how performance depends on sampling density, subgroup rank, and overlap structure. Experiments on synthetic, recommendation, ecological, and neuroscience datasets show that GAME is most beneficial in structured missingness regimes, where subgroup-aware regularization improves both reconstruction accuracy and latent subspace fidelity. Across these benchmarks, GAME is competitive or best among global low-rank, side-information, and modern imputation baselines, with the largest gains when subgroups exhibit distinct low-rank structure.

2026-05-19T23:22:32Z 12 pages, 6 main figures, 1 main algorithm Hamza Golubovic Matthew Shen Genevera I. Allen Tarek M. Zikry http://arxiv.org/abs/2512.02182v3 Two-phase validation sampling via principal components to improve efficiency in multi-model estimation from error-prone biomedical databases 2026-05-19T21:52:15Z

Two-phase sampling offers a cost-effective way to validate error-prone covariate measurements in biomedical databases. Inexpensive or easy-to-obtain information is collected for the entire study in Phase I. Then, a subset of patients undergoes cost-intensive validation (e.g., expert chart review) to collect more accurate data in Phase II. When balancing primary and secondary analyses, competing models and priorities can result in poorly defined objectives for the most informative Phase II sampling criterion. Extreme tail sampling (ETS), wherein patients with the smallest and largest values of a particular quantity (like a covariate or residual) are selected, can offer great statistical efficiency in two-phase studies when focusing on a single analytic objective by targeting observations with the biggest contributions to the Fisher information. We propose an intuitive, easy-to-use approach that extends ETS to balance and prioritize explaining the largest amount of variability across multiple models of interest. Using principal components analysis, we succinctly summarize the inherent variability of all models' error-prone exposures. Then, we sample patients with the most extreme values of the first principal component for validation. Through extensive simulations and an application to the National Health and Nutrition Examination Survey (NHANES), the proposed strategy offered simultaneous efficiency gains across multiple models of interest. Its advantages persisted across various real-world scenarios, including correlated or heterogeneous measurement error. When designing a validation study, concentrating on a single model may be short-sighted. Strategically allocating resources more broadly balances multiple analytical goals simultaneously. Employing dimension reduction before sampling will allow this strategy to scale up well to big-data applications with many error-prone exposures.

2025-12-01T20:22:34Z 22 pages, 5 figures, 2 tables, GitHub repositories with R package and simulation/analysis code Sarah C. Lotspeich Cole Manschot http://arxiv.org/abs/2605.20508v1 Compensator-Based Inference for Signal Detection Under Unknown Background 2026-05-19T21:24:45Z

The problem of detecting new signals in the presence of an unknown background is ubiquitous in scientific discoveries and is especially prominent in the physical sciences. Most solutions proposed thus far to address the problem focus on estimating the background distribution and using that estimate to infer the signal. By studying the geometry of the problem, this article demonstrates that estimating the background distribution is somewhat unnecessary for inferring the signal intensity. Instead, it suffices to estimate a single parameter, referred to as the compensator, to account for the incomplete knowledge on the background, substantially simplifying the problem's complexity and enabling proper uncertainty propagation. Such a compensator is shown to govern the conservativeness of the inference, both in the proposed setup and in likelihood-based approaches.

2026-05-19T21:24:45Z Aritra Banerjee Sara Algeri http://arxiv.org/abs/2605.21530v1 Pairwise Distance-Diffusion Analysis (PDDA): A Geometric Framework for Estimating Hurst Exponents in Multivariate Long-Memory Processes 2026-05-19T20:22:37Z

We introduce Pairwise Distance-Diffusion Analysis (PDDA), a geometric framework for estimating the Hurst exponent from distance plots of long-memory stochastic processes. A single construction yields two complementary routes: R/S-PDDA, a geometric reformulation of the classical rescaled-range definition, and MSD-PDDA, based on mean-squared-displacement scaling, classically used in anomalous diffusion. We extend PDDA to multivariate isotropic and anisotropic processes and derive an explicit link between temporal persistence, range dimension, and recurrence statistics, providing a unified distance-based foundation for Hurst analysis.

2026-05-19T20:22:37Z Supplemental PDF available via ancillary links Diogo C. Soriano Frederique Vanheusden Slawomir J. Nasuto http://arxiv.org/abs/2412.06114v5 Randomized interventional effects in semicompeting risks, with application to a hematopoietic cell transplantation study 2026-05-19T20:19:28Z

In clinical studies, the risk of the primary (terminal) event may be modified by intermediate events, resulting in semicompeting risks. To study the treatment effect on the terminal event mediated by the intermediate event, researchers wish to decompose the total effect into direct and indirect effects. In this article, we extend the randomized interventional approach to time-to-event outcomes, where both intermediate and terminal events are subject to right censoring. We envision a random draw for the intermediate event process from a reference distribution, either marginally over time-varying confounders or conditionally given the observed history. We present the identification formula for interventional effects. We also discuss some variants of the identification assumptions. We estimate the treatment effects using nonparametric maximum likelihood estimation and propose a sensitivity analysis that incorporates a latent frailty. As an illustration, we study the effect of matched unrelated donor versus haploidentical donor on death mediated by relapse in a hematopoietic cell transplantation study with graft-versus-host disease (GVHD) as the time-varying confounder. We find that matched unrelated donor transplantation is preferable in terms of survival rates under the use of post-transplant PTCy GVHD prophylaxis for lymphoma patients.

2024-12-09T00:27:36Z Yuhao Deng Rui Wang Tao Zhang Xiang Zhan 10.1002/sim.70628 http://arxiv.org/abs/2504.01355v3 A Practical Guide to Estimating Conditional Marginal Effects: Modern Approaches 2026-05-19T19:46:26Z

This Element offers a practical guide to estimating conditional marginal effects-how treatment effects vary with a moderating variable-using modern statistical methods. Commonly used approaches, such as linear interaction models, often suffer from unclarified estimands, limited overlap, and restrictive functional forms. This guide begins by clearly defining the estimand and presenting the main identification results. It then reviews and improves upon existing solutions, such as the semiparametric kernel estimator, and introduces robust estimation strategies, including augmented inverse propensity score weighting with Lasso selection (AIPW-Lasso) and double machine learning (DML) with modern algorithms. Each method is evaluated through simulations and empirical examples, with practical recommendations tailored to sample size and research context. All tools are implemented in the accompanying \texttt{interflex} package for \texttt{R}.

2025-04-02T05:00:14Z Jiehan Liu Ziyi Liu Yiqing Xu http://arxiv.org/abs/2605.20399v1 A duration-augmented binary Markov chain for rainfall occurrence with long dry spells 2026-05-19T18:53:55Z

Simulating realistic wet and dry spells is central in weather generators and climate-impact studies. While finite-order Markov chains are standard, they often fail to reproduce persistent dry conditions due to their inherent subexponential decay. We model rainfall occurrence by introducing a duration-augmented binary Markov chain. We establish a link with alternating renewal chains, enabling flexible parametric modelling of wet and dry spell duration distribution. We model those using two regime-adapted specifications from the general class of extended Generalized Pareto Distributions, yielding flexible tail behaviour across various climates. We use estimation methods adapted to each specification. Our model is applied to around 200 stations in the South of Europe spanning diverse Mediterranean and continental climates. We compare this framework to standard Markov models in characterising persistence and high-quantile extrapolation. The approach is generic, extending naturally to multi-state settings or other binary sequence applications in environmental statistics.

2026-05-19T18:53:55Z Antoine Doizé LPSM, SU Denis Allard BioSP Philippe Naveau LSCE, ESTIMR Olivier Wintenberger LPSM, SU http://arxiv.org/abs/2603.07312v3 Predictive Power Analysis of Multiple Test Procedures Under Arbitrary Dependence 2026-05-19T18:20:10Z

Many statistical problems can be addressed by applying a multiple testing procedure (MTP) that controls either the Family-wise Error Rate (FWER) or False Discovery Rate (FDR) under unknown arbitrarily-interdependent $p$-values, without explicitly modeling these inter-correlations. They include the FWER-controlling Bonferroni (1936) MTP and Holm (1979) MTP; the FDR-controlling Benjamini and Yekutieli (2001) MTP; and the DP-MTP (Karabatsos, 2025), based on a Dirichlet process (DP) prior distribution supporting the entire space of MTPs that control either the FWER or FDR. For such an MTP, this study introduces a new and congenial method for Bayesian predictive power analysis, for power calculation and sample size determination for any given planned future (e.g., replication or interim) study. This novel MTP predictive power analysis method is based on a joint prior distribution defining a scale matrix mixture of asymmetric multivariate normal mean-variance mixture distributions, factorized as a general prior distribution for effect sizes (e.g., obtained from expert judgment or results of prior studies), and a uniform prior distribution for correlation matrices representing arbitrary dependencies between $p$-values of test statistics of given multiple hypothesis tests under their alternative hypotheses. The new MTP power analysis method also results in $p$-value weights which can be used to minimize the relative impacts of and assess for significance-chasing biases (e.g., publication bias, $p$-hacking, etc.) in multiple testing, without needing to assume that $p$-values (effect sizes) are independent. The new simulation-based MTP predictive power analysis method is illustrated through the analysis of $p$-values obtained by a famous study of lead exposure and re-analyzed by the previous MTP literature, using R package bnpMTP.

2026-03-07T19:03:01Z George Karabatsos http://arxiv.org/abs/2605.20359v1 The Harmonic Synthetic Control Method 2026-05-19T18:13:18Z

Synthetic control methods can produce misleading counterfactual predictions when outcome series contain unit-specific stochastic trends, a common feature of nonstationary macroeconomic data. Existing remedies, such as pre-filtering or differencing, reduce spurious matching but may discard shared nonstationary variation that helps estimate donor weights. We propose Harmonic Synthetic Control (HSC), which replaces this binary choice with a soft allocation mechanism. HSC jointly estimates donor weights and a treated-unit-specific smooth residual component, then extrapolates this component into post-treatment periods using a time-series forecaster. A tuning parameter, selected by rolling-origin cross-validation, governs the division between donor matching and forecasting. As it varies, HSC continuously interpolates between synthetic control applied to differenced outcomes and synthetic control applied to raw outcomes with an intercept or trend. We provide a spectral interpretation showing how HSC downweights low-frequency residual components in donor matching and assigns them to the forecasting branch. A prediction-error decomposition separates weight-estimation distortion from residual-forecasting error. Monte Carlo exercises show that HSC adapts across regimes, performing well when stochastic trends are predominantly common or idiosyncratic, while estimators fixed to one regime can fail in the other.

2026-05-19T18:13:18Z Ziyi Liu Yiqing Xu http://arxiv.org/abs/2605.20325v1 Explainable Outlier Detection for Multivariate Functional Data 2026-05-19T18:00:01Z

This work addresses the challenges of robust covariance estimation and interpretable outlier detection for multivariate functional data with separable covariance structure. We develop a method that simultaneously improves robustness and interpretability in this context by establishing a connection between stochastic processes with separable covariance structures and the corresponding matrix-variate distribution of their basis representations. Leveraging this connection, we employ the recently developed matrix-variate counterpart of the Minimum Covariance Determinant estimator (MMCD) in conjunction with a truncated multivariate functional Mahalanobis semi-distance to robustly estimate mean and covariance for multivariate functional data. For interpretable outlier detection, we generalize multivariate outlier explanations based on Shapley values to decompose overall multivariate functional outlyingness into time-coordinate-specific contributions. Importantly, we reduce the otherwise exponential computational complexity (relative to the number of components) to linear complexity, while retaining the key properties of the Shapley value. This integrated framework combines robust Mahalanobis distances, MMCD estimators, and Shapley value-based outlyingness decomposition to provide a robust and interpretable approach for analyzing multivariate functional data with separable covariance structures. The effectiveness of this approach is demonstrated through both theoretical analysis and practical applications, including simulations and real-world examples.

2026-05-19T18:00:01Z Marcus Mayrhofer Una Radojičić Horst Lewitschnig Peter Filzmoser