https://arxiv.org/api/DQ7vxNoUVVt408y5YHVA1T6Y2CU2026-06-21T15:04:27Z36316106515http://arxiv.org/abs/2605.17474v1Multivariate EDF tests for uniformity, normality,spherical and elliptical symetry, and independence based on a Brownian sheet deconstruction2026-05-17T14:22:46ZThis paper extends a recently proposed family of EDF-based goodness-of-fit procedures for the hypercube $[0,1]^p$ - the m-test and the s-test - which are based on a unique deconstruction of the $p$-parameter Brownian sheet into independent Gaussian processes.
We use the fact that whenever a null hypothesis implies a joint distribution that factorizes into independent continuous components after a suitable mapping, the problem can be reduced to a uniformity test on the hypercube via componentwise probability integral transforms. Specifically, we introduce and analyze new procedures derived from these principles for testing uniformity on the hypersphere $S^p$, as well as multivariate normality, spherical and elliptical symmetry, and independence in $R^p$. The methodology is based on the decomposition of finite signed measures into zero-marginal components to isolate coordinate interactions. Empirical power comparisons show that these extended procedures are highly competitive with existing methods in the statistical literature, demonstrating particular sensitivity to coordinate-based dependencies and joint dependency structures.2026-05-17T14:22:46ZAcompanying R package: https://github.com/emcabana/MuniCandSAlejandra CabañaEnrique M. Cabañahttp://arxiv.org/abs/2308.05534v4Collective Outlier Detection and Enumeration with Conformalized Closed Testing2026-05-17T07:44:41ZThis paper develops a flexible distribution-free method for collective outlier detection and enumeration, designed for situations in which the presence of outliers can be detected powerfully even though their precise identification may be challenging due to the sparsity, weakness, or elusiveness of their signals. This method builds upon recent developments in conformal inference and integrates classical ideas from other areas, including multiple testing, locally most powerful and adaptive rank tests, and non-parametric large-sample asymptotics. The key innovation lies in developing a principled and effective approach for automatically choosing the most appropriate machine learning classifier and two-sample testing procedure for a given data set. The performance of our method is investigated through extensive empirical demonstrations, including an analysis of the LHCO high-energy particle collision data set.2023-08-10T12:23:51ZChiara G. MagnaniMatteo SesiaAldo Solarihttp://arxiv.org/abs/2410.20319v2High-dimensional partial linear model with trend filtering2026-05-17T07:21:22ZUnderstanding the links between diet, metabolic changes, and health outcomes is a key focus in nutritional science and broader biological research. Analyzing relationships, such as those between ultra-processed food (UPF) intake and metabolites, offers insights into potential biomarkers for diet-related diseases and public health applications. However, these analyses are challenging due to high-dimensional data structures and complex, often nonlinear associations between covariates and health outcomes. Traditional linear models and conventional nonparametric methods often lack the flexibility to accurately capture such complexities in biological data. To address these challenges, we propose a high-dimensional partial linear regression model that captures both linear and nonlinear effects, combining the interpretability of linear models with the adaptability of nonparametric approaches. Our model leverages trend filtering to handle local smoothness variations effectively and achieves minimax optimal rates, making it suitable for complex biological datasets. We apply this model to data from the Interactive Diet and Activity Tracking in AARP (IDATA) Study, demonstrating its utility in identifying biomarkers associated with UPF intake and illustrating its potential for broader applications in dietary, metabolic, and health-related research.2024-10-27T03:13:09Z52 pages, 8 figuresLee, S. K., Loftfield, E., Hong, H. G., and Weng, H. (2026) High-dimensional partial linear model with trend filtering, Electronic Journal of Statistics, 20(1), 1800-1850Sang Kyu LeeErikka LoftfieldHyokyoung G. HongHaolei Weng10.1214/26-EJS2522http://arxiv.org/abs/2605.17240v1The FORSS Framework for Sample Size and Power Calculations With Win Statistics for Hierarchical Endpoints2026-05-17T03:44:41ZWin statistics have gained increasing popularity as primary analysis methods for clinical trials with hierarchical endpoints (HEs) as primary endpoints. However, existing sample size and power calculation approaches in trial design still face several limitations and challenges: simulation-based approaches are computationally intensive, while existing formula-based methods often rely on simplifying assumptions such as independence among HEs, or require specification of overall win statistics and tie probability that are difficult to elicit a priori in practice. To address these challenges, we propose the FORSS framework, a FORmula-based Super-Sample approach that allows investigators to specify marginal treatment effects using familiar metrics (e.g., hazard ratios, mean differences, and risk differences) together with a flexible joint working distribution for the HEs. Rather than repeatedly simulating full trials at each candidate sample size, FORSS uses super-samples to estimate the population-level plug-in quantities required by analytical formulas for both power and sample size calculation. We evaluated the performance of the proposed FORSS through extensive simulation studies. The results show that the formula-based FORSS closely matches empirical power across a wide range of scenarios while maintaining Type~I error rates near the nominal 5\% level. An illustration based on the HEART-FID trial further shows that endpoint-dependence specifications can materially affect projected power and required sample size when planning trials with HEs.2026-05-17T03:44:41ZBaoshan ZhangHuiman X. BarnhartYuan WuRoland A. Matsouakahttp://arxiv.org/abs/2605.07285v2Transporting treatment effects by calibrating large-scale observational outcomes2026-05-17T03:11:48ZA high-quality experimental dataset is often much smaller than a corresponding observational dataset. When this holds with possibly biased measurements of the outcome of interest in the latter, we propose an estimation and inference procedure for a transported treatment effect. Our point estimator can be computed as follows. First, we estimate the conditional average treatment effect (CATE) by calibrating a treatment-control contrast estimated using the observational outcomes to the experimental dataset using ordinary least squares (OLS). Then, we compute the sample average of this estimated CATE over the observational dataset. We show that the limiting estimand is a weighted transported average treatment effect even when the OLS calibration is misspecified. Furthermore, our inference for this estimand is asymptotically valid and semiparametrically efficient when the size of the experimental dataset grows more slowly than the size of the observational dataset, regardless of the existence of positivity (overlap) between the two datasets. We illustrate the stable empirical performance of our method under varying degrees of positivity using numerical simulations and a data example using field experiments and satellite-based yield estimates to estimate the average effect of crop rotation on maize (corn) yields over a large area of the Midwestern United States.2026-05-08T05:48:07Z37 pages, 5 figuresHarrison H Lihttp://arxiv.org/abs/2505.12181v2Reliable fairness auditing with semi-supervised inference2026-05-17T01:27:16ZMachine learning (ML) models often exhibit bias that can exacerbate inequities in biomedical applications. Fairness auditing, the process of evaluating a model's performance across subpopulations, is critical for identifying and mitigating these biases. However, audits typically rely on large volumes of labeled data, which are costly and labor-intensive to obtain. To address this challenge, we introduce $\textit{Infairness}$, a unified framework for auditing a wide range of fairness criteria using semi-supervised inference. Our approach combines a small labeled dataset with a large unlabeled dataset by imputing missing outcomes via regression with carefully selected nonlinear basis functions. Through extensive theoretical and empirical analyses, we show that our proposed estimator is (i) robust to specification of the ML or imputation model and (ii) substantially more efficient than supervised estimation based solely on the labeled data. In two real-world fairness audits using electronic health record and medical imaging data, Infairness reduces variance by approximately 50% compared to supervised estimation, underscoring its value for reliable fairness auditing with limited labeled data.2025-05-18T00:42:21ZJianhui GaoJessica Gronsbellhttp://arxiv.org/abs/2501.09015v4Family-wise Error Rate Control with E-values2026-05-17T00:53:26ZThe closure principle is a standard tool for achieving strong family-wise error rate (FWER) control in multiple testing problems. We develop an e-value-based closed testing framework that inherits nice properties of e-values, which are common in settings of sequential hypothesis testing or universal inference for irregular parametric models. We prove that e-value-based closed testing strongly controls the post-hoc FWER in the static setting, and has stronger anytime-valid and always-valid FWER-controlling properties in the sequential setting. Furthermore, we extend the celebrated graphical approach for FWER control (Bretz et al. 2009), using the weighted average of e-values for the local test, a strictly more powerful approach than weighted Bonferroni local tests with inverse e-values as p-values. In general, the computational cost for closed testing can be exponential in the number of hypotheses. Although the computational shortcuts for the p-value-based graphical approach are not applicable, we develop an efficient polynomial-time algorithm using dynamic programming for e-value-based graphical approaches with any directed acyclic graph, and tailored algorithms for the e-Holm procedure previously studied by Vovk and Wang (2021) and the e-Fallback procedure.2025-01-15T18:57:33Z32 pages, 12 figures, 4 algorithmsWill HartogLihua Leihttp://arxiv.org/abs/2604.13276v2Addressing Confounding by Indication Through (Un)Measured Centre Characteristics in Learn-As-you-GO(LAGO) Trials2026-05-16T21:09:13ZThe Learn-As-you-Go (LAGO) design is an adaptive clinical trial design that allows modifications to multicomponent intervention packages across stages. Centers participate in more than one stage, as is common in large-scale implementation trials. In LAGO trials, center characteristics may act as confounders, predicting both the intervention package and the outcomes. We extend the LAGO theory by introducing fixed center effects to control for confounding by indication through measured and unmeasured center characteristics. Conditioning on center characteristics by including fixed center effects ensures asymptotic results hold without requiring explicit characterization of unmeasured confounders. Our methods apply even with small numbers of centers. LAGO theory is established for continuous outcomes following a generalized linear model and binary outcomes following a logistic regression model, unifying theory across outcome types. Point- and interval estimators are derived, and consistency and asymptotic normality are established. Valid hypothesis tests for the overall intervention effect are provided, and the optimal intervention package minimizing cost subject to a target outcome mean is obtained via constrained optimization.2026-04-14T20:13:26ZMinh Thu BuiChristopher T. LongeneckerAnte BingDonna SpiegelmanAllison R. WebelHayden B. BosworthJudith J. Lokhttp://arxiv.org/abs/2605.17154v1Learning Gaussian Graphical Models under Total Positivity via Spectral Graph Sparsification2026-05-16T20:59:06ZMany practical data analysis tasks reduce to learning, from observed samples, how a collection of variables depend on each other. A widely used approach is to fit a Gaussian graphical model, which represents the dependence structure as a graph connecting the variables. In a number of important applications, such as financial returns, gene co-expression, and climate or network analysis, the dependencies tend to be positive: variables move together rather than offset each other. Encoding this positivity through the constraint of multivariate total positivity of order two (MTP2) yields an attractive estimator that produces accurate fits with no tuning required. The resulting graphs are, however, typically much denser than the underlying ground-truth model, which makes them hard to interpret and slow to use in any downstream task that operates on the graph. In this work, we propose a novel highly-scalable approach for learning Gaussian graphical models from data using spectral sparsification; we call it Spectral-MTP2. Spectral graph sparsification is a fundamental method which aims to preserve meaningful properties of a dense graph with a sparser subgraph. We theoretically and empirically investigate and validate our method, and show that learning Gaussian Graphical Models under MTP2 using spectral sparsification preserves MTP2 and approximates well the original model in terms of Kullback-Leibler divergence and Gaussian log-likelihood. In simulations and applications to equity returns and gene expression, we observe that Spectral-MTP2 retains most of the fit quality of the denser MTP2 baseline, while producing substantially sparser and more interpretable graphs.2026-05-16T20:59:06Z16 pagesIgnacio Echave-Sustaeta RodríguezAida AbiadFrank Röttgerhttp://arxiv.org/abs/2605.17050v1Single World Intervention Graphs as Distributions: A Framework for Causal Identification2026-05-16T15:44:37ZCausal inference seeks to estimate the effect of an intervention on an outcome using observed data, typically via Rubin's potential-outcome framework or Pearl's do-calculus. Following section 9 of Richardson and Robins (2013), this essay treats single-world intervention graphs (SWIGs) as representations of both the observed-data distribution and the interventional distribution, rather than as a bridge to potential outcomes. We demonstrate that this perspective provides a systematic way to derive identifying expressions for estimands defined by interventions on selected variables. Back-door derivations mirror those in existing literature, while front-door derivations offer a distinct pathway that extends more readily to complex settings. Conceptually, the method is simultaneously related to and distinct from Rubin's framework and Pearl's calculus.2026-05-16T15:44:37ZChristian Bartelshttp://arxiv.org/abs/2605.14943v2Piece-wise linear isotonic regression2026-05-16T12:06:21ZIsotonic regression provides a flexible, tuning-free approach to estimating monotonic functions without imposing global curvature constraints, yet the estimated regression function is inherently a step function. This paper addresses a key limitation of such estimators: their inability to provide meaningful marginal properties, such as shadow prices or elasticities. We propose a novel piece-wise linear smoothing framework that recovers meaningful marginal estimates even in non-convex settings. Building on the concept of conditional convexity originally developed in deterministic frontier analysis, we formulate the smoothing process as a bilevel optimization problem that fits a continuous, monotonic, piece-wise linear function to the initial isotonic regression predictions. Monte Carlo simulations demonstrate that the proposed approach can significantly improve estimation accuracy in both convex and non-convex settings for univariate and multivariate data. We apply this approach to analyze agglomeration economies in Finnish municipalities, illustrating its practical value.2026-05-14T15:16:44ZTimo KuosmanenJuan F. MongeJosé L. RuizXun Zhouhttp://arxiv.org/abs/2603.14942v3A System-Theoretic Approach to Hawkes Process Identification with Guaranteed Positivity and Stability2026-05-16T11:57:10ZThe Hawkes process models self-exciting event streams, requiring a strictly non-negative and stable stochastic intensity. Standard identification methods enforce these properties using non-negative causal bases, yielding conservative parameter constraints and severely ill-conditioned least-squares Gram matrices at higher model orders. To overcome this, we introduce a system-theoretic identification framework utilizing the sign-indefinite orthonormal Laguerre basis, which guarantees a well-conditioned asymptotic Gram matrix independent of model order. We formulate a constrained least-squares problem enforcing the necessary and sufficient conditions for positivity and stability. By constructing the empirical Gram matrix via a Lyapunov equation and representing the constraints through a sum-of-squares trace equivalence, the proposed estimator is efficiently computed via semidefinite programming.2026-03-16T07:47:56Z6 pages, 2 figuresXinhui RongGirish N. Nairhttp://arxiv.org/abs/2605.16906v1Differentially private hypothesis testing in survival analysis2026-05-16T09:41:49ZSurvival analysis is widely used in applications involving sensitive individual-level data, yet differentially private hypothesis testing for right-censored data remains largely undeveloped. We initiate a finite-sample theory of private hypothesis testing in survival analysis applications. For Cox regression coefficients, we develop private partial-likelihood-ratio and score-type tests, including a private calibration procedure for the rejection threshold. For cumulative hazard functions, we propose a private distributed two-sample test. Across these problems, we prove differential privacy and finite-sample testing guarantees, as well as minimax lower bounds. Our results identify when privacy is statistically negligible, when it dominates the testing rate, and where optimal private rates for testing in semiparametric survival models remain open. This theoretical analysis is accompanied by numerical experiments on simulated data.2026-05-16T09:41:49ZElly K. H. HungYi Yuhttp://arxiv.org/abs/2605.16885v1A Workflow for Evaluating Regional Treatment Effect Heterogeneity in Multi-Regional Clinical Trials2026-05-16T08:58:35ZMulti-regional clinical trials (MRCTs) enable efficient global drug development by assessing treatment effects across regions within a single protocol. While powered for overall efficacy, MRCTs are typically not designed to provide confirmatory evidence on regional differences, making an assessment of observed regional heterogeneity largely exploratory and susceptible to sampling variability. Despite this challenge, understanding regional heterogeneity remains important for interpretation and regulatory decision-making. This paper proposes a structured, question-driven framework to guide exploratory assessments of regional heterogeneity in MRCTs. We formulate four key questions to clarify the objectives of such analyses and propose a set of statistical methods to address them. Simulation studies evaluate performance under scenarios with no heterogeneity and heterogeneity driven by observed or unobserved treatment effect modifiers, illustrating how a structured approach can support transparent and cautious interpretation.2026-05-16T08:58:35ZCong ZhangMeihua LongTianyu ZhengKonstantinos SechidisXiaoni LiuSophie SunYao ChenXinyi ZhangShuhei KanekoBjörn BornkampYan Houhttp://arxiv.org/abs/2605.10088v2Sample size and power calculations for causal inference with time-to-event outcomes2026-05-16T06:18:55ZThis paper develops power and sample size formulas for causal inference with time-to-event outcomes. The target estimand is the marginal hazard ratio: the coefficient of a marginal structural Cox proportional hazard model with treatment as the only predictor. We extend the robust sandwich variance theory and derive the analytical form of the asymptotic variance for the inverse probability weighted partial likelihood estimator. Building on this, we derive a new analytical sample size formula valid at any prespecified effect size, applicable to both randomized trials and observational studies. For randomized trials, the formula requires only the canonical inputs of treatment proportion, effect size, and event rate. The new formula corrects the mischaracterization of classic log-rank-based formulas. For observational studies, one additional input suffices: an overlap coefficient summarizing covariate similarity between comparison groups. We further develop a variance inflation approach applicable to any propensity score balancing weights, anchored to the corrected baseline variance. We provide an online calculator and an R package 'PSpower' to implement the method.2026-05-11T07:07:23ZChengxin YangBo LiuFan Li