https://arxiv.org/api/DQ7vxNoUVVt408y5YHVA1T6Y2CU 2026-06-21T15:04:27Z 36316 1065 15 http://arxiv.org/abs/2605.17474v1 Multivariate EDF tests for uniformity, normality,spherical and elliptical symetry, and independence based on a Brownian sheet deconstruction 2026-05-17T14:22:46Z

This paper extends a recently proposed family of EDF-based goodness-of-fit procedures for the hypercube $[0,1]^p$ - the m-test and the s-test - which are based on a unique deconstruction of the $p$-parameter Brownian sheet into independent Gaussian processes. We use the fact that whenever a null hypothesis implies a joint distribution that factorizes into independent continuous components after a suitable mapping, the problem can be reduced to a uniformity test on the hypercube via componentwise probability integral transforms. Specifically, we introduce and analyze new procedures derived from these principles for testing uniformity on the hypersphere $S^p$, as well as multivariate normality, spherical and elliptical symmetry, and independence in $R^p$. The methodology is based on the decomposition of finite signed measures into zero-marginal components to isolate coordinate interactions. Empirical power comparisons show that these extended procedures are highly competitive with existing methods in the statistical literature, demonstrating particular sensitivity to coordinate-based dependencies and joint dependency structures.

2026-05-17T14:22:46Z Acompanying R package: https://github.com/emcabana/MuniCandS Alejandra Cabaña Enrique M. Cabaña http://arxiv.org/abs/2308.05534v4 Collective Outlier Detection and Enumeration with Conformalized Closed Testing 2026-05-17T07:44:41Z

This paper develops a flexible distribution-free method for collective outlier detection and enumeration, designed for situations in which the presence of outliers can be detected powerfully even though their precise identification may be challenging due to the sparsity, weakness, or elusiveness of their signals. This method builds upon recent developments in conformal inference and integrates classical ideas from other areas, including multiple testing, locally most powerful and adaptive rank tests, and non-parametric large-sample asymptotics. The key innovation lies in developing a principled and effective approach for automatically choosing the most appropriate machine learning classifier and two-sample testing procedure for a given data set. The performance of our method is investigated through extensive empirical demonstrations, including an analysis of the LHCO high-energy particle collision data set.

2023-08-10T12:23:51Z Chiara G. Magnani Matteo Sesia Aldo Solari http://arxiv.org/abs/2410.20319v2 High-dimensional partial linear model with trend filtering 2026-05-17T07:21:22Z

Understanding the links between diet, metabolic changes, and health outcomes is a key focus in nutritional science and broader biological research. Analyzing relationships, such as those between ultra-processed food (UPF) intake and metabolites, offers insights into potential biomarkers for diet-related diseases and public health applications. However, these analyses are challenging due to high-dimensional data structures and complex, often nonlinear associations between covariates and health outcomes. Traditional linear models and conventional nonparametric methods often lack the flexibility to accurately capture such complexities in biological data. To address these challenges, we propose a high-dimensional partial linear regression model that captures both linear and nonlinear effects, combining the interpretability of linear models with the adaptability of nonparametric approaches. Our model leverages trend filtering to handle local smoothness variations effectively and achieves minimax optimal rates, making it suitable for complex biological datasets. We apply this model to data from the Interactive Diet and Activity Tracking in AARP (IDATA) Study, demonstrating its utility in identifying biomarkers associated with UPF intake and illustrating its potential for broader applications in dietary, metabolic, and health-related research.

2024-10-27T03:13:09Z 52 pages, 8 figures Lee, S. K., Loftfield, E., Hong, H. G., and Weng, H. (2026) High-dimensional partial linear model with trend filtering, Electronic Journal of Statistics, 20(1), 1800-1850 Sang Kyu Lee Erikka Loftfield Hyokyoung G. Hong Haolei Weng 10.1214/26-EJS2522 http://arxiv.org/abs/2605.17240v1 The FORSS Framework for Sample Size and Power Calculations With Win Statistics for Hierarchical Endpoints 2026-05-17T03:44:41Z

Win statistics have gained increasing popularity as primary analysis methods for clinical trials with hierarchical endpoints (HEs) as primary endpoints. However, existing sample size and power calculation approaches in trial design still face several limitations and challenges: simulation-based approaches are computationally intensive, while existing formula-based methods often rely on simplifying assumptions such as independence among HEs, or require specification of overall win statistics and tie probability that are difficult to elicit a priori in practice. To address these challenges, we propose the FORSS framework, a FORmula-based Super-Sample approach that allows investigators to specify marginal treatment effects using familiar metrics (e.g., hazard ratios, mean differences, and risk differences) together with a flexible joint working distribution for the HEs. Rather than repeatedly simulating full trials at each candidate sample size, FORSS uses super-samples to estimate the population-level plug-in quantities required by analytical formulas for both power and sample size calculation. We evaluated the performance of the proposed FORSS through extensive simulation studies. The results show that the formula-based FORSS closely matches empirical power across a wide range of scenarios while maintaining Type~I error rates near the nominal 5\% level. An illustration based on the HEART-FID trial further shows that endpoint-dependence specifications can materially affect projected power and required sample size when planning trials with HEs.

2026-05-17T03:44:41Z Baoshan Zhang Huiman X. Barnhart Yuan Wu Roland A. Matsouaka http://arxiv.org/abs/2605.07285v2 Transporting treatment effects by calibrating large-scale observational outcomes 2026-05-17T03:11:48Z

A high-quality experimental dataset is often much smaller than a corresponding observational dataset. When this holds with possibly biased measurements of the outcome of interest in the latter, we propose an estimation and inference procedure for a transported treatment effect. Our point estimator can be computed as follows. First, we estimate the conditional average treatment effect (CATE) by calibrating a treatment-control contrast estimated using the observational outcomes to the experimental dataset using ordinary least squares (OLS). Then, we compute the sample average of this estimated CATE over the observational dataset. We show that the limiting estimand is a weighted transported average treatment effect even when the OLS calibration is misspecified. Furthermore, our inference for this estimand is asymptotically valid and semiparametrically efficient when the size of the experimental dataset grows more slowly than the size of the observational dataset, regardless of the existence of positivity (overlap) between the two datasets. We illustrate the stable empirical performance of our method under varying degrees of positivity using numerical simulations and a data example using field experiments and satellite-based yield estimates to estimate the average effect of crop rotation on maize (corn) yields over a large area of the Midwestern United States.

2026-05-08T05:48:07Z 37 pages, 5 figures Harrison H Li http://arxiv.org/abs/2505.12181v2 Reliable fairness auditing with semi-supervised inference 2026-05-17T01:27:16Z

Machine learning (ML) models often exhibit bias that can exacerbate inequities in biomedical applications. Fairness auditing, the process of evaluating a model's performance across subpopulations, is critical for identifying and mitigating these biases. However, audits typically rely on large volumes of labeled data, which are costly and labor-intensive to obtain. To address this challenge, we introduce $\textit{Infairness}$, a unified framework for auditing a wide range of fairness criteria using semi-supervised inference. Our approach combines a small labeled dataset with a large unlabeled dataset by imputing missing outcomes via regression with carefully selected nonlinear basis functions. Through extensive theoretical and empirical analyses, we show that our proposed estimator is (i) robust to specification of the ML or imputation model and (ii) substantially more efficient than supervised estimation based solely on the labeled data. In two real-world fairness audits using electronic health record and medical imaging data, Infairness reduces variance by approximately 50% compared to supervised estimation, underscoring its value for reliable fairness auditing with limited labeled data.

2025-05-18T00:42:21Z Jianhui Gao Jessica Gronsbell http://arxiv.org/abs/2501.09015v4 Family-wise Error Rate Control with E-values 2026-05-17T00:53:26Z

The closure principle is a standard tool for achieving strong family-wise error rate (FWER) control in multiple testing problems. We develop an e-value-based closed testing framework that inherits nice properties of e-values, which are common in settings of sequential hypothesis testing or universal inference for irregular parametric models. We prove that e-value-based closed testing strongly controls the post-hoc FWER in the static setting, and has stronger anytime-valid and always-valid FWER-controlling properties in the sequential setting. Furthermore, we extend the celebrated graphical approach for FWER control (Bretz et al. 2009), using the weighted average of e-values for the local test, a strictly more powerful approach than weighted Bonferroni local tests with inverse e-values as p-values. In general, the computational cost for closed testing can be exponential in the number of hypotheses. Although the computational shortcuts for the p-value-based graphical approach are not applicable, we develop an efficient polynomial-time algorithm using dynamic programming for e-value-based graphical approaches with any directed acyclic graph, and tailored algorithms for the e-Holm procedure previously studied by Vovk and Wang (2021) and the e-Fallback procedure.

2025-01-15T18:57:33Z 32 pages, 12 figures, 4 algorithms Will Hartog Lihua Lei http://arxiv.org/abs/2604.13276v2 Addressing Confounding by Indication Through (Un)Measured Centre Characteristics in Learn-As-you-GO(LAGO) Trials 2026-05-16T21:09:13Z

The Learn-As-you-Go (LAGO) design is an adaptive clinical trial design that allows modifications to multicomponent intervention packages across stages. Centers participate in more than one stage, as is common in large-scale implementation trials. In LAGO trials, center characteristics may act as confounders, predicting both the intervention package and the outcomes. We extend the LAGO theory by introducing fixed center effects to control for confounding by indication through measured and unmeasured center characteristics. Conditioning on center characteristics by including fixed center effects ensures asymptotic results hold without requiring explicit characterization of unmeasured confounders. Our methods apply even with small numbers of centers. LAGO theory is established for continuous outcomes following a generalized linear model and binary outcomes following a logistic regression model, unifying theory across outcome types. Point- and interval estimators are derived, and consistency and asymptotic normality are established. Valid hypothesis tests for the overall intervention effect are provided, and the optimal intervention package minimizing cost subject to a target outcome mean is obtained via constrained optimization.

2026-04-14T20:13:26Z Minh Thu Bui Christopher T. Longenecker Ante Bing Donna Spiegelman Allison R. Webel Hayden B. Bosworth Judith J. Lok http://arxiv.org/abs/2605.17154v1 Learning Gaussian Graphical Models under Total Positivity via Spectral Graph Sparsification 2026-05-16T20:59:06Z

Many practical data analysis tasks reduce to learning, from observed samples, how a collection of variables depend on each other. A widely used approach is to fit a Gaussian graphical model, which represents the dependence structure as a graph connecting the variables. In a number of important applications, such as financial returns, gene co-expression, and climate or network analysis, the dependencies tend to be positive: variables move together rather than offset each other. Encoding this positivity through the constraint of multivariate total positivity of order two (MTP2) yields an attractive estimator that produces accurate fits with no tuning required. The resulting graphs are, however, typically much denser than the underlying ground-truth model, which makes them hard to interpret and slow to use in any downstream task that operates on the graph. In this work, we propose a novel highly-scalable approach for learning Gaussian graphical models from data using spectral sparsification; we call it Spectral-MTP2. Spectral graph sparsification is a fundamental method which aims to preserve meaningful properties of a dense graph with a sparser subgraph. We theoretically and empirically investigate and validate our method, and show that learning Gaussian Graphical Models under MTP2 using spectral sparsification preserves MTP2 and approximates well the original model in terms of Kullback-Leibler divergence and Gaussian log-likelihood. In simulations and applications to equity returns and gene expression, we observe that Spectral-MTP2 retains most of the fit quality of the denser MTP2 baseline, while producing substantially sparser and more interpretable graphs.

2026-05-16T20:59:06Z 16 pages Ignacio Echave-Sustaeta Rodríguez Aida Abiad Frank Röttger http://arxiv.org/abs/2605.17050v1 Single World Intervention Graphs as Distributions: A Framework for Causal Identification 2026-05-16T15:44:37Z

Causal inference seeks to estimate the effect of an intervention on an outcome using observed data, typically via Rubin's potential-outcome framework or Pearl's do-calculus. Following section 9 of Richardson and Robins (2013), this essay treats single-world intervention graphs (SWIGs) as representations of both the observed-data distribution and the interventional distribution, rather than as a bridge to potential outcomes. We demonstrate that this perspective provides a systematic way to derive identifying expressions for estimands defined by interventions on selected variables. Back-door derivations mirror those in existing literature, while front-door derivations offer a distinct pathway that extends more readily to complex settings. Conceptually, the method is simultaneously related to and distinct from Rubin's framework and Pearl's calculus.

2026-05-16T15:44:37Z Christian Bartels http://arxiv.org/abs/2605.14943v2 Piece-wise linear isotonic regression 2026-05-16T12:06:21Z

Isotonic regression provides a flexible, tuning-free approach to estimating monotonic functions without imposing global curvature constraints, yet the estimated regression function is inherently a step function. This paper addresses a key limitation of such estimators: their inability to provide meaningful marginal properties, such as shadow prices or elasticities. We propose a novel piece-wise linear smoothing framework that recovers meaningful marginal estimates even in non-convex settings. Building on the concept of conditional convexity originally developed in deterministic frontier analysis, we formulate the smoothing process as a bilevel optimization problem that fits a continuous, monotonic, piece-wise linear function to the initial isotonic regression predictions. Monte Carlo simulations demonstrate that the proposed approach can significantly improve estimation accuracy in both convex and non-convex settings for univariate and multivariate data. We apply this approach to analyze agglomeration economies in Finnish municipalities, illustrating its practical value.

2026-05-14T15:16:44Z Timo Kuosmanen Juan F. Monge José L. Ruiz Xun Zhou http://arxiv.org/abs/2603.14942v3 A System-Theoretic Approach to Hawkes Process Identification with Guaranteed Positivity and Stability 2026-05-16T11:57:10Z

The Hawkes process models self-exciting event streams, requiring a strictly non-negative and stable stochastic intensity. Standard identification methods enforce these properties using non-negative causal bases, yielding conservative parameter constraints and severely ill-conditioned least-squares Gram matrices at higher model orders. To overcome this, we introduce a system-theoretic identification framework utilizing the sign-indefinite orthonormal Laguerre basis, which guarantees a well-conditioned asymptotic Gram matrix independent of model order. We formulate a constrained least-squares problem enforcing the necessary and sufficient conditions for positivity and stability. By constructing the empirical Gram matrix via a Lyapunov equation and representing the constraints through a sum-of-squares trace equivalence, the proposed estimator is efficiently computed via semidefinite programming.

2026-03-16T07:47:56Z 6 pages, 2 figures Xinhui Rong Girish N. Nair http://arxiv.org/abs/2605.16906v1 Differentially private hypothesis testing in survival analysis 2026-05-16T09:41:49Z

Survival analysis is widely used in applications involving sensitive individual-level data, yet differentially private hypothesis testing for right-censored data remains largely undeveloped. We initiate a finite-sample theory of private hypothesis testing in survival analysis applications. For Cox regression coefficients, we develop private partial-likelihood-ratio and score-type tests, including a private calibration procedure for the rejection threshold. For cumulative hazard functions, we propose a private distributed two-sample test. Across these problems, we prove differential privacy and finite-sample testing guarantees, as well as minimax lower bounds. Our results identify when privacy is statistically negligible, when it dominates the testing rate, and where optimal private rates for testing in semiparametric survival models remain open. This theoretical analysis is accompanied by numerical experiments on simulated data.

2026-05-16T09:41:49Z Elly K. H. Hung Yi Yu http://arxiv.org/abs/2605.16885v1 A Workflow for Evaluating Regional Treatment Effect Heterogeneity in Multi-Regional Clinical Trials 2026-05-16T08:58:35Z

Multi-regional clinical trials (MRCTs) enable efficient global drug development by assessing treatment effects across regions within a single protocol. While powered for overall efficacy, MRCTs are typically not designed to provide confirmatory evidence on regional differences, making an assessment of observed regional heterogeneity largely exploratory and susceptible to sampling variability. Despite this challenge, understanding regional heterogeneity remains important for interpretation and regulatory decision-making. This paper proposes a structured, question-driven framework to guide exploratory assessments of regional heterogeneity in MRCTs. We formulate four key questions to clarify the objectives of such analyses and propose a set of statistical methods to address them. Simulation studies evaluate performance under scenarios with no heterogeneity and heterogeneity driven by observed or unobserved treatment effect modifiers, illustrating how a structured approach can support transparent and cautious interpretation.

2026-05-16T08:58:35Z Cong Zhang Meihua Long Tianyu Zheng Konstantinos Sechidis Xiaoni Liu Sophie Sun Yao Chen Xinyi Zhang Shuhei Kaneko Björn Bornkamp Yan Hou http://arxiv.org/abs/2605.10088v2 Sample size and power calculations for causal inference with time-to-event outcomes 2026-05-16T06:18:55Z

This paper develops power and sample size formulas for causal inference with time-to-event outcomes. The target estimand is the marginal hazard ratio: the coefficient of a marginal structural Cox proportional hazard model with treatment as the only predictor. We extend the robust sandwich variance theory and derive the analytical form of the asymptotic variance for the inverse probability weighted partial likelihood estimator. Building on this, we derive a new analytical sample size formula valid at any prespecified effect size, applicable to both randomized trials and observational studies. For randomized trials, the formula requires only the canonical inputs of treatment proportion, effect size, and event rate. The new formula corrects the mischaracterization of classic log-rank-based formulas. For observational studies, one additional input suffices: an overlap coefficient summarizing covariate similarity between comparison groups. We further develop a variance inflation approach applicable to any propensity score balancing weights, anchored to the corrected baseline variance. We provide an online calculator and an R package 'PSpower' to implement the method.

2026-05-11T07:07:23Z Chengxin Yang Bo Liu Fan Li