https://arxiv.org/api/nP/tfEnT+of//XBbndd383JuPM42026-03-20T17:32:03Z346346015http://arxiv.org/abs/2603.17086v1Topological inference on brain networks with application to lesion symptom mapping2026-03-17T19:14:56ZPersistent homology (PH) characterizes the shape of brain networks through persistence features. Group comparison of persistence features from brain networks can be challenging as they are inherently heterogeneous. A recent scale-space representation of persistence diagrams (PDs) through heat diffusion reparameterizes them using a finite number of Fourier coefficients with respect to the Laplace--Beltrami (LB) eigenfunction expansion of the domain, providing a powerful vectorized algebraic representation for group comparisons. In this study, we develop a transposition-based permutation test for comparing multiple groups of PDs using heat-diffusion estimates. We evaluate the empirical performance of the spectral transposition test in capturing within- and between-group similarity and dissimilarity under varying levels of topological noise and cycle location variability. In application, we propose a topological lesion symptom mapping (TLSM) method based on the proposed framework. The method is applied to resting-state functional brain networks of individuals with post-stroke aphasia to identify characteristic cycles associated with varying levels of speech-language impairment.2026-03-17T19:14:56ZarXiv admin note: substantial text overlap with arXiv:2311.01625Yuan WangJian YinNicholas RiccardiDrik-Bart Den OudenJulius FridrikssonRutvik H. Desaihttp://arxiv.org/abs/2603.17066v1Improving RCT-Based CATE Estimation Under Covariate Mismatch via Double Calibration2026-03-17T18:51:12ZWe develop estimators that improve precision of heterogeneous treatment effect estimates that allow borrowing information from observational studies when the available covariates in each data source do not perfectly match. Standard data-borrowing methods often assume perfectly matched covariates. We propose MR-OSCAR, an RCT-calibrated, two-stage estimation approach that first predicts the trial-missing variables using the observational data via imputation and then calibrates observational outcome predictions to the randomized trial, preserving the causal contrast, unlike the results for generalization, where imputation does not improve performance. Our theory gives finite-sample guarantees with a transparent error decomposition including an imputation error that shrinks as the observational mapping becomes more predictable. Simulations show that imputation almost always outperforms naively using only the shared covariates and clarifies when borrowing helps (strong predictability of the missing block, moderate trial size) and when it does not (poor predictability or dominant trial-only moderators). We motivate the approach with the Greenlight Plus trial on early childhood obesity and outline a forthcoming EHR analysis at Vanderbilt, highlighting the use of our method in common scenarios where data do not perfectly align.2026-03-17T18:51:12ZSamhita PalJared D. HulingAmir Asiaeehttp://arxiv.org/abs/2603.17041v1Dependence Fidelity and Downstream Inference Stability in Generative Models2026-03-17T18:24:31ZRecent advances in generative AI have led to increasingly realistic synthetic data, yet evaluation criteria remain focused on marginal distribution matching. While these diagnostics assess local realism, they provide limited insight into whether a generative model preserves the multivariate dependence structures governing downstream inference. We introduce covariance-level dependence fidelity as a practical criterion for evaluating whether a generative distribution preserves joint structure beyond univariate marginals. We establish three core results. First, distributions can match all univariate marginals exactly while exhibiting substantially different dependence structures, demonstrating marginal fidelity alone is insufficient. Second, dependence divergence induces quantitative instability in downstream inference, including sign reversals in regression coefficients despite identical marginal behavior. Third, explicit control of covariance-level dependence divergence ensures stable behavior for dependence-sensitive tasks such as principal component analysis. Synthetic constructions illustrate how dependence preservation failures lead to incorrect conclusions despite identical marginal distributions. These results highlight dependence fidelity as a useful diagnostic for evaluating generative models in dependence-sensitive downstream tasks, with implications for diffusion models and variational autoencoders. These guarantees apply specifically to procedures governed by covariance structure; tasks requiring higher-order dependence such as tail-event estimation require richer criteria.2026-03-17T18:24:31Z22 pages, 7 figures. Poster presentation at MathAI 2026 (International Conference on Mathematics of Artificial Intelligence), March 30 - April 3, 2026Nazia Riasathttp://arxiv.org/abs/2505.22659v2A General Marked Point Process Framework For Self-Exciting Network Evolution2026-03-17T18:18:27ZWe propose a novel modeling framework for time-evolving networks allowing for long-term dependence in network features that update in continuous time. Dynamic network growth is functionally parameterized via the conditional intensity of a marked point process. This characterization enables flexible, joint modeling of both update timing and the network updates themselves, dependent on the entire left-continuous sample path. We propose a path dependent nonlinear marked Hawkes process as an expressive platform for modeling such data; its dynamic mark space embeds the time-evolving network. We prove well-posedness and establish sufficient stability conditions, demonstrate simulation and subsequent feasible likelihood-based inference through numerical study, and illustrate the methodology with an application to conference attendee social network data. The proposed formulation provides a flexible and principled foundation for statistical inference on complex network evolution in continuous time.2025-05-28T17:59:29ZDuncan A ClarkConor J. KresinCharlotte M. Jones-Toddhttp://arxiv.org/abs/2603.17031v1Minimizing Type 2 Errors in an Experiment-Rich Regime via Optimal Resource Allocation2026-03-17T18:13:21ZRandomized experiments (often known as "A/B tests") are widely used to evaluate product and service innovations. We study how to allocate limited experimentation resources across M concurrent experiments in an experiment-rich regime. Existing work on allocation has predominantly focused on minimizing the worst-case mean squared error (MSE) of estimated treatment effects, which favors experiments with larger (and typically unknown) outcome variance. While appropriate for controlling estimation accuracy, this objective does not directly capture a common managerial priority in screening stages: detecting practically meaningful treatment effects with high probability.
Motivated by this, we consider the objective of minimizing the worst-case Type II error across all experiments. When the standard deviations are known, we characterize the power-optimal allocation and show that MSE-based allocations can be highly inefficient for detection, even though the two objectives align asymptotically. When the standard deviations are unknown and must be learned from pilot data, we show that a naive plug-in approach, treating pilot standard deviations as truth, can suffer substantial power loss.
We propose inflating pilot estimates via correction factors and develop three optimization-based frameworks for selecting them, each reflecting a different risk criterion with distinct managerial implications. Although the resulting stochastic programs are computationally challenging at scale, we derive tractable surrogate reformulations inspired by robust optimization and establish favorable theoretical properties. We further propose Surrogate-S, a fully data-dependent and implementable procedure that computes correction factors using only pilot variance estimates and achieves near-oracle performance in numerical experiments.2026-03-17T18:13:21ZFenghua YangDae Woong HamStefanus Jasinhttp://arxiv.org/abs/2603.10272v2An operator-level ARCH Model2026-03-17T18:01:54ZAutoRegressive Conditional Heteroscedasticity (ARCH) models are standard for modeling time series exhibiting volatility, with a rich literature in univariate and multivariate settings. In recent years, these models have been extended to function spaces. However, functional ARCH and generalized ARCH (GARCH) processes established in the literature have thus far been restricted to model ``pointwise'' variances. In this paper, we propose a new ARCH framework for data residing in general separable Hilbert spaces that accounts for the full evolution of the conditional covariance operator. We define a general operator-level ARCH model. For a simplified Constant Conditional Correlation version of the model, we establish conditions under which such models admit strictly and weakly stationary solutions, finite moments, and weak serial dependence. Additionally, we derive consistent Yule--Walker-type estimators of the infinite-dimensional model parameters. The practical relevance of the model is illustrated through simulations and a data application to high-frequency cumulative intraday returns.2026-03-10T23:04:20Z48 pages, 8 Figures, 2 TablesAlexander AueSebastian KühnertGregory RiceJeremy VanderDoeshttp://arxiv.org/abs/2603.16854v1Spatial Causal Tensor Completion for Multiple Exposures and Outcomes: An Application to the Health Effects of PFAS Pollution2026-03-17T17:57:12ZPer- and polyfluoroalkyl substances (PFAS) are typically encountered as mixtures of distinct chemicals with distinct effects on multiple health outcomes. Estimating joint causal effects using spatially-dependent observed data is challenging. We propose a spatial causal tensor completion framework that jointly models multiple exposures and outcomes within a low-rank tensor structure, while adjusting for observed confounders and latent spatial confounders. This method combines a low-rank tensor representation to pool information across exposures and outcomes with a spectral adjustment step that incorporates graph-Laplacian eigenvectors to approximate unmeasured spatial confounders, implemented via a projected-gradient descent algorithm. This framework enables causal inference in the presence of unmeasured spatial confounding and pervasive missingness of potential outcomes. We establish theoretical guarantees for the estimator and evaluate its finite-sample performance through extensive simulations. In an application to national PFAS monitoring data, our approach yields more conservative and credible causal relationships between PFOA and PFOS exposure and 13 chronic disease outcomes compared with existing alternatives.2026-03-17T17:57:12ZXiaodan ZhouBrian J ReichShu Yanghttp://arxiv.org/abs/2603.12454v2Rank-based methods for estimating landmark win probability in longitudinal randomized controlled trials with missing data2026-03-17T17:54:32ZThe primary analysis for longitudinal randomized controlled trials (RCTs) often compares treatment groups at the last timepoint, referred to as the landmark time. Assuming data are normally distributed and missing at random, the mixed model for repeated measures (MMRM) is widely used to conduct inference in terms of a mean difference. When outcomes violate normality assumption and/or the mean difference lacks a clear interpretation, we may quantify treatment effects using the probability that a treated participant would have a better outcome than (or win over) a control participant. For RCTs with missing data, one may apply the generalized pairwise comparison (GPC) procedure, which carries forward the results of a pairwise comparison from a previous timepoint. We propose first using ranks to converts each observation at a timepoint into a win fraction, reflecting the proportion of times that the observation is better than every observation in the comparison group. Then, we conduct inference for the win probability based on the win fractions using the MMRM to obtain the point and variance estimates. Simulation results suggest that our method performed much better than the GPC procedure. We illustrate our proposed procedure in SAS and R using data from two published trials.2026-03-12T21:12:47ZGuangyong ZouShi-Fang QuiJoshua ZouEmma Davies SmithYun-Hee ChoiYuhan Bihttp://arxiv.org/abs/2603.16833v1Semiparametric Inference under Dual Positivity Boundaries:Nested Identification with Administrative Censoring and Confounded Treatment2026-03-17T17:39:24ZWhen a long-term outcome is administratively censored for a substantial fraction of a study cohort while a short-term intermediate variable remains broadly available, the target causal parameter can be identified through a nested functional that integrates the outcome regression over the conditional intermediate distribution, avoiding inverse censoring weights entirely. In observational studies where treatment is also confounded, this nested identification creates a semiparametric structure with two distinct positivity boundaries -- one from the censoring mechanism and one from the treatment assignment -- that enter the efficient influence function in fundamentally different roles. The censoring boundary is removed from the identification by the nested functional but remains in the efficient score; the treatment boundary appears in both. We develop the inference theory for this dual-boundary structure. Three results are established.2026-03-17T17:39:24ZRWD analysis is still pending, in that the section 7 is empty for nowLin Lihttp://arxiv.org/abs/2603.16829v1Conditional Distributional Treatment Effects: Doubly Robust Estimation and Testing2026-03-17T17:35:32ZBeyond conditional average treatment effects, treatments may impact the entire outcome distribution in covariate-dependent ways, for example, by altering the variance or tail risks for specific subpopulations. We propose a novel estimand to capture such conditional distributional treatment effects, and develop a doubly robust estimator that is minimax optimal in the local asymptotic sense. Using this, we develop a test for the global homogeneity of conditional potential outcome distributions that accommodates discrepancies beyond the maximum mean discrepancy (MMD), has provably valid type 1 error, and is consistent against fixed alternatives -- the first test, to our knowledge, with such guarantees in this setting. Furthermore, we derive exact closed-form expressions for two natural discrepancies (including the MMD), and provide a computationally efficient, permutation-free algorithm for our test.2026-03-17T17:35:32ZSaksham JainAlex Luedtkehttp://arxiv.org/abs/2506.19015v3Principal stratification with recurrent events truncated by a terminal event: A nested Bayesian nonparametric approach2026-03-17T17:08:13ZRecurrent events often serve as key endpoints in clinical studies but may be prematurely truncated by terminal events such as death, creating selection bias and complicating causal inference. To address this challenge, we develop a Bayesian nonparametric framework to address potential selection bias due to truncation by death within the continuous-time principal stratification framework. We introduce causal estimands for recurrent events in the presence of a terminal event and derive a partial identification result for the estimand under a dual-frailty framework, enabling transparent sensitivity analysis for non-identifiable parameters. We then propose a flexible Bayesian nonparametric prior, the enriched dependent Dirichlet process, specifically designed for joint modeling of recurrent and terminal events, addressing a limitation where standard Dirichlet process priors create random partitions dominated by recurrent events, yielding poor predictive performance for terminal events. Simulations are carried out to show that our method has superior performance compared to existing methods. We apply the proposed new Bayesian nonparametric methods to infer the causal effect of a structured exercise program on rehospitalizations, which are subject to truncation by death.2025-06-23T18:10:14Z58 pagesYuki OhnishiMichael O. HarhayGuangyu TongFan Lihttp://arxiv.org/abs/2602.16933v2M-estimation under Two-Phase Multiwave Sampling with Applications to Prediction-Powered Inference2026-03-17T16:59:02ZIn two-phase multiwave sampling, inexpensive measurements are collected on a large sample and expensive, more informative measurements are adaptively obtained on subsets of units across multiple waves. Adaptively collecting the expensive measurements can increase efficiency but complicates statistical inference. We give valid estimators and confidence intervals for M-estimation under adaptive two-phase multiwave sampling. We focus on the case where proxies for the expensive variables -- such as predictions from pretrained machine learning models -- are available for all units and propose a Multiwave Predict-Then-Debias estimator that combines proxy information with the expensive, higher-quality measurements to improve efficiency while removing bias. We establish asymptotic linearity and normality and propose asymptotically valid confidence intervals. We also develop an approximately greedy sampling strategy that improves efficiency relative to uniform sampling. Data-based simulation studies support the theoretical results and demonstrate efficiency gains.2026-02-18T22:54:32ZDan M. KlugerStephen Bateshttp://arxiv.org/abs/2603.16530v1Estimation and Hypothesis Testing of Fixed Effects Models-Based Uncertainty for Factor Designs2026-03-17T13:53:31ZTo analyze the uncertain data frequently encountered in practice, this paper proposes novel fixed-effects models that incorporate an uncertain measure to investigate variables of interest and nuisance variables in factor designs. First, an uncertain fixed-effects (UFE) model of a single-factor design is established, and uncertain estimation and hypothesis testing are conducted. We then extend the UFE model to two-factor designs with and without interactions and classify them as balanced or unbalanced based on the equality of replicates within each combination. In the above UFE models, the effectiveness and practicality of estimation and hypothesis methods are demonstrated through three real-world cases, including both balanced and unbalanced designs. These examples highlight the models' ability to handle uncertain experimental data.2026-03-17T13:53:31Z24 pages, 10 tablesFan ZhangZhiming Lihttp://arxiv.org/abs/2508.11814v3Simulation-based validation of Bayes factor computation2026-03-17T13:25:00ZWe propose and evaluate two methods that validate the computation of Bayes factors: one based on an improved variant of simulation-based calibration checking (SBC) and one based on calibration metrics for binary predictions. We show that in theory, binary prediction calibration is equivalent to a special case of SBC, but with limited resources, binary prediction calibration is typically more sensitive to the problems we investigated. With well-designed test quantities, SBC can however detect all possible problems in computation, including some that cannot be uncovered by binary prediction calibration.
Previous work on Bayes factor validation includes checks based on the data-averaged posterior and the Good check method. We demonstrate that both checks miss many problems in Bayes factor computation detectable with SBC and binary prediction calibration. Moreover, we find that the Good check as originally described fails to control its error rates. Our proposed checks also typically use simulation results more efficiently than data-averaged posterior checks. Finally, we show that a special approach based on posterior SBC is necessary when checking Bayes factor computation under improper priors and we validate several models with such priors.
We recommend that novel methods for Bayes factor computation be validated with SBC, binary prediction calibration and data-averaged posterior with at least several hundred simulations. For all the models we tested, the bridgesampling and BayesFactor R packages satisfy all available checks and thus are likely safe to use in standard scenarios.2025-08-15T21:41:28Z49 pages, 14 figuresMartin ModrákSebastian StroppelPaul-Christian Bürknerhttp://arxiv.org/abs/2509.09965v2Confidence Intervals for Extinction Risk: Validating Population Viability Analysis with Limited Data2026-03-17T13:08:53ZQuantitative assessment of extinction risk requires confidence intervals (CIs) that remain informative with limited data. Their usefulness has long been debated because short observation spans can make uncertainty so large that population viability analysis appears impractical. I derive new CIs for extinction probability under the drift-Wiener process, a canonical model of extinction dynamics, by introducing transformed parameters $w$ and $z$ whose maximum-likelihood estimators follow noncentral $t$ distributions. The resulting $w$-$z$ method yields CIs with coverage close to the nominal level and shows that precision depends not only on data length but also on effect size: extinction probabilities that are sufficiently low or high can often be estimated reliably even from limited time series. I also propose an observation-error-and-autocovariance-robust (OEAR) estimator for settings with additive observation error and short-run dependence. Applied to two 64-year national harvest indices for Japanese eel (Anguilla japonica), the method gives Criterion E extinction probabilities far below the IUCN threatened-category thresholds, with narrow CIs, despite the species being listed as Endangered under Criterion A. These results show that extinction-risk CIs can be both statistically rigorous and practically informative for conservation assessment under limited data.2025-09-12T04:51:42Z151 pages, 32 figures, 30 tablesHiroshi Hakoyama10.1111/2041-210X.70294