https://arxiv.org/api/EtqszER2wtVf1kR+oUoRQq+H21w2026-04-04T09:21:33Z3487424015http://arxiv.org/abs/2603.00704v2Robustifying Empirical Bayes2026-03-25T20:33:24ZTwo strategies are explored for robustifying classical denoising procedures for the
Gaussian sequence model. First, the Hodges and Lehmann (1952)
restricted Bayes approach is used to reduce sensitivity to the specification
of the initial prior distribution. Second, alternatives to the Gaussian
noise assumption are explored. In both cases proposals of Huber (1964)
and Mallows (1978) play a crucial role.2026-02-28T15:25:27ZRoger KoenkerJiaying Guhttp://arxiv.org/abs/2603.24792v1Improving online FDR procedures via online analogs of e-closure and compound e-values2026-03-25T20:06:38ZIn many scientific applications, hypotheses are generated and tested continuously in a stream. We develop a framework for improving online multiple testing procedures with false discovery rate (FDR) control under arbitrary dependence. Our approach is two-fold: we construct methods via the online e-closure principle, as well as a novel formulation of online compound e-values that is defined through donations. This yields strict power improvements over state-of-the-art e-value and p-value procedures while retaining FDR control. We further derive algorithms that compute the decision at time $t$ in $O(\log t)$ time, and we demonstrate improved empirical performance on synthetic and real data.2026-03-25T20:06:38Z44 pages, 9 figuresZiyu XuLasse FischerAaditya Ramdashttp://arxiv.org/abs/2603.24783v1Causal Discovery on Dependent Mixed Data with Applications to Gene Regulatory Network Inference2026-03-25T19:57:07ZCausal discovery aims to infer causal relationships among variables from observational data, typically represented by a directed acyclic graph (DAG). Most existing methods assume independent and identically distributed observations, an assumption often violated in modern applications. In addition, many datasets contain a mixture of continuous and discrete variables, which further complicates causal modeling when dependence across samples is present. To address these challenges, we propose a de-correlation framework for causal discovery from dependent mixed data. Our approach formulates a structural equation model with latent variables that accommodates both continuous and discrete variables while allowing correlated Gaussian errors across units. We estimate the dependence structure among samples via a pairwise maximum likelihood estimator for the covariance matrix and develop an EM algorithm to impute latent variables underlying discrete observations. A de-correlation transformation of the recovered latent data enables the use of standard causal discovery algorithms to learn the underlying causal graph. Simulation studies demonstrate that the proposed method substantially improves causal graph recovery compared with applying standard methods directly to the original dependent data. We apply our approach to single-cell RNA sequencing data to infer gene regulatory networks governing embryonic stem cell differentiation. The inferred regulatory networks show significantly improved predictive likelihood on test data, and many high-confidence edges are supported by known regulatory interactions reported in the literature.2026-03-25T19:57:07ZAlex ChenQing Zhouhttp://arxiv.org/abs/2603.24718v1Wavelet-based estimation in aggregated functional data with positive and correlated errors2026-03-25T18:42:11ZWe consider the statistical problem of estimating constituent curves from observations of their aggregated curves, referred to as \textit{aggregated functional data}, in models with strictly positive random errors following a Gamma distribution and correlated errors structured through AR(1) and ARFIMA processes. This problem arises in several areas of knowledge, such as chemometrics, for example, when absorbance curves of the constituents of a given substance must be estimated from its aggregated absorbance curve according to the Beer--Lambert law.
In this context, we propose Bayesian wavelet-based methods to estimate the component functions within a functional data analysis framework. This approach has the advantage of accurately estimating curves with important local features, such as discontinuities, peaks, and oscillations, due to the representation properties of functions in wavelet bases. We further evaluate the performance of the proposed method through computational simulations, as well as applications to real data.2026-03-25T18:42:11ZAlex Rodrigo dos Santos SousaJoão Victor Siqueira RodriguesVitor Ribas PerroneRaul Gomes Rochahttp://arxiv.org/abs/2603.24705v1Amortized Inference for Correlated Discrete Choice Models via Equivariant Neural Networks2026-03-25T18:30:11ZDiscrete choice models are fundamental tools in management science, economics, and marketing for understanding and predicting decision-making. Logit-based models are dominant in applied work, largely due to their convenient closed-form expressions for choice probabilities. However, these models entail restrictive assumptions on the stochastic utility component, constraining our ability to capture realistic and theoretically grounded choice behavior$-$most notably, substitution patterns. In this work, we propose an amortized inference approach using a neural network emulator to approximate choice probabilities for general error distributions, including those with correlated errors. Our proposal includes a specialized neural network architecture and accompanying training procedures designed to respect the invariance properties of discrete choice models. We provide group-theoretic foundations for the architecture, including a proof of universal approximation given a minimal set of invariant features. Once trained, the emulator enables rapid likelihood evaluation and gradient computation. We use Sobolev training, augmenting the likelihood loss with a gradient-matching penalty so that the emulator learns both choice probabilities and their derivatives. We show that emulator-based maximum likelihood estimators are consistent and asymptotically normal under mild approximation conditions, and we provide sandwich standard errors that remain valid even with imperfect likelihood approximation. Simulations show significant gains over the GHK simulator in accuracy and speed.2026-03-25T18:30:11ZEaston HuchMichael Keanehttp://arxiv.org/abs/2603.24704v1Conformal Selective Prediction with General Risk Control2026-03-25T18:29:23ZIn deploying artificial intelligence (AI) models, selective prediction offers the option to abstain from making a prediction when uncertain about model quality. To fulfill its promise, it is crucial to enforce strict and precise error control over cases where the model is trusted. We propose Selective Conformal Risk control with E-values (SCoRE), a new framework for deriving such decisions for any trained model and any user-defined, bounded and continuously-valued risk. SCoRE offers two types of guarantees on the risk among ``positive'' cases in which the system opts to trust the model. Built upon conformal inference and hypothesis testing ideas, SCoRE first constructs a class of (generalized) e-values, which are non-negative random variables whose product with the unknown risk has expectation no greater than one. Such a property is ensured by data exchangeability without requiring any modeling assumptions. Passing these e-values on to hypothesis testing procedures, we yield the binary trust decisions with finite-sample error control. SCoRE avoids the need of uniform concentration, and can be readily extended to settings with distribution shifts. We evaluate the proposed methods with simulations and demonstrate their efficacy through applications to error management in drug discovery, health risk prediction, and large language models.2026-03-25T18:29:23ZTian BaiYing Jinhttp://arxiv.org/abs/2603.16833v3Semiparametric Inference under Dual Positivity Boundaries:Nested Identification with Administrative Censoring and Confounded Treatment2026-03-25T18:20:59ZWhen a long-term outcome is administratively censored for a substantial fraction of a study cohort while a short-term intermediate variable remains broadly available, the target causal parameter can be identified through a nested functional that integrates the outcome regression over the conditional intermediate distribution, avoiding inverse censoring weights entirely. In observational studies where treatment is also confounded, this nested identification creates a semiparametric structure with two distinct positivity boundaries -- one from the censoring mechanism and one from the treatment assignment -- that enter the efficient influence function in fundamentally different roles. The censoring boundary is removed from the identification by the nested functional but remains in the efficient score; the treatment boundary appears in both. We develop the inference theory for this dual-boundary structure. Three results are established.2026-03-17T17:39:24ZLin Lihttp://arxiv.org/abs/2512.10069v2Information Borrowing from Partially Compatible Trajectories for Estimation of Dynamic Treatment Regimes2026-03-25T16:27:39ZDynamic Treatment Regimes (DTRs) provide a systematic framework for optimizing sequential decision-making in chronic disease management, where therapies must adapt to patients' evolving clinical profiles. Inverse probability weighting (IPW) is a cornerstone methodology for estimating regime values from observational data due to its intuitive formulation and established theoretical properties, yet standard IPW estimators face significant limitations, including variance instability and data inefficiency. A fundamental but underexplored source of inefficiency lies in the strict alignment requirement between observed and target treatment trajectories, which fails to account for partial compatibility and discards substantial information from individuals with only minimal deviations from the regime. We propose two novel methodologies that relax the strict inclusion rule through flexible compatibility mechanisms. Both methods provide computationally tractable alternatives that can be easily integrated into existing IPW workflows, offering more efficient approaches to DTR estimation. Theoretical analysis demonstrates that both estimators preserve consistency while achieving superior finite-sample efficiency compared to standard IPW, and comprehensive simulation studies confirm improved stability. We illustrate the practical utility of our methods through an application to HIV treatment data from the AIDS Clinical Trials Group Study 175 (ACTG175).2025-12-10T20:43:40ZChloe SiDavid A. StephensErica E. M. Moodiehttp://arxiv.org/abs/2603.24439v1Distributionally balanced sampling designs via minimum tactical configurations2026-03-25T15:50:57ZDistributionally balanced sampling designs are low-discrepancy probability designs obtained by minimizing the expected discrepancy between the auxiliary-variable distribution of a random sample and the target population distribution. Existing constructions rely on circular population sequences, which restrict the design space by forcing samples to be contiguous blocks of a sequence. We propose a new construction based on minimum tactical configurations that removes this topological constraint. The resulting designs are fixed-size, have equal inclusion probabilities, and belong to the class with minimum feasible configuration size. We develop both a simple initialization valid for arbitrary population and sample sizes and a spatial initialization that yields a lower initial expected discrepancy, together with a simulated annealing algorithm for optimization within this class. In simulations and empirical examples, the proposed method outperforms state-of-the-art alternatives in terms of distributional fit, balance, and spatial spread.2026-03-25T15:50:57Z15 pages, 3 figuresAnton GrafströmWilmer Prentiushttp://arxiv.org/abs/2603.24421v1E-values as statistical evidence: A comparison to Bayes factors, likelihoods, and p-values2026-03-25T15:32:53ZA recurring debate in the philosophy of statistics concerns what, exactly, should count as a measure of evidence for or against a given hypothesis. P-values, likelihood ratios, and Bayes factors all have their defenders. In this paper we add two additional candidates to this list: the e-value and its sequential analogue, the e-process. E-values enjoy several desirable properties as measures of evidence: they combine naturally across studies, handle composite hypotheses, provide long-run error rates, and admit a useful interpretation as the wealth accrued by a bettor in a game against the null distribution. E-processes additionally handle optional stopping and optional continuation. This work examines the extent to which e-values and e-processes satisfy the evidential desiderata of different statistical traditions, concluding that they combine attractive features of p-values, likelihood ratios, and Bayes factors, and merit serious consideration as interpretable and intuitive measures of statistical evidence.2026-03-25T15:32:53Z34 pagesBen ChuggAaditya RamdasPeter Grünwaldhttp://arxiv.org/abs/2506.03462v2Robust domain selection for functional data via interval-wise testing and effect size mapping2026-03-25T15:28:38ZAmong inferential problems in functional data analysis, domain selection is one of the practical interests aiming to identify sub-interval(s) of the domain where desired functional features are displayed. Motivated by applications in quantitative ultrasound signal analysis, we propose the robust domain selection method, particularly aiming to discover a subset of the domain presenting distinct behaviors on location parameters among different groups. By extending the interval testing approach, we propose to take into account multiple aspects of functional features simultaneously to detect the practically interpretable domain. To further handle potential outliers and missing segments on collected functional trajectories, we perform interval testing with a test statistic based on functional M-estimators for the inference. In addition, we introduce the effect size heatmap by calculating robustified effect sizes from the lowest to the largest scales over the domain to reflect dynamic functional behaviors among groups so that clinicians get a comprehensive understanding and select practically meaningful sub-interval(s). The performance of the proposed method is demonstrated through simulation studies and an application to motivating quantitative ultrasound measurements.2025-06-04T00:01:16ZJournal of the Royal Statistical Society Series C: Applied Statistics (2026)Yeonjoo ParkAiguo Han10.1093/jrsssc/qlag014http://arxiv.org/abs/2012.08371v3Limiting laws and consistent estimation criteria for fixed and diverging number of spiked eigenvalues2026-03-25T15:10:49ZIn this paper, we study limiting laws and consistent estimation criteria for the extreme eigenvalues in a spiked covariance model of dimension $p$. Firstly, for fixed $p$, we propose a generalized estimation criterion that can consistently estimate, $k$, the number of spiked eigenvalues. Compared with the existing literature, we show that consistency can be achieved under weaker conditions on the penalty term. Next, allowing both $p$ and $k$ to diverge, we derive limiting distributions of the spiked sample eigenvalues using random matrix theory techniques. Notably, our results do not require the spiked eigenvalues to be uniformly bounded from above or tending to infinity, as have been assumed in the existing literature. Based on the above derived results, we formulate a generalized estimation criterion and show that it can consistently estimate $k$, while $k$ can be fixed or grow at an order of $k=o(n^{1/3})$. We further show that the results in our work continue to hold under a general population distribution without assuming normality. The efficacy of the proposed estimation criteria is illustrated through comparative simulation studies.2020-12-15T15:36:03ZJianwei HuJingfei ZhangJianhua GuoJi Zhuhttp://arxiv.org/abs/2603.24392v1Federated fairness-aware classification under differential privacy2026-03-25T15:09:12ZPrivacy and algorithmic fairness have become two central issues in modern machine learning. Although each has separately emerged as a rapidly growing research area, their joint effect remains comparatively under-explored. In this paper, we systematically study the joint impact of differential privacy and fairness on classification in a federated setting, where data are distributed across multiple servers. Targeting demographic disparity constrained classification under federated differential privacy, we propose a two-step algorithm, namely FDP-Fair. In the special case where there is only one server, we further propose a simple yet powerful algorithm, namely CDP-Fair, serving as a computationally-lightweight alternative. Under mild structural assumptions, theoretical guarantees on privacy, fairness and excess risk control are established. In particular, we disentangle the source of the private fairness-aware excess risk into a) intrinsic cost of classification, b) cost of private classification, c) non-private cost of fairness and d) private cost of fairness. Our theoretical findings are complemented by extensive numerical experiments on both synthetic and real datasets, highlighting the practicality of our designed algorithms.2026-03-25T15:09:12ZGengyu XueYi Yuhttp://arxiv.org/abs/2503.13191v2Stein's method of moment estimators for local dependency exponential random graph models2026-03-25T14:43:36ZProviding theoretical guarantees for parameter estimation in exponential random graph models is a largely open problem. While maximum likelihood estimation has theoretical guarantees in principle, verifying the assumptions for these guarantees to hold can be very difficult. Moreover, in complex networks, numerical maximum likelihood estimation is computer-intensive and may not converge in reasonable time. To ameliorate this issue, local dependency exponential random graph models have been introduced, which assume that the network consists of many independent exponential random graphs. In this setting, progress towards maximum likelihood estimation has been made. However the estimation is still computer-intensive. Instead, we propose to use so-called Stein estimators: we use the Stein characterizations to obtain new estimators for local dependency exponential random graph models.2025-03-17T14:01:11ZUpdated version with detailed connection to MPLEAdrian FischerGesine ReinertWenkai Xuhttp://arxiv.org/abs/2603.24333v1Notes on Forré's Notion of Conditional Independence and Causal Calculus for Continuous Variables2026-03-25T14:13:23ZRecently, Forré (arXiv:2104.11547, 2021) introduced transitional conditional independence, a notion of conditional independence that provides a unified framework for both random and non-stochastic variables. The original paper establishes a strong global Markov property connecting transitional conditional independencies with suitable graphical separation criteria for directed mixed graphs with input nodes (iDMGs), together with a version of causal calculus for iDMGs in a general measure-theoretic setting. These notes aim to further illustrate the motivations behind this framework and its connections to the literature, highlight certain subtlies in the general measure-theoretic causal calculus, and extend the "one-line" formulation of the ID algorithm of Richardson et al. (Ann. Statist. 51(1):334--361, 2023) to the general measure-theoretic setting.2026-03-25T14:13:23ZLeihao Chen