https://arxiv.org/api/A6KTJD2AZE7Uxdf8r3wBV/OsbLU2026-03-20T20:10:06Z346349015http://arxiv.org/abs/2603.13784v2Mixed difference integer-valued GARCH model for $ \mathbb{Z}$-valued time series2026-03-17T03:35:15ZIn this paper, we introduce flexible observation-driven $\mathbb{Z}$-valued time series models constructed from mixtures of negative and non-negative components. Compared to models based on the standard Skellam distribution or on a difference of two integer-valued variables, our specification offers greater versatility. For example, it easily allows for skewness and bimodality. Furthermore, the observation of one component of the mixture makes interpretation and statistical analysis easier. We establish conditions for stationarity and mixing, and develop a mixed Poisson quasi-maximum likelihood estimator with proven asymptotic properties. A portmanteau test is proposed to diagnose residual serial dependence. The finite-sample performance of the methodology is assessed via simulation, and an empirical application on tick prices demonstrates its practical usefulness.2026-03-14T06:27:03Z61 pages, 8 figuresAbdelhakim AknoucheChristian FrancqYuichi Gotohttp://arxiv.org/abs/2603.16041v1Power Analysis for Prediction-Powered Inference2026-03-17T00:57:11ZModern studies increasingly leverage outcomes predicted by machine learning and artificial intelligence (AI/ML) models, and recent work, such as prediction-powered inference (PPI), has developed valid downstream statistical inference procedures. However, classical power and sample size formulas do not readily account for these predictions. In this work, we tackle a simple yet practical question: given a new AI/ML model with high predictive power, how many labeled samples are needed to achieve a desired level of statistical power? We derive closed-form power formulas by characterizing the asymptotic variance of the PPI estimator and applying Wald test inversion to obtain the required labeled sample size. Our results cover widely used settings including two-sample comparisons and risk measures in 2x2 tables. We find that a useful rule of thumb is that the reduction in required labeled samples relative to classical designs scales roughly with the R2 between the predictions and the ground truth. Our analytical formulas are validated using Monte Carlo simulations, and we illustrate the framework in three contemporary biomedical applications spanning single-cell transcriptomics, clinical blood pressure measurement, and dermoscopy imaging. We provide our software as an R package and online calculators at https://github.com/yiqunchen/pppower.2026-03-17T00:57:11ZYiqun T. ChenMoran GuoShengy Lihttp://arxiv.org/abs/2603.15924v1Time Partitioning in Target Trial Emulation2026-03-16T21:17:21ZIn target trial emulation, time partitioning enables researchers to handle time-varying confounders and immortal time bias with appropriate methods. Based on two clinical scenarios, this study aimed to explore issues related to time partitioning and to provide guidance for trial emulation. After formalizing the research question within the framework of structural causal models, we show how a given time partitioning may be too fine or too coarse depending on the clinical context. When the partitioning is too fine, the dimensionality of the model is unnecessarily high. When the partitioning is too coarse, the resulting causal structure may hinder effect estimation. We also show that cloning-censoring-weighting may not be valid when treatment influences outcome within study periods, and we confirm this through simulations. In conclusion, we provide practical guidance for actively specifying an appropriate time partitioning in trial emulation, rather than using the available data resolution as a default.2026-03-16T21:17:21ZHarold Tankpinou ZoumenouSorbonne Universite, Inserm, Institut Pierre-Louis d'epidemiologie et de sante publique, Paris, FranceSimon FerreiraSorbonne Universite, Inserm, Institut Pierre-Louis d'epidemiologie et de sante publique, Paris, FranceCharles AssaadSorbonne Universite, Inserm, Institut Pierre-Louis d'epidemiologie et de sante publique, Paris, FranceNathanael LapidusSorbonne Universite, Inserm, Institut Pierre-Louis d'epidemiologie et de sante publique, Departement de Sante Publique, Hopital Saint-Antoine, APHP, Paris, FranceDaria BystrovaSorbonne Universite, Inserm, Institut Pierre-Louis d'epidemiologie et de sante publique, Paris, FranceBenjamin GlemainSorbonne Universite, Inserm, Institut Pierre-Louis d'epidemiologie et de sante publique, Departement de Sante Publique, Hopital Saint-Antoine, APHP, Paris, FranceParis Brain Institute-ICM, Inserm, Inria, Sorbonne Universite, Paris, Francehttp://arxiv.org/abs/2603.15902v1SEMMS with Random Effects: A Mixed-Model Extension for Variable Selection in Clustered and Longitudinal Data2026-03-16T20:46:38ZSEMMS (Scalable Empirical-Bayes Model for Marker Selection) is a variable-selection procedure for generalized linear models that uses a three-component normal mixture prior on regression coefficients. In its original form, SEMMS assumes that all observations are independent. Many real-world datasets, however, arise from repeated-measures or clustered designs in which observations within the same subject are correlated. Ignoring this correlation inflates the apparent residual variance and can severely degrade variable-selection performance. We extend SEMMS to accommodate random intercepts, random slopes, or both, via an alternating coordinate-ascent algorithm. After each round of fixed-effect variable selection, the subject-level best linear unbiased predictors (BLUPs) are updated with \texttt{lmer} (Gaussian) or \texttt{glmer} (non-Gaussian); the fixed-effect step then operates on the random-effect-adjusted response. We describe the algorithm, evaluate its performance in three Gaussian simulation studies spanning a range of signal strengths, random-effect magnitudes, and sample/predictor-space regimes, and present a semi-synthetic real-data example. We further extend the framework to non-Gaussian families (Poisson, binomial) via an IRLS working-response adaptation: at each outer iteration the fixed-effects step uses the RE-adjusted working response computed from the current \texttt{glmer} fitted values rather than the raw response. When the fixed-effect signal is strong relative to the random-effect variance, both the original and extended procedures perform comparably. When the random-effect variance dominates -- the scenario most likely to cause plain SEMMS to fail -- the mixed-model extension recovers the exact true predictor set in 93\% of simulated datasets (Gaussian), 61\% (Poisson), and 65\% (binomial), compared with 1\%, 45\%, and 39\% for plain SEMMS respectively.2026-03-16T20:46:38ZHaim BarMartin T. Wellshttp://arxiv.org/abs/2603.15884v1A Utility Score Framework for Dose Optimization Studies with Binary Efficacy-Safety Endpoints: Sample Size Determination and Bias Characterization2026-03-16T20:23:16ZThe FDA's Project Optimus initiative emphasizes patient-centered dose selection in oncology that balances efficacy and safety. We develop a framework for randomized dose optimization studies that uses clinically interpretable utility scores to integrate binary efficacy and safety endpoints and select the optimal dose for a follow-on confirmatory trial. The framework provides: (i) a systematic method for eliciting utility scores that reflect clinical priorities; (ii) closed-form sample size formulas to achieve prespecified Probabilities of Correct Selection (PCS) under clinically relevant scenarios; and (iii) analytical expressions characterizing the propagation of selection-induced bias to confirmatory trials, including time-to-event endpoints correlated with the selection endpoint. Extensive simulations (10^6 replications per scenario) confirm that the sample size methods achieve target PCS and that the bias and Type I error formulas closely match empirical estimates. An R package DoseOptDesign and an interactive Shiny application are publicly available.2026-03-16T20:23:16ZXuemin GuCong XuLei XuYing Yuhttp://arxiv.org/abs/2504.02547v3Outlier-Robust Multi-Group Gaussian Mixture Modeling with Flexible Group Reassignment2026-03-16T20:00:08ZDo expert-defined or diagnostically-labeled data groups align with clusters inferred through statistical modeling? If not, where do discrepancies between predefined labels and model-based groupings occur and why? In this work, we introduce the multi-group Gaussian mixture model (MG-GMM), the first model developed to investigate these questions. It incorporates prior group information while allowing flexibility to reassign observations to alternative groups based on data-driven evidence. We achieve this by modeling the observations of each group as arising not from a single distribution, but from a Gaussian mixture comprising all group-specific distributions. Moreover, our model offers robustness against cellwise outliers that may obscure or distort the underlying group structure. We propose a novel penalized likelihood approach, called cellMG-GMM, to jointly estimate mixture probabilities, location and scale parameters of the MG-GMM, and detect outliers through a penalty term on the number of flagged cellwise outliers in the objective function. We show that our estimator has good breakdown properties in presence of cellwise outliers. We develop a computationally-efficient EM-based algorithm for cellMG-GMM, and demonstrate its strong performance in identifying and diagnosing observations at the intersection of multiple groups through simulations and diverse applications in medicine and oenology.2025-04-03T12:54:21ZPatricia PuchhammerInes WilmsPeter Filzmoserhttp://arxiv.org/abs/2601.01259v3A Novel Multiple Imputation Approach For Parameter Estimation in Observation-Driven Time Series Models With Missing Data2026-03-16T19:59:13ZHandling missing data in time series is a complex problem due to the presence of temporal dependence. General-purpose imputation methods, while widely used, often distort key statistical properties of the data, such as variance and dependence structure, leading to biased estimation and misleading inference. These issues become more pronounced in models that explicitly rely on capturing serial dependence, as standard imputation techniques fail to preserve the underlying dynamics. This paper proposes a novel multiple imputation method specifically designed for parameter estimation in observation-driven models (ODM). The approach takes advantage of the iterative nature of the systematic component in ODM to propagate the dependence structure through missing data, minimizing its impact on estimation. Unlike traditional imputation techniques, the proposed method accommodates continuous, discrete, and mixed-type data while preserving key distributional and dependence properties. We evaluate its performance through Monte Carlo simulations in the context of GARMA models, considering time series with up to 70\% missing data. An application to the proportion of stocked energy stored in South Brazil further demonstrates its practical utility.2026-01-03T19:00:02ZThis version presents the large sample theory for the proposed method, showing its strong consistency under mild assumptions, regardless of the amount of missing data or the its generating mechanismGuilherme PumiTaiane Schaedler PrassDouglas Krauthein Verdumhttp://arxiv.org/abs/2603.16950v1Kriging via variably scaled kernels2026-03-16T19:57:42ZClassical Gaussian processes and Kriging models are commonly based on stationary kernels, whereby correlations between observations depend exclusively on the relative distance between scattered data. While this assumption ensures analytical tractability, it limits the ability of Gaussian processes to represent heterogeneous correlation structures. In this work, we investigate variably scaled kernels as an effective tool for constructing non-stationary Gaussian processes by explicitly modifying the correlation structure of the data. Through a scaling function, variably scaled kernels alter the correlations between data and enable the modeling of targets exhibiting abrupt changes or discontinuities. We analyse the resulting predictive uncertainty via the variably scaled kernel power function and clarify the relationship between variably scaled kernels-based constructions and classical non-stationary kernels. Numerical experiments demonstrate that variably scaled kernels-based Gaussian processes yield improved reconstruction accuracy and provide uncertainty estimates that reflect the underlying structure of the data2026-03-16T19:57:42ZGianluca AudoneFrancesco MarchettiEmma PerracchioneMilvia Rossinihttp://arxiv.org/abs/2603.15845v1Besag-Clifford e-values for unnormalized testing2026-03-16T19:24:51ZUnnormalized probability distributions are frequently used in machine learning for modeling complex data generating processes. Though Markov chain Monte Carlo (MCMC) algorithms can approximately sample from unnormalized distributions, intractability of their normalizing constants renders likelihood ratio testing infeasible. We propose to use the parallel method of Besag and Clifford to generate samples that are exchangeable with the data under the null, to then generate valid e-values for any number of iterations or algorithmic steps. We show that as the number of samples grows, these Besag-Clifford e-values constructed using the unnormalized likelihood ratio are actually log-optimal up to a multiplicative term that diminishes with the mixing time of the Markov chain. Additionally, averaging over the output of multiple chains retains validity while increasing the e-power. We extend Besag-Clifford e-values to the general problem of unnormalized test statistics, which allows application to composite hypotheses, uncertainty quantification, generative model evaluation, and sequential testing. Through simulations and an application to galaxy velocity modeling, we empirically verify our theory, explore the impact of autocorrelation and mixing, and evaluate the performance of Besag-Clifford e-values.2026-03-16T19:24:51ZAlexander DombowskyBarbara E. EngelhardtAaditya Ramdashttp://arxiv.org/abs/2603.15817v1On the Equivalence between Neyman Orthogonality and Pathwise Differentiability2026-03-16T18:48:55ZIt has been frequently observed that Neyman orthogonality, the central device underlying double/debiased machine learning (Chernozhukov et al., 2018), and pathwise differentiability, a cornerstone concept from semiparametric theory, often lead to the same debiased estimators in practice. Despite the widespread adoption of both ideas, the precise nature of this equivalence has remained elusive, with the two concepts having been developed in largely separate traditions. In this work, we revisit the semiparametric framework of van der Laan and Robins (2003) and identify an implicit regularity assumption on the relationship between target and nuisance parameters -- a local product structure -- that allows us to establish a formal equivalence between Neyman orthogonality and pathwise differentiability. We demonstrate that the two directions of this equivalence impose fundamentally different structural requirements, and illustrate the theory through a concrete example of estimating the average treatment effect. This helps clarify the relationship between these two foundational frameworks and provides a useful reference for practitioners working at their intersection.2026-03-16T18:48:55ZYuxi ChenEdward H. KennedySivaraman Balakrishnanhttp://arxiv.org/abs/2603.15578v1Low-Complexity and Consistent Graphon Estimation from Multiple Networks2026-03-16T17:41:00ZRecovering the random graph model from an observed collection of networks is known to present significant challenges in the setting, where the networks do not share a common node set and have different sizes. More specifically, the goal is the estimation of the graphon function that parametrizes the nonparametric exchangeable random graph model. Existing methods typically suffer from either limited accuracy or high computational complexity. We introduce a new histogram-based estimator with low algorithmic complexity that achieves high accuracy by jointly aligning the nodes of all graphs, in contrast to most conventional methods that order nodes graph by graph. Consistency results of the proposed graphon estimator are established. A numerical study shows that the proposed estimator outperforms existing methods in terms of accuracy, especially when the dataset comprises only small and variable-size networks. Moreover, the computing time of the new method is considerably shorter than that of other consistent methodologies. Additionally, when applied to a graph neural network classification task, the proposed estimator enables more effective data augmentation, yielding improved performance across diverse real-world datasets.2026-03-16T17:41:00ZAccepted at AISTATS 2026Roland Boniface SoganTabea Rebafkahttp://arxiv.org/abs/2512.05650v2Efficient sequential Bayesian inference for state-space epidemic models using ensemble data assimilation2026-03-16T14:49:46ZEstimating latent epidemic states and model parameters from partially observed, noisy data remains a major challenge in infectious disease modeling. State-space formulations provide a coherent probabilistic framework for such inference, yet fully Bayesian estimation is often computationally prohibitive because evaluating the observed-data likelihood requires integration over a latent trajectory. The Sequential Monte Carlo squared (SMC$^2$) algorithm offers a principled approach for joint state and parameter inference, combining an outer SMC sampler over parameters with an inner particle filter that estimates the likelihood up to the current time point. Despite its theoretical appeal, this nested particle filter imposes substantial computational cost, limiting routine use in near-real-time outbreak response. We propose Ensemble SMC$^2$ (eSMC$^2$), a computationally efficient variant that replaces the inner particle filter with an Ensemble Kalman Filter (EnKF) to approximate the incremental likelihood at each observation time. While this substitution introduces bias via a Gaussian approximation, we mitigate finite-sample effects using an unbiased Gaussian density estimator and adapt the EnKF for epidemic data through state-dependent observation variance. This makes our approach particularly suitable for overdispersed incidence data commonly encountered in infectious disease surveillance. Simulation experiments with known ground truth and an application to 2022 United States (U.S.) monkeypox incidence data demonstrate that eSMC$^2$ achieves substantial computational gains while producing posterior estimates comparable to SMC$^2$. The method accurately recovers latent epidemic trajectories and key epidemiological parameters, providing an efficient framework for sequential Bayesian inference from imperfect surveillance data.2025-12-05T11:51:55ZDhorasso TemfackJason Wysehttp://arxiv.org/abs/2507.00260v2Disentangled Feature Importance2026-03-16T13:57:57ZFeature importance (FI) measures are widely used to assess the contributions of predictors to an outcome, but they may target different notions of relevance. When predictors are correlated, traditional statistical FI methods are often tailored for feature selection and correlation can therefore be treated as conditional redundancy. By contrast, for model interpretation, FI is more naturally defined through marginal predictive relevance. In this context, we show that most existing approaches target identical population functionals under squared-error loss and exhibit correlation-induced bias.
To address this limitation, we introduce Disentangled Feature Importance (DFI), a nonparametric generalization of the classical $R^2$ decomposition via canonical entropic optimal transport (EOT). DFI transforms correlated features into independent latent features using an EOT coupling for general covariate laws, including mixed and discrete settings. Importance scores are computed in this disentangled space and attributed back through the transition kernel's sensitivity. Under arbitrary feature dependencies, DFI provides a principled decomposition of latent importance scores that sum to the total predictive variability for latent additive models and to interaction-weighted functional ANOVA variances more generally.
We develop semiparametric theory for DFI. Under the EOT formulation, we establish root-$n$ consistency and asymptotic normality for nondegenerate importance estimators in the latent space and the original feature space. Notably, our estimators achieve second-order estimation error, which vanishes if both regression function and EOT kernel estimation errors are $o_{\mathbb{P}}(n^{-1/4})$. By design, DFI avoids the computational burden of repeated submodel refitting and the challenges of conditional covariate distribution estimation, thereby achieving computational efficiency.2025-06-30T20:54:48Z27 main and 47 supplementary pagesJin-Hong DuKathryn RoederLarry Wassermanhttp://arxiv.org/abs/2307.01111v3A Gaussian process and linear-based framework for computing cut distributions in modular Bayesian calibration of two chained computer models2026-03-16T13:53:45ZComputer models are widely used in science and engineering to simulate complex systems. However, these models are affected by several sources of uncertainty, which may limit their use for decision making in risk management. We present a Bayesian approach for quantifying parameter uncertainty in a chain of two computer models motivated by multiphysics simulations in the nuclear field. Part of the inputs of a downstream model parametrized by $θ\in \mathbb{R}^p$ come from the outputs of an upstream model parametrized by $λ\in \mathbb{R}^q$. Usually, the joint posterior distribution of $(θ, λ)$ would be obtained by applying Bayes' theorem using the experimental observations of both models. However, when the observations of the downstream model are too indirect to provide informative inference on $λ$, it may be preferable to compute a modular posterior distribution of $(θ, λ)$, referred to as the \emph{cut distribution}. Assuming that the posterior distribution of $λ$ has been previously estimated from observations of the upstream model only, we aim to compute the posterior distribution of $θ$ conditional on $λ$ using observations from the downstream model. To this end, we propose a Gaussian-process and linear-based framework to estimate the functional dependence between $θ$ and $λ$, denoted by $θ(λ)$, where each component is modeled as a realization of a Gaussian process. As the downstream model is approximated by a linear function of $θ(λ)$, Bayesian conjugacy allows us to derive a Gaussian posterior predictive distribution of $θ(λ)$ for any realization of $λ$. The effectiveness of the method is illustrated through several synthetic examples, and we highlight how variations in $λ$ impact the predictive distribution of the chained simulation.2023-07-03T15:35:55Z44 pages, 14 figuresOumar BaldéGuillaume DamblinAmandine MarrelAntoine BouloréLoïc Giraldihttp://arxiv.org/abs/2509.19040v2Nonparametric efficient estimation of the longitudinal front-door functional2026-03-16T13:29:59ZThe front-door criterion is an identification strategy for the intervention-specific mean outcome in settings where the standard back-door criterion fails due to unmeasured exposure-outcome confounders, but an intermediate variable exists that completely mediates the effect of exposure on the outcome and is not affected by unmeasured confounding. The front-door criterion has been extended to the longitudinal setting, where exposure and mediator vary over time. However, with the exception of a simple plug-in estimator, no suitable estimation techniques have been proposed. In this work, we derive nonparametric efficient estimators of the longitudinal front-door functional. The estimators accommodate high-dimensional mediators, are multiply robust, and allow for the use of data-adaptive methods for estimating nuisance functions while still providing valid inference. The theoretical properties of the estimators are illustrated in a simulation study, and we apply the estimators to a trial of peanut allergy in infants.2025-09-23T14:09:50ZMarie S. BreumHelene C. W. RytgaardTorben MartinussenErin E. Gabriel