https://arxiv.org/api/OxlveQK1wD8g/Sw0E5eggiXcsXs2026-03-20T14:31:13Z272424515http://arxiv.org/abs/2405.19553v2Convergence Bounds for Sequential Monte Carlo on Multimodal Distributions using Soft Decomposition2026-03-17T12:42:59ZWe prove bounds on the variance of a function $f$ under the empirical measure of the samples obtained by the Sequential Monte Carlo (SMC) algorithm, with time complexity depending on local rather than global Markov chain mixing dynamics. SMC is a Markov Chain Monte Carlo (MCMC) method, which starts by drawing $N$ particles from a known distribution, and then, through a sequence of distributions, re-weights and re-samples the particles, at each instance applying a Markov chain for smoothing. In principle, SMC tries to alleviate problems from multi-modality. However, most theoretical guarantees for SMC are obtained by assuming global mixing time bounds, which are only efficient in the uni-modal setting. We show that bounds can be obtained in the truly multi-modal setting, with mixing times that depend only on local MCMC dynamics.2024-05-29T22:43:45ZHolden LeeMatheau Santana-Gijzenhttp://arxiv.org/abs/2510.25289v2Testing Correlation in Graphs by Counting Bounded Degree Motifs2026-03-17T12:38:28ZWe investigate the problem of detecting correlation between two Erdős-Rényi graphs $G(n,p)$, formulated as a hypothesis testing problem: under the null hypothesis, the two graphs are independent, while under the alternative hypothesis, they are correlated through a latent bijective mapping between their vertex sets. We develop a polynomial-time test by counting bounded degree motifs and prove its effectiveness for any constant correlation coefficient $ρ$ when the edge connecting probability satisfies $p\ge n^{-1+δ}$ for some constant $δ>0$. In particular, our guarantee improves the constrain of motif-counting methods from $ρ\ge \sqrtα$ to any constant $ρ= Ω(1)$, where $α\approx 0.338$ is the Otter's constant.2025-10-29T08:45:14Z46 pages, 8 figuresDong HuangPengkun Yanghttp://arxiv.org/abs/2504.13336v2On the minimax optimality of Flow Matching through the connection to kernel density estimation2026-03-17T10:54:30ZFlow Matching has recently gained attention in generative modeling as a simple and flexible alternative to diffusion models. While existing statistical guarantees adapt tools from the analysis of diffusion models, we take a different perspective by connecting Flow Matching to kernel density estimation. We first verify that the kernel density estimator matches the optimal rate of convergence in Wasserstein distance up to logarithmic factors, improving existing bounds for the Gaussian kernel. Based on this result, we prove that for sufficiently large networks, Flow Matching achieves the optimal rate up to logarithmic factors. If the target distribution lies on a lower-dimensional manifold, we show that the kernel density estimator profits from the smaller intrinsic dimension on a small tube around the manifold. The faster rate also applies to Flow Matching, providing a theoretical foundation for its empirical success in high-dimensional settings.2025-04-17T21:06:41ZLea KunkelMathias Trabshttp://arxiv.org/abs/2503.13986v3Stratified Permutational Berry--Esseen Bounds and Their Applications to Statistics2026-03-17T10:24:42ZThe stratified linear permutation statistic arises in various statistics problems, including stratified and post-stratified survey sampling, stratified and post-stratified experiments, conditional permutation tests, etc. Although we can derive the Berry--Esseen bounds for the stratified linear permutation statistic based on existing bounds for the non-stratified statistics, those bounds are not sharp, and moreover, this strategy does not work in general settings with heterogeneous strata with varying sizes. We first use Stein's method to obtain a unified stratified permutational Berry--Esseen bound that can accommodate heterogeneous strata. We then apply the bound to various statistics problems, leading to stronger theoretical quantifications and thereby facilitating statistical inference in those problems.2025-03-18T07:44:01ZPengfei TianFan YangPeng Dinghttp://arxiv.org/abs/2409.01983v3The causal interpretation of acceleration factors2026-03-17T10:20:48ZIn studies of time-to-event outcomes with unmeasured heterogeneity, the hazard ratio for treatment is known to have a complex causal interpretation. Accelerated failure time (AFT) models, which assess the effect on the survival time ratio scale, are often suggested as a better alternative because they model a parameter with direct causal interpretation while allowing straightforward adjustment for measured confounders. In this work, we formalize the causal interpretation of the acceleration factor in AFT models using structural causal models and data under independent censoring. We prove that the acceleration factor is a valid causal effect measure, even in the presence of frailty and treatment effect heterogeneity. Through simulations, we show that the acceleration factor better captures the causal effect than the hazard ratio when both AFT and conditional proportional hazards models apply. Additionally, we extend the interpretation to systems with time-dependent acceleration factors, illustrating the impossibility of distinguishing between a time-varying homogeneous effect and unmeasured effect heterogeneity. While the causal interpretation of acceleration factors is promising, we caution practitioners about potential challenges for the interpretation in the presence of effect heterogeneity.2024-09-03T15:25:55ZMari BrathovdeHein PutterMorten ValbergRichard A. J. Posthttp://arxiv.org/abs/2603.16294v1A Kernel Two-Sample Test Invariant under Group Action with Applications to Functional Data2026-03-17T09:31:38ZWe introduce a kernel-based two-sample test for comparing probability distributions up to group actions. Our construction yields invariant kernels for locally compact $σ$-compact groups and extends classical Haar-based approaches beyond the compact setting. The resulting invariant Maximum Mean Discrepancy (MMD) test is developed in a general framework where the sample space is assumed to be Polish. Under natural conditions, the invariant kernel induces a characteristic kernel on the quotient space, ensuring consistency of the associated MMD test. The method is well suited to functional data, where invariances such as temporal shifts arise naturally, and its effectiveness is illustrated through simulation studies.2026-03-17T09:31:38ZMadison GiacofciUR2, IRMARAnouar MeynaouiUR2, IRMARAlex PodgornyENSAI, CRESThttp://arxiv.org/abs/2503.13148v3Spearman's rho for zero-inflated count data: formulation and attainable bounds2026-03-17T08:09:08ZWe propose an alternative formulation of Spearman's rho for zero-inflated count data. The formulation yields an estimator with explicitly attainable bounds, facilitating interpretation in settings where the standard range [-1,1] is no longer informative.2025-03-17T13:19:22ZJasper ArendsGuanjie LyuMhamed MesfiouiElisa PerroneJulien Trufinhttp://arxiv.org/abs/2603.16213v1Equivalence testing with data-dependent and post-hoc equivalence margins2026-03-17T07:44:04ZEquivalence testing compares the hypothesis that an effect $μ$ is large against the alternative that it is negligible. Here, `large' is classically expressed as being larger than some `equivalence margin' $Δ$. A longstanding problem is that this margin must be specified but can rarely be objectively justified in practice. We lay the foundation for an alternative paradigm, arguing to instead report a data-dependent margin $\widehatΔ_α$ that bounds the true effect $μ$ with probability $1 - α$. Our key argument is that $\widehatΔ_α$ is more useful than a test outcome at a fixed margin $Δ$, as measured by the guarantees it offers to decision makers. We generalize this to a curve of margins $α\mapsto \widehatΔ_α$, uniformly valid under the post-hoc selection of the margin. These ideas rely on e-values, which we derive for models that are strictly totally positive of order 3, nesting the classical z-test and t-test settings.2026-03-17T07:44:04ZStan KoobsNick W. Koninghttp://arxiv.org/abs/2603.13784v2Mixed difference integer-valued GARCH model for $ \mathbb{Z}$-valued time series2026-03-17T03:35:15ZIn this paper, we introduce flexible observation-driven $\mathbb{Z}$-valued time series models constructed from mixtures of negative and non-negative components. Compared to models based on the standard Skellam distribution or on a difference of two integer-valued variables, our specification offers greater versatility. For example, it easily allows for skewness and bimodality. Furthermore, the observation of one component of the mixture makes interpretation and statistical analysis easier. We establish conditions for stationarity and mixing, and develop a mixed Poisson quasi-maximum likelihood estimator with proven asymptotic properties. A portmanteau test is proposed to diagnose residual serial dependence. The finite-sample performance of the methodology is assessed via simulation, and an empirical application on tick prices demonstrates its practical usefulness.2026-03-14T06:27:03Z61 pages, 8 figuresAbdelhakim AknoucheChristian FrancqYuichi Gotohttp://arxiv.org/abs/2602.03999v2Functional Stochastic Localization2026-03-17T02:02:28ZEldan's stochastic localization is a probabilistic construction that has proved instrumental to modern breakthroughs in high-dimensional geometry and the design of sampling algorithms. Motivated by sampling under non-Euclidean geometries and the mirror descent algorithm in optimization, we develop a functional generalization of Eldan's process that replaces Gaussian regularization with regularization by any positive integer multiple of a log-Laplace transform. We further give a mixing time bound on the Markov chain induced by our localization process, which holds if our target distribution satisfies a functional Poincaré inequality. Finally, we apply our framework to differentially private convex optimization in $\ell_p$ norms for $p \in [1, 2)$, where we improve state-of-the-art query complexities in a zeroth-order model.2026-02-03T20:34:46ZComments welcome! v2 adds citations and fixes typosAnming GuBobby ShiKevin Tianhttp://arxiv.org/abs/2603.16005v1Breakdown properties of optimal transport maps: general transportation costs2026-03-16T23:27:32ZTwo recent works, Avella-Medina and González-Sanz (2026) and Passeggeri and Paindaveine (2026), studied the robustness of the optimal transport map through its breakdown point, i.e., the smallest fraction of contamination that can make the map take arbitrarily aberrant values. Their main finding is the following: let $P$ and $Q$ denote the target and reference measures, respectively, and let $T$ be the optimal transport map for the squared Euclidean cost. Then, the breakdown point of $T(u)$, when $P$ is perturbed and $Q$ is fixed, coincides with the Tukey depth of $u$ relative to $Q$. In this note, we extend this result to general convex cost functions, demonstrating that the cost function does not have any impact on the breakdown point of the optimal transport map. Our contribution provides a definitive characterization of the breakdown point of the optimal transport map. In particular, it shows that for a broad class of regular cost functions, all transport-based quantiles enjoy the same high breakdown point properties.2026-03-16T23:27:32ZAlberto Gonzalez-SanzMarco Avella Medinahttp://arxiv.org/abs/2601.01259v3A Novel Multiple Imputation Approach For Parameter Estimation in Observation-Driven Time Series Models With Missing Data2026-03-16T19:59:13ZHandling missing data in time series is a complex problem due to the presence of temporal dependence. General-purpose imputation methods, while widely used, often distort key statistical properties of the data, such as variance and dependence structure, leading to biased estimation and misleading inference. These issues become more pronounced in models that explicitly rely on capturing serial dependence, as standard imputation techniques fail to preserve the underlying dynamics. This paper proposes a novel multiple imputation method specifically designed for parameter estimation in observation-driven models (ODM). The approach takes advantage of the iterative nature of the systematic component in ODM to propagate the dependence structure through missing data, minimizing its impact on estimation. Unlike traditional imputation techniques, the proposed method accommodates continuous, discrete, and mixed-type data while preserving key distributional and dependence properties. We evaluate its performance through Monte Carlo simulations in the context of GARMA models, considering time series with up to 70\% missing data. An application to the proportion of stocked energy stored in South Brazil further demonstrates its practical utility.2026-01-03T19:00:02ZThis version presents the large sample theory for the proposed method, showing its strong consistency under mild assumptions, regardless of the amount of missing data or the its generating mechanismGuilherme PumiTaiane Schaedler PrassDouglas Krauthein Verdumhttp://arxiv.org/abs/2603.15817v1On the Equivalence between Neyman Orthogonality and Pathwise Differentiability2026-03-16T18:48:55ZIt has been frequently observed that Neyman orthogonality, the central device underlying double/debiased machine learning (Chernozhukov et al., 2018), and pathwise differentiability, a cornerstone concept from semiparametric theory, often lead to the same debiased estimators in practice. Despite the widespread adoption of both ideas, the precise nature of this equivalence has remained elusive, with the two concepts having been developed in largely separate traditions. In this work, we revisit the semiparametric framework of van der Laan and Robins (2003) and identify an implicit regularity assumption on the relationship between target and nuisance parameters -- a local product structure -- that allows us to establish a formal equivalence between Neyman orthogonality and pathwise differentiability. We demonstrate that the two directions of this equivalence impose fundamentally different structural requirements, and illustrate the theory through a concrete example of estimating the average treatment effect. This helps clarify the relationship between these two foundational frameworks and provides a useful reference for practitioners working at their intersection.2026-03-16T18:48:55ZYuxi ChenEdward H. KennedySivaraman Balakrishnanhttp://arxiv.org/abs/2603.15785v1On the Uniqueness of Fréchet Means for Polytope Norms2026-03-16T18:12:20ZFréchet means are a popular type of average for non-Euclidean datasets, defined as those points which minimise the average squared distance to a set of data points. We consider the behaviour of sample Fréchet means on normed spaces whose unit ball is a polytope; this setting is rarely covered by existing literature on Fréchet means, which focuses on smooth spaces or spaces with bounded curvature. We study the geometry of the set of Fréchet means over polytope normed spaces, with a focus on dimension and probabilistic conditions for uniqueness. In particular, we provide a geometric characterisation of the threshold sample size at which Fréchet means have a positive probability of being unique, and we prove that this threshold is at most one more than the dimension of our space. We are able to use this geometric characterisation to compute the unique Fréchet mean sample threshold in the case of the $\ell_\infty$ and $\ell_1$ norms.2026-03-16T18:12:20Z28 pages, 1 figureRoan TalbutAndrew McCormackAnthea Monodhttp://arxiv.org/abs/2501.15926v3Minimax convergence rates of a binary classification procedure for time-homogeneous SDE paths2026-03-16T17:36:53ZIn the context of binary classification of trajectories generated by time-homogeneous stochastic differential equations, we consider a mixture of two diffusion processes characterized by a stochastic differential equation (SDE) whose drift coefficient depends on the class and whose diffusion coefficient is independent of the class. We assume that the drift and diffusion coefficients are unknown as well as the law of the discrete random variable that models the class. In this paper, we study the minimax convergence rates for the excess risk of the resulting plug-in classifier under different sets of assumptions on the diffusion model. As the plug-in classifier is based on nonparametric estimators of drift and diffusion coefficients, we established rates of convergence for projection estimators of drift coefficients on the real line. We propose a new methodology for the study of the lower bound on the excess risk. The theoretical study is completed with a numerical experiment over simulated data.2025-01-27T10:24:21Z39 pagesEddy Michel Ella Mintsa