https://arxiv.org/api/QC0YygZVZfYkBW/amNe8svNEqUA 2026-03-20T12:40:31Z 27242 30 15 http://arxiv.org/abs/2601.00987v2 Tessellation Localized Transfer learning for nonparametric regression 2026-03-17T21:39:16Z Transfer learning aims to improve performance on a target task by leveraging information from related source tasks. We propose a nonparametric regression transfer learning framework that explicitly models heterogeneity in the source-target relationship. Our approach relies on a local transfer assumption: the covariate space is partitioned into finitely many cells such that, within each cell, the target regression function can be expressed as a low-complexity transformation of the source regression function. This localized structure enables effective transfer where similarity is present while limiting negative transfer elsewhere. We introduce estimators that jointly learn the local transfer functions and the target regression, together with fully data-driven procedures that adapt to unknown partition structure and transfer strength. We establish sharp minimax rates for target regression estimation, showing that local transfer can mitigate the curse of dimensionality by exploiting reduced functional complexity. Our theoretical guarantees take the form of oracle inequalities that decompose excess risk into estimation and approximation terms, ensuring robustness to model misspecification. Numerical experiments illustrate the benefits of the proposed approach. 2026-01-02T20:58:05Z 57 pages, 2 figures Hélène Halconruy Benjamin Bobbia Paul Lejamtel http://arxiv.org/abs/2603.17142v1 Identifiability and Estimation in Continuous Lyapunov Models 2026-03-17T21:14:37Z Cross-sectional observations from a dynamical system can be modeled via steady-state distributions of Markov processes. The major challenge is then to determine whether the process parameters can be identified and estimated from the steady-state distributions. We study this problem for continuous Lyapunov models that arise as steady-state distributions of the solution to a multivariate stochastic differential equation, whose linear drift matrix is parametrized by a directed graph. We derive equations for the cumulant tensors of any order for this distribution, which generalize the well-known covariance Lyapunov equation. Under a non-Gaussianity assumption we prove generic identifiability of the drift matrix for any connected graph using the equations for the higher-order cumulants. Based on the identifiability result, we propose a new semiparametric estimator of the drift matrix, and we derive its asymptotic distribution. A simulation study demonstrates the asymptotic validity of the estimator but shows that it is only accurate for relatively large sample sizes, illustrating the hardness of the unconstrained estimation problem. 2026-03-17T21:14:37Z 41 pages Cecilie Olesen Recke Niels Richard Hansen http://arxiv.org/abs/2509.11381v2 The Honest Truth About Causal Trees: Accuracy Limits for Heterogeneous Treatment Effect Estimation 2026-03-17T19:59:56Z Recursive decision trees are widely used to estimate heterogeneous causal treatment effects in experimental and observational studies. These methods are typically implemented using CART-type recursive partitioning and are often viewed as adaptive procedures capable of discovering treatment effect heterogeneity in high-dimensional settings. We study causal tree estimators based on adaptive recursive partitioning and establish lower bounds on their estimation accuracy. Under basic conditions, we show that causal trees constructed via standard CART-type splitting rules cannot achieve polynomial-in-$n$ convergence rates in the uniform norm (where $n$ denotes the sample size). The underlying mechanism is that greedy recursive partitioning selects highly imbalanced splits with non-vanishing probability, producing terminal nodes containing very few observations and leading to large estimation variance. We further show that sample splitting (``honesty'') yields at most negligible improvements in convergence rates. As a consequence, causal tree estimators may converge arbitrarily slowly and can even be inconsistent in some settings. Our results also clarify the role of balanced partition assumptions in existing theoretical guarantees for causal forests and related ensemble methods. The analysis develops new probabilistic tools for studying adaptive recursive partitioning procedures, including non-asymptotic approximations for suprema of partial sums and Gaussian processes. As a technical by-product, we also identify and correct an error in Eicker (1979). 2025-09-14T18:29:45Z Matias D. Cattaneo Jason M. Klusowski Ruiqi Rae Yu http://arxiv.org/abs/2603.10272v2 An operator-level ARCH Model 2026-03-17T18:01:54Z AutoRegressive Conditional Heteroscedasticity (ARCH) models are standard for modeling time series exhibiting volatility, with a rich literature in univariate and multivariate settings. In recent years, these models have been extended to function spaces. However, functional ARCH and generalized ARCH (GARCH) processes established in the literature have thus far been restricted to model ``pointwise'' variances. In this paper, we propose a new ARCH framework for data residing in general separable Hilbert spaces that accounts for the full evolution of the conditional covariance operator. We define a general operator-level ARCH model. For a simplified Constant Conditional Correlation version of the model, we establish conditions under which such models admit strictly and weakly stationary solutions, finite moments, and weak serial dependence. Additionally, we derive consistent Yule--Walker-type estimators of the infinite-dimensional model parameters. The practical relevance of the model is illustrated through simulations and a data application to high-frequency cumulative intraday returns. 2026-03-10T23:04:20Z 48 pages, 8 Figures, 2 Tables Alexander Aue Sebastian Kühnert Gregory Rice Jeremy VanderDoes http://arxiv.org/abs/2603.16833v1 Semiparametric Inference under Dual Positivity Boundaries:Nested Identification with Administrative Censoring and Confounded Treatment 2026-03-17T17:39:24Z When a long-term outcome is administratively censored for a substantial fraction of a study cohort while a short-term intermediate variable remains broadly available, the target causal parameter can be identified through a nested functional that integrates the outcome regression over the conditional intermediate distribution, avoiding inverse censoring weights entirely. In observational studies where treatment is also confounded, this nested identification creates a semiparametric structure with two distinct positivity boundaries -- one from the censoring mechanism and one from the treatment assignment -- that enter the efficient influence function in fundamentally different roles. The censoring boundary is removed from the identification by the nested functional but remains in the efficient score; the treatment boundary appears in both. We develop the inference theory for this dual-boundary structure. Three results are established. 2026-03-17T17:39:24Z RWD analysis is still pending, in that the section 7 is empty for now Lin Li http://arxiv.org/abs/2603.16829v1 Conditional Distributional Treatment Effects: Doubly Robust Estimation and Testing 2026-03-17T17:35:32Z Beyond conditional average treatment effects, treatments may impact the entire outcome distribution in covariate-dependent ways, for example, by altering the variance or tail risks for specific subpopulations. We propose a novel estimand to capture such conditional distributional treatment effects, and develop a doubly robust estimator that is minimax optimal in the local asymptotic sense. Using this, we develop a test for the global homogeneity of conditional potential outcome distributions that accommodates discrepancies beyond the maximum mean discrepancy (MMD), has provably valid type 1 error, and is consistent against fixed alternatives -- the first test, to our knowledge, with such guarantees in this setting. Furthermore, we derive exact closed-form expressions for two natural discrepancies (including the MMD), and provide a computationally efficient, permutation-free algorithm for our test. 2026-03-17T17:35:32Z Saksham Jain Alex Luedtke http://arxiv.org/abs/2603.16798v1 High-Dimensional Gaussian Mean Estimation under Realizable Contamination 2026-03-17T17:04:18Z We study mean estimation for a Gaussian distribution with identity covariance in $\mathbb{R}^d$ under a missing data scheme termed realizable $ε$-contamination model. In this model an adversary can choose a function $r(x)$ between 0 and $ε$ and each sample $x$ goes missing with probability $r(x)$. Recent work Ma et al., 2024 proposed this model as an intermediate-strength setting between Missing Completely At Random (MCAR) -- where missingness is independent of the data -- and Missing Not At Random (MNAR) -- where missingness may depend arbitrarily on the sample values and can lead to non-identifiability issues. That work established information-theoretic upper and lower bounds for mean estimation in the realizable contamination model. Their proposed estimators incur runtime exponential in the dimension, leaving open the possibility of computationally efficient algorithms in high dimensions. In this work, we establish an information-computation gap in the Statistical Query model (and, as a corollary, for Low-Degree Polynomials and PTF tests), showing that algorithms must either use substantially more samples than information-theoretically necessary or incur exponential runtime. We complement our SQ lower bound with an algorithm whose sample-time tradeoff nearly matches our lower bound. Together, these results qualitatively characterize the complexity of Gaussian mean estimation under $ε$-realizable contamination. 2026-03-17T17:04:18Z Ilias Diakonikolas Daniel M. Kane Thanasis Pittas http://arxiv.org/abs/2603.16785v1 Local asymptotic normality for mixed fractional Ornstein-Uhlenbeck process under high-frequency observation 2026-03-17T16:59:22Z This paper consider the LAN property for the mixed O-U process under high-frequency observation when H>3/4. As considered in mixed fractional Brownian motion, we will also use the projection step to get the non-diagonal rate matrix. 2026-03-17T16:59:22Z Chunhao Cai Yiwu Shang Cong Zhang http://arxiv.org/abs/2602.16933v2 M-estimation under Two-Phase Multiwave Sampling with Applications to Prediction-Powered Inference 2026-03-17T16:59:02Z In two-phase multiwave sampling, inexpensive measurements are collected on a large sample and expensive, more informative measurements are adaptively obtained on subsets of units across multiple waves. Adaptively collecting the expensive measurements can increase efficiency but complicates statistical inference. We give valid estimators and confidence intervals for M-estimation under adaptive two-phase multiwave sampling. We focus on the case where proxies for the expensive variables -- such as predictions from pretrained machine learning models -- are available for all units and propose a Multiwave Predict-Then-Debias estimator that combines proxy information with the expensive, higher-quality measurements to improve efficiency while removing bias. We establish asymptotic linearity and normality and propose asymptotically valid confidence intervals. We also develop an approximately greedy sampling strategy that improves efficiency relative to uniform sampling. Data-based simulation studies support the theoretical results and demonstrate efficiency gains. 2026-02-18T22:54:32Z Dan M. Kluger Stephen Bates http://arxiv.org/abs/2603.16712v1 High-dimensional estimation with missing data: Statistical and computational limits 2026-03-17T16:02:41Z We consider computationally-efficient estimation of population parameters when observations are subject to missing data. In particular, we consider estimation under the realizable contamination model of missing data in which an $ε$ fraction of the observations are subject to an arbitrary (and unknown) missing not at random (MNAR) mechanism. When the true data is Gaussian, we provide evidence towards statistical-computational gaps in several problems. For mean estimation in $\ell_2$ norm, we show that in order to obtain error at most $ρ$, for any constant contamination $ε\in (0, 1)$, (roughly) $n \gtrsim d e^{1/ρ^2}$ samples are necessary and that there is a computationally-inefficient algorithm which achieves this error. On the other hand, we show that any computationally-efficient method within certain popular families of algorithms requires a much larger sample complexity of (roughly) $n \gtrsim d^{1/ρ^2}$ and that there exists a polynomial time algorithm based on sum-of-squares which (nearly) achieves this lower bound. For covariance estimation in relative operator norm, we show that a parallel development holds. Finally, we turn to linear regression with missing observations and show that such a gap does not persist. Indeed, in this setting we show that minimizing a simple, strongly convex empirical risk nearly achieves the information-theoretic lower bound in polynomial time. 2026-03-17T16:02:41Z Kabir Aladin Verchand Ankit Pensia Saminul Haque Rohith Kuditipudi http://arxiv.org/abs/2504.09347v4 Inference for Deep Neural Network Estimators in Generalized Nonparametric Models 2026-03-17T15:56:33Z While deep neural networks (DNNs) are used for prediction, inference on DNN-estimated subject-specific means for categorical or exponential family outcomes remains underexplored. We address this by proposing a DNN estimator under generalized nonparametric regression models (GNRMs) and developing a rigorous inference framework. Unlike existing approaches that assume independence between estimation errors and inputs to establish the error bound, a condition often violated in GNRMs, we allow for dependence and our theoretical analysis demonstrates the feasibility of drawing inference under GNRMs. To implement inference, we consider an Ensemble Subsampling Method (ESM) that leverages U-statistics and the Hoeffding decomposition to construct reliable confidence intervals for DNN estimates. We show that, under GNRM settings, ESM enables model-free variance estimation and accounts for heterogeneity among individuals in the population. Through simulations under nonparametric logistic, Poisson, and binomial regression models, we demonstrate the effectiveness and efficiency of our method. We further apply the method to the electronic Intensive Care Unit (eICU) dataset, a large scale collection of anonymized health records from ICU patients, to predict ICU readmission risk and offer patient-centric insights for clinical decision making. 2025-04-12T21:32:42Z 91 pages, 14 figures, 20 tables Xuran Meng Yi Li http://arxiv.org/abs/2505.09647v2 On Unbiased Low-Rank Approximation with Minimum Distortion 2026-03-17T15:37:09Z We describe an algorithm for sampling a low-rank random matrix $Q$ that best approximates a fixed target matrix $P\in\mathbb{C}^{n\times m}$ in the following sense: $Q$ is unbiased, i.e., $\mathbb{E}[Q] = P$; $\mathsf{rank}(Q)\leq r$; and $Q$ minimizes the expected Frobenius norm error $\mathbb{E}\|P-Q\|_F^2$. Our algorithm mirrors the solution to the efficient unbiased sparsification problem for vectors, except applied to the singular components of the matrix $P$. Optimality is proven by showing that our algorithm matches the error from an existing lower bound. 2025-05-12T20:52:28Z Leighton Pate Barnes Stephen Cameron Benjamin Howard http://arxiv.org/abs/2503.14978v2 Inferring diffusivity from killed diffusion 2026-03-17T14:00:23Z We consider diffusion of independent molecules in an insulated Euclidean domain with unknown diffusivity parameter. At a random time and position, the molecules may bind and stop diffusing in dependence of a given `binding potential'. The binding process can be modeled by an additive random functional corresponding to the canonical construction of a `killed' diffusion Markov process. We study the problem of conducting inference on the infinite-dimensional diffusion parameter from a histogram plot of the `killing' positions of the process. We show first that these positions follow a Poisson point process whose intensity measure is determined by the solution of a certain Schrödinger equation. The inference problem can then be re-cast as a non-linear inverse problem for this PDE, which we show to be consistently solvable in a Bayesian way under natural conditions on the initial state of the diffusion, provided the binding potential is not too `aggressive'. In the course of our proofs we obtain novel posterior contraction rate results for high-dimensional Poisson count data that are of independent interest. A numerical illustration of the algorithm by standard MCMC methods is also provided. 2025-03-19T08:16:16Z 33 pages, to appear in the Annals of Statistics Richard Nickl Fanny Seizilles http://arxiv.org/abs/2509.09965v2 Confidence Intervals for Extinction Risk: Validating Population Viability Analysis with Limited Data 2026-03-17T13:08:53Z Quantitative assessment of extinction risk requires confidence intervals (CIs) that remain informative with limited data. Their usefulness has long been debated because short observation spans can make uncertainty so large that population viability analysis appears impractical. I derive new CIs for extinction probability under the drift-Wiener process, a canonical model of extinction dynamics, by introducing transformed parameters $w$ and $z$ whose maximum-likelihood estimators follow noncentral $t$ distributions. The resulting $w$-$z$ method yields CIs with coverage close to the nominal level and shows that precision depends not only on data length but also on effect size: extinction probabilities that are sufficiently low or high can often be estimated reliably even from limited time series. I also propose an observation-error-and-autocovariance-robust (OEAR) estimator for settings with additive observation error and short-run dependence. Applied to two 64-year national harvest indices for Japanese eel (Anguilla japonica), the method gives Criterion E extinction probabilities far below the IUCN threatened-category thresholds, with narrow CIs, despite the species being listed as Endangered under Criterion A. These results show that extinction-risk CIs can be both statistically rigorous and practically informative for conservation assessment under limited data. 2025-09-12T04:51:42Z 151 pages, 32 figures, 30 tables Hiroshi Hakoyama 10.1111/2041-210X.70294 http://arxiv.org/abs/2506.23213v3 Nuisance parameters and elliptically symmetric distributions: a geometric approach to parametric and semiparametric efficiency 2026-03-17T13:03:56Z Elliptically symmetric distributions are a classic example of a semiparametric model where the location vector and the scatter matrix (or a parameterization of them) are the two finite-dimensional parameters of interest, while the density generator represents an \textit{infinite-dimensional nuisance} term. This basic representation of the elliptic model can be made more accurate, rich, and flexible by considering additional \textit{finite-dimensional nuisance} parameters. Our aim is therefore to investigate the deep and counter-intuitive links between statistical efficiency in estimating the parameters of interest in the presence of both finite and infinite-dimensional nuisance parameters. Previous seminal works have addressed this problem by leveraging a general result: if the statistical model has a specific group invariance, then the projection operator onto the semiparametric nuisance tangent space can be asymptotically expressed as a conditional expectation with respect to the maximal invariant sub-$σ$ algebra. In this article, we show that, for the statistical model of elliptical distributions, the projection operator can be explicitly computed without relying on the above-mentioned asymptotic approximation. This allows us to obtain original results also for the case in which the location vector and the scatter matrix are parameterized by a finite-dimensional vector that can be partitioned in two sub-vectors: one containing the parameters of interest and the other containing the nuisance parameters. As an example, we illustrate how the obtained results can be applied to the well-known \virg{low-rank} parameterization. Furthermore, while the theoretical analysis will be developed for Real Elliptically Symmetric (RES) distributions, we show how to extend our results to the case of Circular and Non-Circular Complex Elliptically Symmetric (C-CES and NC-CES) distributions. 2025-06-29T12:50:21Z Stefano Fortunati Jean-Pierre Delmas Esa Ollila