https://arxiv.org/api/A9HsuRVNL2jss/CwSO8lN8DLi7Y2026-06-11T10:14:23Z3614651015http://arxiv.org/abs/2605.23419v2Generalized Stochastic Approximation of the Log-Likelihood Ratio for Robust Sequential Change-Point Detection2026-05-27T14:10:33ZSequential change-point detection in non-Gaussian stochastic processes is challenging because the underlying densities are rarely known in real time. Classical parametric procedures such as CUSUM lose optimality under distributional mismatch, whereas nonparametric alternatives often react slowly. We develop a unified framework that approximates the log-likelihood ratio (LLR) on a generalized stochastic basis -- polynomial, logarithmic, or fractional-power -- using only moments up to order 3s, with no analytic form of the distribution, and thereby adapts the classical CUSUM, GRSh, and SRP procedures to non-Gaussian data. The convergence functional J(s) = K^T Y is interpreted as the projection of the Kullback-Leibler divergence onto the basis span, yielding a formal criterion for selecting the approximation order. We target the regime of small relative change-points, where the signal energy changes little but the shape of the distribution -- tail structure and modality -- does. A robust threshold follows from Kunchenko's probability-error bound (KU-PE), which controls the false-alarm rate without empirical tuning. On nine public benchmarks across four domains, the method is, to our knowledge, the only one operative on extremely heavy-tailed data (excess kurtosis gamma_4 > 20), where classical methods produce 100% false alarms, while reducing the detection delay at a guaranteed false-alarm level. The core theorems are formally verified in Lean 4.2026-05-22T09:28:42Z68 pages, 7 figures. Companion code, Monte Carlo experiments, and Lean 4 formal proofs of the core theorems: https://github.com/SZabolotnii/KuYuPe-Change_Point-code-supplementSerhii Zabolotniihttp://arxiv.org/abs/2605.28471v1The Modified Egger Intercept Tests for Detecting Horizontal Pleiotropy in Two-Sample Summary-Data Mendelian Randomization2026-05-27T13:36:37ZThe Egger intercept (EI) test is a widely used tool to detect horizontal pleiotropy in two-sample summary-data Mendelian randomization. A significant EI test suggests that either the average pleiotropic effect differs from zero (i.e., directional pleiotropy) or the InSIDE (Instrument Strength Independent of Direct Effect) assumption is violated (i.e., correlated pleiotropy) or both. As such, the EI test provides an assessment of the validity of the instrumental variable assumptions, with a non-zero EI indicating that the commonly used inverse-variance weighted (IVW) estimator will be biased. However, the EI test may exhibit inaccurate type one error rates due to biased estimation in Egger regression caused by the measurement error and winner's curse. In this article, we propose a modified EI (MEI) test based on a bias-corrected EI estimator under the null hypothesis of no directional or correlated pleiotropy, leveraging the recently developed rerandomized IVW estimator. We then prove the asymptotic properties of the MEI test under realistic conditions. Like the EI test, we find that the power of the MEI test is also affected by the orientation of SNPs. To enhance the robustness of power, we further combine the MEI test statistics obtained under two specific allele coding schemes. Both simulation and real data studies show that the combined test outperforms the EI test in terms of type one error control and power.2026-05-27T13:36:37ZYilei MaYoupeng SuXin LiuXuanye CuiPing YinPeng Wanghttp://arxiv.org/abs/2404.11150v2Automated, efficient and model-free inference for randomized clinical trials via data-driven covariate adjustment2026-05-27T12:24:54ZIn 2023, the U.S. Food and Drug Administration issued guidance for adjustment of covariates in randomized clinical trials, emphasizing its role in enhancing precision and power through prognostic baseline variables. Despite its potential, many trials underutilize this method partly due to challenges in pre-specifying optimal baseline covariates and their functional forms.
We explore the potential of automated, data-adaptive methods-including stepwise regression, Lasso and flexible machine learning algorithms-for covariate adjustment, addressing the challenge of pre-specification. Our approach ensures valid and interpretable treatment effect estimates and standard errors, even when outcome models are misspecified or biased outcome predictions are used. This differs from most competing methods, which assume correctly specified models for consistent standard errors. Our estimators require cross-fitting for reliable standard error estimation, though it can be omitted when variable selection is used, provided the outcome model satisfies an ultra-sparsity assumption. As such, we arrive at simple estimators and standard errors for marginal treatment effects in randomized clinical trials (or similar studies like A/B-testing), exploiting data-adaptive predictions from prognostic baseline covariates, with little (or no) bias in finite samples even when predictions are biased.
Empirical and methodological results demonstrate promise of automated covariate adjustment for improving statistical power of trial analyses.2024-04-17T08:01:15ZKelly Van LanckerIván DíazStijn Vansteelandthttp://arxiv.org/abs/2605.28339v1From nonstationarity to stationarity via $1/f$ noise: discrete Fourier transforms and sample mean asymptotics for testing2026-05-27T11:43:27ZWe study the asymptotic behaviour of different statistics for time series exhibiting long memory and nonstationarity. For processes with memory parameter $d\in(-1/2,3/2)$, we derive the joint limiting distribution of discrete Fourier transforms at a fixed number of Fourier frequencies, with a unified normalization. The resulting limits are Gaussian with an explicit covariance structure. Particular attention is given to the boundary case $d=1/2$, also known as $1/f$ noise. We show that logarithmic corrections yield nondegenerate limits for sample mean and sample variance leading to explicit asymptotic distributions of $χ^2$ type. We construct a statistic that combines the sample mean, the sample variance, and low-frequency periodogram ordinates, designed so that, at the boundary case $(d=1/2)$, it admits a tractable limit distribution.
These results are applied to construct a consistent parameter-free test of nonstationarity against long memory stationarity.2026-05-27T11:43:27ZMohamedou Ould HayeAnne Philippehttp://arxiv.org/abs/2602.14862v2The Well-Tempered Classifier: Some Elementary Properties of Temperature Scaling2026-05-27T11:05:26ZTemperature scaling is a simple method that allows to control the uncertainty of probabilistic models. It is mostly used in two contexts: improving the calibration of classifiers and tuning the stochasticity of large language models (LLMs). In both cases, temperature scaling is the most popular method for the job. Despite its popularity, a rigorous theoretical analysis of the properties of temperature scaling has remained elusive. We investigate here some of these properties. For classification, we show that increasing the temperature increases the uncertainty in the model in a very general sense (and in particular increases its entropy). However, for LLMs, we challenge the common claim that increasing temperature increases diversity. Furthermore, we introduce two new characterisations of temperature scaling. The first one is geometric: the tempered model is shown to be the information projection of the original model onto the set of models with a given entropy. The second characterisation clarifies the role of temperature scaling as a submodel of more general linear scalers such as matrix scaling and Dirichlet calibration: we show that temperature scaling is the only linear scaler that does not change the hard predictions of the model.2026-02-16T15:54:52ZPierre-Alexandre MatteiBruno Loureirohttp://arxiv.org/abs/2605.28269v1Dynamic Topic Modeling with a Higher-Order Hypergraphical Representation2026-05-27T10:16:05ZDynamic topic modeling is widely used to analyze evolving trends in scientific literature, medical records, and social media. Traditional topic models represent each topic through a single probability vector on the multinomial simplex and implicitly couple word occurrence and repetition within one probabilistic mechanism. However, this formulation restricts the dependence structure among words and overlooks informative higher-order interactions, particularly in dynamic corpora with overlapping semantics. To address these limitations, we introduce a hypergraph representation of text where each document is modeled as a hyperedge connecting all co-occurring words, with repetition intensities encoded as node weights. This representation naturally separates word occurrence from repetition and induces a novel hypergraph-based multinomial distribution with a nonlinear normalization depending on the observed word set of each document. Building on this likelihood, we develop a dynamic topic modeling framework via structured low-rank factorizations with explicit temporal regularization on topic-word profiles. Moreover, we establish local convergence guarantees and derive non-asymptotic error bounds despite the intrinsic nonconvexity induced by bilinear factorization and document-specific nonlinear normalization. Numerical experiments on synthetic data and an application to the International Conference on Learning Representations (ICLR) corpus demonstrate consistent improvements over existing multinomial-based topic models.2026-05-27T10:16:05Z34 pages, 4 figuresHanjia GaoHanwen YeQing NieAnnie Quhttp://arxiv.org/abs/2605.28105v1Identifying Direct Causal Effects in Latent Factor Models by Accounting for Unidentified Parents2026-05-27T07:58:22ZWe consider linear structural equation models with explicitly modelled latent variables.
In such models, observed and latent variables solve linear equations including stochastic noise terms. The goal of our work is to identify the direct causal effects between the observed variables of interest by providing (rational) formulas in the observed covariances. Most prior identification approaches operate in the latent projection framework, where latent variables are projected away into dependent error terms. However, when the observed variables are densely confounded, even if only by a few latent variables, the projection-based approaches are unable to certify identifiability of most effects. For such problems, approaches that explicitly use the latent variables are more effective, but algorithms that were recently proposed for this purpose often remain inconclusive for denser causal graphs. We develop a new identification criterion that is able to better handle dense graphs by leveraging the key insight that recursive identification schemes can be generalized by explicitly accounting for causal parents with (yet) unidentified direct effects. Combinatorial search problems in our new criterion can be tackled with the help of network-flow computations, leading to a practical useful algorithmic tool that we also make available in software.2026-05-27T07:58:22Z48 pages, 4 tables, 7 figuresTom HochsprungNils SturmaJakob RungeMathias DrtonAndreas Gerhardushttp://arxiv.org/abs/2605.28099v1A computationally-tractable measure of global sensitivity for sampling-based Bayesian inference2026-05-27T07:53:44ZBayesian inference can often be sensitive to the choice of hyperparameters of the prior or likelihood, yet defining and quantifying this sensitivity in a principled and computationally feasible way remains challenging in practice. Unfortunately, existing sensitivity methods are rarely applicable in modern Bayesian workflows due to their high computational cost and poor performance in moderate to high dimensions. To address these limitations, we introduce a new approach to global sensitivity analysis based on the Fisher divergence. Our method only requires a set of samples from a reference posterior and the ability to evaluate score functions, making it broadly computationally tractable. Under mild regularity conditions, it controls changes in the whole posterior, and provides a bound on the impact of perturbations on the first two moments. We demonstrate these strengths on challenging Bayesian inference problems which are practically out of reach of existing approaches, including generalised Bayesian inference for unnormalised models, inference in Bayesian models of time series, and neural simulation-based inference.2026-05-27T07:53:44ZArina OdnoblyudovaCharita DellaportaFrançois-Xavier Briolhttp://arxiv.org/abs/2503.17531v3Bayesian Latent Class Regression with Interpretable Binary Profiles2026-05-27T07:27:59ZHigh-dimensional categorical data arise in diverse scientific domains and are often accompanied by covariates. Latent class regression models are routinely used in such settings, reducing dimensionality by assuming conditional independence of the categorical variables given a single latent class that depends on covariates through a logistic regression model. However, such methods become unreliable as the dimensionality increases. To address this, we propose Bayesian latent class regression with interpretable binary profiles (BLIP), a flexible family of models that introduces a binary latent-attribute layer between the covariate-dependent latent class and the observed categorical responses. BLIP satisfies key theoretical properties, including identifiability and posterior consistency, and we establish a Bayes oracle clustering property that ensures robustness against the curse of dimensionality. We develop efficient posterior computation methods, validate them through simulation studies, and use BLIP to infer regions of common profile in ecological data.2025-03-21T20:25:38ZYuren ZhouYuqi GuDavid B. Dunsonhttp://arxiv.org/abs/2409.19712v2Posterior Conformal Prediction2026-05-27T06:52:51ZConformal prediction is a popular technique for constructing prediction intervals with distribution-free coverage guarantees. The coverage is marginal, meaning it only holds on average over the entire population but not necessarily for any specific subgroup. This article introduces posterior conformal prediction (PCP), which generates prediction intervals with both marginal and approximate conditional validity for clusters (or subgroups) naturally discovered in the data. PCP achieves these guarantees by modelling the conditional nonconformity score distribution as a mixture of cluster distributions. Compared to other methods with approximate conditional validity, this approach produces tighter intervals, particularly when the test data is drawn from clusters that are well represented in the validation data. PCP can also be applied to guarantee conditional coverage on user-specified subgroups, in which case it further ensures coverage for underrepresented individuals in each subgroup. When the response variable is categorical, PCP can adjust the coverage level based on the classifier's predictive probabilities, yielding low-cardinality prediction sets if the classifier is well calibrated. We demonstrate enhanced performance on datasets from socioeconomics, materials science, and healthcare.2024-09-29T14:09:07Z67 pages, 17 figuresYao ZhangEmmanuel J. Candèshttp://arxiv.org/abs/2603.08276v2A Unified Framework for Density Estimation under Right-Censored Point-Centred Quarter Sampling2026-05-27T06:24:13ZWhile the point-centred quarter method (PCQM) is widely used for density estimation, existing methods for handling right-censored data from truncated search radii rely primarily on a Poisson model assuming complete spatial randomness (CSR), leaving a critical gap for spatially aggregated populations. To address this limitation, we develop a unified likelihood- and moment-based framework for right-censored point-centred quarter sampling under both Poisson and negative binomial distribution (NBD) models. In particular, the proposed NBD-based estimators explicitly account for spatial aggregation and censoring simultaneously, extending distance-based inference beyond the CSR setting. Extensive simulations and applications to fully mapped forest plots reveal that the NBD-based MLE delivers the most robust overall performance across diverse ecological scenarios. Across more than 100 species from fully mapped forest plots, the proposed NBD-based MLE approximately reduced absolute relative bias by a median of 0.10 compared with existing censored estimators, representing a relative improvement of over 30%. Ultimately, our framework provides a rigorously validated and practically useful toolkit for analysing censored point-to-tree distance data.2026-03-09T11:47:55Z42 pages, 28 figures, 4 tableWenzhe HuangGuochun ShenDingliang XingJiangyan Zhaohttp://arxiv.org/abs/2605.27967v1Multi-Teacher Knowledge Distillation via Teacher-Informed Mixture Priors2026-05-27T05:03:24ZKnowledge distillation is a powerful method for model compression, enabling the efficient deployment of complex deep learning models (teachers), including large language models. However, its underlying statistical mechanisms remain unclear, and uncertainty evaluation is often overlooked, especially in real-world scenarios requiring diverse teacher expertise. To address these challenges, we introduce \textit{Multi-Teacher Bayesian Knowledge Distillation} (MT-BKD), where a distilled student model learns from multiple teachers within the Bayesian framework. Our approach leverages Bayesian inference to capture inherent uncertainty in the distillation process. We introduce a teacher-informed prior, integrating external knowledge from teacher models and task-specific training data, offering better generalization, robustness, and scalability. Additionally, an entropy-based weighting mechanism adaptively adjusts each teacher's influence, allowing the student to combine multiple sources of expertise effectively. MT-BKD enhances the interpretability of the student model's learning process, improves predictive accuracy, and provides uncertainty quantification. We validate MT-BKD on both synthetic and real-world tasks, including protein subcellular location prediction and image classification. Our experiments show improved performance and robust uncertainty quantification, highlighting the strengths of our MT-BKD framework.2026-05-27T05:03:24ZLuyang FangYongkai ChenJiazhang CaiPing MaWenxuan Zhonghttp://arxiv.org/abs/2605.27925v1Finite-size occupancy scaling of apparent fractal dimensions in stochastic trajectories2026-05-27T03:56:52ZEstimating a fractal dimension from a finite stochastic trajectory is a finite-size scaling problem: the apparent box-counting exponent is shaped by an occupancy crossover between the resolved range of scales and the finite number of sampled points, and need not equal the dimension of the limiting process. We model this crossover with a balls-in-boxes occupancy law, which predicts the box-count curve, the finite-size saturation scale, and a scaling function for the normalized local slope. Across random-walk traces, fractional Brownian graphs, and Levy flights, the normalized local slope collapses onto a single crossover curve, while the windowed box-counting bias collapses when the regression window is positioned relative to the saturation scale. Inverting the occupancy model gives a finite-size bias correction that reduces error on controlled stochastic trajectories and transfers across held-out model classes. Comparisons with correlation dimension, detrended fluctuation analysis, the variogram, and Higuchi's method show that the dominant bias is specific to point-sampled box-counting over finite scale windows, and that local-slope stability alone is not a reliable diagnostic. A DNA-walk example illustrates the workflow on measured data, and all figures, tables, and in-text numbers are regenerated from released single-seed code.2026-05-27T03:56:52ZMain text: 30 pages, 5 figures; supplementary material includedBon A. KooUniversity of PennsylvaniaEdward JuCalifornia Institute of Technologyhttp://arxiv.org/abs/2603.19745v3Invariant quantile regression for heterogeneous environments2026-05-27T02:25:45ZIn this paper, we propose an invariant quantile regression (IQR) framework specifically designed for multi-environment datasets, which captures the invariance across different environments. This framework is closely related to transfer learning, causal inference, and fair machine learning, and is motivated by scenarios in which the conditional probability of the response given covariates varies, while certain key variables remain invariant. This perspective differs notably from previous works that restrict attention to the conditional mean, which is often insufficient to capture the full causal relationships between covariates and the response in heterogeneous environments. In contrast, quantile-based invariance naturally accommodates heterogeneity, and aligns more closely with structural causal models, in which variables invariant across environments at one or multiple quantile levels directly indicate potential and stable causal variables. Moreover, we show that IQR may yield a larger set of endogenous variables compared to the conditional mean framework, which in turn promotes more effective exclusion of spurious (non-causal) variables. To achieve this, we introduce a Kernel-Smoothed Invariant Quantile Regression (KS-IQR) estimator, which leverages the underlying invariance structure and heterogeneity among environments, ensuring stable estimation across multiple environments. We establish the causal discovery properties of our method, demonstrate its ability to overcome the ``curse of endogeneity'', and derive an $\ell_2$ error bound for our estimator, all in a non-asymptotic framework. We apply our method to real data for causal discovery and obtain biologically meaningful relationships, recovering known signaling pathways and revealing additional quantile-specific effects.2026-03-20T08:29:51Z25 pages, 4 figuresBo FuDandan Jianghttp://arxiv.org/abs/2605.27844v1A Parameterization-Invariant DIC2026-05-27T02:01:26ZThe classic Deviance Information Criterion (DIC) is not invariant to reparameterization and can have a negative and unstable effective number of parameters. The reason for the effective number of parameters being negative is actually that the plug-in deviance becomes excessively large when the posterior means of the model parameter differ dramatically from the maximum likelihood estimates. In latent variable models, the cause can be identifiability issues that lead to meaningless and unstable plug-in estimates. Specifically, nonidentifiability means that distinct parameter points can have the same likelihood and switching between such points within or between MCMC chains produces unstable and meaningless posterior means. To address this issue, we propose a plug-in-free, parameterization-invariant version of the DIC, denoted DIC$_i$, and show that it is asymptotically equivalent to the Watanabe-Akaike Information Criterion (WAIC). Simulations demonstrate that DIC$_i$ aligns with WAIC in factor analysis and growth mixture models where the classic DIC breaks down. These results suggest that DIC$_i$ is a useful, computationally efficient alternative to the DIC when WAIC is not applicable or not available.2026-05-27T02:01:26ZXingyao XiaoStanford UniversitySophia Rabe-HeskethUniversity of California, Berkeley