https://arxiv.org/api/rCa5wP/NAzplRNNLDwhkAPPIo8o 2026-03-20T21:49:22Z 9966 105 15 http://arxiv.org/abs/2403.17609v2 Estimation Method under Three-Parameter Generalized Exponential Model: Consistency, Uniqueness and its Applications 2026-02-28T14:11:48Z

In numerous instances, the generalized exponential distribution can be used as an alternative to the most widely used non-regular family of distributions: Weibull, gamma, lognormal with three-parameters when analyzing lifetime or any skewed continuous data. A non-regular family is a class of probability distributions that do not satisfy the regularity conditions typically assumed in classical statistical inference. Some key features of such family of distributions are: support of its probability density function depends on one its parameters; its likelihood function may not be bounded for a certain range of parameter space, hence maximum likelihood estimators do not exist; the likelihood function even may not be differentiable or integrable as needed, hence Fisher Information may not exist or be infinite. Moreover, standard results like MLE existence, consistency, asymptotic normality may fail. Therefore, specialized or robust inferential techniques are needed. This article offers a consistent method for estimating the parameters of a three-parameter generalized exponential distribution that sidesteps the issue of an unbounded likelihood function. The method is hinged on a maximum likelihood estimation of shape and scale parameters that uses a location-invariant statistic. Important estimator properties, such as uniqueness and consistency, are demonstrated for the first time under this approach. In addition, quantile estimates for the assumed distribution are provided. We present a Monte Carlo simulation study along with comparisons to a number of well-known estimation techniques in terms of bias and root mean square error. For illustrative purposes, a real dataset from reliability engineering, has been analyzed and the goodness of fit along with the bootstrap confidence intervals are compared with existing traditional methods.

2024-03-26T11:41:09Z Accepted for publication in the Japanese Journal of Statistics and Data Science Kiran Prajapat Sharmishtha Mitra Debasis Kundu http://arxiv.org/abs/2410.21603v5 Approximate Bayesian Computation with Statistical Distances for Model Selection 2026-02-28T05:36:17Z

Model selection in the presence of intractable likelihoods remains a central challenge in Bayesian inference. Approximate Bayesian computation (ABC) provides a flexible likelihood-free framework, but its use for model choice is known to be sensitive to the choice of summary statistics, often leading to poorly calibrated posterior model probabilities. Recent ABC variants based on statistical distances allow comparisons to be performed directly on empirical distributions, avoiding data reduction and offering improved theoretical guarantees under suitable conditions. This paper provides a systematic evaluation of discrepancy-based ABC methods for Bayesian model selection, focusing on their empirical behavior across a range of simulation settings and levels of model complexity. We compare full data ABC approaches based on Wasserstein, Creamer-von-Mises, and maximum mean discrepancy metrics with summary-statistic-based ABC and neural network classifiers. The results highlight settings in which full data ABC yields stable and well-calibrated posterior model probabilities, as well as scenarios where performance degrades due to model overlap or dependence. An application to toad movement models illustrates the practical implications of these findings. Overall, the study clarifies the strengths and limitations of discrepancy-based ABC for likelihood-free model choice and provides guidance for its use in realistic inferential settings.

2024-10-28T23:12:31Z Clara Grazian http://arxiv.org/abs/2511.10967v2 Autocovariance and Optimal Design for Random Walk Metropolis-Hastings Algorithm 2026-02-28T03:06:54Z

The Metropolis-Hastings algorithm has been extensively studied in the estimation and simulation literature, with most prior work focusing on convergence behavior and asymptotic theory. However, its covariance structure-an important statistical property for both theory and implementation-remains less understood. In this work, we provide new theoretical insights into the scalar case, focusing primarily on symmetric unimodal target distributions with symmetric random walk proposals, where we also establish an optimal proposal design. In addition, we derive some more general results beyond this setting. For the high-dimensional case, we relate the covariance matrix to the classical 0.23 average acceptance rate tuning criterion.

2025-11-14T05:24:15Z Jingyi Zhang James C. Spall http://arxiv.org/abs/2507.07815v2 Vecchia approximated Bayesian heteroskedastic Gaussian processes 2026-02-27T21:23:06Z

Many computer simulations are stochastic and exhibit input dependent noise. In such situations, heteroskedastic Gaussian processes (hetGPs) make ideal surrogates as they estimate a latent, non-constant variance. However, existing hetGP implementations are unable to deal with large simulation campaigns and use point-estimates for all unknown quantities, including latent variances. This limits applicability to small experiments and undercuts uncertainty. We propose a Bayesian hetGP using elliptical slice sampling (ESS) for posterior variance integration, and the Vecchia approximation to circumvent computational bottlenecks. We show good performance for our upgraded hetGP capability, compared to alternatives, on a benchmark example and a motivating corpus of more than 9-million lake temperature simulations. An open source implementation is provided as bhetGP on CRAN.

2025-07-10T14:45:33Z 33 pages, 14 figures Parul V. Patil Robert B. Gramacy Cayelan C. Carey R. Quinn Thomas http://arxiv.org/abs/2603.00277v1 CliPS -- How to identify cluster distributions in Bayesian mixture models 2026-02-27T19:50:45Z

We propose the CliPS procedure when fitting Bayesian mixture models in the context of model-based clustering to identify the cluster distributions while simultaneously assessing the suitability of a cluster solution and validating the cluster structure. The procedure relies on the point process representation of a mixture model and is based on the assumption that a suitable cluster solution requires the clusters to be distinguishable with respect to a low-dimensional functional of the component-specific parameters of the mixture. CliPS maps the component-specific MCMC draws to the point process representation and identifies clusters there, exploiting that, while data distributions usually overlap, the posterior of these functionals are more and more separated for increasing sample size. We outline the procedure and illustrate its use on several model-based clustering examples.

2026-02-27T19:50:45Z Gertraud Malsiner-Walli Sylvia Frühwirth-Schnatter Bettina Grün http://arxiv.org/abs/2512.11012v2 On a class of constrained Bayesian filters and their numerical implementation in high-dimensional state-space Markov models 2026-02-27T18:38:57Z

Bayesian filtering is a key tool in many problems that involve the online processing of data, including data assimilation, optimal control, nonlinear tracking and others. Unfortunately, the implementation of filters for nonlinear, possibly high-dimensional, dynamical systems is far from straightforward, as computational methods have to meet a delicate trade-off involving stability, accuracy and computational cost. In this paper we investigate the design, and theoretical features, of constrained Bayesian filters for state space models. The constraint on the filter is given by a sequence of compact subsets of the state space that determines the sources and targets of the Markov transition kernels in the dynamical model. Subject to such constraints, we provide sufficient conditions for filter stability and approximation error rates with respect to the original (unconstrained) Bayesian filter. Then, we look specifically into the implementation of constrained filters in a continuous-discrete setting where the state of the system is a continuous-time stochastic Itô process but data are collected sequentially over a time grid. We propose an implementation of the constraint that relies on a data-driven modification of the drift of the Itô process using barrier functions, and discuss the relation of this scheme with methods based on the Doob $h$-transform. Finally, we illustrate the theoretical results and the performance of the proposed methods in computer experiments for a partially-observed stochastic Lorenz 96 model.

2025-12-11T16:44:14Z Utku Erdogan Gabriel J. Lord Joaquin Miguez http://arxiv.org/abs/2508.15978v2 A nonstationary spatial model of PM2.5 with localized transfer learning from numerical model output 2026-02-27T16:47:02Z

Ambient air pollution measurements from regulatory monitoring networks are routinely used to support epidemiologic studies and environmental policy decision making. However, regulatory monitors are spatially sparse and preferentially located in areas with large populations. Numerical air pollution model output can be leveraged into the inference and prediction of air pollution data combining with measurements from monitors. Nonstationary covariance functions allow the model to adapt to spatial surfaces whose variability changes with location like air pollution data. In the paper, we employ localized covariance parameters learned from the numerical output model to knit together into a global nonstationary covariance, to incorporate in a fully Bayesian model. We model the nonstationary structure in a computationally efficient way to make the Bayesian model scalable.

2025-08-21T21:43:25Z Environ Ecol Stat (2026) Wenlong Gong Brian J. Reich Joseph Guinness 10.1007/s10651-026-00710-z http://arxiv.org/abs/2602.23911v1 Online Bootstrap Inference for the Trend of Nonstationary Time Series 2026-02-27T11:00:47Z

This article proposes an online bootstrap scheme for nonparametric level estimation in nonstationary time series. Our approach applies to a broad class of level estimators expressible as weighted sample averages over time windows, including exponential smoothing methods and moving averages. The bootstrap procedure is motivated by asymptotic arguments and provides well-calibrated uniform-in-time coverage, enabling scalable uncertainty quantification in streaming or large-scale time-series settings. This makes the method suitable for tasks such as adaptive anomaly detection, online monitoring, or streaming A/B testing. Simulation studies demonstrate good finite-sample performance of our method across a range of nonstationary scenarios. In summary, this offers a practical resampling framework that complements online trend estimation with reliable statistical inference.

2026-02-27T11:00:47Z Thomas Nagler Tobias Brock Nicolai Palm http://arxiv.org/abs/2602.23909v1 Automated selection of r for stationary and nonstationary models for r largest order statistics 2026-02-27T10:58:33Z

In generalized extreme value model for the r largest order statistics, denoted by rGEV, the selection of r is critical. The existing entropy difference test for selecting r is applicable to large sample. Another existing method (the score test with parametric bootstrap) is applicable to small sample, but computationally demanding. To address this problem for small sample, we propose a new method using a sequence of the goodness-of-fit tests based on the conditional cumulative distribution function (CCDF). The proposed CCDF test is easy to implement and computationally fast. The Cram{é}r-von Mises test was employed for the goodness-of-fit purpose. The proposed method is compared via Monte Carlo simulations with existing methods including the spacings, the score, and the entropy difference tests. The proposed CCDF test turned out to perform well for both small and large samples, comparable to the spacings and entropy difference tests. The utility of the proposed method is illustrated by an application to the r largest daily rainfall data in Korea. Additionally, we extended the existing methods and the CCDF test to a nonstationary rGEV model. Wide applicability of the proposed method are discussed.

2026-02-27T10:58:33Z Yire Shin Jihong Park Jeong-Soo Park http://arxiv.org/abs/2602.23892v1 Towards Tsallis Fully Probabilistic Design 2026-02-27T10:39:41Z

In this paper we present the foundations of Fully Probabilistic Design for the case when the Kullback-Leibler divergence is replaced by the Tsallis divergence. Because the standard chain rule is replaced by subadditivity, immediate backwards recursion is not available. However, by forming a fixed point iteration, we can establish a constructive proof of the existence of a solution to this problem, which also constitutes an algorithmic scheme that iteratively converges to this solution. This development can provide greater versatility in Bayesian Decision Making as far as adding flexibility to the problem formulation.

2026-02-27T10:39:41Z Vyacheslav Kungurtsev Giovanni Russo http://arxiv.org/abs/2602.23815v1 Efficient Tests for Testing in Two-way ANOVA under Heteroscedasticity 2026-02-27T08:55:05Z

New tests are developed for two-way ANOVA models with heterogeneous error variances. The testing problems are considered for testing the significant interaction effects, simple effects, and treatment effects. The likelihood ratio tests (LRTs) and simultaneous comparison tests are derived for all three problems. Hill climbing algorithms have been proposed to compute the maximum likelihood estimators (MLEs) of parameters under the restrictions on the null and alternative hypotheses. It is proved that the proposed algorithms converge to the MLEs. A parametric bootstrap algorithm is provided for the computation of the critical points. The simulated power values of the proposed tests are compared with two existing tests. For testing main effects in the additive ANOVA model, the LRT appears to be about $30\%$ to $50\%$ gain in power over the available tests. Also, the proposed tests for the interaction and simple effects are seen to have comparable power and size performance to the existing tests. The behavior of the proposed tests under the non-normal error distribution is also discussed. Four real data sets are used to demonstrate the application of the proposed tests. A software package is made in `R' to make it simple to apply the tests to experimental data sets.

2026-02-27T08:55:05Z Anjana Mondal Somesh Kumar http://arxiv.org/abs/2602.23561v1 VaSST: Variational Inference for Symbolic Regression using Soft Symbolic Trees 2026-02-27T00:07:31Z

Symbolic regression has recently gained traction in AI-driven scientific discovery, aiming to recover explicit closed-form expressions from data that reveal underlying physical laws. Despite recent advances, existing methods remain dominated by heuristic search algorithms or data-intensive approaches that assume low-noise regimes and lack principled uncertainty quantification. Fully probabilistic formulations are scarce, and existing Markov chain Monte Carlo-based Bayesian methods often struggle to efficiently explore the highly multimodal combinatorial space of symbolic expressions. We introduce VaSST, a scalable probabilistic framework for symbolic regression based on variational inference. VaSST employs a continuous relaxation of symbolic expression trees, termed soft symbolic trees, where discrete operator and feature assignments are replaced by soft distributions over allowable components. This relaxation transforms the combinatorial search over an astronomically large symbolic space into an efficient gradient-based optimization problem while preserving a coherent probabilistic interpretation. The learned soft representations induce posterior distributions over symbolic structures, enabling principled uncertainty quantification. Across simulated experiments and Feynman Symbolic Regression Database within SRBench, VaSST achieves superior performance in both structural recovery and predictive accuracy compared to state-of-the-art symbolic regression methods.

2026-02-27T00:07:31Z 38 pages, 5 figures, 35 tables, Submitted Somjit Roy Pritam Dey Bani K. Mallick http://arxiv.org/abs/2602.23528v1 Neural Operators Can Discover Functional Clusters 2026-02-26T22:20:34Z

Operator learning is reshaping scientific computing by amortizing inference across infinite families of problems. While neural operators (NOs) are increasingly well understood for regression, far less is known for classification and its unsupervised analogue: clustering. We prove that sample-based neural operators can learn any finite collection of classes in an infinite-dimensional reproducing kernel Hilbert space, even when the classes are neither convex nor connected, under mild kernel sampling assumptions. Our universal clustering theorem shows that any $K$ closed classes can be approximated to arbitrary precision by NO-parameterized classes in the upper Kuratowski topology on closed sets, a notion that can be interpreted as disallowing false-positive misclassifications. Building on this, we develop an NO-powered clustering pipeline for functional data and apply it to unlabeled families of ordinary differential equation (ODE) trajectories. Discretized trajectories are lifted by a fixed pre-trained encoder into a continuous feature map and mapped to soft assignments by a lightweight trainable head. Experiments on diverse synthetic ODE benchmarks show that the resulting practical SNO recovers latent dynamical structure in regimes where classical methods fail, providing evidence consistent with our universal clustering theory.

2026-02-26T22:20:34Z Yicen Li Jose Antonio Lara Benitez Ruiyang Hong Anastasis Kratsios Paul David McNicholas Maarten Valentijn de Hoop http://arxiv.org/abs/2208.14537v2 Bayesian Multinomial Logistic Regression for Numerous Categories 2026-02-26T16:52:15Z

Bayesian multinomial logistic regression provides a principled, interpretable approach to multiclass classification, but posterior sampling becomes increasingly expensive as the model dimension grows. Prior work has studied scalability in the number of subjects and covariates; in contrast, this paper focuses on how computation changes as the number of outcome categories increases. To improve scalability in settings with numerous categories, we adapt a gamma-augmentation strategy to decouple category-specific coefficient updates, so that each category's coefficients can be updated conditional on a single auxiliary variable per subject, rather than on the full set of other categories' coefficients. Because the resulting coefficient conditionals are non-conjugate, we couple this augmentation with either adaptive Metropolis-Hastings or elliptical slice sampling. Through simulation and a real-data example, we compare effective sample size and effective sampling rate across several standard competitors. We find that the best-performing sampler depends on the dimension and imbalance regime, and that the proposed augmentation provides substantial speedups in scenarios with numerous categories.

2022-08-30T20:56:29Z 14 pages, 2 figures. R package available at https://github.com/kylemcevoy/BayesMultiLogit Jared D. Fisher Kyle R. McEvoy http://arxiv.org/abs/2602.22965v1 A note on the area under the likelihood and the fake evidence for model selection 2026-02-26T13:01:50Z

Improper priors are not allowed for the computation of the Bayesian evidence $Z=p({\bf y})$ (a.k.a., marginal likelihood), since in this case $Z$ is not completely specified due to an arbitrary constant involved in the computation. However, in this work, we remark that they can be employed in a specific type of model selection problem: when we have several (possibly infinite) models belonging to the same parametric family (i.e., for tuning parameters of a parametric model). However, the quantities involved in this type of selection cannot be considered as Bayesian evidences: we suggest to use the name ``fake evidences'' (or ``areas under the likelihood'' in the case of uniform improper priors). We also show that, in this model selection scenario, using a diffuse prior and increasing its scale parameter asymptotically to infinity, we cannot recover the value of the area under the likelihood, obtained with a uniform improper prior. We first discuss it from a general point of view. Then we provide, as an applicative example, all the details for Bayesian regression models with nonlinear bases, considering two cases: the use of a uniform improper prior and the use of a Gaussian prior, respectively. A numerical experiment is also provided confirming and checking all the previous statements.

2026-02-26T13:01:50Z Computational Statistics, Volume 40, pages 4799-4824, year 2025 L. Martino F. Llorente 10.1007/s00180-025-01641-2