https://arxiv.org/api/dDwHt5amCqnaifb//TW4dh1s71o 2026-03-20T12:45:11Z 9966 15 15 http://arxiv.org/abs/2408.03415v2 Gradient-Based Approximate Bayesian Inference with Entropy-Optimized Summary Statistics for Compartmental Models 2026-03-17T09:21:22Z

Recent pandemics have highlighted the critical role of infectious disease models in guiding public health decision-making, driving demand for realistic models that can provide timely answers under uncertainty. Compartmental models are widely used to capture disease dynamics, and advances in data availability, computational resources, and epidemiological understanding have allowed the development of models that incorporate detailed representations of population structure, disease progression, and intervention effects. While these improvements improve model fidelity, they also increase model complexity, leading to high-dimensional parameter spaces, intractable likelihoods, and computational challenges for fitting models to limited surveillance data in real time. Existing likelihood-free methods, such as Approximate Bayesian Computation (ABC) and Bayesian Synthetic Likelihood (BSL), have developed largely independently, each with distinct strengths and limitations. We propose an integrated three-stage framework that synthesizes advances from both likelihood-based and likelihood-free method: (1) ABC-based entropy minimization to identify low-dimensional, approximately orthogonal summary statistics; (2) BSL inference using these optimized summaries to construct tractable Gaussian approximations; and (3) Hamiltonian Monte Carlo sampling for efficient posterior exploration. Through SEIR simulation study and application to the 1978 British boarding school influenza outbreak, we demonstrate that our framework achieves reliable parameter estimation and uncertainty quantification while maintaining computational efficiency.

2024-08-06T19:29:34Z Xiahui Li Fergus J. Chadwick Ben Swallow http://arxiv.org/abs/2602.17922v2 Data-driven configuration tuning of glmnet for balancing accuracy and computational efficiency 2026-03-17T07:46:50Z

The glmnet package in R is widely used for lasso estimation because of its computational efficiency. Despite its popularity, glmnet occasionally yields solutions that deviate substantially from the true ones because of the inappropriate default configuration of the algorithm. The accuracy of the obtained solutions can be improved by appropriately tuning the configuration. However, such improvements typically increase computational time, resulting in a tradeoff between accuracy and computational efficiency. Therefore, a systematic approach is required to determine the appropriate configuration. To address this need, we propose a unified data-driven framework specifically designed to optimize the configuration by balancing solution path accuracy and computational cost. Specifically, we generate a large-scale training dataset by measuring the accuracy and computation time of glmnet. Using this dataset, we construct neural networks to predict accuracy and computation time from data characteristics and configuration. For a new dataset, the proposed framework uses the trained networks to explore the configuration space and derive a Pareto front that represents the tradeoff between accuracy and computational cost. This front enables automatic selection of the configuration that maximizes accuracy under a user-specified time constraint. The proposed method is implemented in the R package glmnetconf, available at https://github.com/Shuhei-Muroya/glmnetconf.git.

2026-02-20T00:58:59Z 23 pages, 9 figure. Title changed. Revised for linguistic clarity and stylistic improvements; no changes to the main results Shuhei Muroya Kei Hirose http://arxiv.org/abs/2603.16014v1 Fast Multitask Gaussian Process Regression 2026-03-16T23:46:41Z

Gaussian process (GP) regression is a powerful probabilistic modeling technique with built-in uncertainty quantification. When one has access to multiple correlated simulations (tasks), it is common to fit a multitask GP (MTGP) surrogate which is capable of capturing both inter-task and intra-task correlations. However, with a total of $N$ evaluations across all tasks, fitting an MTGP is often infeasible due to the $\mathcal{O}(N^2)$ storage and $\mathcal{O}(N^3)$ computations required to store, solve a linear system in, and compute the determinant of the $N \times N$ Gram matrix of pairwise kernel evaluations. In the single-task setting, one may reduce the required storage to $\mathcal{O}(N)$ and computations to $\mathcal{O}(N \log N)$ by fitting "fast GPs" which pair low-discrepancy design points from quasi-Monte Carlo to special kernel forms which yields nicely structured Gram matrices, e.g., circulant matrices. This article generalizes fast GPs to fast MTGPs which pair low-discrepancy design points for each task to special product kernel forms which yields nicely structured block Gram matrices, e.g., circulant block matrices. An algorithm is presented to efficiently store, invert, and compute the determinant of such Gram matrices with optionally different sampling nodes and different sample sizes for each task. Derivations for fast MTGP Bayesian cubature are also provided. A GPU-compatible, open-source Python implementation is made available in the FastGPs package (https://alegresor.github.io/fastgps/). We validate the efficiency of our algorithm and implementation compared to standard techniques on a range of problems with low numbers of tasks and large sample sizes.

2026-03-16T23:46:41Z Aleksei G. Sorokin Pieterjan Robbe Fred J. Hickernell http://arxiv.org/abs/2603.15902v1 SEMMS with Random Effects: A Mixed-Model Extension for Variable Selection in Clustered and Longitudinal Data 2026-03-16T20:46:38Z

SEMMS (Scalable Empirical-Bayes Model for Marker Selection) is a variable-selection procedure for generalized linear models that uses a three-component normal mixture prior on regression coefficients. In its original form, SEMMS assumes that all observations are independent. Many real-world datasets, however, arise from repeated-measures or clustered designs in which observations within the same subject are correlated. Ignoring this correlation inflates the apparent residual variance and can severely degrade variable-selection performance. We extend SEMMS to accommodate random intercepts, random slopes, or both, via an alternating coordinate-ascent algorithm. After each round of fixed-effect variable selection, the subject-level best linear unbiased predictors (BLUPs) are updated with \texttt{lmer} (Gaussian) or \texttt{glmer} (non-Gaussian); the fixed-effect step then operates on the random-effect-adjusted response. We describe the algorithm, evaluate its performance in three Gaussian simulation studies spanning a range of signal strengths, random-effect magnitudes, and sample/predictor-space regimes, and present a semi-synthetic real-data example. We further extend the framework to non-Gaussian families (Poisson, binomial) via an IRLS working-response adaptation: at each outer iteration the fixed-effects step uses the RE-adjusted working response computed from the current \texttt{glmer} fitted values rather than the raw response. When the fixed-effect signal is strong relative to the random-effect variance, both the original and extended procedures perform comparably. When the random-effect variance dominates -- the scenario most likely to cause plain SEMMS to fail -- the mixed-model extension recovers the exact true predictor set in 93\% of simulated datasets (Gaussian), 61\% (Poisson), and 65\% (binomial), compared with 1\%, 45\%, and 39\% for plain SEMMS respectively.

2026-03-16T20:46:38Z Haim Bar Martin T. Wells http://arxiv.org/abs/2512.05650v2 Efficient sequential Bayesian inference for state-space epidemic models using ensemble data assimilation 2026-03-16T14:49:46Z

Estimating latent epidemic states and model parameters from partially observed, noisy data remains a major challenge in infectious disease modeling. State-space formulations provide a coherent probabilistic framework for such inference, yet fully Bayesian estimation is often computationally prohibitive because evaluating the observed-data likelihood requires integration over a latent trajectory. The Sequential Monte Carlo squared (SMC$^2$) algorithm offers a principled approach for joint state and parameter inference, combining an outer SMC sampler over parameters with an inner particle filter that estimates the likelihood up to the current time point. Despite its theoretical appeal, this nested particle filter imposes substantial computational cost, limiting routine use in near-real-time outbreak response. We propose Ensemble SMC$^2$ (eSMC$^2$), a computationally efficient variant that replaces the inner particle filter with an Ensemble Kalman Filter (EnKF) to approximate the incremental likelihood at each observation time. While this substitution introduces bias via a Gaussian approximation, we mitigate finite-sample effects using an unbiased Gaussian density estimator and adapt the EnKF for epidemic data through state-dependent observation variance. This makes our approach particularly suitable for overdispersed incidence data commonly encountered in infectious disease surveillance. Simulation experiments with known ground truth and an application to 2022 United States (U.S.) monkeypox incidence data demonstrate that eSMC$^2$ achieves substantial computational gains while producing posterior estimates comparable to SMC$^2$. The method accurately recovers latent epidemic trajectories and key epidemiological parameters, providing an efficient framework for sequential Bayesian inference from imperfect surveillance data.

2025-12-05T11:51:55Z Dhorasso Temfack Jason Wyse http://arxiv.org/abs/2501.10229v3 Amortized Bayesian Mixture Models 2026-03-16T14:12:03Z

Finite mixtures are a broad class of models useful in scenarios where observed data is generated by multiple distinct processes but without explicit information about the responsible process for each data point. Estimating Bayesian mixture models is computationally challenging due to issues such as high-dimensional posterior inference and label switching. Furthermore, traditional methods such as MCMC are applicable only if the likelihoods for each mixture component are analytically tractable. Amortized Bayesian Inference (ABI) is a simulation-based framework for estimating Bayesian models using generative neural networks. This allows the fitting of models without explicit likelihoods, and provides fast inference. ABI is therefore an attractive framework for estimating mixture models. This paper introduces a novel extension of ABI tailored to mixture models. We factorize the posterior into a distribution of the parameters and a distribution of (categorical) mixture indicators, which allows us to use a combination of generative neural networks for parameter inference, and classification networks for mixture membership identification. The proposed framework accommodates both independent and dependent mixture models, enabling filtering and smoothing. We validate and demonstrate our approach through synthetic and real-world datasets.

2025-01-17T14:51:03Z 34 pages, 17 figures Šimon Kucharský Paul Christian Bürkner http://arxiv.org/abs/2307.01111v3 A Gaussian process and linear-based framework for computing cut distributions in modular Bayesian calibration of two chained computer models 2026-03-16T13:53:45Z

Computer models are widely used in science and engineering to simulate complex systems. However, these models are affected by several sources of uncertainty, which may limit their use for decision making in risk management. We present a Bayesian approach for quantifying parameter uncertainty in a chain of two computer models motivated by multiphysics simulations in the nuclear field. Part of the inputs of a downstream model parametrized by $θ\in \mathbb{R}^p$ come from the outputs of an upstream model parametrized by $λ\in \mathbb{R}^q$. Usually, the joint posterior distribution of $(θ, λ)$ would be obtained by applying Bayes' theorem using the experimental observations of both models. However, when the observations of the downstream model are too indirect to provide informative inference on $λ$, it may be preferable to compute a modular posterior distribution of $(θ, λ)$, referred to as the \emph{cut distribution}. Assuming that the posterior distribution of $λ$ has been previously estimated from observations of the upstream model only, we aim to compute the posterior distribution of $θ$ conditional on $λ$ using observations from the downstream model. To this end, we propose a Gaussian-process and linear-based framework to estimate the functional dependence between $θ$ and $λ$, denoted by $θ(λ)$, where each component is modeled as a realization of a Gaussian process. As the downstream model is approximated by a linear function of $θ(λ)$, Bayesian conjugacy allows us to derive a Gaussian posterior predictive distribution of $θ(λ)$ for any realization of $λ$. The effectiveness of the method is illustrated through several synthetic examples, and we highlight how variations in $λ$ impact the predictive distribution of the chained simulation.

2023-07-03T15:35:55Z 44 pages, 14 figures Oumar Baldé Guillaume Damblin Amandine Marrel Antoine Bouloré Loïc Giraldi http://arxiv.org/abs/2512.19398v2 A Reduced Basis Decomposition Approach to Efficient Data Collection in Pairwise Comparison Studies 2026-03-16T10:18:28Z

Comparative judgement studies elicit quality assessments through pairwise comparisons, typically analysed using the Bradley-Terry model. A challenge in these studies is experimental design, specifically, determining the optimal pairs to compare to maximize statistical efficiency. Constructing static experimental designs for these studies requires spectral decomposition of a covariance matrix over pairs of pairs, which becomes computationally infeasible for studies with more than approximately 150 objects. We propose a scalable method based on reduced basis decomposition that bypasses explicit construction of this matrix, achieving computational savings of two to three orders of magnitude. We establish eigenvalue bounds guaranteeing approximation quality and characterise the rank structure of the design matrix. Simulations demonstrate speedup factors exceeding 100 for studies with 64 or more objects, with negligible approximation error. We apply the method to construct designs for a 452-region spatial study in under 7 minutes and enable real-time design updates for classroom peer assessment, reducing computation time from 15 minutes to 15 seconds.

2025-12-22T13:48:53Z Author Accepted Manuscript Jiahua Jiang Joseph Marsh Rowland G Seymour http://arxiv.org/abs/2309.05435v2 Parallel Selected Inversion for Space-Time Gaussian Markov Random Fields 2026-03-16T08:48:35Z

Performing Bayesian inference on large spatio-temporal models requires extracting inverse elements of large sparse precision matrices for marginal variances, as well as estimating model hyperparameters. Although direct matrix factorizations can be used for the inversion, such methods fail to scale well for distributed problems when run on large computing clusters. On the contrary, Krylov subspace methods for the selected inversion have been gaining traction. We propose a parallel hybrid approach based on domain decomposition, which extends the Rao-Blackwellized Monte Carlo estimator for distributed precision matrices. Our approach exploits the strength of Krylov subspace methods as global solvers and efficiency of direct factorizations as base case solvers to compute the marginal variances and the derivatives required for hyperparameter estimation using a divide-and-conquer strategy. By introducing subdomain overlaps, one can achieve greater accuracy at an increased computational effort with little to no additional communication. We demonstrate the speed improvements and efficient hyperparameter inference on both simulated models and a massive US daily temperature data.

2023-09-11T13:15:01Z Published in Statistics and Computing (2025). Expanded version with additional results, discussion, and references Statistics and Computing 35, 211 (2025) Abylay Zhumekenov Elias T. Krainski Håvard Rue 10.1007/s11222-025-10747-y http://arxiv.org/abs/2510.04582v3 Constrained Dikin-Langevin diffusion for polyhedra 2026-03-15T19:19:46Z

We propose a reflection-free Langevin framework for sampling and optimization on compact polyhedra. The method is based on the inverse Hessian of the logarithmic barrier, which defines a Dikin--Langevin diffusion whose drift and noise adapt to the local interior-point geometry. We show that trajectories started in the interior remain feasible for all finite times almost surely, so the constrained domain is preserved without reflections or projections. For computation, we discretize the diffusion using the Euler--Maruyama scheme and apply a Metropolis--Hastings correction, yielding a sampler that targets the exact constrained distribution. We also propose an annealed interacting variant for nonconvex optimization. Numerically, the Metropolis-adjusted method outperforms both the Dikin random walk and standard MALA on anisotropic box-constrained Gaussians, and the interacting optimizer escapes suboptimal basins more reliably than the non-interacting method.

2025-10-06T08:34:55Z James Chok Domenic Petzinna http://arxiv.org/abs/2507.00923v2 ForLion: An R Package for Finding Optimal Experimental Designs with Mixed Factors 2026-03-15T16:05:04Z

Optimal design is crucial for experimenters to maximize the information collected from experiments and estimate the model parameters most accurately. ForLion algorithms have been proposed to find D-optimal designs for experiments with mixed types of factors. In this paper, we introduce the ForLion package which implements the ForLion algorithm to construct locally D-optimal designs and the Expected Weighted (EW) ForLion algorithm to generate robust EW D-optimal designs, which maximize the determinant of the expected Fisher information matrix under parameter uncertainty. The package supports experiments under linear models (LM), generalized linear models (GLM), and multinomial logistic models (MLM) with continuous, discrete, or mixed-type factors. It provides both optimal approximate designs and an efficient function converting approximate designs into exact designs with integer-valued allocations of experimental units. Tutorials are included to show the package's usage across different scenarios.

2025-07-01T16:28:37Z 33 pages, 5 figures, 5 tables Siting Lin Yifei Huang Jie Yang http://arxiv.org/abs/2110.10801v2 Efficient Sampling for Ising and Potts Models using Auxiliary Gaussian Variables 2026-03-15T01:47:09Z

Ising and Potts models are an important class of discrete probability distributions which originated from statistical physics and since then have found applications in several disciplines. Simulation from these models is a well known challenging problem. In this paper, we study a class of Markov chain Monte Carlo algorithms, in which we introduce an auxiliary Gaussian variable such that, conditional on this variable, the discrete states are independent. This approach is broadly applicable to Ising and Potts models, including ones in which the coupling matrix admits negative entries, as in spin glass and Hopfield models. We focus on a block Gibbs sampler version of this algorithm, which alternates between sampling the auxiliary Gaussian and the discrete states, and derive mixing time bounds for a wide class of Ising/Potts models at both high and low temperatures, yielding results analogous to those derived for the Heat Bath and Swendsen-Wang algorithms. We present novel choices of auxiliary Gaussian variables which scale well with the number of states in the Potts model, and which can take advantage of the low rank structure of the coupling matrix, if any. Finally, we numerically evaluate the performance of the auxiliary Gaussian Gibbs sampler with several competing algorithms, across a range of examples.

2021-10-20T22:05:12Z Charles C. Margossian Chenyang Zhong Sumit Mukherjee http://arxiv.org/abs/2603.14094v1 Maximin Robust Bayesian Experimental Design 2026-03-14T19:40:39Z

We address the brittleness of Bayesian experimental design under model misspecification by formulating the problem as a max--min game between the experimenter and an adversarial nature subject to information-theoretic constraints. We demonstrate that this approach yields a robust objective governed by Sibson's $α$-mutual information~(MI), which identifies the $α$-tilted posterior as the robust belief update and establishes the Rényi divergence as the appropriate measure of conditional information gain. To mitigate the bias and variance of nested Monte Carlo estimators needed to estimate Sibson's $α$-MI, we adopt a PAC-Bayes framework to search over stochastic design policies, yielding rigorous high-probability lower bounds on the robust expected information gain that explicitly control finite-sample error.

2026-03-14T19:40:39Z 11 pages + 15 in appendix, 5 figures Hany Abdulsamad Sahel Iqbal Christian A. Naesseth Takuo Matsubara Adrien Corenflos http://arxiv.org/abs/2509.01437v2 Sampling as Bandits: Evaluation-Efficient Design for Black-Box Densities 2026-03-14T01:44:24Z

We propose bandit importance sampling (BIS), a powerful importance sampling framework tailored for settings in which evaluating the target density is computationally expensive. BIS facilitates accurate sampling while minimizing the required number of target-density evaluations. In contrast to adaptive importance sampling, which optimizes a proposal distribution, BIS directly optimizes the set of samples through a sequential selection process driven by multi-armed bandits. BIS serves as a general framework that accommodates user-defined bandit strategies. Theoretically, the weak convergence of the weighted samples, and thus the consistency of the Monte Carlo estimator, is established regardless of the specific strategy employed. In this paper, we present a practical strategy that leverages Gaussian process surrogates to guide sample selection, adapting the principles of Bayesian optimization for sampling. Comprehensive numerical studies demonstrate the superior performance of BIS across multimodal, heavy-tailed distributions, and real-world Bayesian inference tasks involving Markov random fields.

2025-09-01T12:47:32Z Takuo Matsubara Andrew Duncan Simon Cotter Konstantinos Zygalakis http://arxiv.org/abs/2603.13622v1 The Continuous Rank Probability Score of a Generalized Beta-Prime Distribution and Some Special Cases 2026-03-13T22:00:42Z

This working paper describes new results in derivations of the Continuous Ranked Probability Score of a generalized beta-prime distribution and several special cases, such as the Dagum distribution and Singh-Maddala distribution. Comparison with Monte Carlo estimates is also presented.

2026-03-13T22:00:42Z 9 pages, no figures. Work in progress Matthew LeDuc