https://arxiv.org/api/Q7Ua4lM55UnRMrZTTKi05SdWUZo 2026-07-17T19:57:36Z 28366 0 15 http://arxiv.org/abs/2510.20742v3 Bayesian Prediction under Moment Conditioning 2026-07-16T17:13:12Z

How should prediction proceed when information is expressed through moment restrictions rather than a complete likelihood? Let $Q$ be a baseline law and $P^*$ its Kullback-Leibler projection onto the moment-constrained class. We study the law of a selected block from an exchangeable ensemble conditioned on the corresponding empirical moment event. Finite partitions provide coordinates for the conditional type law. On a fixed chart, exactly feasible types admit a Gaussian localization on the tangent space at the discrete projection, with precision givenby the reduced Hessian; a separate window bound covers generic real-valued constraints. These results yield quantitative convergence of the block law to the product law generated by the projection. A fixed-sample Le Cam comparison and a projected-law diagonal relate the finite chart to the ambient projection under refinement. The projected family also motivates a generalized conditional likelihood and a local comparison with generalized method of moments. Simulations illustrate the geometry under correct specification and misspecification.

2025-10-23T17:03:17Z 42 pages, 6 figures. Substantial revision: sharper statements of the collapse results, a chart-efficiency result for the induced conditional-likelihood estimator, and a corrected account of the companion paper arXiv:2509.13283 Nicholas G. Polson Daniel Zantedeschi http://arxiv.org/abs/2607.15179v1 A Complete-Data Likelihood for Epidemic Processes on Partially Observed Dynamic Networks 2026-07-16T16:34:39Z

Inference for infectious disease transmission on dynamic contact networks is complicated by latent infection times, partially observed network evolution, measurement error in contact data, and infection originating from outside the observed population. Existing likelihood-based approaches typically address these challenges separately and often rely on restrictive assumptions such as fully observed networks, closed populations, or symptom onset as a surrogate for infection time. We develop a unified complete-data likelihood framework for epidemic processes evolving on partially observed dynamic networks. The proposed formulation represents disease progression, network evolution, and observation mechanisms as interacting continuous-time stochastic processes within a common probabilistic framework. Specifically, we couple a susceptible-exposed-infectious-removed (SEIR) epidemic process with a status-dependent dynamic contact network and explicit observation models for symptoms and contacts. The resulting framework accommodates latent incubation periods, intermittent network observation, contact measurement error, and external infection pressure while preserving a coherent likelihood structure. Our principal contribution is the derivation of a complete-data event-history likelihood for the joint epidemic-network process under partial observation. The likelihood provides a rigorous foundation for likelihood-based and Bayesian inference through data augmentation, clarifies how information from disease progression and contact dynamics jointly determines parameter estimability, and reveals a broad class of existing epidemic network models as special cases. More generally, the framework contributes to statistical inference for partially observed interacting stochastic systems on evolving networks and establishes a foundation for uncertainty-aware analysis of complex transmission processes.

2026-07-16T16:34:39Z 42 pages, 20 figures. For associated R files, see https://github.com/asadmdaz/infectious-disease-modelling Md Asaduzzaman http://arxiv.org/abs/2403.00916v4 Characterizing Signalling: Connections between Causal Inference and Space-time Geometry 2026-07-16T15:47:49Z

Causality is pivotal to our understanding of the world, presenting itself in different forms: information-theoretic and relativistic, the former linked to the flow of information, the latter to the structure of space-time. Leveraging a framework introduced in PRA, 106, 032204 (2022), which formally connects these two notions in general physical theories, we study their interplay. Here, information-theoretic causality is defined through a causal modelling approach. First, we improve the characterization of information-theoretic signalling as defined through so-called affects relations. Specifically, we provide conditions for identifying redundancies in different parts of such a relation, introducing techniques for causal inference in unfaithful causal models (where the observable data does not "faithfully" reflect the causal dependences). In particular, this demonstrates the possibility of causal inference using the absence of signalling between certain nodes. Second, we define an order-theoretic property called conicality, showing that it is satisfied for light cones in Minkowski space-times with $d>1$ spatial dimensions but violated for $d=1$. Finally, we study the embedding of information-theoretic causal models in space-time without violating relativistic principles such as no superluminal signalling (NSS). In general, we observe that constraints imposed by NSS in a space-time and those imposed by purely information-theoretic causal inference behave differently. We then prove a correspondence between conical space-times and faithful causal models: in both cases, there emerges a parallel between these two types of constraints. This indicates a connection between informational and geometric notions of causality, and offers new insights for studying the relations between the principles of NSS and no causal loops in different space-time geometries and theories of information processing.

2024-03-01T19:00:45Z 31 + 25 pages, 12 figures. This work includes significantly improved versions of initial results presented in MG's master's thesis arXiv:2211.03593. v4 is close to the published version, and contains clarifications and some minor corrections Class. Quantum Grav. 43 (2026), 105008 Maarten Grothus V. Vilasini 10.1088/1361-6382/ae5d1d http://arxiv.org/abs/2405.19903v3 A new family of Gaussian processes for modeling animal movement: application to bat telemetry data 2026-07-16T15:27:10Z

Modeling animal movement is essential for addressing various ecological and biological questions. However, developing an effective predictive model for animal movement is a challenging task. In this paper, we introduce a new family of Gaussian processes, derived from the limiting fluctuations of the rescaled occupation-time process of certain branching particle systems, and study its applicability to real animal movement data. We examine two subfamilies and show that these processes exhibit long-range dependence and covariance functions with logarithmic asymptotic growth. For the exponential subfamily used in the applied analysis, the process is also non-stationary and not intrinsically stationary on compact time intervals. These properties are relevant when dealing with animal trajectories that exhibit strong memory. Finally, we illustrate the practical applicability of the proposed model by analyzing bat movement data.

2024-05-30T10:08:17Z 73 pages Jose Hermenegildo Ramirez Gonzalez Antonio Murillo Salas Ying Sun http://arxiv.org/abs/2604.22453v2 Adapted Wasserstein Barycenters of Gaussian Processes 2026-07-16T15:12:43Z

We study barycenters of filtered Gaussian processes in adapted Wasserstein space. The adapted Wasserstein distance refines classical optimal transport by requiring transport plans to respect the temporal flow of information, making it the natural metric for stochastic systems with filtration constraints, as in stochastic control, mathematical finance, and sequential decision problems. We prove that the \emph{unrestricted} barycenter problem for weighted Fréchet means of filtered Gaussian inputs admits a solution with Gaussian underlying law, representable as an enlarged filtered Gaussian process but not necessarily as an ordinary one. The problem decomposes into finitely many classical Bures--Wasserstein barycenter problems for the covariance contributions of the successive innovations. We then treat the \emph{restricted} problem, in which the barycenter is required to be an ordinary filtered Gaussian process, giving a rank and common-noise criterion for when the two problems agree, sufficient conditions for uniqueness, and first order optimality and regularity results. Under a martingale constraint we obtain an explicit solution via martingale projection and Bures--Wasserstein barycenters of the Gaussian increments. Beyond their intrinsic theoretical interest, our results provide a principled way to build representative models from collections of Gaussian stochastic systems, with applications to stochastic optimization, robust finance, and sequential statistical analysis.

2026-04-24T11:14:42Z Comments very welcome! Madhu Gunasingam Francesco Mattesini Johannes Wiesel Ting-Kam Leonard Wong http://arxiv.org/abs/2509.13283v2 De Finetti + Sanov = Bayes: Exchangeable Prediction under Moment Constraints 2026-07-16T13:42:32Z

We study exchangeable prediction when an empirical-moment constraint is primitive rather than a one-time completed-sample event. For each active finite horizon N, the relevant law is the de Finetti mixture conditioned on E_N = {Phat_N in E_{eps_N}}. Since the underlying law and the constraint are permutation invariant, the prediction target may be any fixed block of m coordinates contained in the active horizon, including coordinates interpreted as future relative to an arbitrary finite cut. Conditionally on the directing measure mu, the Gibbs-conditioning principle sends the law of such a block to the m-fold product of the I-projection P*_mu = argmin_{Q in E} D(Q || mu). On a finite alphabet we give an elementary master inequality for general polyhedral moment windows. After mixing over the constraint posterior Pi_{N,E}, and under weak convergence plus posterior-averaged component control, the finite-dimensional marginals converge to a consistent exchangeable law whose random directing measure is the I-projection P*_mu, with mu drawn from the weak limit Pi_E. Sequential prediction under this limiting law is therefore Bayesian prediction from a mixture of componentwise I-projections. What survives is a dichotomy on the subfamily minimizing the constraint rate: a reachable constraint leaves the projection asymptotically inactive, an unreachable one leaves genuine projections but drives the prior onto that subfamily. In each of our examples one mechanism or the other is gone in the limit, while both act at every finite N. The master bound also reads as an equivalence of ensembles. We reserve "maximum entropy" for a uniform or flat baseline and use "minimum relative entropy" or "I-projection" in general.

2025-09-16T17:36:41Z v2: substantially revised and reorganized; results and terminology updated Nicholas G. Polson Daniel Zantedeschi http://arxiv.org/abs/2607.14965v1 Statistical Inference for Scenario-Based Dynamic Optimization under Uncertainty 2026-07-16T13:16:03Z

Motivated by batch and semi-batch process operation, we study finite-horizon open-loop dynamic optimization problems with uncertain parameters. A common computational approach replaces the expected performance criterion by an average over finitely many sampled parameter realizations. We develop a statistical theory for the resulting sample-based optimal value as an estimator of the population optimal value. The analysis is based on a stability estimate showing that terminal losses depend Lipschitz continuously on the time-integrated control, which records the cumulative input delivered up to each time. This estimate yields a functional central limit theorem for the sample-based objective and a statistical limit theorem for the corresponding optimal value error. As a consequence, we obtain confidence intervals for the population optimal value. When the population optimizer is unique, the limit is Gaussian and leads to a plug-in confidence interval. When multiple optimal policies may exist, we use a subsampling confidence interval that does not require uniqueness. The methodology is illustrated on two fed-batch case studies in which feed-rate profiles are optimized under parametric uncertainty.

2026-07-16T13:16:03Z Aurya Javeed Johannes Milz http://arxiv.org/abs/2607.14948v1 Graph alignment in sparse inhomogeneous models via self-overlap 2026-07-16T12:57:28Z

We develop a general framework for understanding when graph alignment is information-theoretically feasible in sparse inhomogeneous random graph models, by studying the set of vertices on which the underlying matching can be recovered. Our main theorem gives a general lower bound on this set by leveraging the balanced load function introduced by Hajek (1990). The corresponding obstruction is captured by a new graph parameter, the self-overlap, which measures the extent to which a graph can imitate itself under a non-trivial relabelling. We then show that this criterion is sharp in a broad class of sparse inhomogeneous models, recovering known Erdős--Rényi phenomena and yielding sharp thresholds for Chung--Lu graphs and stochastic block models.

2026-07-16T12:57:28Z 31 pages, 1 figure Louis Vassaux http://arxiv.org/abs/2607.14930v1 Testing for correct model specification in copula regression models 2026-07-16T12:45:35Z

We propose a goodness-of-fit test for semiparametric copula regression models. Such models express the regression function in terms of marginal distribution functions and copula densities and therefore provide a flexible way to avoid fully nonparametric estimation in high-dimensional regression problems. Their performance, however, depends crucially on the specification of the parametric copula family. Instead of testing the copula model itself, we assess misspecification directly at the level of the induced regression function. To this end, we introduce a weighted $L^2$-distance between the true regression function and its best approximation within the postulated copula regression model. A kernel-based estimator of this distance is proposed and shown to be consistent and asymptotically normal under both the null hypothesis of correct specification and fixed alternatives. We derive a classical specification test and, using a self-normalized sequential statistic, construct pivotal confidence intervals and tests for relevant deviations from the model. Finite-sample simulations demonstrate accurate level approximation and good power properties of the proposed procedures.

2026-07-16T12:45:35Z Holger Dette Philip Dörr http://arxiv.org/abs/2607.14880v1 Measuring Spatial Clustering via Metropolis-Hastings Diffusion Distance 2026-07-16T11:55:50Z

We propose a novel measure of the discrepancy between two probability distributions $f$ and $g$ on a graph - which we call the diffusion distance - that measures the rate of convergence of $f$ to $g$ under a graph-constrained Markov chain with stationary distribution $g$. As a default choice for this Markov chain, we use the Metropolis-Hastings transition matrix targeting $g$ with proposals given by a random walk on the graph. Our primary case of interest is when the second distribution $g$ is uniform, in which case the diffusion distance becomes a measure of spatial clustering in $f$. Used in this way, (Metropolis-Hastings) diffusion distance to uniformity extends Moran's $I$-type measures of spatial autocorrelation by incorporating global graph geometry rather than just local patterns. Indeed, Moran's $I$, the most well-known measure of spatial autocorrelation, can be viewed as a one-step heuristic for diffusion distance, so long as specific spatial weights are used. We establish theoretical bounds and a stability result for our measure, connecting it to graph spectra and optimal transport. We then turn our attention to outlining a statistical test for spatial clustering using diffusion distance. Under permutation null models, we derive high-probability bounds on diffusion distance underpinned by exact spectral formulas for convergence of distributions, enabling an efficient statistical test for spatial clustering on large datasets. We empirically compare diffusion distance to Moran's $I$ both as a numerical measure and as a statistical test. We show that diffusion distance exhibits higher power on synthetic data using a stochastic block model. Empirical analysis of Black population distributions for 100 U.S. cities shows that diffusion distance detects subtle differences in urban segregation patterns that Moran's $I$ does not.

2026-07-16T11:55:50Z Thomas Weighill Chidinma Williams http://arxiv.org/abs/2607.14814v1 Post Hoc Inference for Component Attribution in Multivariate Change-Point Detection 2026-07-16T10:30:52Z

We consider the post-detection analysis of change-points for multivariate time series, with the goal of identifying which coordinates are responsible for a detected change. After a change-point has been located by an offline detection algorithm, we propose post hoc statistical procedures to determine whether the change occurs in either of two predefined blocks of coordinates or in both. Our methods rely on two-sample testing procedures with a particular focus on nonparametric tests; we provide theoretical guarantees for Type I error control. Simulations and a real-data experiment demonstrate the strong performance of the proposed procedures.

2026-07-16T10:30:52Z 44 pages, 18 figures Dhia-Elhaq Ouerfelli Sylvain Arlot Kevin Bleakley Patrick Pamphile http://arxiv.org/abs/2607.14812v1 No Universal Multiplicative FDR Bound for the Benjamini-Hochberg Procedure with Correlated Two-Sided Gaussian Tests 2026-07-16T10:30:15Z

We study the worst-case false discovery rate of the Benjamini-Hochberg procedure applied to two-sided Gaussian p-values when the correlation matrix is otherwise unrestricted. Dobriban [2026] shows that BH does not always control the FDR at its nominal level. An analogous folklore conjecture is that BH controls the FDR up to a universal multiplicative constant. We prove that this conjecture is false. In particular, we construct Gaussian models for which the inflation factor FDR(BH_q)/q diverges as q tends to 0. More precisely, for all sufficiently small q, the supremum over the number of hypotheses, mean vector, and correlation matrix is at least cq\sqrt{log(1/q)} for a universal constant c > 0. Finally, for a broad class of common-factor Gaussian models with arbitrary means and loadings, we prove the matching-order upper bound FDR(BH_q) = O(q\sqrt{log(1/q)}), and hence the lower bound is sharp in order for this class.

2026-07-16T10:30:15Z 16 pages Lihua Lei http://arxiv.org/abs/2607.14680v1 Operator-Split Bayesian Learning for Elliptic PDEs with Unequal Interior and Boundary Data 2026-07-16T07:43:15Z

We propose an operator-split Bayesian learning framework for second-order uniformly elliptic Dirichlet problems with unequal numbers of interior and boundary observations. The data consist of noisy measurements of the source in the domain and noisy measurements of the boundary values. Independent Bayesian neural-network (BNN) priors are assigned to these two quantities, and the resulting product posterior is pushed forward through the elliptic solution operator. We prove that the posterior induced by this construction contracts around the true solution. The contraction radius separates a domain contribution, governed by the second-order elliptic operator, from a boundary contribution, governed by the intrinsic dimension of the boundary. Together with the minimax lower bound of \cite{ZhaoLu2026}, this yields a near-minimax upper bound up to logarithmic factors. Our numerical experiments illustrate the propagation of source and boundary uncertainty and the effects of unequal sampling budgets on the posterior reconstruction.

2026-07-16T07:43:15Z 28 pages, 7 figures, 3 tables Emmanuel E. Oguadimma http://arxiv.org/abs/2607.14460v1 Precise sample covariance spectral norm error -- an RDT view 2026-07-16T01:16:31Z

We study the sample covariance error of centered Gaussians. A remarkable breakthrough [66] established the correct error scaling order and explicitly revealed the critical role of both the effective rank and the true covariance spectrum. In this work, we move beyond scaling characterizations and determine the precise limiting value of the error's spectral norm. To do so, we develop a generic framework based on Random Duality Theory (RDT). Within this framework, we first determine closed-form, explicit RDT-based upper bounds. We then establish complementary lower bounds by introducing a novel bilinear-quadratic RDT lower-bounding mechanism. By combining this mechanism with a two-replica systems bounding strategy, we show that our lower and upper bounds match in large-dimensional contexts. Our theoretical results are supplemented with numerical evaluations and simulations, demonstrating an excellent agreement already for problem sizes on the order of thousands.

2026-07-16T01:16:31Z Mihailo Stojnic http://arxiv.org/abs/2604.25202v2 Geometry of tail allocation in conformal prediction intervals 2026-07-15T21:41:39Z

Lower and upper errors of a two-sided conformal prediction interval can have different scientific consequences. The division of target miscoverage between the two endpoints determines the corresponding tail-specific guarantees and can alter interval length at first order when tail scales differ. We characterize this allocation-length relation after separate one-sided split calibration, which preserves the tail-specific guarantees and marginal coverage whenever the allocation is selected independently of the calibration sample. Tail-quantile response to proportional rescaling determines the resulting length geometry. For regularly varying tails, normalized length converges to $g_γ(c)=c^{-ξ}+γ(1-c)^{-ξ}$, where $c$ is the upper-tail allocation fraction, $ξ$ is the tail index, and $γ$ is the lower-to-upper tail-scale ratio. A dominant tail produces a boundary optimum and makes the equal-tail interval asymptotically $2^ξ$ times as long as the optimum. Comparable tails produce an interior optimum, with equal-tail allocation optimal only at matching scales. An empirical allocation rule attains the corresponding optimum without estimating tail parameters. In the de Haan class the effect moves to an additive scale. Calibration resolution determines whether ordinary ranks can realize these allocations. When calibration tail counts remain bounded, two-sided rank feasibility also constrains the allocation. Tail homogeneity transfers the length relation over covariates, while opposite dominant tails preclude one globally efficient allocation.

2026-04-28T04:14:27Z Tianying Wang