https://arxiv.org/api/YzyJns5rPpcuUlPJ6EiW4nV78Mk2026-03-20T15:54:19Z646446015http://arxiv.org/abs/2603.16700v1Nonlinear Information Theory: Characterizing Distributional Uncertainty in Communication Models with Sublinear Expectation2026-03-17T15:53:19ZA mathematical framework for information-theoretic analysis is established, with a new viewpoint of describing transmitted messages and communication channels by the nonlinear expectation theory, beyond the framework of classical probability theory. The major motivation of this research is to emphasize the probabilistic distribution uncertainty within the ever increasingly complex communication networks, where random phenomena are often nonstationary, heterogeneous, and cannot be characterized by a single probability distribution. Based on the nonlinear expectation theory, in this paper we first explicitly define several fundamental concepts, such as nonlinear information entropy, nonlinear joint entropy, nonlinear conditional entropy and nonlinear mutual information, and establish their basic properties. Secondly, by using the strong law of large numbers under sublinear expectations, we propose a nonlinear source coding theorem, which shows that the nonlinear information entropy is the upper bound of the achievable coding rate of sources whose distributions are uncertain under the maximum error probability criterion, and determines a cluster point of the coding rate of such sources under the minimum error probability criterion. Thirdly, we propose a nonlinear channel coding theorem, which gives the explicit expression of the upper bound under the maximum error probability criterion and a cluster point under the minimum error probability criterion, respectively, for the achievable coding rate of communication channels whose distributions are uncertain. Additionally, we propose a nonlinear rate-distortion source coding theorem, proving that the rate distortion function based on the nonlinear mutual information is a cluster point of the lossy compression performance of uncertain-distribution sources under the minimum expected distortion criterion.2026-03-17T15:53:19Z48 pages,8 figuresWen-Xuan LangShaoshi YangJianhua ZhangZhiming Mahttp://arxiv.org/abs/2603.16678v1Sharp Threshold for the Convergence of Nonstationary Averaging2026-03-17T15:42:04ZWe study non-stationary averaging processes, where each term of a sequence is a weighted average of previous terms, namely $a_{n+1} = \sum_{j=1}^n p_n(j) a_j$. Our results extend classical theory in two distinct regimes. First, we prove a sharp threshold for convergence in the regime where the weights are bounded between two envelopes $(\log n)^{-α} \le np_n(\cdot) \leq (\log n)^β$. We show that the sequence necessarily converges when $α+ β/ 2 \leq 1$, while $α+ β/ 2 > 1$ the convergence can fail. Second, we study complementary fixed shape regime, when $p_n$ is obtained by a fixed limiting density on $(0,1)$. We show that under mild regularity assumptions, the sequence converges.2026-03-17T15:42:04Z29 pagesSaba LepsveridzeElchanan Mosselhttp://arxiv.org/abs/2603.16675v1Relation between Hitting Times and Probabilities for Imprecise Markov Chains2026-03-17T15:39:05ZIn the present paper, we investigate the relationship between hitting times and hitting probabilities in discrete-time imprecise Markov chains (IMCs). We define lower and upper hitting times and probabilities for IMCs whose set of transition matrices $\T$ is compact, convex, and has separately specified rows. Building on reachability-based partitions of the state space, we prove two key implications: (i) finiteness of the upper expected hitting time entails the lower hitting probability equals one, and (ii) finiteness of the lower expected hitting time entails the upper hitting probability equals one. We further show an equivalence: the upper expected hitting time is finite if and only if the lower hitting probability is one. Finally, by presenting a counterexample, we show that the converse of the second implication can fail.2026-03-17T15:39:05ZUnder submission for International Conference on Soft Methods in Probability and Statistics (SMPS)Marco SangalliErik QuaeghebeurThomas Krakhttp://arxiv.org/abs/2505.09647v2On Unbiased Low-Rank Approximation with Minimum Distortion2026-03-17T15:37:09ZWe describe an algorithm for sampling a low-rank random matrix $Q$ that best approximates a fixed target matrix $P\in\mathbb{C}^{n\times m}$ in the following sense: $Q$ is unbiased, i.e., $\mathbb{E}[Q] = P$; $\mathsf{rank}(Q)\leq r$; and $Q$ minimizes the expected Frobenius norm error $\mathbb{E}\|P-Q\|_F^2$. Our algorithm mirrors the solution to the efficient unbiased sparsification problem for vectors, except applied to the singular components of the matrix $P$. Optimality is proven by showing that our algorithm matches the error from an existing lower bound.2025-05-12T20:52:28ZLeighton Pate BarnesStephen CameronBenjamin Howardhttp://arxiv.org/abs/2603.16665v1Lower and Upper Expected Hitting Times for Weighted Imprecise Markov Chains2026-03-17T15:31:53ZIn this paper, we extend hitting times for imprecise Markov chains to the framework of weighted imprecise Markov chains (WIMCs), in which each transition is associated with a strictly positive weight encoded by a matrix $W$. Given a convex set $\mathcal{T}$ of admissible transition matrices, we define lower and upper expected hitting times for WIMCs as the infimum and supremum of the (weighted) expected hitting times over $\mathcal{T}$, and we characterise these quantities as the unique solutions of nonlinear fixed-point equations. We show that any weighted hitting time problem can be transformed into an unweighted hitting time problem on an augmented state space, enabling the reuse of existing IMC theory and algorithms. In particular, we are able to adapt known iterative methods for the numerical computation of expected hitting times for WIMCs.2026-03-17T15:31:53ZUnder submission for International Conference on Soft Methods in Probability and Statistics (SMPS)Marco SangalliThomas Krakhttp://arxiv.org/abs/2512.16696v2Computing Lower and Upper Hitting Probabilities for Imprecise Markov Chains2026-03-17T15:20:22ZWe study the computation of lower and upper probabilities of hitting a target set of states for imprecise Markov chains, where transition uncertainty is modelled by a convex set of transition matrices. In the precise case, hitting probabilities are the minimal nonnegative solution of a linear system and admit a closed-form expression. We investigate the notion of reachability in the imprecise setting. The literature review highlights several different definitions of lower reachability; thus, we explore the relations among them and present examples to clarify their logical implications. Using this revised definition of reachability for imprecise Markov chains, we partition the state space into classes of states whose hitting probabilities are trivially zero or one, and those which require further computation. For these nontrivial states, we show that the lower hitting probability is the unique solution of a nonlinear fixed-point equation, while the same does not hold for upper hitting probabilities. For the practical computation of lower and upper hitting probabilities, we propose iterative algorithms that alternate between solving a linear system and choosing an extreme point from the set of transition matrices. Numerical experiments demonstrate that, in practice, these algorithms converge in substantially fewer iterations than the theoretically established worst-case bound.2025-12-18T16:00:06ZPreprint for International Journal of Approximate Inference (IJAR)Marco SangalliErik QuaeghebeurThomas Krakhttp://arxiv.org/abs/2601.09950v2Recursive Packing Bounds for Supercritical Disconnection in Bernoulli Site Percolation2026-03-17T14:42:55ZFor Bernoulli site percolation on an infinite, connected, locally finite graph $G=(V,E)$, we obtain quantitative upper bounds on the supercritical disconnection probability \[ \mathbb{P}_p(S\nleftrightarrow\infty) \] for arbitrary finite or infinite sets $S\subset V$ and all $p>p^{\mathrm{site}}_c(G)$.
The key quantity is a recursive packing number $\mathbf{PK}_{p,\eps,c}(S)$. It is the maximal number of vertices that can be extracted from $S$ so that, after deleting witness balls around the previously chosen vertices, each selected vertex still connects to infinity with probability at least $c$, while its failure to connect to infinity is already detected, up to a factor $1+\eps$, by failure to reach the inner boundary of its witness ball. Thus $\mathbf{PK}_{p,\eps,c}(S)$ counts essentially independent local witnesses for the global event $\{S\nleftrightarrow\infty\}$.
We prove the structural estimate \[ \mathbb{P}_p(S\nleftrightarrow\infty) \le \frac{\eps(1-c)}{c} +(1-c)^{\mathbf{PK}_{p,\eps,c}(S)}. \] Combining this bound with the local functional characterization of $p^{\mathrm{site}}_c(G)$ from \cite{ZL24} yields an explicit supercritical estimate valid on every infinite, connected, locally finite graph.
We also illustrate the packing number on ray-homogeneous trees. In particular, sparse finite subsets of a distinguished ray have packing number equal to their cardinality, both for regular trees and for a non-regular decorated spine. This shows that the packing number is explicit on concrete graph families.2026-01-15T00:21:22ZZhongyang Lihttp://arxiv.org/abs/2503.14978v2Inferring diffusivity from killed diffusion2026-03-17T14:00:23ZWe consider diffusion of independent molecules in an insulated Euclidean domain with unknown diffusivity parameter. At a random time and position, the molecules may bind and stop diffusing in dependence of a given `binding potential'. The binding process can be modeled by an additive random functional corresponding to the canonical construction of a `killed' diffusion Markov process. We study the problem of conducting inference on the infinite-dimensional diffusion parameter from a histogram plot of the `killing' positions of the process. We show first that these positions follow a Poisson point process whose intensity measure is determined by the solution of a certain Schrödinger equation. The inference problem can then be re-cast as a non-linear inverse problem for this PDE, which we show to be consistently solvable in a Bayesian way under natural conditions on the initial state of the diffusion, provided the binding potential is not too `aggressive'. In the course of our proofs we obtain novel posterior contraction rate results for high-dimensional Poisson count data that are of independent interest. A numerical illustration of the algorithm by standard MCMC methods is also provided.2025-03-19T08:16:16Z33 pages, to appear in the Annals of StatisticsRichard NicklFanny Seizilleshttp://arxiv.org/abs/2603.02982v2Well-posedness, mean attractors and invariant measures of stochastic discrete long-wave-short-wave resonance equations driven by locally Lipschitz nonlinear noise2026-03-17T13:45:07ZThis paper is devoted to investigating the random dynamics of stochastic discrete long-wave-short-wave resonance equations, which are characterized by the following features: $(1)$ the equations contain locally Lipschitz nonlinear coupling terms $u_mv_m$ and $(B(|u(t)|^2))_m$ for $m\in \mathbb{Z}$; $(2)$ the nonlinear coefficients of noises satisfy local Lipschitz conditions; and $(3)$ the system couples real and complex equations and is infinite-dimensional. These inherent structural properties prevent the analysis from being carried out in a standard Bochner product space of the same order and make it difficult to directly verify the tightness of the distribution family of solutions. To address these challenges, we adopt a higher-order Bochner product space $L^4(Ω,\ell_c^2)\times L^2(Ω,\ell^2)$ as the phase space and employ the technique of uniform tail-end estimates. The main results include: establishing the global well-posedness of the nonautonomous stochastic discrete long-wave-short-wave resonance equations driven by nonlinear noise in $L^4(Ω,\ell_c^2)\times L^2(Ω,\ell^2)$; based on this, defining the mean random dynamical system and proving the existence and uniqueness of weak $\mathscr{D}$-pullback mean random attractors. When the external forcing terms are independent of time and sample, we investigate the existence of invariant measures for the corresponding autonomous system and examine the limiting behavior of the invariant measure as the noise intensity tends to zero.2026-03-03T13:35:19ZXia PanJianhua HuangJuntao WuJiangwei Zhanghttp://arxiv.org/abs/2411.01983v3Real-world models for multiple term structures: a unifying HJM semimartingale framework2026-03-17T12:53:48ZWe develop a unified framework for modeling multiple term structures arising in financial, insurance, and energy markets, adopting an extended Heath-Jarrow-Morton (HJM) approach under the real-world probability. We study market viability and characterize the set of local martingale deflators. We conduct an analysis of the associated stochastic partial differential equation (SPDE), addressing existence and uniqueness of solutions, invariance properties and existence of affine realizations.2024-11-04T11:10:20Z47 pagesClaudio FontanaEckhard PlatenStefan Tappehttp://arxiv.org/abs/2405.19553v2Convergence Bounds for Sequential Monte Carlo on Multimodal Distributions using Soft Decomposition2026-03-17T12:42:59ZWe prove bounds on the variance of a function $f$ under the empirical measure of the samples obtained by the Sequential Monte Carlo (SMC) algorithm, with time complexity depending on local rather than global Markov chain mixing dynamics. SMC is a Markov Chain Monte Carlo (MCMC) method, which starts by drawing $N$ particles from a known distribution, and then, through a sequence of distributions, re-weights and re-samples the particles, at each instance applying a Markov chain for smoothing. In principle, SMC tries to alleviate problems from multi-modality. However, most theoretical guarantees for SMC are obtained by assuming global mixing time bounds, which are only efficient in the uni-modal setting. We show that bounds can be obtained in the truly multi-modal setting, with mixing times that depend only on local MCMC dynamics.2024-05-29T22:43:45ZHolden LeeMatheau Santana-Gijzenhttp://arxiv.org/abs/2603.16454v1The largest $K_r$-free set of vertices in a random graph2026-03-17T12:36:01ZFor $r \ge 2$ and a graph $G$, let $α_{r}(G)$ be the maximum number of vertices in a $K_r$-free subgraph of $G$. We investigate the value $α_{r}(G)$ when $G$ is the random graph $G \sim G_{n, 1/2}$ and discover the following phenomenon: with high probability, $α_r(G)$ lies in an interval of constant length that varies in a non-monotonic fashion from $1$ to $\lfloor r/2\rfloor+1$ depending on the value of $n$. The special case $r=2$ corresponds to the independence number of random graphs which is well-known to have two-point concentration; our results therefore extend and generalize this basic fact in random graph theory, showing more complicated behavior when $r>2$. We also prove similar results where $K_r$ is replaced by any color critical graph like $C_5$.2026-03-17T12:36:01Z30 pages, 2 figuresTom BohmanMarcus MichelenDhruv Mubayihttp://arxiv.org/abs/2603.16431v1On central limit theorems for Ewens--Pitman model2026-03-17T12:09:13ZWe establish a quenched functional central limit theorem for the total number of components of random partitions induced by Chinese restaurant process with parameters $(α,θ), α\in(0,1), θ>-α$. With $P_j$ denoting the asymptotic frequency of $j$-th table, it is well-known that the component count has the same law as the occupancy count of an infinite urn scheme with sampling frequencies being $(P_j)_{j\in\mathbb N}$. Our analysis follows this approach and is based on earlier results of Karlin (1967) and Durieu and Wang (2016). In words, our result reveals that the fluctuations of component count consist of two parts, one due to the sampling effect given the asymptotic frequencies $(P_j)_{j\in\mathbb N}$, the other due to the fluctuations of the random asymptotic frequencies, and in the limit the fluctuations of two parts are conditionally independent given the $α$-diversity. Our result strengthens a recent central limit theorem obtained by Bercu and Favaro (2024) via a different method.2026-03-17T12:09:13Z20 pagesYizao Wanghttp://arxiv.org/abs/2602.08581v2Random Polyhedral Cones I: Distributional Results via Gale Duality2026-03-17T12:03:33ZLet $U_1,\ldots,U_n$ be independent random vectors uniformly distributed on the unit sphere $\mathbb S^{d-1}\subseteq\mathbb R^d$, where $n\ge d$, and consider the random polyhedral cone \[ \mathcal W_{n,d}:=\mathop{\mathrm{pos}} (U_1,\ldots,U_n) = \{λ_1 U_1+ \ldots + λ_n U_n: λ_1\geq 0, \ldots, λ_n \geq 0\}. \] We establish several distributional results for $\mathcal W_{n,d}$ and the associated spherical polytope $\mathcal W_{n,d}\cap\mathbb S^{d-1}$. Our main contributions include:
(i) Let $α_d$ denote the solid angle of $\mathcal W_{d,d}$ and write $m(d,k):=\mathbb E[α_d^k]$ for its $k$-th moment. We prove the symmetry $m(d,k)=m(k,d)$. As an application, we compute $\mathop{\mathrm{Var}}[α_d]=2^{-d}(d+1)^{-1}-4^{-d}$ and derive a closed formula for the third moment.
(ii) For $n=d+1,d+2,d+3$ we determine the probability that $\mathcal W_{n,d}\cap\mathbb S^{d-1}$ is a spherical simplex, a spherical analogue of the classical Sylvester problem. In the case $n=d+2$ we also determine the distribution of the number of vertices of $\mathcal W_{d+2,d}\cap\mathbb S^{d-1}$.
(iii) Let $f_\ell(\mathcal W_{n,d})$ denote the number of $\ell$-dimensional faces of $\mathcal W_{n,d}$. We prove a distributional limit theorem for $f_\ell(\mathcal W_{n,d})$ in the regime $n=d+k$ and $\ell=d-q$, where $k,q\in\mathbb N$ are fixed and $d\to\infty$. The limit law is a weighted sum of independent chi squared variables, with weights given by explicit eigenvalues of a convolution operator on the sphere.
A unifying ingredient is an explicit coupling producing i.i.d. uniform vectors $U_1,\ldots,U_n\in\mathbb S^{d-1}$ together with i.i.d. uniform vectors $V_1,\ldots,V_n\in\mathbb S^{n-d-1}$ whose associated oriented matroids are Gale dual.2026-02-09T12:20:11Z36 pagesZakhar Kabluchkohttp://arxiv.org/abs/2506.01324v3Near-Optimal Clustering in Mixture of Markov Chains2026-03-17T12:00:13ZWe study the problem of clustering $T$ trajectories of length $H$, each generated by one of K unknown ergodic Markov chains over a finite state space of size $S$. We derive an instance-dependent, high-probability lower bound on the clustering error rate, governed by the stationary-weighted KL divergence between transition kernels. We then propose a two-stage algorithm: Stage I applies spectral clustering via a new injective Euclidean embedding for ergodic Markov chains, a contribution of independent interest enabling sharp concentration results; Stage II refines clusters with a single likelihood-based reassignment step. We prove that our algorithm achieves near-optimal clustering error with high probability under reasonable requirements on $T$ and $H$. Preliminary experiments support our approach, and we conclude with discussions of its limitations and extensions.2025-06-02T05:10:40ZAISTATS 2026 (50 pages, 6 figures) (ver3: camera-ready version, major revisions)Junghyun LeeYassir JedraAlexandre ProutièreSe-Young Yun