https://arxiv.org/api/ntx3ucekf7t8J9TUnHEzUK/tgIA2026-06-21T20:27:48Z5484121015http://arxiv.org/abs/2606.10871v1A New Invariant for Prime Alternating Knots From Error-Correcting Codes2026-06-09T13:47:53ZThis paper shows that the Alexander-Briggs code of a knot gives rise to a new invariant that distinguishes prime alternating knots. The restriction to prime alternating knots precisely follows from the fact that our approach relies on Tait s flyping theorem. We also provide examples where the new invariant succeeds in separating knots that the well known invariants, such as some knot polynomials, fail.2026-06-09T13:47:53ZAltan B. KilicRuud PellikaanAlberto Ravagnanhttp://arxiv.org/abs/2606.11280v1Designed-Source Reductions and a Dual-Purpose Feasibility Band for Semantic Rate-Distortion2026-06-09T13:46:47ZThe joint rate-distortion framework of Stavrou and Kountouris (IEEE Transactions on Communications 2023) characterises dual-fidelity tradeoffs for semantic communication on stochastic semantic sources. Many task-oriented communication systems instead use designed sources, where the semantic object is a deterministic oracle allocation $φ^(t)$ rather than a stochastic quantity given by nature. We isolate the subclass of designed sources under smooth concave utility with assumptions A1, A2 and Euclidean allocation codomain, and restrict the encoder class to deterministic common-category mappings. Within this subclass the SK exponential-tilting decoder and generalised Blahut--Arimoto iteration specialise to conditional-mean decoding and Lloyd--Max stationarity on $φ^(t)$. When the second fidelity is a monotone single-letter distortion, the joint problem stays inside the SK admissible class; the common-category SK rate is lower-bounded by the max of the corresponding Shannon rate-distortion functions, with equality only when the common-category reconstruction is compatible and RDF-optimal. When the second fidelity is aggregate verification, the joint problem leaves the SK single-letter class and admits a constrained-design feasibility band $R_{\min}(\varepsilon^) \leq R \leq R_{\max}(β^)$ of width $\log_2(K_{\max}/K_{\min})$ bits in partition cardinality. The reduction and the band are scope statements on the SK apparatus, not modifications to it. A smart-grid economic-dispatch example with a non-technical-loss-detection contrast illustrates the band.2026-06-09T13:46:47ZJoss Armstronghttp://arxiv.org/abs/2606.10780v1Secure Aggregation with Top-K Sparsification in Decentralized Federated Learning2026-06-09T12:33:53ZSecure aggregation is a vital component for mitigating gradient leakage in federated learning, but its communication cost conventionally scales with the gradient dimension. This becomes prohibitive for large models and even more pronounced in decentralized federated learning with limited bandwidth and unreliable nodes. Top-K gradient sparsification is an effective approach to reduce communication by transmitting only a few entries of the full gradient, while maintaining competitive model accuracy. Nevertheless, the top-K entries selected by each user are unpredictable and vary across users, which poses a challenge for efficient sparse secure aggregation. This paper studies information-theoretic secure aggregation with top-K sparsification in decentralized federated learning under user dropouts and user collusion. We propose a communication-efficient sparse secure aggregation scheme that offloads dimension-dependent overhead to an offline phase and protects private gradients using random masks and permutations. Experimental results demonstrate that our scheme preserves accuracy comparable to full-gradient aggregation even with only 1% gradient sparsification, while substantially reducing the communication cost.2026-06-09T12:33:53Z6 pages, 1 figure, accepted to IEEE ISIT 2026Hengxuan TangJinbao ZhuXiaohu Tanghttp://arxiv.org/abs/2606.03942v2Stability Analysis for Autoregressive Sampling Sets2026-06-09T11:13:04ZMotivated by recent developments in stochastic modeling of clock jitter in Analog-to-Digital Converters (ADCs) as autoregressive processes of order one (AR(1)), we study the density and stability properties of AR(1)-jittered sampling sets for Paley-Wiener signals. We show that, despite having the correct asymptotic density both on average and almost surely, such sets almost surely fail to be stable sampling sets. We complement this negative result with a finite-dimensional analysis, showing that the corresponding jittered sinc matrices are nonetheless well-conditioned with high probability.2026-06-02T17:30:47ZComments are welcome! v2: we clarified some notation in the proof of Theorem 2Daniele GerosaThomas Erikssonhttp://arxiv.org/abs/2606.10633v1On concatenation of matrices for reversible linear codes over a finite commutative ring and applications to DNA codes2026-06-09T09:38:07ZIn this paper, we develop a generalized framework for constructing reversible linear, reversible self dual and reversible DNA codes using a matrix-theoretic approach based on involutory matrices. The proposed concatenation scheme gives a large class of generator matrices and yields codes with good parameters. The construction is carried out at the level of linear codes and then extended to DNA codes. Using a matrix product approach, we provide a unified method for analysis and proof. Further, we resolve an open problem raised by Oztas et.al. and also we correct and improve some results of them.2026-06-09T09:38:07ZAvanish Kumar ChaturvediSatyadeep Pandeyhttp://arxiv.org/abs/2509.25854v2Delay-Doppler Domain Channel Measurements and Modeling in High-Speed Railways2026-06-09T08:02:55ZAs next-generation wireless communication systems need to be able to operate in high-frequency bands and high-mobility scenarios, delay-Doppler (DD) domain multicarrier (DDMC) modulation schemes, such as orthogonal time frequency space (OTFS), demonstrate superior reliability over orthogonal frequency division multiplexing (OFDM). Accurate DD domain channel modeling is essential for DDMC system design. However, since traditional channel modeling approaches are mainly confined to time, frequency, and space domains, the principles of DD domain channel modeling remain poorly studied. To address this issue, we propose a systematic DD domain channel measurement and modeling methodology in high-speed railway (HSR) scenarios. First, we design a DD domain channel measurement method based on the long-term evolution for railway (LTE-R) system. Second, for DD domain channel modeling, we investigate quasi-stationary interval, statistical power modeling of multipath components, and particularly, the quasi-invariant intervals of DD domain channel fading coefficients. Third, via LTE-R measurements at 371 km/h, taking the quasi-stationary interval as the decision criterion, we establish DD domain channel models under different channel time-varying conditions in HSR scenarios. Fourth, the accuracy of proposed DD domain channel models is validated via bit error rate comparison of OTFS transmission. In addition, simulation verifies that in HSR scenario, the quasi-invariant interval of DD domain channel fading coefficient is on millisecond (ms) order of magnitude, which is much smaller than the quasi-stationary interval length on 100 ms order of magnitude. This study could provide theoretical guidance for DD domain modeling in high-mobility environments, supporting future DDMC and integrated sensing and communication designs for 6G and beyond.2025-09-30T06:48:01Z15 pages, 10 figuresin IEEE Transactions on Wireless Communications, vol. 25, pp. 15725-15740, 2026Hao ZhouYiyan MaDan FeiWeirong LiuZhengyu ZhangMi YangGuoyu MaYunlong LuRuisi HeGuoyu WangCheng LiZhaohui SongBo Ai10.1109/TWC.2026.3684101http://arxiv.org/abs/2411.02817v2Conditional Vendi Score: Prompt-Aware Diversity Evaluation for Generative AI Models and LLMs2026-06-09T06:28:35ZGenerative models guided by text prompts are widely evaluated for fidelity and prompt alignment, yet their ability to produce outputs remains underexplored. Existing diversity metrics such as Vendi and RKE, which are based on the von Neumann and Rényi entropies of kernel matrices, were developed for unconditional models and cannot distinguish prompt-induced from model-induced variability. We address this gap by introducing \textit{Conditional-Vendi} and \textit{Conditional-RKE}, diversity measures derived from the conditional entropy of positive semidefinite matrices. These scores isolate model-induced diversity in prompt-guided generation, with Conditional-RKE enjoying an $O(1/\sqrt{n})$ convergence rate. For Conditional-Vendi, we introduce a truncated-spectrum approximation that yields scalable and consistent estimates. Experiments on text-to-image, image-captioning, and LLM tasks show that the conditional scores recover ground-truth diversity orderings and can also guide diffusion models toward more diverse samples. The codebase is available at https://github.com/mjalali/conditional-vendi.2024-11-05T05:30:39ZMohammad JalaliAzim OspanovAmin GohariFarzan Farniahttp://arxiv.org/abs/2512.16772v3Thermodynamics a la Souriau on Kähler Non Compact Symmetric Spaces for Cartan Neural Networks2026-06-09T06:08:53ZIn this paper, we clarify several issues concerning the abstract geometrical formulation of thermodynamics on non compact symmetric spaces $\mathrm{U/H}$ that are the mathematical model of hidden layers in the new paradigm of Cartan Neural Networks. We introduce a distinction between the generalized thermodynamics associated with Dynamical Systems and the challenging proposal of Gibbs probability distributions on $\mathrm{U/H}$ provided by generalized thermodynamics {à} la Souriau. Main result is the proof that $\mathrm{U/H}$.s supporting Gibbs distributions are only the Kähler ones. For the latter, we solve the problem of determining the space of temperatures, namely of Lie algebra elements for which the partition function converges. The space of generalized temperatures is the orbit under the adjoint action of $\mathrm{U}$ of a positivity domain in the Cartan subalgebra $C_c\subset\mathbb{H}$ of the maximal compact subalgebra $\mathbb{H}\subset\mathbb{U}$. We illustrate how our explicit constructions for the Poincaré and Siegel planes might be extended to the whole class of Calabi-Vesentini manifolds utilizing Paint Group symmetry. Furthermore we claim that Rao's, Chentsov's, Amari's Information Geometry and the thermodynamical geometry of Ruppeiner and Lychagin are the very same thing. The most important property of the Gibbs probability distributions provided by the here introduced setup is their covariance with respect to the action of the full group of symmetries $\mathrm{U}$. The partition function is invariant against $\mathrm{U}$ transformations and the set of its arguments, namely the generalized temperatures, can be always reduced to a minimal set whose cardinality is equal to the rank of the compact denominator group $\mathrm{H}\subset \mathrm{U}$.2025-12-18T17:04:43Z108 pages, 8 figures, Corrected missing referencesEntropy 2026, 28, 365Pietro G. FréAlexander S. SorinMario Trigiante10.3390/e28040365http://arxiv.org/abs/2606.10458v1Minimum Distortion Quantization with Specified Output Distribution2026-06-09T06:06:41ZWe derive the optimal quantizer of a real-valued random variable $W$ with distribution $P_W$ such that 1) the distribution of the quantization output $X$ that can take $k$ values follows any specified distribution $P_X$ over $\{1,\ldots,k\}$, and 2) the minimum mean squared error (MMSE) of estimating $W$ from $X$ is minimized. It is shown that the optimal quantizer takes the form $X=σ\big(F_{σ^{-1}(X)}^{-1}(F_W(W))\big)$, where $σ$ is the optimal permutation of $\{1,\ldots,k\}$ among all permutations to minimize the MMSE, and $F$ is the cumulative distribution function. When $P_W$ is uniform over an interval or $P_X$ is uniform over $\{1,\ldots,k\}$, the quantizer takes a simple form $X=F_{X}^{-1}(F_W(W))$. The concept of majorization plays a key role in the optimality proof. Specifying the output distribution is useful for designing quantizers with explicitly controlled output entropy, maximized mutual information between input and output, tailored output distribution to match channel input requirements for communication, and data anonymization.2026-06-09T06:06:41ZAolin Xuhttp://arxiv.org/abs/2605.17111v2Symmetry-Aware Convex Shrinkage for High-Dimensional Covariance Estimation2026-06-09T05:43:11ZWe develop a class of data-adaptive shrinkage estimators for high-dimensional covariance estimation in which the shrinkage target is a Reynolds projection of the sample covariance under a finite symmetry group selected from a candidate library by held-out predictive performance. The class generalizes the convex shrinkage estimator of Ledoit and Wolf by replacing the scalar-identity target with a structured target derived from a symmetry group when one is available, and generalizes the group-symmetric maximum-likelihood estimator of Shah and Chandrasekaran by combining structural targeting with adaptive convex shrinkage and by selecting the group from data rather than treating it as prespecified. A two-tier procedure performs the group selection: a universal per-candidate evaluation based on held-out negative log-likelihood, optionally preceded by a domain-specific step that constructs the candidate library from structural priors. We establish a finite-sample regret bound for the held-out calibration of the convex combination weight, an oracle inequality for the data-driven group selection, and a quantitative sufficient-match condition under which the proposed estimator dominates Ledoit-Wolf shrinkage in Frobenius mean-squared error. The procedure is illustrated on six real-data problems spanning finance (S&P~500 daily returns), climate (NOAA OISST sea-surface temperature anomalies), genomics (TCGA-BRCA gene expression), radio signal processing (RadioML 2018.A), astronomical imaging (Galaxy10 DECaLS), and natural image patches (CIFAR-10 with a CIFAR-10.1 distribution-shift companion). An empirical comparison is also made against the Bayesian permutation-symmetry estimator of Chojecki and colleagues. Outside the few-shot regime, where structural priors carry the most information per observation, Ledoit-Wolf shrinkage remains the appropriate baseline.2026-05-16T18:31:23Zv1: 99 pp, 20 fig, 22 theorems, 6 datasets; v2: clarified comparison to gipsMitchell A. Thorntonhttp://arxiv.org/abs/2510.21668v2Privacy Guarantee for Nash Equilibrium Computation of Aggregative Games Based on Pointwise Maximal Leakage2026-06-09T05:09:58ZPrivacy preservation has served as a key metric in designing Nash equilibrium (NE) computation algorithms. Although differential privacy (DP) has been widely employed for privacy guarantees, it does not exploit prior distributional knowledge of datasets and is ineffective in assessing information leakage for correlated datasets. To address these concerns, we establish a pointwise maximal leakage (PML) framework when computing NE in aggregative games. By incorporating prior knowledge of players' cost function datasets, we obtain a precise and computable upper bound of privacy leakage with PML guarantees. In the entire view, we show PML refines DP by offering a tighter privacy guarantee, enabling flexibility in designing NE computation with prior knowledge. Also, in the individual view, we reveal that the lower bound of PML can exceed the upper bound of DP by constructing specific correlated datasets. The results emphasize that PML is a more proper privacy measure than DP since the latter fails to adequately capture privacy leakage in correlated datasets. Moreover, we conduct experiments with adversaries who attempt to infer players' private information to illustrate the effectiveness.2025-10-24T17:24:24ZZhaoyang ChengGuanpu ChenTobias J. OechteringMikael Skoglundhttp://arxiv.org/abs/2606.10374v1Equation Asymmetry: An Algebraic Framework for Unifying Secrecy and Covertness in Information-Theoretic Security2026-06-09T03:35:33ZThis paper studies the algebraic structure underlying a broad class of information-theoretic security problems. We define the equation asymmetry degree (EAD) as $Φ= (n - r)/n$, where $n$ is the signal embedding dimension and $r$ is the effective rank of the adversary's observation matrix. This single parameter is shown to simultaneously govern both secrecy (measured by equivocation $H(M|Y_E)$) and covertness (measured by detection error probability $P_e$). On finite fields $\mathbb{F}_q$, we establish the equivocation lower bound $H(M|Y_E) = \min(k, n - r_E) \log q$ with exact probabilistic conditions (Theorem~1), the secrecy capacity $C_s = (n - r_E) \log q$ with complete achievability and converse proofs (Theorem~2), and a strong converse (Theorem~8). In the continuous Gaussian regime, we derive a differential-entropy equivocation bound (Lemma~1), the high-SNR secrecy capacity asymptotics (Lemma~2), and a 2-Wasserstein distance covertness condition $W_2 \approx \sqrt{r_W} \cdot P / (2Nσ) \to 0$ (Theorem~5'). The EAD-SDoF equivalence $d_s = n \cdot Φ$ is established (Theorem~7). Both $η_s$ and $η_c$ are shown to be monotone functions of $Φ$ (Theorem~6), with a Pearson correlation of $0.997$ in continuous-domain experiments. Seven existing security schemes -- matrix embedding, MIMO wiretap, secure network coding, FRFT multi-angle transmission, traffic steganography, group-key secure summation, and MDS secure summation -- are unified under the common form $C_s = (n - r) \log q$. Post-quantum security follows from the information-theoretic hardness of underdetermined linear systems (Theorem~9). All numerical experiments are reproducible with open-source code.2026-06-09T03:35:33ZWang HaoZhang Kuanghttp://arxiv.org/abs/2605.17189v2Sample-efficient inductive matrix completion with noise and inexact side-information2026-06-09T02:46:33ZInductive matrix completion (IMC) is a variant of low-rank matrix completion that incorporates row and column side-information. In principle, it can reduce the effective dimension of the recovery problem from the ambient matrix size to the dimension of the side-information features. Existing theory, however, does not fully realize this advantage in the noisy setting: sample-efficient guarantees only apply to noiseless recovery, while noisy guarantees require sample sizes comparable to ordinary matrix completion. This paper closes this gap for noisy IMC. We analyze a nonconvex projected gradient descent algorithm with spectral initialization and prove that, under exact side-information, it achieves linear convergence and stable recovery at a sample complexity governed by the effective side-information dimension rather than the ambient matrix dimension. The key technical ingredient is a local regularity condition for the IMC loss that holds at this reduced sample size, despite the mismatch between the observation pattern and the side-information subspaces.
We further extend the analysis to inexact side-information, showing that the same reduced sample complexity is preserved and that the estimation error degrades optimally with the level of subspace misspecification. Motivated by this trade-off, we also propose a penalized interpolation between IMC and ordinary matrix completion that balances sample efficiency against robustness to imperfect side-information. Simulations and experiments on the MovieLens dataset support the theoretical findings and illustrate the practical benefits of exploiting side-information in low-sample regimes.2026-05-16T23:10:10ZYuepeng YangCong Mahttp://arxiv.org/abs/2501.07561v5Design and Analysis of a Concatenated Code for Intersymbol Interference Wiretap Channels2026-06-08T21:09:00ZWe propose a two-stage concatenated coding scheme for reliable and secure communication over intersymbol interference wiretap channels. We first establish the secrecy capacity. Then, motivated by the theoretical codes that achieve the secrecy capacity, our scheme integrates low-density parity-check (LDPC) codes in the outer stage, forming a nested structure of wiretap codes, with trellis codes in the inner stage to improve achievable secure rates. The trellis code is specifically designed to transform the uniformly distributed codewords produced by the LDPC code stage into a Markov process, achieving tight lower bounds on the secrecy capacity. We further estimate the information leakage rate of the proposed scheme using an upper bound. To meet the weak secrecy criterion, we optimize degree distributions of the irregular LDPC codes at the outer stage, essentially driving the estimated upper bound on the information leakage rate to zero.2025-01-13T18:51:46ZAria NouriReza AsvadiJun Chenhttp://arxiv.org/abs/2512.14539v2The Performance of Compression-Based Denoisers2026-06-08T20:18:59ZWe consider a denoiser that reconstructs a stationary ergodic source by lossily compressing samples of the source observed through a memoryless noisy channel. Prior work on compression-based denoising has been limited to additive noise channels. We extend this framework to general discrete memoryless channels by deliberately choosing the distortion measure for the lossy compressor to match the channel conditional distribution. By bounding the deviation of the empirical joint distribution of the source, observation, and denoiser outputs from satisfying a Markov property, we give an exact characterization of the loss achieved by such a denoiser. Consequences of these results are explicitly demonstrated in special cases, including for MSE and Hamming loss. A comparison is made to an indirect rate-distortion perspective on the problem.2025-12-16T16:15:25ZAdded experiments in Section VI, minor revisions, 26 pages, 6 figuresDan SongAyfer ÖzgürTsachy Weissman