https://arxiv.org/api/8YL80DRzQBHyXj/PhmZSgWGICNc2026-06-28T23:29:55Z54938112515http://arxiv.org/abs/2507.23707v6Cellular, Cell-less, and Everything in Between: A Unified Framework for Utility Region Analysis in Wireless Networks2026-05-06T11:42:26ZWe introduce a unified framework for analyzing utility regions of wireless networks, with a focus on signal-to-interference-plus-noise-ratio (SINR) and achievable rate regions. The framework provides valuable insights into interference patterns of modern network architectures, including extremely large MIMO and cell-less networks. A central contribution is a simple characterization of feasible utility regions using the concept of spectral radius of nonlinear mappings. This characterization provides a powerful mathematical tool for wireless system design and analysis. For example, it allows us to generalize existing characterizations of the weak Pareto boundary using compact notation. It also allows us to derive tractable sufficient conditions for the identification of convex utility regions. This property is particularly important because, on the weak Pareto boundary, it guarantees that time sharing (or user grouping) cannot simultaneously improve the utilities of all users. Beyond geometrical insights, these sufficient conditions have two key implications. First, they identify a family of (weighted) sum-rate maximization problems that are inherently convex, thus paving the way for the development of efficient, provably optimal solvers for this family. Second, they provide justification for formulating sum-rate maximization problems directly in terms of achievable rates, rather than SINR levels. Our theoretical insights also motivate an alternative to the concept of favorable propagation in the massive MIMO literature -- one that explicitly accounts for self-interference and the beamforming strategy.2025-07-31T16:27:45ZIEEE Transactions on Signal Processing, 2026Renato Luis Garrido CavalcanteTomasz PiotrowskiSlawomir Stanczakhttp://arxiv.org/abs/2605.04703v1Entropy and Distributed Source Coding of Connected Soft Random Geometric Graphs2026-05-06T09:55:06ZWe consider the distributed compression of Soft Random Geometric Graphs (SRGGs) above the connectivity threshold. We establish the Slepian-Wolf rate region for the SRGG in the setting where there are a finite number of encoders compressing sections of the graph independently. To do so, we prove novel limit theorems and asymptotic equipartition properties for the SRGG and its entropy, which allow us to use random binning techniques for distributed compression.2026-05-06T09:55:06ZOliver BakerCarl P. Dettmannhttp://arxiv.org/abs/2605.08211v1Learning the Channel Gain from Anywhere to Anywhere via Cross-environment Transformer Estimators2026-05-06T09:17:19ZChannel-gain maps provide the channel gain between any two locations in a geographical region. They find numerous applications, from resource allocation and interference control to path planning for autonomous vehicles. Channel-gain map estimation (CGME) is considerably more challenging than conventional radio map estimation (RME) because channel-gain maps are functions over a 6-dimensional input space. This calls for specialized methods, which currently rely on the (inaccurate) radio tomographic model or require a prohibitively large number of measurements since they do not exploit any spatial structure. This paper overcomes this issue by leveraging spatial patterns that channel-gain maps exhibit across environments, as dictated by the laws of physics and typical environmental characteristics (e.g. building materials and layouts). Adopting a metalearning perspective, a transformer-based estimator is proposed to implicitly learn this common structure from measurements collected in multiple environments. This enables CGME in new environments from significantly fewer measurements (five times less in our experiments). To maximize learning efficiency, the transformer is composed with a feature map that enforces the invariances of CGME, such as those following from reciprocity. Numerical experiments corroborate the merits of the proposed estimator relative to existing methods.2026-05-06T09:17:19ZPrasenjit DharaDaniel Romerohttp://arxiv.org/abs/2605.04618v1Constructions of locally repairable codes via concatenated codes2026-05-06T08:07:17ZIn recent years, locally repairable codes (LRCs) have attracted considerable attention owing to their pivotal role in distributed storage systems. Since binary linear locally repairable codes can significantly reduce the complexity of both encoding and decoding processes, the construction of binary LRCs has attracted extensive research interest. In this paper, we construct locally repairable codes via concatenated codes and present a systematic approach to select outer codes to obtain optimal binary LRCs, where the outer codes are linear codes over $\mathbb{F}_4$. The weight distributions of the resulting LRCs are determined by the weight distributions of the selected linear codes over $\mathbb{F}_4$. Furthermore, several classes of optimal binary locally repairable codes are constructed, including binary LRCs meeting the Griesmer-like bound, and binary perfect LRCs. Meanwhile, for the locality $r=2$, we improve the Johnson-like bound for binary LRCs with disjoint local repair groups established by Ma and Ge, and construct explicit LRCs that attain this new bound.2026-05-06T08:07:17ZHengfeng JinFang-Wei Fuhttp://arxiv.org/abs/2601.07389v2On the Non-decoupling of Supervised Fine-tuning and Reinforcement Learning in Post-training2026-05-06T07:45:21ZPost-training of large language models routinely interleaves supervised fine-tuning (SFT) with reinforcement learning (RL). These two methods have different objectives: SFT minimizes the cross-entropy loss between model outputs and expert responses, while RL maximizes reward signals derived from human preferences or rule-based verifiers. Modern reasoning models have widely adopted the practice of alternating SFT and RL training. However, there is no theoretical account of whether they can be decoupled. We prove that decoupling is impossible in either order: (1) SFT-then-RL coupling: RL increases SFT loss under both distributional (KL-based) and landscape (PL-based) analyses; and (2) RL-then-SFT coupling: SFT lowers the reward achieved by RL under analogous conditions. Under the PL condition, we further derive the optimal RL duration that balances reward improvement against SFT degradation, identify the non-decoupling threshold governing when RL can improve SFT, and bound the gradient misalignment via spectral concentration. Experiments on Qwen3-0.6B confirm the predicted degradation, verifying that SFT and RL cannot be separated without loss of prior performance in the post-training pipeline.2026-01-12T10:14:09ZXueyan NiuBo BaiWei HanWeixi Zhanghttp://arxiv.org/abs/2605.04545v1Z-Opt: A Near-Optimal Reduced-Complexity Two-Dimensional Grassmannian Constellation2026-05-06T06:46:03ZGrassmannian constellations are known to achieve the capacity of noncoherent communications over Rayleigh fading channels in the high-SNR regime, yet their efficient construction remains challenging. In this paper, we propose two construction methods for Grassmannian constellations of one-dimensional subspaces in a two-dimensional space, termed S-Opt and Z-Opt, along with two low-complexity detectors. Both the construction and detection procedures are performed on the unit sphere, known as the Bloch sphere in quantum computing. We show that the chordal distance on the Grassmann manifold is proportional to the Euclidean distance on the Bloch sphere and derive a corresponding theoretical upper bound based on the Fejes--Tóth bound on the minimum chordal distance. The S-Opt constellation is constructed from sphere-packing solutions and attains the derived upper bound for the optimal Bloch-sphere packings considered. The S-Opt detector can be applied to arbitrary Grassmannian constellations on $\mathcal{G}(2,1)$, and its time complexity scales linearly with the number of receive antennas and logarithmically with the constellation size, while yielding the same detection performance as the GLRT detector. Furthermore, based on the insight obtained through the S-Opt construction, the Z-Opt constellation is constructed by stacking regular polygons on the Bloch sphere, and its minimum chordal distance approaches the derived upper bound over the evaluated constellation sizes. The Z-Opt detector's time complexity scales linearly with the number of receive antennas, while yielding the same detection performance as the GLRT detector for Z-Opt.2026-05-06T06:46:03Z12 pages, 11 figuresKotaro ShigenagaHiroki IimoriYuto HamaChandan PradhanSzabolcs MalomsokyNaoki Ishikawahttp://arxiv.org/abs/2605.04533v1Online Riemannian Gradient Descent for Quantum State Tomography with Matrix Product Operators2026-05-06T06:19:46ZMatrix product operators (MPOs) provide a scalable approach for quantum state tomography (QST) by offering a compact representation of many-body mixed states with limited entanglement, using only a number of parameters that scales polynomially with the system size. In this paper, we study QST for quantum density matrices that can be represented by MPOs. We first derive an equivalent characterization of Hermiticity in terms of the MPO core tensors and show that the coefficient tensor of an MPO under the Pauli or generalized Gell-Mann basis admits a real-valued low tensor-train (TT) rank structure. This establishes an explicit connection between MPO-based QST and noisy low-rank tensor completion. Motivated by this formulation, we develop an online Riemannian gradient descent (oRGD) algorithm that sequentially incorporates measurement data during the reconstruction process. With a proper initialization, we prove that oRGD converges linearly to the target MPO and succeeds with a number of distinct measurement settings that scales quadratically with the system size. As a byproduct, our analysis also yields a significantly improved sample complexity bound for the low TT rank tensor completion task. Furthermore, we propose a tailored spectral initialization method and establish its theoretical guarantee. Numerical experiments on several classes of quantum states validate the effectiveness and scalability of the proposed method.2026-05-06T06:19:46ZJian-Feng CaiJingyang LiXiaoqun ZhangYuanwei Zhanghttp://arxiv.org/abs/2605.04342v1Adaptive Diagonal Loading for Norm Constrained Beamforming2026-05-05T23:00:06ZReliable adaptive beamforming is critical for large microphone arrays operating in highly dynamic acoustic environments. In scenarios characterized by fast-moving talkers and interferers, the available sample support for estimating the spatial correlation matrix is often snapshot-deficient. This deficiency, coupled with array imperfections, degrades the White Noise Gain (WNG), leading to severe target signal cancellation. To ensure stable and robust beamforming, we propose a novel adaptive diagonal loading method that guarantees the WNG remains strictly within specified bounds. By leveraging the Kantorovich inequality, we map the desired WNG to a strict upper bound on the condition number of the correlation matrix. Furthermore, we present three estimation techniques for the adaptive loading level, ranging from trace-based bounding to exact eigenvalue decomposition, offering scalable computational complexities of $\mathcal{O}(M)$, $\mathcal{O}(M^2)$, and $\mathcal{O}(M^3)$. Our approach demonstrates highly stable beamforming under fast-changing interference.2026-05-05T23:00:06Z5 pages, 5 figuresManan MittalRyan M. CoreyJohn R. BuckAndrew C. Singerhttp://arxiv.org/abs/2501.01556v3The Geometry of Statistical Data and Information: A Large Deviation Perspective2026-05-05T19:40:54ZThe manifold of empirical mean values of statistical data ad infinitum has a geometric shape that depends on the probability measure that governs the generating model. Large deviation theory produces entropy functions that depend on both the probability measure and the statistical data; we use entropy to study the geometry of the data space rather than that of the space of probability distributions. It is well known, since Rao's work, that the Fisher-Rao metric makes the probability simplex into a sphere. From our perspective, that result translates to the space of empirical singleton counting frequencies under an i.i.d. assumption. Following our ideas and going beyond i.i.d., the choice of measure curves the space. When we study the pairwise statistics, the spherical geometry breaks down entirely. We show that the information projection, defined in information geometry as divergence minimization, coincides with the information projection in Kolmogorov's probability theory. This identification holds under both i.i.d. and Markovian assumptions and connects information geometry to the foundations of probability theory.2025-01-02T22:23:28ZViswa Virinchi MuppiralaHong Qianhttp://arxiv.org/abs/2603.29895v3A Rational Account of Categorization Based on Information Theory2026-05-05T18:49:59ZWe present a new theory of categorization based on an information-theoretic rational analysis. To evaluate this theory, we investigate how well it can account for key findings from classic categorization experiments conducted by Hayes-Roth and Hayes-Roth (1977), Medin and Schaffer (1978), and Smith and Minda (1998). We find that it explains the human categorization behavior as well as (or better) than the independent cue and context models (Medin & Schaffer, 1978), the rational model of categorization (Anderson, 1991), and a hierarchical Dirichlet process model (Griffiths et al., 2007).2026-02-07T22:21:32Z6 pages, 5 figures, 2 tables; Published at CogSci 2026 ConferenceChristopher J. MacLellanKarthik SingaravadivelanXin LianZekun WangPat Langleyhttp://arxiv.org/abs/2605.03991v1Joint Design of Piggyback and Conjugate Transformation Functions for Repair Bandwidth Reduction in Piggybacking Codes2026-05-05T17:11:55ZEfficient node repair is a central requirement in distributed storage systems, particularly in high-rate erasure-coded deployments where repair traffic directly affects network overhead and recovery cost. Piggybacking codes reduce the repair bandwidth of MDS array codes while keeping the sub-packetization level small. However, existing piggybacking constructions often rely on restrictive piggyback-function designs to preserve the MDS property over small fields, which limits their repair-bandwidth reduction. We propose {\em conjugate-piggybacking} codes, a new class of MDS array codes that jointly design piggyback functions and conjugate transformations under small sub-packetization. The proposed construction improves repair efficiency while preserving the MDS property over moderate field sizes. In particular, it enables some parity nodes to achieve optimal repair bandwidth and reduces the overall repair bandwidth compared with existing piggybacking-based designs. We analyze the MDS property and repair bandwidth of the proposed codes and evaluate them against existing piggybacking codes under high-code-rate settings over $\mathbb{F}_{2^8}$. We further conduct a repair-traffic simulation under uniform single-node failures to quantify the expected traffic reduction in storage-oriented settings. The results show that our construction consistently achieves lower repair bandwidth than related piggybacking codes and reduces expected repair traffic compared with conventional RS repair. These gains are obtained at the cost of a slightly larger field size, revealing a practical trade-off between repair efficiency and field-size overhead for high-rate distributed storage.2026-05-05T17:11:55ZHao ShiZhengyi JiangGefeng DengZhongyi HuangHanxu Houhttp://arxiv.org/abs/2605.03935v1Deterministic Sparse FFT via Keyed Multi-View Gating with $O(\sqrt{N} \log k)$ Expected Time2026-05-05T16:25:48ZWe introduce a deterministic sparse Fourier transform framework based on a keyed multi-view gating mechanism that leverages 2-of-3 Chinese Remainder Theorem (CRT) agreement to reduce candidate frequency pairs from $O(k^2)$ to $Θ(k)$ under sparse-regime assumptions. Unlike prior approaches that rely on randomized bucketization for candidate formation, the proposed method provides deterministic structure with probabilistic guarantees arising only from assumptions on frequency placement and independence of affine hashing across views. The algorithm is realized through a peeling-based recovery procedure that extracts frequencies directly from singleton bins without explicit pair enumeration. A recursive self-reduction eliminates the $O(\sqrt{N} \log N)$ preprocessing floor, yielding $O(\sqrt{N} \log k)$ expected identification time while maintaining an $O(N \log N)$ worst-case bound via deterministic dense-FFT fallback. A multi-view verification framework combining Parseval energy consistency and bin-wise residual checks ensures bounded failure probability and no false negatives under correct verification. This establishes a framework combining deterministic candidate reduction, sublinear expected complexity, and worst-case safety guarantees within a CRT-based sparse FFT architecture.2026-05-05T16:25:48Z19 pages, 6 figures. Includes theoretical analysis, algorithm specification, and complexity proofs. Companion works establish deterministic lower bounds and hybrid safety-certified extensionsAaron R. FlouroShawn P. Chadwickhttp://arxiv.org/abs/2605.01849v2Optimal Communication Rate of Secure Aggregation over Ring Networks with Pairwise Keys2026-05-05T14:59:44ZInformation-theoretic topological secure aggregation (TSA)\cite{zhang2026information_regular} enables distributed users to compute neighborhood sums over arbitrary networks without revealing individual inputs, while remaining communication-efficient. It has broad applications, including secure model aggregation in decentralized federated learning (FL). Existing TSA formulations rely on arbitrarily correlated keys generated by a trusted key server, which introduces a single point of failure. In this paper, we instead study TSA with \tit{pairwise} secret keys, where each user pair $(i,j)$ shares an independent key $S_{i,j}$. Such keys can be established through inter-user communication, eliminating the need for a key server and improving robustness. Focusing on a ring topology with $K$ users, we characterize the minimum per-user communication rate: \tit{to securely compute one bit of the desired input sum, each user must send at least $1$ bit to its neighbors when $K=3,4$, and at least $2$ bits for all $K\ge 5$}. The higher rate in larger networks arises because each user must simultaneously satisfy two independent key-alignment constraints from its two neighborhoods, which cannot be resolved within a single broadcast symbol under pairwise key independence. We propose a linear pairwise-masking scheme that achieves these rates and prove its optimality via tight entropic converse bounds that exploit the dependency structure of the keys. Notably, for all $K\ge 4$, only a subset of the $\binom{K}{2}$ pairwise keys -- specifically, those between users at ring distance $2$ -- is sufficient to achieve optimality, revealing a nontrivial role of topological sparsity in secure aggregation.2026-05-03T12:37:48ZXiang ZhangHan YuZhou LiYizhou ZhaoGiuseppe Cairehttp://arxiv.org/abs/2511.22747v2On minimal codes arising from projective embeddings of point-line geometries2026-05-05T14:52:07ZLet ${\mathcal C}(Ω)$ be the linear code arising from a projective system $Ω$ of $\mathrm{PG}(V).$ Consider the point-line geometry $Γ=({\mathcal P},{\mathcal L})$ and a projective embedding $\varepsilon\colon Γ\rightarrow \mathrm{PG}(V)$ of $Γ.$ We show that the projective code obtained by taking as projective system $Ω:=\varepsilon(\mathcal{P})$ is minimal if the graph induced on the set $Γ\setminus\varepsilon^{-1}(H)$ by the collinearity graph of $Γ$ is connected for any hyperplane $H$ of $\mathrm{PG}(V)$. As an application, Grassmann codes, Segre codes, polar Grassmann codes of orthogonal, symplectic, hermitian type and codes arising from the point-hyperplane geometry of a projective space are minimal codes.2025-11-27T20:33:10Z20 pagesIlaria CardinaliLuca Giuzzihttp://arxiv.org/abs/2508.04313v2Is Lattice Reduction Necessary for Vector Perturbation Precoding?2026-05-05T12:56:01ZVector perturbation (VP) precoding is an effective nonlinear precoding technique in the downlink (DL) with modulo channels, providing an approximation of dirty paper coding (DPC) which is capacity-achieving. Especially, when combined with Lattice reduction (LR), low-complexity algorithms achieve a very promising performance, outperforming other popular non-linear precoding techniques like Tomlinson-Harashima precoding (THP). However, these results are based on the symbol error rate (SER) or bit error rate (BER). When shifting the focus to the mutual information as the figure of merit, we show that this is different and that the underlying lattice problem has a unique structural property. For lattice problems with this special structure, we show for a whole class of algorithms that LR does not have any impact on the solution vector. At the same time, algorithms are identified which benefit from LR, even if this lattice structure arises. The provided structural analysis has strong implications on the performance evaluation of VP. In particular, we re-evaluate popular Lenstra-Lenstra-Lovász (LLL)-aided methods like the LLL-aided nearest plane (NP) algorithm and show that they do not outperform conventional THP, highlighting the effectiveness of the THP method. This is in contrast to the existing results based on SER and BER where these methods clearly outperform THP.2025-08-06T11:01:09ZDominik SemmlerWolfgang UtschickMichael Joham