https://arxiv.org/api/e4rHERfp2N6GgEbc/eKLyR4/cmY2026-06-24T07:19:52Z5490564515http://arxiv.org/abs/2602.02131v3Two-Stage Coded-Sliding Beam Training and QoS-Constrained Sum-Rate Maximization for SIM-Assisted Wireless Communications2026-05-23T01:38:33ZStacked intelligent metasurfaces (SIM) provide a cost-effective and scalable solution for large-scale antenna communications.However, efficient channel state information acquisition and phase shift optimization remain critical challenges. In this paper, we develop a unified framework of low-complexity algorithms for SIM-assisted communication systems to address these issues. Specifically, we propose a generalized two-step codebook construction (TSCC) method that leverages two-dimensional angular-domain decoupling to transform planar array beamformer design into two independent one-dimensional linear array beamformer design problems, efficiently solved via the Gerchberg-Saxton algorithm and our proposed majorization-minimization-based proximal distance (PDMM) algorithm. We further develop a two-stage coded-sliding beam training (TSCSBT) method for low-overhead and high-accuracy beam training, where error-correcting codes are embedded in the first-stage training to enhance robustness against noise, and sliding sampling is subsequently performed around the matched angular samples to improve angular resolution. The proposed framework is further extended to multi-path user channels. Finally, a variable decoupling-based block successive upper bound minimization (VD-BSUM) algorithm is proposed to directly solve the QoS-constrained sum-rate maximization problem through closed-form iterative updates with substantially reduced computational complexity. Simulation results demonstrate the effectiveness of the proposed methods in achieving precise beam pattern realization, improved beam training accuracy and angular resolution, and enhanced sum-rate performance.2026-02-02T14:15:19ZIEEE Transactions on Wireless Communications, vol. 25, pp. 12162-12179, 2026Qian ZhangJu LiuYao GeYufei ZhaoWali Ullah KhanZheng DongYong Liang GuanChau Yuen10.1109/TWC.2026.3661858http://arxiv.org/abs/2605.24314v1On Permutation Groups of Cyclic Codes over Finite Fields2026-05-23T00:43:24ZThe permutation groups of cyclic codes are widely applicable in determining the weight distribution of codes, decoding theory and various other areas. In this paper, by employing two distinct matrix representations, we can relate cyclic codes with very long lengths and special generator polynomials to those with prime lengths. Consequently, we mainly determine the permutation groups of certain cyclic codes over $\mathbb{F}_{r^α}$ with lengths $hp$, $r^mp^n$ and $pq$ and special generator polynomials where $h$ is a positive integer and $p$, $q$ and $r$ are distinct prime numbers. For length $pq$, we manage to provide the permutation groups of cyclic codes with generator polynomials $Q_{pq}(x)$(the $pq$-th cyclotomic polynomial) or others, which seems to be the first work about permutation groups of cyclic codes with generator polynomials that are factors of $x^{pq}-1$ but not factors of $x^p-1(\text{or }x^q-1)$.2026-05-23T00:43:24ZJunjie HuangJicheng MaChang-An Zhaohttp://arxiv.org/abs/2605.15236v2Learning Selective Merge Policies for Deadline-Constrained Coded Caching via Deep Reinforcement Learning2026-05-22T22:27:15ZIn the coded caching, the server uses the cached information at the users to serve multiple users in parallel with a single coded multi-casting message or packet, that is, a merged packet, and thus mitigates the peak network congestion. In order to deliver the timely messages to the users in the deadline-driven applications like the video streaming, we must determine online the messages to be merged for the delivery, as there is a time limit for each request. It is important to note that while the merging aids the current coded multi-casting packet, it could harm the future deliveries. Our solution employs the deep reinforcement learning to view the coded multi-casting delivery as a masked action-discrete state control problem, and our policy network, trained via the proximal policy optimization, performs better than SACM++. On the uniform-demand benchmark, our policy network reduces the broadcast-packet expiration ratio $ρ$ by $40.9\%$ ($0.208$ vs.\ $0.352$) with respect to the best coded multi-casting baseline (SACM++), while also attaining the best broadcast-efficiency score $σ$ across the Track~A battery among the coded multi-casting methods. One noteworthy phenomenon here is that, for the applications with stricter deadlines, the merging becomes selective instead of aggressive, since the policy network selectively merges at approximately $31.8\%$ of the chances, even though the same observation holds across the variations within the same simulator family. The focus of our design is on the efficient pairwise XOR merging, where the higher-order ($K{\ge}3$) coding can be considered as a natural generalization left for future work.2026-05-13T22:18:30ZAmirhossein Yousefiramandihttp://arxiv.org/abs/2605.07107v2Sub-Gaussian Concentration and Entropic Normality of the Maximum Likelihood Estimator2026-05-22T21:39:27ZIt is well known that, under standard regularity conditions, the maximum likelihood estimator (MLE) satisfies a central limit theorem and converges in distribution to a Gaussian random variable as the sample size grows. This paper strengthens this classical result by developing several stronger forms of asymptotic normality for the normalized MLE. With additional assumptions on the score, we first establish sub-Gaussian tail bounds and convergence of all moments for the normalized estimation error. We then prove an entropic central limit theorem for a smoothed version of the estimator, showing convergence in relative entropy to the limiting Gaussian law. When the Fisher information of the normalized estimate is bounded, or its density has bounded first derivative, we further show that the smoothing can be removed, yielding entropic normality of the MLE itself. The proofs develop auxiliary tools that may be of independent interest, including exponential consistency bounds, high-moment estimates, and entropy-control arguments for the estimator.2026-05-08T01:34:03ZLeighton P. BarnesAlex Dytsohttp://arxiv.org/abs/2605.24177v1Towards Scalable Quaternary Message-Passing Decoding for Quantum Error Correction2026-05-22T19:57:24ZThe scalability and interpretability of message-passing (MP) decoding, such as (quaternary) Belief Propagation, remain open challenges in quantum error correction. Even for surface codes, arguably the first testbed for decoding methods, studies of improved MP decoders have mostly been restricted to small distances ($d \lesssim 19$). Moreover, the mismatch with established message-passing theory limits the decoder's interpretability, making it unclear whether MP decoding can sustain its effectiveness at large system sizes. This work takes a step toward a more principled and interpretable MP decoding framework, with the goal of making MP-based decoding more reliable and bridging theory and practice. We introduce a dilution method, which allows a quaternary Min-Sum (MS) decoder to exhibit an apparent depolarizing threshold of $16\%$ up to distance $20$, outperforming Minimum-Weight Perfect Matching in finite-length regimes. Notably, for $X$-noise, the standard MS decoder under dilution has worst-case complexity $O(N \log^2 d)$ and outperforms BP-OSD at $d=65$. The observed $\sim 9\%$ threshold may correspond to a true asymptotic threshold. Finally, we give a graph-dilution argument that interprets the success of the dilution method and offers insight into when MP algorithms can genuinely scale. Taken together, these results provide encouraging progress toward scalable and interpretable MP decoding in quantum error correction.2026-05-22T19:57:24ZBoqing ZhangHenry D. PfisterHanwen YaoSiyuan Niuhttp://arxiv.org/abs/2605.23901v1LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws2026-05-22T17:59:38ZExisting scaling laws for Large Language Models (LLMs), predominantly monotonic power laws, fail to explain emerging non-monotonic phenomena such as catastrophic overtraining and quantization-induced degradation, where performance deteriorates despite increased compute.
We propose the Shannon Scaling Law, a unified theoretical framework that models LLM training as information transmission over a noisy channel, grounded in the Shannon-Hartley theorem. By mapping model parameters to channel bandwidth and training tokens to signal power, our formulation explicitly captures the interaction between learning signal and intrinsic noise. This perspective reveals a fundamental Shannon capacity for LLMs: scaling model size or data without preserving a sufficient signal-to-noise ratio (SNR) inevitably amplifies noise, inducing a transition from monotonic improvement to U-shaped performance degradation.
We validate our theory through experiments on Pythia and OLMo2 under perturbations, including Gaussian noise, quantization and supervised fine-tuning on math, QA and code tasks. The Shannon Scaling Law consistently outperforms classical scaling laws and recent perturbation-aware laws, achieving strong $R^2$ scores and accurately capturing loss basins missed by prior approaches. It also extrapolates: fitted on $\leq$6.9B Pythia models with $\leq$180B tokens, it predicts the unseen 12B model up to 307B tokens at pooled $R^2{=}0.847$, while monotonic baselines collapse.2026-05-22T17:59:38ZAccepted by ICML 2026Xu OuyangDeyi LiuYuhang CaiJing LiuYuan YangChen ZhengThomas HartvigsenYiyuan Mahttp://arxiv.org/abs/2605.23894v1A Two-Branch Finite-Field Construction for Regular CSS LDPC Bases2026-05-22T17:56:26ZThis paper develops a two-branch multiplicative-coset construction for regular Calderbank-Shor-Steane (CSS) quantum low-density parity-check base matrices. For a target column weight \(J\) and an even row weight \(L\), the method reduces regularity, CSS orthogonality, and same-type 4-cycle exclusion to explicit quotient-coset conditions over a finite field. A normalized exhaustive search for these conditions produces base matrices for several \((J,L)\) pairs, so the construction is not tied to a single degree distribution. The construction separates the finite-length design into two stages: the base matrix fixes the degree distribution and the first girth constraints, and a cyclic lift randomizes edge connections subject to exact algebraic checks. As a detailed example, we carry one \((3,10)\)-regular base through the lift and decoding stages. For this example, the selected 64-fold lift gives a code whose same-type Tanner graphs have girth at least eight, and it also excludes a specified weight-16 nondegenerate logical-support orbit. The resulting instance is a \([[10240,4108,\,10\le d\le32]]\) CSS code. For decoding, we use joint log-domain belief propagation together with low-complexity deterministic post-processing rules for small residual syndromes, including repairs for residual patterns with two unsatisfied checks. The frame error rate (FER) measurements provide finite-length decoding data for this detailed example; at depolarizing probability \(p=0.058\), the post-processing FER is \(1.0\times10^{-7}\).2026-05-22T17:56:26ZKoki OkadaKenta Kasaihttp://arxiv.org/abs/2605.11138v2Field Theory of Data: Anomaly Detection via the Functional Renormalization Group. The 2D Ising Model as a Benchmark2026-05-22T16:26:23ZWe establish a correspondence between anomaly detection in high-noise regimes and the renormalization group flow of non-equilibrium field theories. We provide a physical grounding for this framework by proving that the detection of phase transitions in interacting non-equilibrium systems maps to the study of an effective equilibrium field theory near its Gaussian fixed point, which we identify with the universal Marchenko-Pastur distribution. Applying the Functional Renormalization Group to the two-dimensional Model A, we demonstrate that the noise-to-signal ratio acts as a physical temperature, where the signal emerges as ordered domains within a thermalized background of fluctuations. Using the exact Onsager solution as a benchmark, we show that this approach identifies critical thresholds with an error below 4%, significantly outperforming standard information-theoretic metrics such as the Kullback-Leibler divergence. Our results provide a universal strategy for resolving structures in complex datasets near criticality, bridging the gap between statistical mechanics and statistical inference.2026-05-11T18:43:14Z15 pages, 2 appendixes; correction of typos and captions, improved clarityRiccardo FinotelloVincent LahocheParham RadpayDine Ousmane Samaryhttp://arxiv.org/abs/2605.09655v2Geometry of Rényi Entropy on the Majorization Lattice2026-05-22T14:38:40ZMajorization is a stochastic ordering relation that compares the relative diversity of probability distributions with numerous applications in econometrics, spectral theory, and ecology. It is well-known that the majorization partial order forms a complete lattice on the set of ordered probability distributions. In this work, we study the properties of Rényi entropy on the majorization lattice. We establish a fundamental relation between the comonotone coupling and the independent coupling associated with a collection of marginal distributions. Consequently, we show that, for every order $α\in [0,\infty]$, the Rényi entropy is subadditive on the majorization lattice. We further characterize the supermodular regime, showing that Rényi entropy is supermodular on the majorization lattice for $α\in \{0\} \,\cup \, [1,\infty]$. For the Tsallis entropy, we show that it also satisfies subadditivity on the majorization lattice, for every order $α\in [0,\infty)$. Finally, we show that, unlike the Rényi entropy, the Tsallis entropy is supermodular on the majorization lattice for every $α\in [0,\infty)$.2026-05-10T16:58:44Z20 pages, 2 figuresAnuj Kumar YadavYanina Y. Shkelhttp://arxiv.org/abs/2605.23683v1Multi-User MIMO with Rotatable Antennas and IRS: Joint Antenna Boresight and IRS Orientation Design2026-05-22T14:32:47ZIn this paper, we investigate an intelligent reflecting surface (IRS)-assisted multi-user system, where the base station (BS) employs rotatable antennas (RAs) and the IRS can adjust the panel orientation.To alleviate the severe multiplicative path loss of the cascaded channel, the IRS is deployed near the BS, while the user-BS and user-IRS links remain in the far field. We formulate a sum-rate maximization problem by jointly optimizing the receive beamforming, IRS phase shifts, BS antenna boresights, and IRS panel orientation. To tackle the resulting highly coupled and non-convex problem, we first study a single-user case to reveal the structure of the dual-rotation gain, which is shown to be multiplicatively separable in the far field but coupled in the near field. For the general multi-user case, we develop an alternating optimization algorithm, where the receive beamforming is updated in closed form, the IRS phase shifts are optimized by an FP-assisted Riemannian conjugate gradient method, and the BS antenna boresights and IRS panel orientation are updated via projected gradient methods. Simulation results demonstrate the significant sum-rate gains achieved by the proposed coordinated rotation design over fixed-orientation and single-rotation benchmark schemes, and provide useful insights into near-field dual-rotation design.2026-05-22T14:32:47ZGuoying ZhangQingqing WuZiyuan ZhengQiaoyan PengAiling ZhengYanze ZhuYing GaoWen Chenhttp://arxiv.org/abs/2605.23638v1List Reconstruction Problem with List Size Two2026-05-22T13:52:06ZThe problem of computing the cardinality of the intersection of multiple balls in the Hamming space has attracted a lot of attention recently due to their applications in the list reconstruction problem and information retrieval in Associative Memories. In previous work, most of the results are for the cases where the radii of each ball, $r$ and the distance between the centers of these balls, $k$ are fixed when the length $n$ of each codeword tend to infinity. In this work, we focus on the case where $r = αn$ and $k=βn$ for some constants $α$ and $β$ and compute the maximum asymptotic rate of the cardinality of the intersection of three balls. We provide the maximum asymptotic rate as a function of two parameters $α$ and $β$. We also provide numerical results and compare these results with the intersection of two balls.2026-05-22T13:52:06Z6 pages, 1 figure, submitted to ISITA 2026Binh VuVinUniversity, Hanoi, VietnamShuche WangNational University of Singapore, SingaporeVan Khu VuVinUniversity, Hanoi, Vietnamhttp://arxiv.org/abs/2605.23502v1Distributed Two-Phase Processing for Modular XL-MIMO with Wireless Fronthaul under Hardware Impairments2026-05-22T11:05:45ZModular extremely large-scale MIMO (XL-MIMO) architectures combined with wireless fronthaul provide a scalable alternative to monolithic arrays, but their performance is sensitive to hardware impairments and resource allocation strategies. In this paper, we consider a distributed two-phase processing framework for modular XL-MIMO systems employing amplify-and-forward wireless fronthaul under practical hardware constraints. We jointly model access-side and fronthaul-side distortions and formulate a weighted minimum mean-square error (WMMSE)-based optimization problem that maximizes the uplink sum spectral efficiency (SE) by jointly adjusting UE transmit powers and fronthaul amplification levels. The resulting algorithm alternates between distortion-aware receiver design and convex power-control updates. Numerical results demonstrate that the proposed joint optimization significantly improves spectral efficiency compared to fixed transmission strategies, particularly when the CPU has a moderate number of antennas, while also quantifying the relative impact of access and fronthaul impairments.2026-05-22T11:05:45Z5 pages, 2 figures, accepted to be presented at EUSIPCO 2026Özlem Tuğfe Demirhttp://arxiv.org/abs/2605.23498v1Constant-Envelope Quantized Precoding with Power Control for Cell-Free Massive MIMO-OFDM2026-05-22T11:02:18ZCell-free massive MIMO has matured into a key candidate technology for 6G and beyond, owing to its ability to provide nearly uniform service quality to many user equipments (UEs) over the same time-frequency resources. Unlike conventional cellular massive MIMO, the core idea is to distribute a large number of low-cost access points (APs) across the network and enable joint coherent transmission and reception. While early works largely assumed ideal hardware, hardware impairments become inevitable when APs are implemented with low-cost components. In this context, this paper investigates the adverse impact of low-resolution digital-to-analog converters (DACs) on the downlink performance of cell-free massive MIMO-OFDM systems. In contrast to prior studies that mainly quantify spectral-efficiency degradation under low-resolution DACs, we consider the design of quantized constant-envelope (CE) precoding, which additionally enables the use of highly power-efficient amplifiers. To the best of our knowledge, this is the first work on quantized CE precoding for cell-free massive MIMO-OFDM. Beyond adapting the classical maximum-antenna-power method, we propose a novel power-control strategy across APs that mitigates the detrimental effects of severely quantized transmitters by reducing the contribution of harmful APs. Simulation results demonstrate that the proposed power-control mechanism significantly improves the uncoded bit error rate performance.2026-05-22T11:02:18Z5 pages, 2 figures, accepted to be presented at EUSIPCO 2026Özlem Tuğfe DemirSalih Gümüşbuğahttp://arxiv.org/abs/2605.06958v2Hybrid Multiport Receivers for Slow Fluid Antenna Multiple Access2026-05-22T10:48:26ZWe propose a novel receiver architecture that preserves the performance benefits of multiport selection in fluid-antenna systems while requiring only a very small number of radio-frequency (RF) chains. The resulting fluid-antenna hybrid multiport (FAHM) receiver effectively decouples port selection from signal combining by integrating a low-complexity analog combining network similar to those used in conventional hybrid multiantenna designs. We develop a stopping criterion to determine the number of selected ports, which limits the performance loss associated with port selection, and then design the hybrid combiner for a given RF-chain budget. The FAHM architecture is evaluated in a multiuser set-up operating under slow fluid-antenna multiple access (FAMA). In this scenario, a FAHM implementation with only 2 RF chains showcases a performance comparable to a fully-digital conventional multiport scheme with a much larger number of RF chains. Additionally, the proposed receiver architecture attains over 60% reduction in computational burden when integrated with a novel efficient implementation of the state-of-the-art generalized-eigenvector port-selection method.2026-05-07T21:23:23Z12 pages, 8 figures, 1 table. This work has been submitted to the IEEE for publicationJosé P. González-ComaJosé David Vega-SánchezF. Javier López-Martínezhttp://arxiv.org/abs/2605.23460v1Self-Orthogonal Twisted Generalized Reed-Solomon Codes and Their Application to Quantum Error-Correcting Codes2026-05-22T10:20:20ZIn this paper, two classes of twisted generalized Reed-Solomon (TGRS) codes with multi-twists are studied. Firstly, some sufficient and necessary conditions for these codes to be self-orthogonal and self-dual are established. Then several explicit constructions of self-orthogonal and self-dual codes are presented, from which quantum stabilizer codes are further derived. Finally, some corresponding examples are given, especially that some of these codes are MDS, AMDS or NMDS and that some of the resulting quantum stabilizer codes are optimal, achieving the quantum Singleton bound.2026-05-22T10:20:20ZYanxin ChenYanli WangTongjiang Yan