https://arxiv.org/api/yvALCfzIA8SkghWf4clv5/Se1RE 2026-06-23T01:43:11Z 54858 585 15 http://arxiv.org/abs/2605.23502v1 Distributed Two-Phase Processing for Modular XL-MIMO with Wireless Fronthaul under Hardware Impairments 2026-05-22T11:05:45Z

Modular extremely large-scale MIMO (XL-MIMO) architectures combined with wireless fronthaul provide a scalable alternative to monolithic arrays, but their performance is sensitive to hardware impairments and resource allocation strategies. In this paper, we consider a distributed two-phase processing framework for modular XL-MIMO systems employing amplify-and-forward wireless fronthaul under practical hardware constraints. We jointly model access-side and fronthaul-side distortions and formulate a weighted minimum mean-square error (WMMSE)-based optimization problem that maximizes the uplink sum spectral efficiency (SE) by jointly adjusting UE transmit powers and fronthaul amplification levels. The resulting algorithm alternates between distortion-aware receiver design and convex power-control updates. Numerical results demonstrate that the proposed joint optimization significantly improves spectral efficiency compared to fixed transmission strategies, particularly when the CPU has a moderate number of antennas, while also quantifying the relative impact of access and fronthaul impairments.

2026-05-22T11:05:45Z 5 pages, 2 figures, accepted to be presented at EUSIPCO 2026 Özlem Tuğfe Demir http://arxiv.org/abs/2605.23498v1 Constant-Envelope Quantized Precoding with Power Control for Cell-Free Massive MIMO-OFDM 2026-05-22T11:02:18Z

Cell-free massive MIMO has matured into a key candidate technology for 6G and beyond, owing to its ability to provide nearly uniform service quality to many user equipments (UEs) over the same time-frequency resources. Unlike conventional cellular massive MIMO, the core idea is to distribute a large number of low-cost access points (APs) across the network and enable joint coherent transmission and reception. While early works largely assumed ideal hardware, hardware impairments become inevitable when APs are implemented with low-cost components. In this context, this paper investigates the adverse impact of low-resolution digital-to-analog converters (DACs) on the downlink performance of cell-free massive MIMO-OFDM systems. In contrast to prior studies that mainly quantify spectral-efficiency degradation under low-resolution DACs, we consider the design of quantized constant-envelope (CE) precoding, which additionally enables the use of highly power-efficient amplifiers. To the best of our knowledge, this is the first work on quantized CE precoding for cell-free massive MIMO-OFDM. Beyond adapting the classical maximum-antenna-power method, we propose a novel power-control strategy across APs that mitigates the detrimental effects of severely quantized transmitters by reducing the contribution of harmful APs. Simulation results demonstrate that the proposed power-control mechanism significantly improves the uncoded bit error rate performance.

2026-05-22T11:02:18Z 5 pages, 2 figures, accepted to be presented at EUSIPCO 2026 Özlem Tuğfe Demir Salih Gümüşbuğa http://arxiv.org/abs/2605.06958v2 Hybrid Multiport Receivers for Slow Fluid Antenna Multiple Access 2026-05-22T10:48:26Z

We propose a novel receiver architecture that preserves the performance benefits of multiport selection in fluid-antenna systems while requiring only a very small number of radio-frequency (RF) chains. The resulting fluid-antenna hybrid multiport (FAHM) receiver effectively decouples port selection from signal combining by integrating a low-complexity analog combining network similar to those used in conventional hybrid multiantenna designs. We develop a stopping criterion to determine the number of selected ports, which limits the performance loss associated with port selection, and then design the hybrid combiner for a given RF-chain budget. The FAHM architecture is evaluated in a multiuser set-up operating under slow fluid-antenna multiple access (FAMA). In this scenario, a FAHM implementation with only 2 RF chains showcases a performance comparable to a fully-digital conventional multiport scheme with a much larger number of RF chains. Additionally, the proposed receiver architecture attains over 60% reduction in computational burden when integrated with a novel efficient implementation of the state-of-the-art generalized-eigenvector port-selection method.

2026-05-07T21:23:23Z 12 pages, 8 figures, 1 table. This work has been submitted to the IEEE for publication José P. González-Coma José David Vega-Sánchez F. Javier López-Martínez http://arxiv.org/abs/2605.23460v1 Self-Orthogonal Twisted Generalized Reed-Solomon Codes and Their Application to Quantum Error-Correcting Codes 2026-05-22T10:20:20Z

In this paper, two classes of twisted generalized Reed-Solomon (TGRS) codes with multi-twists are studied. Firstly, some sufficient and necessary conditions for these codes to be self-orthogonal and self-dual are established. Then several explicit constructions of self-orthogonal and self-dual codes are presented, from which quantum stabilizer codes are further derived. Finally, some corresponding examples are given, especially that some of these codes are MDS, AMDS or NMDS and that some of the resulting quantum stabilizer codes are optimal, achieving the quantum Singleton bound.

2026-05-22T10:20:20Z Yanxin Chen Yanli Wang Tongjiang Yan http://arxiv.org/abs/2605.23424v1 Sparse In-Network Learning via Shortest-Path Backpropagation and Finite-Rate Gating 2026-05-22T09:35:05Z

In-network learning (INL) trains distributed neural modules by exchanging latent activations and backpropagated errors over a communication graph. This letter proposes Dijkstra-pruned INL (D-INL), which removes non-tree links by retaining a capacity-aware shortest-path tree rooted at the fusion node. To balance sparsity and predictive information, local routing (or aggregation) is modeled as a finite-rate stochastic gate with rate $R_g=I(Z; T)$. We derive a rate-distortion-generalization bound and validate the method on a reproducible distributed-classification experiment, where D-INL reduces training exchange by $70.4\%$ while preserving accuracy within the standard deviation of dense INL. Adding finite-rate regularization further reduces the estimated latent rate by $45.7\%$ relative to unregularized Dijkstra INL.

2026-05-22T09:35:05Z Mohammad Reza Deylam Salehi http://arxiv.org/abs/2605.23421v1 Stochastic Generalized Sampling 2026-05-22T09:32:47Z

Reconstructing an infinite-dimensional signal from a finite set of measurements is a fundamental problem in approximation theory and signal processing. While the generalized sampling (GS) framework provides a robust methodology for recovering elements in arbitrary separable Hilbert spaces, deterministic approaches suffer from severe basis-dependent dimensionality constraints, often requiring a quadratic sample complexity $m \gtrsim n^2$ to avoid numerical instability. In this paper, we introduce a fully stochastic framework for GS that natively overcomes these deterministic barriers. By drawing measurements according to an optimal leverage-score probability distribution, we prove that stable recovery is guaranteed with high probability at a near-linear sample complexity of $m \gtrsim n\log n$. Crucially, this optimal rate is universal-independent of the specific choice of measurement and reconstruction bases-and holds even when the sensing system is a highly redundant frame. To establish these guarantees, we derive a novel matrix Bernstein inequality for random rectangular operators, allowing us to rigorously control the aliasing error governed by the empirical cross-term. Finally, we demonstrate the practical efficacy of our approach on the classical problem of recovering analytic functions from continuous Fourier measurements via Legendre polynomials, where our randomized method achieve near-exponential convergence rates.

2026-05-22T09:32:47Z Luca Finotti Matteo Santacesaria http://arxiv.org/abs/2605.23390v1 Layered construction of Message-Wise Unequal Error Protection Codes 2026-05-22T09:00:49Z

Conventional communication systems are mainly designed to reduce error rates and increase transmission rates, and therefore usually provide uniform protection to all transmitted messages. However, in intent-oriented applications, different messages may have different semantic meanings and importance levels, requiring different levels of reliability. This paper proposes a layered construction of message-level unequal error protection (UEP) codes for short-blocklength communication. Instead of appending an explicit protection tag to each codeword, the proposed method embeds the protection structure directly into the Hamming-distance structure of the codebook. By assigning larger minimum intra-level distances to higher-importance message groups and imposing suitable inter-level distance constraints, the proposed codebook provides differentiated error-correction capabilities while enabling reliable importance-level classification at the receiver. Theoretical conditions for correct group classification are derived, and simulations over AWGN and VLC-ISI channels show that the proposed scheme improves BER performance and group classification accuracy compared with a tag-based ECC baseline.

2026-05-22T09:00:49Z 6pages,5 figures Qiming Lu Shan Lu Takaya Yamazato http://arxiv.org/abs/2605.23362v1 Instance-Optimal Estimation with Multiple LLM Judges on a Budget 2026-05-22T08:26:08Z

Evaluating large language models increasingly relies on LLM-as-a-judge protocols, but such evaluations remain costly: different judges have different prices and reliabilities, and the difficulty of each prompt-response pair can vary substantially. This raises a basic allocation question: under a fixed budget, how should one distribute evaluation queries across heterogeneous judges and instances to obtain the most accurate score estimates? We formalize this question as *budgeted heteroskedastic multi-judge estimation*. Given $K$ prompt-response pairs, $J$ judges with known costs, and unknown query-judge variances, the goal is to estimate a bounded score vector while minimizing an $\ell_p$-error. Our first contribution is to analyze the inverse-variance weighted estimator (IVWE) and to derive the oracle allocation that minimizes its error rate. Since this allocation depends on the unknown variances, we then address the practical unknown-variance setting by proposing EST-IVWE, an adaptive algorithm that constructs and leverages *optimistically biased* variance estimates to stabilize the empirical allocation. We prove that EST-IVWE matches the oracle IVWE rate up to lower-order terms in the budget. Our second and central theoretical contribution is a matching *local* minimax lower bound, which establishes the instance-optimality of the proposed algorithms. A key technical insight is that Fano-type high-probability arguments are too coarse for this problem: their packing construction loses the local variance structure that governs the optimal allocation. We instead use an Assouad-type in-expectation argument, based on local perturbations, which preserves this structure and yields the sharp allocation-dependent lower bound. Finally, we numerically validate the superiority of our approach over naïve uniform allocation on synthetic and HelpSteer2 datasets.

2026-05-22T08:26:08Z 53 pages, 4 figures; the first two authors contributed equally Junghyun Lee Sanghwa Kim Yassir Jedra Alexandre Proutière Se-Young Yun http://arxiv.org/abs/2602.07235v2 ArcMark: Distortion-Free Multi-Byte LLM Watermark via Optimal Transport 2026-05-22T08:22:26Z

Watermarking is an important tool for promoting the responsible use of large language models (LLMs). Existing watermarks insert a signal into generated tokens that either flags LLM-generated text (zero-bit watermarking) or encodes more complex messages (multi-bit watermarking). Though a number of recent approaches insert multiple bits into text without perturbing average next-token predictions, they largely extend design principles from the zero-bit setting, such as encoding a single bit per token. In contrast, a watermarker capable of embedding multiple bytes into the text would dramatically increase the potential applications, by embedding information such as the ID of the user who submitted the prompt, the precise model version that was used, or even the prompt itself. We address this problem by introducing ArcMark: a new watermark construction based on coding and information-theoretic principles that is capable of reliably embedding multiple bytes of information into just a few hundred tokens, without any distortion of the underlying LLM next-token distribution. We derive ArcMark by formulating the distortion-free watermarking problem as a channel coding problem, and deriving an information-theoretic channel capacity that establishes the fundamental limit of embedding information in LLM output in a distortion-free manner. This capacity formulation informs the design of ArcMark. In practice, ArcMark outperforms competing multi-bit distortion-free watermarks in terms of reconstruction accuracy, including in the face of attacks that alter a subset of the LLM text. ArcMark output is also shown to be indistinguishable from unwatermarked text in terms of perplexity, and in downstream task quality.

2026-02-06T22:28:03Z Atefeh Gilani Sajani Vithana Carol Xuan Long Oliver Kosut Lalitha Sankar Flavio P. Calmon http://arxiv.org/abs/2605.23329v1 MDS and NMDS Codes from the Extended Twisted Generalized Reed-Solomon Codes 2026-05-22T07:44:33Z

This paper contributes to maximum distance separable (MDS) and near MDS (NMDS) properties of the extended generalized twisted Reed-Solomon (TGRS) codes. Firstly, a family of extended TGRS (ETGRS) are constructed by appending three columns to the generator matrix of original TGRS codes. Secondly, the necessary and sufficient conditions for these codes to be MDS or almost MDS (AMDS) codes are derived. Then, by analyzing the AMDS properties of their dual codes, the necessary and sufffcient conditions for them to be NMDS codes are established. Furthermore, some examples are given to verify the main results. Finally, we determine the non-generalized Reed-Solomon (non-GRS) characteristics of them via the Schur product method.

2026-05-22T07:44:33Z Yanli Wang Yanxin Chen Tongjiang Yan http://arxiv.org/abs/2504.09388v2 The Rate-Immediacy Barrier in Explicit Tree Code Constructions 2026-05-22T07:03:01Z

Since the introduction of tree codes by Schulman (STOC 1993), explicit construction of asymptotically good tree codes has remained a notorious challenge. A work by Cohen, Haeupler and Schulman (STOC 2018), as well as the state-of-the-art construction by Ben Yaacov, Cohen, and Yankovitz (STOC 2022) have achieved codes with rate $Ω(1/\log\log n)$, exponentially improving upon the original rate $Ω(1/\log n)$ construction of Evans, Klugerman and Schulman from 1994. All of these constructions rely, at least in part, on increasingly sophisticated methods of combining (block) error-correcting codes. In this work, we identify a fundamental barrier to constructing tree codes using known techniques. We introduce a key property which we call immediacy, that, while not required by the original definition of tree codes, is shared by all known constructions and inherently arises in recursive combinations of error-correcting codes. Our main technical contribution is the proof of a rate-immediacy trade-off, which, in particular, implies that any tree code with constant distance and non-trivial immediacy must necessarily have vanishing rate. By applying our rate-immediacy trade-off to existing constructions, we establish that their known rate analyses are essentially optimal given their actual error-correction properties. More broadly, our work highlights the need for fundamentally new ideas -- beyond the recursive use of error-correcting codes -- to achieve substantial progress in explicitly constructing asymptotically good tree codes.

2025-04-13T00:47:48Z Added further discussion and examples. To appear in proceedings of CCC 2026 Gil Cohen Leonard J. Schulman Piyush Srivastava http://arxiv.org/abs/2605.23260v1 MISO Downlink with Fluid Antenna Multiple Access 2026-05-22T06:01:07Z

Fluid antenna multiple access (FAMA) enables each user to rapidly switch among several closely spaced ports and select the strongest received signal. Although this mechanism offers micro-scale spatial diversity, its behavior in multiuser downlink systems with spatial correlation and linear precoding is not well understood. This paper develops a unified analytical framework for the multiple-input single-output (MISO) downlink with FAMA users served via maximum ratio transmission (MRT) or zero-forcing (ZF). We show that the per-port signal-to-interference ratio (SIR) follows a Beta-prime distribution with parameters $(M_{\mathrm{eff}},L)$, where $M_{\mathrm{eff}}=M$ under MRT and $M_{\mathrm{eff}}=M-U+1$ under ZF, and derive closed-form finite-sum cumulative distribution functions (CDFs) for both cases. We further provide the first analytical characterization of cross-port SIR correlation. \textcolor{black}{Furthermore, we derive rigorous outage probability bounds that tightly bracket the exact performance and become exact in the limiting cases of fully correlated and independent ports.} Asymptotic analyses reveal the fundamental diversity orders and tail behavior for each precoder. Numerical results confirm the accuracy of the SIR distributions, correlation model, and outage bounds, and show that MRT achieves weaker port correlation and larger selection gains than ZF when the base station (BS) has ample spatial degrees of freedom. The framework offers explicit guidelines for port configuration and precoder selection in practical FAMA systems.

2026-05-22T06:01:07Z accepted in IEEE Transactions on Wireless Communications Anastasios Papazafeiropoulos http://arxiv.org/abs/2604.07796v2 Order-Optimal Sequential 1-Bit Mean Estimation in General Tail Regimes 2026-05-22T05:47:06Z

In this paper, we study the problem of mean estimation under 1-bit communication constraints. We propose a novel adaptive mean estimator based solely on randomized threshold queries, where each 1-bit outcome indicates whether a given sample exceeds a sequentially chosen threshold. Our estimator is $(ε, δ)$-PAC for any distribution with a bounded mean $μ\in [-λ, λ]$ and a bounded $k$-th central moment $\mathbb{E}[|X-μ|^k] \le σ^k$ for any fixed $k > 1$. Moreover, our sample complexity is order-optimal in all such tail regimes, i.e., for every such $k$ value. For $k \neq 2$, our estimator's sample complexity matches the unquantized minimax lower bounds plus an unavoidable $O(\log(λ/σ))$ localization cost. For the finite-variance case ($k=2$), our estimator's sample complexity has an extra multiplicative $O(\log(σ/ε))$ penalty, and we establish a novel information-theoretic lower bound showing that this penalty is a fundamental limit of 1-bit quantization. We also establish a significant adaptivity gap: for both threshold queries and more general interval queries, the sample complexity of any non-adaptive estimator must scale linearly with the search space parameter $λ/σ$, rendering it vastly less sample efficient than our adaptive approach. Finally, we present algorithmic variants that (i) handle an unknown sampling budget, (ii) adapt to an unknown scale parameter $σ$ given (possibly loose) bounds, (iii) require only two stages of adaptivity to achieve order-optimal sample complexity at the expense of more general 1-bit queries, and (iv) leverage multiple local samples per 1-bit query to proportionally reduce communication costs.

2026-04-09T04:49:21Z This article substantially extends the AISTATS version, arXiv:2509.21940 Ivan Lau Jonathan Scarlett http://arxiv.org/abs/2605.23236v1 A Posterior MWPM Decoding Boosts the XYZ Planar Code 2026-05-22T05:08:57Z

The minimum-weight perfect matching (MWPM) decoder is a standard decoding strategy for surface codes, but its performance degrades considerably under biased noise. In this paper, a modified surface code, termed the XYZ planar code, is introduced, and the MWPM decoder is extended to posterior MWPM (pMWPM) with almost no increase in decoding complexity. The XYZ planar code exhibits higher and more stable thresholds than the planar code under almost all bias conditions, while also achieving significantly lower logical error rates. Specifically, in the infinite-bias case, the threshold of the XYZ planar code is improved by about $36\%$ compared to that of the surface code, and it maintains comparable or higher thresholds under other biases -- for example, the threshold reaches approximately $15.5\%$ at bias $η= 1$ and $14.2\%$ at $η= 100$. Furthermore, pMWPM can be adapted to a wide range of modified surface codes, and the results presented in this work also indicate its excellent potential in other scenarios, such as configurations in which $Y$ operators involve a larger number of data qubits.

2026-05-22T05:08:57Z Zhiwei Wang Liqi Wang http://arxiv.org/abs/2605.23225v1 Entropy Equivalence Testing 2026-05-22T04:35:04Z

We introduce the problem of \emph{entropy equivalence testing} for probability distributions, a relaxation of the well-studied closeness testing problem, where the distribution testing algorithm is now only required to distinguish, given samples from two unknown distributions $p,q$ and a parameter $\varepsilon \in(0,1/2]$, between $p=q$ and $|H(p)-H(q)| \geq \varepsilon$ (where $H$ denotes the Shannon entropy). We provide a time- and sample-efficient algorithm for this task, showing that the optimal sample complexity for this task can be significantly lower than that of closeness testing. As an application, we leverage this result to provide the first non-trivial testing algorithm for (standard) closeness of low-degree \emph{Bayesian networks}, which significantly improves on either the sample or time complexity of a baseline based on full learning.

2026-05-22T04:35:04Z Clément L. Canonne Yash Pote Jonathan Scarlett Joy Qiping Yang