https://arxiv.org/api/15ljKGDYYNCJ443BFvRLKeNJJP4 2026-06-22T13:11:53Z 54841 420 15 http://arxiv.org/abs/2605.31059v1 CRB-Optimal Arrays and Waveforms in Active Sensing: Role of Redundancy and Spatial Covariance of Array Geometry 2026-05-29T09:29:33Z

This paper characterizes the performance limits of optimal array designs using orthogonal and coherent waveforms for both linear and planar arrays. For orthogonal waveforms, we show that the single-target Cramér-Rao Bound (CRB) depends on the sum of the so-called spatial variances of the transmit (Tx) and receive (Rx) arrays, or equivalently, the spatial variance of the sum co-array weighted by the multiplicities of the virtual sensors. This reveals that CRB-optimal geometries are inherently redundant, highlighting a fundamental trade-off between mean squared error (MSE) and identifiability in parameter estimation. Moreover, we derive optimal Tx-Rx sensor allocations given a total sensor budget and show that unequal allocation (favoring the Rx) is optimal even for nonredundant arrays, questioning conventional designs. We extend our results to planar arrays, providing a new general condition that the spatial covariances of the Tx and Rx arrays should satisfy for the optimal waveforms to direct power in the target direction. Additionally, we establish a connection between Diophantine equations and array geometries with equal CRB, along with a constructive method for designing such arrays. Our work provides new guidelines for and insights into optimal array and waveform design with relevance in emerging active sensing multiple-input multiple-output systems.

2026-05-29T09:29:33Z Accepted for publication in IEEE Transactions on Signal Processing Ids van der Werf Robin Rajamäki Geert Leus http://arxiv.org/abs/2605.30976v1 Batched Stochastic Linear Bandits with 1-Bit Communication Constraints 2026-05-29T08:17:31Z

We study stochastic linear bandits under a natural combination of batching and communication constraints: the time horizon is partitioned into batches of equal size $B$, and during each batch the learner sends $B$ requested arm pulls to an agent, who then observes the corresponding $B$ rewards and responds with a single bit of feedback to the learner. For each batch, the learner specifies the 1-bit quantization rule the agent uses, which may depend on all previously received bits but not on any past rewards directly. This setting addresses a significant yet unexplored ``middle ground'' between previous models having per-round quantization only or total bit budgets only. We establish a minimax lower bound showing that $Ω(B\min\{d,\log\lvert \mathcal{A} \rvert\})$ regret is unavoidable due to the 1-bit communication bottleneck, even in the absence of noise. Combined with standard statistical limits, this yields a general lower bound of $\widetildeΩ(B\min\{d,\log\lvert \mathcal{A} \rvert\} + \sqrt{dT \min\{d,\log\lvert \mathcal{A} \rvert\}})$. We develop two phased-elimination algorithms based on $G$-optimal designs and 1-bit mean estimation. The first achieves $\widetilde{O}(dB + d\sqrt{T})$ regret, matching the lower bound up to logarithmic factors when $\lvert \mathcal{A} \rvert = \exp(Ω(d))$, and the second incorporates a safe-arm identification and warm-start procedure to obtain $\widetilde{O}(B\log\lvert \mathcal{A} \rvert + d^{3/2}\sqrt{B} + \sqrt{dT\log\lvert \mathcal{A} \rvert})$ regret, which is near-optimal in broad scaling regimes of $(\lvert \mathcal{A} \rvert, B, d, T)$. Together, our results demonstrate that a single bit of feedback per batch suffices to nearly match the minimax regret of unconstrained linear bandits in broad scaling regimes, even for batch sizes as large as $Θ(\sqrt{T})$.

2026-05-29T08:17:31Z Ivan Lau Daniel McMorrow Kevin Jamieson Jonathan Scarlett http://arxiv.org/abs/2604.19038v2 Explicit Factorization of $x^{p+1}-1$ over $\mathbb{Z}_{p^e}$: A Structural Approach via Dickson Polynomials 2026-05-29T07:17:46Z

Let $p$ be an odd prime. The factorization of the polynomial $x^{p+1}-1$ over the integer residue ring $\mathbb{Z}_{p^e}$ is pivotal for constructing cyclic codes with Hermitian symmetry, a critical resource for Linear Complementary Dual (LCD) codes and Entanglement-Assisted Quantum Error-Correcting Codes (EAQECC). Traditionally, lifting factorizations relies on the generic Hensel's Lemma, masking the underlying algebraic structure. In this paper, we establish a structural isomorphism between the lifting process and the roots of a special auxiliary polynomial $V(x)$, unveiling a deterministic link to Dickson polynomials. Based on this theory, we develop \texttt{Dickson-Engine}, a linear-time algorithm ($O(ep)$) that outperforms standard libraries by orders of magnitude. Applying this engine to $\mathbb{Z}_{169}$, we explicitly construct a family of classical LCD codes of length $n=182$ via the isometric Gray map. Our search reveals codes with parameters (e.g., $[182, 1, 168]_{13}$ and $[182, 2, 144]_{13}$) that are \textbf{near-optimal} with respect to the theoretical Griesmer Bound. Notably, we discover a ``robustness plateau'' starting from non-trivial dimensions ($k=4$), where the minimum distance remains stable ($d=120$) even as the dimension triples ($k=4 \rightarrow 12$). These codes provide exceptional resources for post-quantum cryptography and quantum error correction without entanglement consumption ($c=0$).

2026-04-21T03:44:12Z Full and extended version of the ISIT 2026 accepted paper. Updates in this version: Algorithm 1 has been refined to reflect the optimized 'single-seed lifting' implementation; the discussion on the 'robustness plateau' of LCD codes has been updated to clarify it as an intrinsic property of cyclic codes over local rings Yongchao Wang Yang Ding Jiansheng Yang Zhiqiu Huang http://arxiv.org/abs/2605.30885v1 Beyond 1$\to$N Decoding: Capacity-Aware Rateless Polar Codes for IR-HARQ 2026-05-29T06:19:33Z

This paper introduces a novel framework for polar codes, designed for flexible Incremental Redundancy Hybrid Automatic Repeat Request (IR-HARQ). By generalizing the decoding order beyond the standard 1$\to$N sequence, we enable a capacity-aware scheduling strategy that prioritizes the decoding of reliable subblocks. The framework integrates nested parity-check polar construction and reverse bit-mapping to support continuous and arbitrary transmission lengths $E \in [N_{\min}, N_{\max}]$. Simulation results show that the proposed rateless codes match the coding gain of independently optimized fixed-rate codes across the entire range of rates and lengths. With a validated hardware implementation, this work provides a practical solution for next-generation wireless data channels.

2026-05-29T06:19:33Z Huazi Zhang Xianbin Wang Jiajie Tong Jun Wang Wen Tong http://arxiv.org/abs/2605.28952v2 Optimal Rates for Differentially Private Hypothesis Testing with E-values 2026-05-29T03:39:02Z

E-values have attracted considerable interest in recent years as flexible tools for enabling anytime-valid and adaptive data analysis. Hypothesis testing is at the core of many of these applications, which can often involve private or sensitive data. In this work, we answer a simple but important question: given two distributions $\mathbb{P}$ and $\mathbb{Q}$, what is the maximum achievable e-power when testing $X\sim \mathbb{P}^n$ against $X\sim\mathbb{Q}^n$ with e-values that satisfy $\varepsilon$-differential privacy? We characterize the optimal rate for this problem and provide an algorithm which matches it exactly. In the sequential setting, when observations arrive one-by-one and the analyst chooses when to halt, we give matching upper and lower bounds on the stopping times of any private e-process. Numerical experiments confirm the practicality of our algorithms, which require less data than the recently proposed DP-SPRT across a range of sequential testing problems and privacy levels.

2026-05-27T18:00:13Z Corrected typos; updated references; generalized proposition 3.1 Ben Jacobsen Tomas Gonzalez Gavin Brown Kassem Fawaz Aaditya Ramdas http://arxiv.org/abs/2511.16864v2 Functional uniqueness and stability of Gaussian priors in optimal L1 estimation 2026-05-29T02:13:22Z

We study when optimal Bayesian estimators under Gaussian noise are approximately linear, and what this implies about the underlying prior distribution. Consider the classical model $Y = X + Z$, where $Z$ is Gaussian and independent of $X$. It is well known that under squared-error loss, the conditional mean $\mathbb{E}[X|Y]$ is a linear function of $Y$ if and only if the prior is Gaussian. Much less is understood under absolute-error loss, where the optimal estimator is the conditional median and standard orthogonality-based tools no longer apply. Recent work has established that, in the Gaussian noise model, the Gaussian prior is also the unique distribution that induces an exactly linear conditional median. In this paper, we move beyond exact characterizations and develop a quantitative stability theory: if the optimal estimator is approximately linear, must the prior be close to Gaussian? For the $L_2$ setting, we derive explicit rates showing that near-linearity of the conditional mean forces the prior to be close to Gaussian in the Levy metric. For the $L_1$ setting, we develop a functional-analytic framework based on Hermite expansions and adjoint operators, establishing that approximate linearity of the conditional median implies proximity to the Gaussian family.

2025-11-21T00:21:25Z Leighton Barnes Alex Dytso http://arxiv.org/abs/2605.30636v1 Free Energy Universality in Tensor Estimation via Generic Chaining 2026-05-28T22:38:19Z

We study high-dimensional inference problems with tensor-structured data and establish conditions under which their free energy can be approximated by that of a Gaussian comparison model. Our framework applies to models with independent observations and mismatch between the data-generating distribution and the statistical model. The results extend prior work beyond matrix settings and accommodate scaling regimes where the model parameters depend on the dimension. A key technical contribution is the use of generic chaining to control remainder terms arising from likelihood expansions over tensor-structured parameter spaces. As an application, we establish free energy universality for binary hypergraph models under the minimal assumption of diverging average degree, showing that their asymptotic behavior coincides with that of a Gaussian tensor model, even under model mismatch.

2026-05-28T22:38:19Z Wenxuan Zou Galen Reeves http://arxiv.org/abs/2605.30600v1 The Fast Mixing Mechanism for Differential Privacy 2026-05-28T21:48:37Z

Randomized sketching is a central tool for compressing large-scale optimization problems while preserving accuracy. In particular, sketches that are based on structured matrices, such as the Hadamard matrix, can be applied efficiently and often yield solutions that approximate those of the original problem at much lower computational cost. In differential privacy (DP), Gaussian sketching has been used to solve DP linear regression, beginning with \citet{sheffet2017differentially, sheffet2019old} and later refined by \citet{lev2025gaussianmix, lev2026near}. However, although these methods achieve strong utility guarantees, they usually do not improve runtime over classical DP approaches. In this work, we introduce a new DP sketching mechanism based on fast transforms, which, in certain cases, matches the runtime of classical fast sketching methods. We prove state-of-the-art privacy guarantees for this mechanism and show that, in favorable regimes, they match those of the Gaussian sketch up to a constant factor. As an application, we combine this mechanism with recent sketch-based methods for DP linear regression to obtain a new algorithm with strong utility and improved runtime. We establish privacy and accuracy guarantees for this algorithm, yielding, to the best of our knowledge, the first fast method for DP ordinary least squares.

2026-05-28T21:48:37Z Omri Lev Moshe Shenfeld Vishwak Srinivasan Katrina Ligett Ashia C. Wilson http://arxiv.org/abs/2605.30553v1 Destruction is a General Strategy to Learn Generation; Diffusion's Strength is to Take it Seriously; Exploration is the Future 2026-05-28T20:35:16Z

I present diffusion models as part of a family of machine learning techniques that withhold information from a model's input and train it to guess the withheld information. I argue that diffusion's destroying approach to withholding is more flexible than typical hand-crafted information withholding techniques, providing a rich training playground that could be advantageous in some settings, notably data-scarce ones. I then address subtle issues that may arise when porting reinforcement learning techniques to the diffusion context, and wonder how such exploration problems could be addressed in more diffusion-native ways. I do not have definitive answers, but I do point my fingers in directions I deem interesting. A tutorial follows this thesis, expanding on the destroy-then-generate perspective. A novel kind of probabilistic graphical models is introduced to facilitate the tutorial's exposition.

2026-05-28T20:35:16Z Published April 27th, 2026 as an ICLR blogpost https://iclr-blogposts.github.io/2026/blog/2026/destruction/ Noël, Piere-André. "Destruction is a General Strategy to Learn Generation; Diffusion's Strength is to Take it Seriously; Exploration is the Future", ICLR Blogposts, 2026 Pierre-André Noël http://arxiv.org/abs/2605.30476v1 Local Differential Privacy with Correlated Noise Achieves Central-DP Optimal Cost 2026-05-28T18:47:37Z

We study privately estimating the sum of $n$ user-held values in the presence of an honest-but-curious server. This motivates requiring privacy not only at data release but also throughout server-side computation. We therefore adopt the local (pure) differential privacy model, in which each user transmits a noise-perturbed value. It is well known that independent local noise typically incurs a substantial utility loss compared to the centralized model, where noise is added only after aggregation. We show that this gap is not fundamental. By carefully designing correlations among the locally added noise variables, we construct $\varepsilon$-DP mechanisms whose estimation cost matches the optimal cost achievable in the centralized setting, up to an arbitrarily small error.

2026-05-28T18:47:37Z Madhura Pathegama Srikanth Avasarala Viveck R. Cadambe Juba Ziani http://arxiv.org/abs/2606.00127v1 One Adaptive Trailing Head Can Outperform Many Oblivious Trailing Heads 2026-05-28T17:58:26Z

In the setting of multi-head finite-state dimensions, trailing heads lag behind a leading head, accessing past data to aid a finite-state gambler placing bets on successive bits read by the leading head. Cruz, Glashausser, Li, and Lutz (2026) proved that, for any fixed number of trailing heads, adaptive (data-dependent) movement rules can strictly outperform oblivious (data-independent) movement schedules. In this paper we strengthen that separation by proving that a single trailing head with adaptive movements can outperform, by a large and uniform margin, arbitrarily many trailing heads with oblivious movements. Formally, our main theorem states that there is a binary sequence whose adaptive two-head finite-state strong dimension is less than its oblivious multi-head finite-state dimension, and that the gap is greater than 0.3.

2026-05-28T17:58:26Z Julianne Cruz Sho Glashausser Neil Lutz http://arxiv.org/abs/2605.30331v1 Majorization precursors to supermodularity and subadditivity on the majorization lattice 2026-05-28T17:57:58Z

We establish two structural majorization relations, which we call precursors, underlying the properties of supermodularity and subadditivity on the lattice induced by majorization. These are precursors in that they immediately imply that all sums of concave functions, which we dub sum-concave functions, are supermodular and subadditive on the majorization lattice. Using these majorization relations, we then show the supermodularity and subadditivity (in the lattice-theoretic sense) of Tsallis entropies (for all $α$) and Rényi entropies (for all $α> 1$), also recovering these properties for the Shannon entropy in the process. We further strengthen these inequalities, showing that: (i) all these entropic functionals are strictly subadditive on the majorization lattice; (ii) Tsallis entropies (and therefore the Shannon entropy as well) are strictly supermodular on the majorization lattice.

2026-05-28T17:57:58Z 9 pages, 1 figure Alexander Stévins Michael G. Jabbour Serge Deside Nicolas J. Cerf http://arxiv.org/abs/2605.04493v4 The unique, universal entropy for complex systems 2026-05-28T17:37:25Z

An axiomatic foundation regarding the entropy for complex systems is established. Missing from decades of research was the requirement that entropy must measure the uncertainty at the informational scale of the maximizing distribution, where the log-log slope equals $-1$. Additionally, entropy must be extensive across the full universality scaling classes defined by Hanel-Thurner. The coupled entropy, maximized by the coupled stretched exponential distributions, is proven to be the unique, universal entropy that satisfies these requirements. The non-additivity of the entropy is equal to the long-range dependence or nonlinear statistical coupling. The entropy-matched extensivity is a function of the coupling, stretching parameter, and dimensions. Evidence is provided that the Tsallis $q$-statistics creates misalignment in the physical modeling of complex systems. Information thermodynamic applications are reviewed, including measuring complexity, a zeroth law of temperature, the thermodynamic consistency of the coupled free energy, and a model of intelligence in non-equilibrium.

2026-05-06T04:47:03Z 35 pages, 6 figures, 3 tables. v4 improves the c and d scaling proof. arXiv admin note: substantial text overlap with arXiv:2511.17684 Kenric P. Nelson http://arxiv.org/abs/2605.20560v2 Reconfigurable Coupler Antenna for Wireless Networks 2026-05-28T17:18:22Z

The reconfigurable coupler antenna (RCA), also called the flexible coupler antenna (FCA), is a new technique that aims to improve the performance of wireless communication networks by reconfiguring the positions and rotations of low-cost couplers around fixed-position active antennas to harness mutual coupling. Specifically, different couplers can independently adjust their positions and/or rotations at the transceiver to reshape the induced currents on the couplers for radiation, thereby collaboratively achieving mechanical beamforming for directional signal enhancement or nulling. The position and/or rotation reconfiguration of passive couplers provides a new and cost-effective means of enhancing wireless communication performance, while significantly reducing the antenna and radio-frequency (RF) chain costs of conventional active arrays. The compact and low form-factor structure of the RCA makes it particularly appealing for devices with stringent size, weight, and power (SWAP) constraints. In this article, we provide an overview of RCA to reveal its promising capabilities in wireless networks, including its system modeling, practical implementation, and competitive advantages over existing techniques. We present a variety of RCA-enabled performance enhancements in terms of mechanical beamforming gain, path-loss reduction, fading mitigation, spatial multiplexing gain, interference suppression, and geometric gain. Furthermore, we elaborate on the design challenges of RCA as well as promising solutions, and discuss the key applications of RCA in wireless networks. Finally, numerical results are presented to verify the substantial capacity gains enabled by RCA-aided transmission in wireless networks.

2026-05-19T23:26:03Z 7 pages Xiaodan Shao Chuangye Shan Weihua Zhuang Xuemin Shen http://arxiv.org/abs/2605.30153v1 Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions 2026-05-28T16:13:44Z

Score-based diffusion models have demonstrated remarkable empirical success in learning high-dimensional distributions, particularly those exhibiting low-dimensional and multi-modal structures. However, theoretical understanding of their statistical efficiency remains limited. Existing theories typically rely on strong regularity assumptions, such as uniformly bounded densities or globally smooth score functions, which fail to capture such intrinsic structures. In this work, we study the sample complexity of diffusion models for learning distributions supported on a union of low-dimensional subspaces. Assuming that the data distribution within each subspace is subgaussian, we show that diffusion models require at most $\widetilde{O}(\varepsilon^{-k \vee 2})$ samples to achieve $\varepsilon$ error in 1-Wasserstein distance, where $k$ is the intrinsic dimension. This near-optimal convergence rate depends only on the intrinsic dimension and significantly improves upon prior theoretical guarantees that suffer from the curse of dimensionality. Notably, our analysis applies to a broad collection of distributions without imposing smoothness, bounded-density, or log-concavity assumptions. Overall, our results show that diffusion models can statistically adapt to intrinsic low-dimensional structure while naturally accommodating multi-modal data, offering a rigorous theoretical justification for their success in complex high-dimensional learning tasks.

2026-05-28T16:13:44Z accepted to ICML 2026 Jingda Wu Changxiao Cai