https://arxiv.org/api/gIbC47i/QoMyWSpOFTl9sEV+/2I 2026-06-27T09:41:14Z 54938 960 15 http://arxiv.org/abs/2605.11356v1 RankGuardPolar Private Public Finite Length Polar Codes with Rank-Certified Leakage 2026-05-12T00:23:14Z

We introduce \textbf{RankGuard-Polar}, a framework for safely publishing a subset of polar codeword coordinates over shared public resources. We assume a strong eavesdropper who has access to the channel input, i.e., the transmitted codeword coordinates published on a public resource access model. Working over $\mathbb F_2$ and focusing on time-shared public/private BEC uses, we show that leakage from a published index set $\mathbf{P}$ admits an exact algebraic characterization comes from an information-theoretic viewpoint, and we construct an explicit linear extractor ($R$) that identifies the leaked linear combinations. Building on this identity, we (i) give efficient procedures to compute and certify leakage for any $\mathbf{P}$, (ii) propose a practical fast algorithm with provable efficiency.

2026-05-12T00:23:14Z This paper has been accepted for presentation at the 2026 IEEE International Symposium on Information Theory (ISIT 2026) Hassan Tavakoli Thinh Nguyen Bella Bose http://arxiv.org/abs/2605.11352v1 Parameter Estimation of Mutual Information Maximized Channels 2026-05-12T00:15:36Z

We study the problem of estimating a parametric discrete memoryless channel $ p(y \mid x; \boldsymbolθ) $ when the transmitter selects its input distribution $ π$ to maximize mutual information under the true parameter $ \boldsymbolθ^* $. Using only i.i.d.\ observations of the channel output, we aim to jointly estimate the capacity-achieving input distribution $ \boldsymbolπ^* $ and the true channel parameter $ \boldsymbolθ^* $. In general, recovery of $ \boldsymbolπ^* $ and $ \boldsymbolθ^* $ can be challenging. To that end, we propose two efficient algorithms based on the Blahut--Arimoto (BA) optimality conditions: (i) a bilevel fixed-point method and (ii) an augmented Lagrangian method. Empirical results demonstrate that both proposed algorithms successfully recover the true $ \boldsymbolθ^* $ and $ \boldsymbolπ^* $, whereas a naive maximum-likelihood approach that ignores the mutual-information maximization constraint fails to do so.

2026-05-12T00:15:36Z This paper has been accepted for presentation at the 2026 IEEE International Symposium on Information Theory (ISIT 2026) Hassan Tavakoli Thinh Nguyen Bella Bose http://arxiv.org/abs/2505.11517v4 Information-Theoretic Grid Topology Reconstruction using Low-Precision Smart Meter Data 2026-05-11T21:55:35Z

Accurate knowledge of power grid topology is a prerequisite for effective state estimation and grid stability. While data-driven methods for topology reconstruction exist, the minimum requirements for measurement quality, specifically regarding quantization, precision, and sampling frequency, remain under-explored. This study investigates the data fidelity required to reconstruct distribution grid topologies using voltage magnitude measurements. Adopting an information-theoretic approach, we utilize the Chow-Liu algorithm to generate maximum spanning trees based on mutual information. Rather than proposing a new reconstruction algorithm, our primary contribution is a comprehensive sensitivity analysis of the measurement data itself. We systematically evaluate the impact of data bit-depth, significant digit truncation, time-window length, and different mutual information estimators on reconstruction accuracy. We validate this approach using IEEE test cases (via MATPOWER) and time-series data from GridLAB-D. Our results demonstrate that grid topology can be successfully recovered even with highly quantized 8-bit data or millivolt-level precision. However, performance degrades significantly when downsampling intervals exceed 20 minutes or when data availability is limited to short durations. These findings establish an optimistic theoretical lower bound, suggesting that costly high-precision instrumentation may not be strictly necessary for structural inference under ideal conditions. This rigorous baseline provides a foundation for future evaluations of noisy real world smart meter data and hybrid approaches that incorporate existing engineering priors.

2025-05-07T20:09:04Z Daniel T. Speckhard http://arxiv.org/abs/2508.14780v2 Context Steering: A New Paradigm for Compression-based Embeddings by Synthesizing Relevant Information Features 2026-05-11T21:30:00Z

Compression-based dissimilarities (CD) offer a flexible and domain-agnostic means of measuring similarity by identifying implicit information through redundancies between data objects. However, as similarity features are derived from the data, rather than defined as an input, it often proves difficult to align with the task at hand, particularly in complex clustering or classification settings. To address this issue, we introduce "context steering", a novel methodology that actively guides the feature-shaping process. Instead of passively accepting the emergent data structure (typically a hierarchy derived from clustering CDs), our approach "steers" the process by systematically analyzing how each object influences the relational context within a clustering framework. This process generates a custom-tailored embedding that isolates and amplifies class-distinctive information. We validate this supervised context-steering strategy using Normalized Compression Distance (NCD) and Relative Compression Distance (NRC) combined with hierarchical clustering, and evaluate the learned embeddings through both classification performance and cluster-quality metrics. Experiments on heterogeneous datasets-from text to real-world audio-show that the proposed approach yields robust task-oriented embeddings from compression dissimilarities, moving from traditional transductive uses of distance matrices to an inductive representation that can be applied to unseen data.

2025-08-20T15:26:52Z Guillermo Sarasa Ana Granados Francisco de Borja Rodríguez http://arxiv.org/abs/2605.11120v1 Sensor Design for Accuracy-Bounded Estimation via Maximum-Entropy Likelihood Synthesis 2026-05-11T18:28:45Z

Designing the sensing architecture for large-scale spatio-temporal systems is hard when accuracy requirements are specified but sensor models are uncertain or unavailable. Classical design treats sensor placement and estimation sequentially, requiring valid forward models for each sensing modality. This paper inverts the design flow: given an error budget, synthesize the measurement likelihood that enforces it while injecting minimal information beyond the dynamical prior. The likelihood is constructed by constrained optimization: among all posteriors satisfying a prescribed accuracy bound relative to a target, select the one minimizing Kullback-Leibler divergence from the prior. The solution is a maximum-entropy posterior in relative-entropy form, and the induced likelihood is the Radon-Nikodym derivative. The framework accommodates arbitrary discrepancies and is instantiated for Wasserstein distance, maximum mean discrepancy, $f$-divergences, moment constraints, and hybrid metrics. For each, we derive the discrete particle-level problem, analyze its convex or convex-relaxed structure, and present solvers with complexity scaling. A closed-form solution exists for the symmetric exponential-tilt case, and a distillation procedure converts nonparametric likelihood samples into parametric forms. A two-layer sensor design architecture embeds the synthesized likelihood in the recursive predict-update loop, connecting accuracy budgets to physical sensor placement, precision, and configuration. Numerical experiments comparing four metrics on unimodal and multimodal scenarios confirm the accuracy constraints are reliably enforced and reveal how metric choice determines the amount and spatial distribution of injected information.

2026-05-11T18:28:45Z Raktim Bhattacharya http://arxiv.org/abs/2605.10943v1 A passive self-correcting quantum memory in three dimensions 2026-05-11T17:59:56Z

We construct a 3D Pauli stabilizer Hamiltonian whose ground state space can encode a qubit for exponential time when coupled to a bath at non-zero temperature. Our construction recursively applies a sequence of transformations to a seed Hamiltonian that increases the memory lifetime of the encoded qubit while maintaining geometric locality in $\mathbb{R}^3$.

2026-05-11T17:59:56Z 102 pages Shankar Balasubramanian Margarita Davydova Ting-Chun Lin http://arxiv.org/abs/2605.10879v1 Private Information Retrieval With Arbitrary Privacy Requirements for Graph-Based Storage 2026-05-11T17:27:33Z

We reformulate the definition of privacy in the private information retrieval (PIR) problem to accommodate flexible privacy requirements. We focus on graph-replicated PIR, with a generalized privacy requirement, instead of requiring all messages to be private from all servers, during retrieval. Towards this, we define a privacy requirement set for each server, which can be an arbitrary subset of all message indices, as long as the stored message indices are in their privacy requirement set. Since both the storage and privacy requirement sets have many possibilities, we focus on two specific storage settings, namely the path and cyclic graphs. We consider several privacy settings for each of them, which are not necessarily the same, to give different examples for privacy sets. Of particular interest are the privacy sets that comprise the indices of messages stored at servers within a neighborhood range. The neighborhood range parameter allows a transition from the recently introduced local PIR [1] to the standard graph-replicated PIR. In these cases, we derive bounds on the capacity or find the exact capacity.

2026-05-11T17:27:33Z Mohamed Nomeir Shreya Meel Sennur Ulukus http://arxiv.org/abs/2605.10878v1 Neural Weight Norm = Kolmogorov Complexity 2026-05-11T17:27:31Z

Why does weight decay work? We prove that, in any fixed-precision regime, the smallest weight norm of a looped neural network outputting a binary string equals the Kolmogorov complexity of that string, up to a logarithmic factor. This implies that weight decay induces a prior matching Solomonoff's universal prior, the optimal prior over computable functions, up to a polynomial factor. The result is norm-agnostic: in fixed precision, every weight norm collapses to the non-zero parameter count up to constants, so the same sandwich bound holds for any norm used as a regulariser. The proof has two short reductions: any program for a universal Turing machine can be encoded into neural weights at unit cost per program bit, and any fixed-precision network can be described by enumerating its non-zero parameters with logarithmic addressing overhead. Both bounds are tight up to constants, with the logarithmic factor realised by permutation encodings: a network whose parameters encode a permutation produces a string whose Kolmogorov complexity is the non-zero parameter count times its logarithm. The fixed-precision assumption is essential: with infinite precision, neural networks can encode non-computable functions and the weight norm loses its relevance.

2026-05-11T17:27:31Z Tiberiu Musat http://arxiv.org/abs/2605.10872v1 Local Private Information Retrieval: A New Privacy Perspective for Graph-Based Replicated Systems 2026-05-11T17:25:10Z

We rethink the definition of privacy in multi-server, graph-replicated private information retrieval (PIR) systems, and introduce a novel setting where the user's privacy is governed by the servers' storage structure. In particular, while retrieving a message from a server, the user is concerned with hiding their desired message index from the server, only if the server stores the corresponding message. We coin this privacy requirement as local user privacy and the resulting PIR problem as local PIR on the graph. Our goal is to measure the gain in communication efficiency of local PIR, compared to that of canonical PIR, by establishing its capacity, i.e., the maximum number of message symbols retrieved, per downloaded symbol. To this end, we observe a remarkable gain in the local PIR capacity of graphs, that are disjoint union of distinct graphs, which is multiplicative, compared to the PIR capacity, when the individual graphs are identical. For connected graphs, we propose schemes to establish capacity lower bounds for edge-transitive and bipartite graphs, which are greater than the best-known PIR capacity bounds. Finally, we derive the exact local PIR capacity for the cyclic graph, and the path graph with an odd number of vertices.

2026-05-11T17:25:10Z Shreya Meel Mohamed Nomeir Sennur Ulukus http://arxiv.org/abs/2603.19379v2 Wireless Broadcast Gossip for Decentralized Drone Swarms: Success Probability, Contraction, and Optimal Aloha 2026-05-11T16:09:20Z

We study a tractable baseline for average-preserving broadcast gossip in decentralized drone swarms under a quasi-static planar Poisson model and a matching-based abstraction. With slotted Aloha, Rayleigh fading, and threshold decoding, we derive: 1) a closed-form SIR success law; 2) a mean-square contraction bound that separates ideal mixing from wireless successful updates via a conservative lower bound; and 3) a closed-form proxy access rule with interpretable density scaling. Explicit-interference simulations, together with robustness checks for receiver selection, noise, fading, and spatial regularity, confirm a stable intermediate operating region for the Aloha probability.

2026-03-19T18:13:49Z Ali Khalesi http://arxiv.org/abs/2605.10724v1 Selective Placement of Hollow-Core Fibers for QKD and Classical Communication Coexistence 2026-05-11T15:32:27Z

We investigate the benefits of partially upgrading optical networks with hollow-core fibers for QKD-classical communication coexistence. Results show that upgrading 40% of links in a metro topology can reduce the number of quantum modules by up to 49%.

2026-05-11T15:32:27Z Giovanni Simone Sticca Alessandro Gagliano Memedhe Ibrahimi Alberto Gatto Francesco Musumeci Massimo Tornatore http://arxiv.org/abs/2605.10713v1 Price of Quality: Sufficient Conditions for Sparse Recovery using Mixed-Quality Data 2026-05-11T15:24:27Z

We study sparse recovery when observations come from mixed-quality sources: a small collection of high-quality measurements with small noise variance and a larger collection of lower-quality measurements with higher variance. For this heterogeneous-noise setting, we establish sample-size conditions for information-theoretic and algorithmic recovery. On the information-theoretic side, we show that it is sufficient for $(n_1, n_2)$ to satisfy a linear trade-off defining the Price of Quality: the number of low-quality samples needed to replace one high-quality sample. In the agnostic setting, where the decoder is completely agnostic to the quality of the data, it is uniformly bounded, and in particular one high-quality sample is never worth more than two low-quality samples for this sufficient condition to hold. In the informed setting, where the decoder is informed of per-sample variances, the price of quality can grow arbitrarily large. On the algorithmic side, we analyze the LASSO in the agnostic setting and show that the recovery threshold matches the homogeneous-noise case and only depends on the average noise level, revealing a striking robustness of computational recovery to data heterogeneity. Together, these results give the first conditions for sparse recovery with mixed-quality data and expose a fundamental difference between how the information-theoretic and algorithmic thresholds adapt to changes in data quality.

2026-05-11T15:24:27Z Published as a conference paper at ICLR 2026 Youssef Chaabouni David Gamarnik http://arxiv.org/abs/2605.10681v1 Scalable Mamba-Based Message-Passing Neural Decoder for Error-Correcting Codes 2026-05-11T14:57:41Z

Forward error correction is essential for reliable communication over noisy channels. Attention-based model-free neural decoders have shown strong performance for short codes, but their scalability to longer codes is limited by the quadratic memory and computational cost of attention. In this paper, we introduce the Mamba message-passing decoder (MMPD), an attention-free syndrome-based neural decoder for binary linear codes. MMPD retains the Tanner-graph structure of a message-passing decoder by performing local pairwise aggregation along variable-check edges. To enable efficient long-range information propagation, these local updates are combined with bidirectional Mamba state-space blocks. By avoiding dense attention matrices, MMPD scales more favorably for long codes in both memory and computation. Experiments on the (1056, 880) LDPC code show that MMPD achieves a 0.45 dB gain over the state-of-the-art CrossMPT decoder at a specified target bit error rate, while reducing memory consumption by a factor of 1.5. This reduction factor increases substantially for longer codes, demonstrating the applicability of MMPD to scalable neural decoding of practical long codes.

2026-05-11T14:57:41Z This work has been submitted to the IEEE for possible publication Rostislav Gusev Nikita Aleksandrov Artem Solomkin Dmitry Artemasov http://arxiv.org/abs/2510.08117v2 Near-optimal Rank Adaptive Inference of High Dimensional Matrices 2026-05-11T14:19:10Z

We address the problem of estimating a high-dimensional matrix from linear measurements, with a focus on designing optimal rank-adaptive algorithms. These algorithms infer the matrix by estimating its singular values and the corresponding singular vectors up to an effective rank, adaptively determined based on the data. We establish instance-specific lower bounds for the sample complexity of such algorithms, uncovering fundamental trade-offs in selecting the effective rank: balancing the precision of estimating a subset of singular values against the approximation cost incurred for the remaining ones. Our analysis identifies how the optimal effective rank depends on the matrix being estimated, the sample size, and the noise level. We propose an algorithm that combines a Least-Squares estimator with a universal singular value thresholding procedure. We provide finite-sample error bounds for this algorithm and demonstrate that its performance nearly matches the derived fundamental limits. Our results rely on an enhanced analysis of matrix denoising methods based on singular value thresholding. We validate our findings with applications to multivariate regression and linear dynamical system identification.

2025-10-09T12:01:46Z AISTATS 2026 Frédéric Zheng Yassir Jedra Alexandre Proutiere http://arxiv.org/abs/2605.10626v1 Sparse Signal Recovery using Log-Sum Regularization and Adaptive Smoothing 2026-05-11T14:17:53Z

We study sparse signal recovery from noisy linear observations using nonconvex log-sum regularization. The log-sum penalty reduces the shrinkage bias of $\ell_1$ regularization and more closely approximates the $\ell_0$ regularization, but its nonconvexity can make reconstruction algorithms unstable. To mitigate this instability, we use an adaptive smoothing strategy that determines the smoothing parameter so that the scalar proximal operator remains continuous. Using this proximal operator, we formulate the approximate message passing (AMP) algorithm and derive the corresponding state evolution (SE) recursion. The fixed point of the SE recursion predicts the final mean squared error (MSE) and, in the noiseless limit, the exact-recovery phase transition. To further investigate finite-dimensional reconstruction behavior, we implement an alternating direction method of multipliers (ADMM) algorithm. In the noiseless setting, we find that the empirical success boundary of ADMM closely agrees with the SE-predicted phase transition. In the noisy setting, we observe that AMP closely follows the SE prediction, whereas ADMM qualitatively reproduces the SE-predicted dependence of the final MSE on the regularization parameter. A comparison with $\ell_1$ regularization shows that log-sum regularization is beneficial in low-density or high-measurement-rate regimes, whereas $\ell_1$ regularization remains preferable at higher densities and lower measurement rates.

2026-05-11T14:17:53Z 6 pages, 4 figures Keisuke Morita Masayuki Ohzeki