https://arxiv.org/api/flDiAzXOIaVjOFivGS26aSgBPtw 2026-06-25T14:10:49Z 54922 720 15 http://arxiv.org/abs/2605.00953v4 Information Accessibility Limits in Structured NP Search 2026-05-21T00:48:46Z

We study the problem of locating violating principal minors in matrix families lying near the boundary of P-matrices. Rather than viewing this search problem purely through computational complexity, we analyze it from an information-accessibility perspective. We show that, despite strong underlying algebraic structure, the location of a violating subset may remain difficult to infer through local queries. In the sparse-violation regime, local observations typically provide only weak eliminative power, and polynomially many queries accumulate only vanishing mutual information about the hidden witness under the induced oracle model. Using mutual information and Fano's inequality, we characterize the resulting limitation on information acquisition. The analysis highlights a conceptual distinction between structure and accessibility: a problem may possess rich underlying structure while the information required to identify a hidden witness remains weakly inferable from observable responses.

2026-05-01T11:43:51Z 24 pages, 1 figure. Includes appendices with explicit constructions and numerical examples Jing-Yuan Wei http://arxiv.org/abs/2503.06115v2 On Statistical Estimation of Edge-Reinforced Random Walks 2026-05-21T00:31:37Z

Reinforced random walks (RRWs), including vertex-reinforced random walks (VRRWs) and edge-reinforced random walks (ERRWs), model random walks where the transition probabilities evolve based on prior visitation history~\cite{mgr, fmk, tarres, volkov}. These models have found applications in various areas, such as network representation learning~\cite{xzzs}, reinforced PageRank~\cite{gly}, and modeling animal behaviors~\cite{smouse}, among others. However, statistical estimation of the parameters governing RRWs remains underexplored. This work focuses on estimating the initial edge weights of ERRWs using observed trajectory data. Leveraging the connections between an ERRW and a random walk in a random environment (RWRE)~\cite{mr, mr2}, as given by the so-called ``magic formula", we propose an estimator based on the generalized method of moments. To analyze the sample complexity of our estimator, we exploit the hyperbolic Gaussian structure embedded in the random environment to bound the fluctuations of the underlying random edge conductances.

2025-03-08T07:57:50Z This is the full version of the conference paper in submission to ISIT 2025 Qinghua Devon Ding Venkat Anantharam http://arxiv.org/abs/2503.12859v2 The Density Formula Approach for Non-reversible Isomorphism Theorems, with Applications 2026-05-21T00:11:40Z

The classical isomorphism theorems for reversible Markov chains have played an important role in studying the properties of local time processes of strongly symmetric Markov processes~\cite{mr06}, bounding the cover time of a graph by a random walk~\cite{dlp11}, and in topics related to physics, such as random walk loop soups and Brownian loop soups~\cite{lt07}. Non-reversible versions of these theorems have been discovered by Le Jan, Eisenbaum, and Kaspi~\cite{lejan08, ek09, eisenbaum13}. Here, we give a density-formula-based proof for all these non-reversible isomorphism theorems, extending the results in \cite{bhs21}. Moreover, we use this method to generalize the comparison inequalities derived in \cite{eisenbaum13} for permanental processes and derive an upper bound for the cover time of non-reversible Markov chains.

2025-03-17T06:41:13Z This is the full version of the conference paper in submission to ISIT 2025 Qinghua Devon Ding Venkat Anantharam http://arxiv.org/abs/2605.21784v1 Constructions of Rank-Metric Codes of Small Tensor Rank 2026-05-20T22:23:06Z

Rank-metric codes are subspaces of matrices over finite fields endowed with the rank metric and admit a natural tensorial representation. The tensor rank provides a measure of the minimal size of a decomposition of a code into rank-one tensors. Kruskal showed that the tensor rank of a rank-metric code of dimension $k$ and minimum rank distance $d$ is at least $k + d - 1$, and codes meeting this bound with equality are called minimal tensor rank (MTR) codes. It is known from algebraic complexity theory that the existence of an MTR code implies the existence of a maximum distance separable (MDS) code. In this work, we establish new results relating the tensor rank of a rank-metric code to the parameters of associated linear codes in the Hamming metric and introduce the notion of tensor rank defect. We then develop new constructions of rank-metric codes with small tensor rank defect using algebraic geometry (AG) codes.

2026-05-20T22:23:06Z Matteo Bonini Eimear Byrne Giuseppe Cotardo http://arxiv.org/abs/2605.21742v1 Correcting Class Imbalance in Prior-Data Fitted Networks for Tabular Classification 2026-05-20T21:10:54Z

Prior-data fitted networks (PFNs) have achieved exceptional performance on tabular classification tasks. However, like other classifiers, their performance can suffer under the effect of class imbalance, resulting in poor performance for rare classes. Several techniques exist which attempt to mitigate the deleterious effect of class imbalance on classification performance, but the in-context learning (ICL) dynamic of PFNs means that loss-based strategies are impossible, and other techniques are unproven. We have adapted several classical techniques addressing class imbalance and analyzed their performance on PFN classification. We observe that thresholding performs exceptionally well because of the calibration characteristics of PFNs, and downsampling performs comparably because of PFNs exceptional limited-data performance, with the additional benefit of reduced computation cost for inference.

2026-05-20T21:10:54Z 5 pages, 6 figures, Information Theory Workshop (ITW) Samuel McDowell Nathan Stromberg Lalitha Sankar http://arxiv.org/abs/2508.16498v2 Enhanced Successive Cancellation List Decoder for Long Polar Codes Targeting Air Interface 2026-05-20T20:29:09Z

Polar codes are the first codes with a proven capacity-achieving capability, but their decoding faces several challenges, especially under long code lengths. In this paper, we target algorithmic improvements and analyses to enable the implementation of long polar codes (e.g., length 8K bits) by addressing key challenges in memory usage and computational complexity presented by successive cancellation list (SCL) polar decoding. Perturbation-enhanced (PE) SCL decoders with a list size of $L$ reach the decoding performance of the SCL decoder with a list size of $2L$. The proposed bias-enhanced (BE) SCL decoders, which simplify the PE SCL decoder based on insights gained by an ablation study, return similar decoding performance to PE SCL decoders. Also, proposed BE generalized partitioned SCL (GPSCL) decoders with a list size of $8$ have a $67\%$ reduction in the memory usage and similar decoding performance compared to SCL decoders with a list size of $16$, and it demonstrates that an accurate bias can be generated under a reduced number of codewords from the list and reduces the overhead from $\left(L-1\right)n$ XOR gates plus $n$ priority encoders to $n$ XOR gates, where $n$ is the code length. Furthermore, input-distribution-aware (IDA) decoding is applied to BE GPSCL decoders, which shows how an accurate bias is generated under a low-complexity decoder. Up to $5.4\times$ reduction in the computational complexity is achieved compared to SCL decoders with a list size of $16$, and negligible latency overhead is added to the decoding process. The degraded decoding performance is at most $0.05\text{ dB}$ compared to BE GPSCL decoders without IDA decoding. Lastly, we theoretically prove that the bias in the BE SCL decoder moves the received soft information toward valid polar codewords with a high likelihood, and explain the decoding performance gain.

2025-08-22T16:24:46Z Jiajie Li Sihui Shen Warren J. Gross http://arxiv.org/abs/2605.21328v1 SAOITHE: Sustainable Age-of-Information-Based Timely Status Updating for Hardware-constrained Edge networks 2026-05-20T15:54:43Z

In future large-scale deployments of 6G and beyond networks, collecting timely information, as measured by the Age of Information (AoI) metric, is becoming increasingly important. At the same time, the environmental impact, often characterized by the resulting Carbon Footprint (CF), depends on both the amount of consumed energy and the Carbon Intensity (CI), i.e., the amount of CO$_2$-equivalent emissions produced per unit of consumed energy. Since CI varies over time, minimizing energy is not equivalent to minimizing CF, as a status update with the same energy demand may result in a different carbon cost depending on when it is transmitted. This makes timely status updating a nontrivial scheduling problem. To address this challenge, we formulate carbon-aware status updating as a constrained Markov Decision Process (MDP) that minimizes AoI subject to CF budget, transmission duty-cycle, and channel-capacity constraints. We then propose Sustainable Age-of-Information-Based Timely Status Updating for Hardware-constrained Edge networks (SAOITHE), a Whittle-index-based scheduling solution that enables scalable real-time scheduling. Using real-world CI traces across low-, medium-, and high-CI regions, the results show that SAOITHE remains within the allocated CF budget while achieving lower AoI than baseline policies. Moreover, the gains are around 25% and 20% in low- and medium-CI regions, respectively, and up to 75% in high-CI settings, while preserving scalability.

2026-05-20T15:54:43Z 11 pages, 7 figures, Under review at IEEE Shih-Kai Chou Maice Costa Mihael Mohorčič Jernej Hribar http://arxiv.org/abs/2504.16726v2 Partial orders and contraction for BISO channels 2026-05-20T15:07:04Z

A fundamental question in information theory is to quantify the loss of information under a noisy channel. Partial orders and contraction coefficients are typical tools to that end, however, they are often also challenging to evaluate. For the special class of binary input symmetric output (BISO) channels, Geng et al. showed that among channels with the same capacity, the binary symmetric channel (BSC) and binary erasure channel (BEC) are extremal with respect to the more capable order. Here, we show two main results. First, for channels with the same KL contraction coefficient, the same holds with respect to the less noisy order. Second, for channels with the same Dobrushin coefficient, or equiv. maximum leakage or Doeblin coefficient, the same holds with respect to the degradability order. In the process, we provide a closed-form expression for the contraction coefficients of BISO channels. We also discuss the comparability of BISO channels and extensions to binary channels in general.

2025-04-23T14:00:36Z 6 pages, accepted at ISIT 2025 Christoph Hirche Oxana Shaya http://arxiv.org/abs/2605.21185v1 Information Leakage Envelopes 2026-05-20T13:50:26Z

We study privacy guarantees in the framework of pointwise maximal leakage (PML) that satisfy two requirements: they are robust under post-processing and upper bound the failure probability, i.e., the probability that the information leakage exceeds a given threshold. We first examine two candidate definitions inspired by (approximate) differential privacy and show that neither one satisfies both requirements simultaneously. We then introduce the notion of the PML envelope, which quantifies the largest amount of information leakage about a secret after arbitrary post-processing of a mechanism's output. By construction, the PML envelope satisfies both requirements. We discuss basic structural properties of the envelope, such as monotonicity, and derive general upper and lower bounds. We further analyze the envelope for two widely used privacy mechanisms: the PML-extremal mechanisms in the high-privacy regime and randomized response. Overall, this work establishes the PML envelope as a natural and operationally meaningful definition for providing privacy guarantees that are preserved under arbitrary downstream transformations.

2026-05-20T13:50:26Z Accepted to CSF2026 Sara Saeidian KTH Royal Institute of Technology Inria Saclay Carlos Pinzón Inria Saclay École Polytechnique Catuscia Palamidessi Inria Saclay École Polytechnique http://arxiv.org/abs/2605.21181v1 On the Identifiability of Semi-Blind Estimation in Cell-Free Massive MIMO Networks 2026-05-20T13:47:56Z

Semi-blind joint channel estimation and data detection (JCD) is a promising approach to mitigate pilot contamination in cell-free massive multiple-input multiple-output (CF-MaMIMO) networks. The effectiveness of such methods fundamentally depends on identifiability, i.e., the ability to unambiguously recover the unknown channel coefficients and transmitted data signals from the received uplink observations. In this work, we investigate the identifiability of semi-blind JCD from a large-scale system design perspective. We consider a CF-MaMIMO network in which access points (APs) and user equipments (UEs) are spatially distributed according to Poisson point processes (PPPs). The resulting network topology is modeled as bipartite random geometric graph (BRGG) that captures local connectivity induced by wireless propagation. To enable a tractable analysis, the spatially dependent graph model is approximated by a surrogate independent-edge random graph with matched degree distributions. Building on this model, we develop a recursive probabilistic analysis that characterizes the conditions under which semi-blind recovery succeeds with high probability. The proposed analysis reveals an identifiability region as a function of key system parameters, including AP and UE densities and the connectivity radius beyond which channel coefficients are assumed negligible. Monte Carlo simulations validate the predicted identifiability region and assess the accuracy of the proposed graph approximation. The proposed framework provides system level insights into how network density and connectivity affect identifiability in large-scale CF-MaMIMO systems and offers guidelines for selecting deployment parameters and pilot sequence lengths that enable reliable semi-blind recovery.

2026-05-20T13:47:56Z 6 pages, 4 figures, submitted for possible conference publication Christian Forsch Laura Cottatellucci http://arxiv.org/abs/2505.00894v3 Non-Adaptive Cryptanalytic Time-Space Lower Bounds via a Shearer-like Inequality for Permutations 2026-05-20T12:35:50Z

The power of adaptivity in algorithms has been intensively studied in diverse areas of theoretical computer science. In this paper, we obtain a number of sharp lower bound results which show that adaptivity provides a significant extra power in cryptanalytic time-space tradeoffs with (possibly unlimited) preprocessing time. Most notably, we consider the discrete logarithm (DLOG) problem in a generic group of $N$ elements. The classical `baby-step giant-step' algorithm for the problem has time complexity $T=O(\sqrt{N})$, uses $O(\sqrt{N})$ bits of space (up to logarithmic factors in $N$) and achieves constant success probability. We examine a generalized setting where an algorithm obtains an advice string of $S$ bits and is allowed to make $T$ arbitrary non-adaptive queries that depend on the advice string (but not on the challenge group element). We show that in this setting, the $T=O(\sqrt{N})$ online time complexity of the baby-step giant-step algorithm cannot be improved, unless the advice string is more than $Ω(\sqrt{N})$ bits long. This lies in stark contrast with the classical adaptive Pollard's rho algorithm for DLOG, which can exploit preprocessing to obtain the tradeoff curve $ST^2=O(N)$. We obtain similar sharp lower bounds for several other cryptanalytic problems. To obtain our results, we present a new model that allows analyzing non-adaptive preprocessing algorithms for a wide array of search and decision problems in a unified way. Since previous proof techniques inherently cannot distinguish between adaptive and non-adaptive algorithms for the problems in our model, they cannot be used to obtain our results. Consequently, our proof uses a variant of Shearer's lemma for this setting, due to Barthe, Cordero-Erausquin, Ledoux, and Maurey (2011). This seems to be the first time a variant of Shearer's lemma for permutations is used in an algorithmic context.

2025-05-01T22:17:11Z Minor editorial changes. A shorter version was published at STOC 2026 Itai Dinur Nathan Keller Avichai Marmor http://arxiv.org/abs/2605.21553v1 TONIC: Token-Centric Semantic Communication for Task-Oriented Wireless Systems 2026-05-20T11:49:11Z

Tokens are becoming the basic units through which foundation models represent and process information for understanding and inference. However, traditional wireless communication, centered on bit-level fidelity, faces a mismatch between what is transmitted reliably and what downstream models actually consume. This mismatch calls for a communication design that directly accounts for token-level task relevance and downstream model requirements, rather than treating all transmitted bits as equally important. In this paper, we propose TONIC, a token-centric semantic communication framework for task-oriented wireless systems. The transmitter converts each source sample into a sequence of tokens, estimates token-level task relevance, and allocates protection through utility-aware unequal error protection under a fixed channel-use budget. At the receiver, token-level confidence is used to gate unreliable decisions, turning harmful substitutions into recoverable erasures before a Transformer-based completion model restores the masked tokens for final task inference. Our framework combines transmitter-side semantic-aware protection with receiver-side confidence-aware gating in a modular and interpretable architecture, rather than relying solely on fully black-box end-to-end learning. We further establish a utility-aware Bayes-risk interpretation for the receiver-side gating rule and study its interaction with unequal protection and completion. Experimental results on image classification show that TONIC consistently outperforms separation-based schemes, the pixel-domain DeepJSCC baseline, and token-domain baselines under matched communication budgets over AWGN, Rayleigh, and Rician channels.

2026-05-20T11:49:11Z 15 pages, 10 figures Sige Liu Kezhi Wang http://arxiv.org/abs/2605.21056v1 On Unified and Sharpened CMI Bounds for Generalization Errors 2026-05-20T11:42:35Z

We present a new family of information-theoretic generalization bounds within the framework of conditional mutual information (CMI). Most of our results are established based on the leave-$m$-out (L$m$O) cross-validation error, with $m$ denoting the number of the hold-out supersamples. Under this setting, we propose a unified CMI-based bound, allowing to envelop and reproduce many known CMI-based bounds and also bridge the gap between the MI- and CMI-based bounds when $m$ tends to infinity. The proposed framework not only provides a unified description of the existing bounds but also develops new, sharper bounds. We show the benefits of the proposed bounds through several simple examples, where the existing results are either inapplicable or looser. Moreover, under the premise that the loss function is bounded, we tighten the CMI quantities involved in the proposed bounds by reducing the number of conditional terms, thereby enhancing the proposed framework. We show empirically that the resulting new bounds improve upon the previously known ones.

2026-05-20T11:42:35Z This work is an extended version of the preliminary work presented at the ISIT2025 conference Yang Lu Matthias Frey Margreta Kuijper Jingge Zhu http://arxiv.org/abs/2605.21020v1 Microwave Linear Analog Computer (MiLAC)-Aided MIMO Radar Sensing: Transmit Beamforming Design and DoA Estimation 2026-05-20T10:53:37Z

Multiple-input multiple-output (MIMO) radar has waveform diversity and large spatial degrees of freedom (DoFs), making it attractive for high-resolution sensing. Scaling MIMO radar to massive arrays can further improve sensing performance, but it also increases hardware cost, power consumption, and digital processing complexity. The microwave linear analog computer (MiLAC) can tackle these challenges by moving linear operations from the digital domain to the analog domain. MiLAC has shown promising benefits for communications in recent studies and this paper identifies its potential for radar sensing. Specifically, we consider both MiLAC-aided transmit beamforming and receiver-side two-dimensional discrete Fourier transform (2D-DFT)-based direction-of-arrival (DoA) estimation. For transmit beamforming, we formulate a weighted Cramer Rao bound (CRB) minimization problem under lossless and reciprocal MiLAC constraints and propose a penalty dual decomposition (PDD)-based iterative algorithm to address the non-convex problem. We further prove that MiLAC-aided and fully-digital beamforming achieve the same CRB. For receiver processing, we show that the 2D DFT can be implemented by a lossless reciprocal MiLAC, which enables analog-domain DoA estimation without digital optimization. Numerical results confirm the theoretical finding and show that the MiLAC-aided approach achieves the same CRB and DoA estimation performance as the fully-digital benchmark. Meanwhile, hardware cost and power consumption are reduced because only low-resolution DACs are required at the transmitter, while RF chains and ADCs are eliminated at the receiver. Moreover, performing the 2D DFT in the analog domain eliminates all digital DFT operations for DoA estimation.

2026-05-20T10:53:37Z Submitted to IEEE journal Ziang Liu Zheyu Wu Bruno Clerckx http://arxiv.org/abs/2605.21016v1 Partially Observable Restless Bandits for Age-Optimal Scheduling over Markov Channels 2026-05-20T10:50:29Z

There is a surge of need for fresh information with the overwhelming proliferation of the Internet of Things (IoT) applications. To characterize the information freshness perceived by the destination, the age of information (AoI) has been proposed. In this paper, we consider an IoT system with multiple devices sending status update packets to a central controller through time-correlated Markov channels and assume that the instantaneous channel states are not available to the central controller before making scheduling decisions. To ensure information freshness, we investigate a timely scheduling problem that minimizes the total expected time-average AoI under a strict communications bandwidth constraint. We formulate this problem as a partially observable restless multi-armed bandit problem. Using Lagrangian relaxation, we decouple the relaxed problem into multiple sub-problems and prove the threshold structure of their optimal policies. Armed with this property, we establish the indexability for the decoupled problem and design an algorithm to compute the Whittle's index. To reduce implementation complexity, we further derive the Whittle-like index in closed-form for low-complexity scheduling. Simulation results show that the proposed index-based policies outperform the baselines, remain close to the optimal policy or relaxed lower bound, and are especially effective when scheduling resources are limited or the network size is large.

2026-05-20T10:50:29Z This work has been submitted to the IEEE for possible publication Xijun Wang Shuying Gan Yanzhi Huang Xiaoyu Zhao Chao Xu Xiang Chen