https://arxiv.org/api/8twq+b4X0a6PoVlt5asV3SO9BeM2026-06-21T19:16:36Z5484119515http://arxiv.org/abs/2606.11500v1FlexiBrain: Resolution-Agnostic Voxel-Level Encoding for Native fMRI2026-06-09T22:45:45ZThe success of large-scale deep learning models in neuroscience is fundamentally constrained by severe data heterogeneity. Native fMRI data aggregated from diverse sources exhibit substantial variation in both spatial and temporal resolutions. Consequently, most existing frameworks rely on lengthy, rigid preprocessing pipelines that enforce uniformity across datasets. This practice introduces two critical limitations: (1) potential degradation of subject-specific anatomical information; (2) significant computational overhead, often requiring hours of processing per subject. Here, we propose FlexiBrain, a resolution-agnostic voxel-level encoding framework for native fMRI based on Mamba-JEPA. FlexiBrain defines patch sizes in real-world physical units and employs a dynamic patch resizing, thereby bypassing destructive spatial standardization while enabling direct ingestion of data in native space. We instantiate the framework using an efficient Mamba-JEPA backbone to model high-dimensional 4D fMRI signals. Across five diverse downstream neuroscience tasks, FlexiBrain consistently outperforms recent state-of-the-art methods, achieving gains of up to 12 percentage points without external data augmentation. Importantly, FlexiBrain functions as a seamless plug-in module, substantially reducing preprocessing costs and accelerating the development of robust voxel-level fMRI foundation models. Code is available at https://github.com/OneMore1/FlexiBrain.2026-06-09T22:45:45ZMo WangWenhao YeJunfeng XiaMinghao XuHongkai WenQuanying Liuhttp://arxiv.org/abs/2606.11484v1Handbook of Error-Correcting Codes2026-06-09T22:18:56ZBarcode scans, clear phone calls, reliable data storage, satellite communication, and large-scale quantum computation are all made possible by error correction. We present a handbook version of The Error Correction Zoo, a curated reference of methods for protecting classical or quantum information from errors during storage and transmission. The handbook includes descriptions of these error-correcting codes and a classification according to the symbols they use. It also catalogues relations among codes and related objects such as sphere packings, lattices, designs, groups, and classical and quantum phases of matter. The collection is intended both as a rigorous reference and as a practical aid for tracing the web of code relationships and uncovering new connections.2026-06-09T22:18:56Z440 classical codes, 619 quantum codes, 15 c-q codes. Online zoo at https://errorcorrectionzoo.org/. Notify zookeeper of errors or issue a pull request at https://github.com/errorcorrectionzoo/eczoo_dataVictor V. AlbertPhilippe Faisthttp://arxiv.org/abs/2606.11468v1Optimizing Encoder Circuits of Entanglement-Assisted Quantum LDPC Codes via Beam Search2026-06-09T21:57:07ZEntanglement-assisted (EA) quantum QC-LDPC codes offer strong error-correction capabilities with structured parity-check matrices, but their practical use depends on efficient encoder circuits and the availability of pre-shared Bell pairs (ebits). In all encoder implementations based on the stabilizer formalism, the dominant contribution to this complexity comes from the use of controlled gates. In this paper, we adopt the Sharma-Kumar-Garani (SKG) encoder construction. We formulate the encoder optimization as a search over GF(2) row operations that decompose the binary matrix derived from its CNOT sub-sequence. We solve this problem using a beam search algorithm guided by a Hamming-distance heuristic. For the tested EA quantum QC-LDPC code families, the proposed method achieves CNOT-count reductions of 7.3-34.0% relative to the SKG baseline encoder. The optimized circuits also yield lower CNOT counts than Patel-Markov-Hayes synthesis on all tested instances and are verified by stabilizer-tableau simulation. These results show that substantial encoder simplification is possible for structured EA QC-LDPC codes.2026-06-09T21:57:07ZAditya SodhaniUniversity of Minnesota, Minneapolis, USAPavan KumarIndian Institute of Science, Bengaluru, IndiaShayan Srinivasa GaraniIndian Institute of Science, Bengaluru, IndiaKeshab K. ParhiUniversity of Minnesota, Minneapolis, USAhttp://arxiv.org/abs/2606.11454v1Lifted Gabidulin Construction for LDPC Representations of Finite Geometry Codes2026-06-09T21:11:55ZFinite geometry (FG) codes combine the algebraic properties of classical block codes with the iterative belief propagation (BP) decoding ability of low-density parity-check~(LDPC) codes. However, exploiting both advantages in practice is hindered by the fact that the standard incidence matrix between $(μ+1)$-flats and points is dense and contains many short cycles for any flat dimension $μ\geq 1$. In this work, we propose to sparsify the decoding matrix based on pencil selection, formulated as a constant-dimension subspace packing problem and solved explicitly using lifted Gabidulin codes. For both affine and projective geometries, sparse parity-check matrices are constructed and verified for FG codes of lengths up to $1024$. Simulations on four FG codes show no visible error floor and around $0.5$~dB gain over corresponding 5G LDPC codes at a block error rate of $10^{-7}$.2026-06-09T21:11:55ZYifei ShenAndreas Burghttp://arxiv.org/abs/2606.11448v1A Unified Lower Bound on the Noisy Query Complexity of Boolean Functions2026-06-09T21:00:16ZWe study the query complexity of Boolean functions $f: \{0, 1\}^n \rightarrow \{0, 1\}$ in the noisy query model introduced by Feige, Raghavan, Peleg and Upfal [SICOMP 1994]. In this model, an algorithm can adaptively query the bits of an input vector, but each query result is independently flipped with constant probability $p \in (0, 1/2)$; repeated queries are allowed. The noisy query complexity $\mathsf{N}_p(f)$ of a function $f$ is defined as the minimum expected number of queries needed to compute $f(x)$ with error probability at most $1/3$, for the worst case input $x$.
We prove a general lower bound on $\mathsf{N}_p(f)$ based on degree statistics of certain subgraphs of the Boolean hypercube. This is the first general lower bound beyond those implied by the simple observation that $\mathsf{N}_p(f)$ is lower bounded by the randomized query complexity. We show that this recovers (up to a constant factor) most previously known lower bounds on the noisy query complexity of Boolean functions, providing a unified framework for understanding these results and simplifying the proofs in several cases. Furthermore, this resolves in the affirmative an open problem of Gu, Li and Xu [COLT 2025] that $\mathsf{N}_p(f) = Ω(\mathsf{I}(f) \log \mathsf{I}(f))$, where $\mathsf{I}(f)$ denotes the total influence of $f$. We also apply our general lower bound to obtain tight bounds on the noisy query complexity for several new functions.2026-06-09T21:00:16ZCOLT 2026Yuzhou GuXin LiYinzhan Xuhttp://arxiv.org/abs/2606.11432v1Additive Noise, Shift Recovery, and Signed Signals in the Cumulative Distribution Transform2026-06-09T20:36:29ZThe cumulative distribution transform (CDT) is a quantile-based transport representation that exactly linearizes one-dimensional translations of positive densities. We study how this structure behaves under additive perturbations and how it can be exploited for shift recovery. Under a local nondegeneracy condition, we derive a first-order expansion showing that additive noise in physical space induces a nonlocal perturbation in CDT space through the primitive of the noise, weighted by the reciprocal density. This yields an explicit description of transform-domain sensitivity and shows, in particular, that perturbations are amplified in low-density regions. When the physical-space perturbation is modeled as a centered Gaussian random field, the induced first-order CDT perturbation is again Gaussian, with an explicit covariance kernel.
We then use this structure to study recovery in CDT coordinates. In the known-template setting, the transport shift is obtained by projection onto the constant mode, giving an explicit estimator together with exactness in the noiseless case and a stability bound under perturbations. In the unknown-template setting, multiple observations permit joint recovery of the shifts and a common template up to the natural constant-mode gauge, leading to a simple de-shift--and--average procedure. We also consider a signed-signal analogue based on the signed cumulative distribution transform (SCDT), where shifts are estimated numerically by feature matching and unknown templates are recovered by alternating alignment and averaging. Numerical experiments validate the perturbation analysis and illustrate effective recovery for both density-valued and signed signals.2026-06-09T20:36:29ZHarbir AntilRatna KhatriAryan Saxenahttp://arxiv.org/abs/2503.07804v6Simultaneous Decoding of Classical Coset Codes over 3-User Quantum Interference Channel : New Achievable Rate Regions2026-06-09T20:18:17ZWe undertake a Shannon theoretic study of the problem of communicating bit streams over a 3-user classical-quantum interference channel (3-CQIC) and focus on characterizing inner bounds. We design a new coding strategy based on (i) coset codes possessing algebraic closure properties and (ii) decoding POVMs to decode bi-variate interference efficiently. Needing to perform simultaneous decoding, we enhance Sen's powerful technique of tilting, smoothing, and augmentation - originally designed only for IID codes - to decode into `functions of codebooks'. Developing analysis techniques to combine all of these elements, we derive a new inner bound to the capacity region of a 3-CQIC. The derived inner bound subsumes all currently known inner bounds and is analytically proven to be strictly larger for identified examples, including non-commutative `additive' and `non-additive' ones.2025-03-10T19:36:53ZFatma GouiaaArun Padakandlahttp://arxiv.org/abs/2606.11401v1Maximum Coverage Chase Decoder for Optical Interconnects2026-06-09T19:45:20ZWe propose a low-complexity Chase decoder for optical interconnects that formulates test pattern selection as a generalized maximum coverage problem. For concatenated RS-BCH and oFEC codes, our decoder achieves the standard Chase decoding performance with 25% and 61.3% fewer test patterns, respectively.2026-06-09T19:45:20ZAlessandro CardinaleWenqing SongBin ChenAlex AlvaradoAndreas BurgYifei Shenhttp://arxiv.org/abs/2602.12220v2Breaking Symmetry in D2D Coded Caching: Optimal Communication with Low Subpacketization2026-06-09T19:34:41ZFinite-length design is essential for making coded caching practical, as the optimal communication gains of existing schemes often require prohibitively large subpacketization. This paper studies rate-optimal device-to-device (D2D) coded caching with reduced subpacketization. We propose a packet type-based (PT) framework that exploits the geometric structure induced by user grouping. Under this structure, subfiles, packets, and multicast groups are classified into types, allowing the originally symmetric Ji-Caire-Molisch (JCM) design~\cite{ji2016fundamental} to be systematically relaxed without sacrificing the optimal D2D communication rate. The key feature of the PT framework is that subpacketization reduction is achieved through two complementary mechanisms: \emph{subfile saving}, by excluding redundant subfile types, and \emph{further-splitting saving}, by assigning type-dependent further-splitting factors to subfiles through transmitter selection. The type-dependent splitting factors are then coordinated across multicast group types to produce a globally consistent file-splitting structure. Based on this framework, we construct several classes of rate-optimal D2D coded caching schemes that strictly improve upon the JCM subpacketization. The proposed schemes achieve either order-wise reductions in the number of users or constant-factor reductions over broad memory regimes, while preserving the optimal rate. These results reveal a structural distinction between D2D and shared-link coded caching: unlike in the shared-link setting, full symmetric subpacketization is not necessary for rate-optimal D2D caching.2026-02-12T17:58:37ZSubmitted to IEEE Transactions on Information TheoryXiang ZhangGiuseppe CaireMingyue Jihttp://arxiv.org/abs/2606.11365v1Color-Rule-Function Encoding for Combinatorial Memory2026-06-09T18:47:07ZCombinatorial memory is a class of memory in which information is encoded in the set of paths through a structured mesh. In this work, we introduce a systematic encoding framework, referred to as the Color-Rule-Function (CRF) approach, for representing information in combinatorial memory. The method consists of four key steps: selecting a sequence of paths in the mesh, assigning values (e.g., colors) to each cell, defining a set of rules based on the values encountered along each path, and constructing a Boolean function that determines the state of each path. . The coding procedure is illustrated by several examples. The design space scales of the CRF scale fundamentally faster compared to conventional memory. This apparent advantage arises from the use of rule-based and functional representations but is accompanied by increased hardware complexity. A possible hardware realization of the CRF framework is discussed. Importantly, the hardware overhead can be substantially reduced through the use of customized modules. The examples of the customized design are described in the text. The combination of CRF coding with customized module design may lead to a practical advantage in data storage density. According to the estimates, the data storage density may exceed Exabit per centimeter squared. A key problem that requires further investigation is related to the minimum Hamming distance between an arbitrary target bit sequence and the closest sequence realizable within the CRF framework under fixed hardware constraints.2026-06-09T18:47:07ZAlexander Khitunhttp://arxiv.org/abs/2606.11353v1An Information-Theoretic Analysis of Threshold Group Testing2026-06-09T18:30:03ZWe study the Threshold Group Testing (TGT) problem in the noiseless and non-adaptive setting, where the objective is to exactly recover a sparse binary vector from pooled tests, using as few tests as possible. In TGT, each test applied to a subset of items returns a positive outcome if the number of 1's (defective items) in that subset meets or exceeds a specified threshold, and has a negative outcome otherwise. We investigate how the complexity of TGT compares to that of Classical Group Testing (CGT), corresponding to the special case of the threshold equal to one, and analyse the impact of increasing the threshold on the required number of tests.
Our main contribution is the derivation of a sharp information-theoretic phase transition at $c_{\mathrm{inf}}^{\mathrm{TGT}}k\log(n/k)$ (non-adaptive) tests for TGT within the constant-column test design. The threshold constant $c_{\mathrm{inf}}^{\mathrm{TGT}}$ is expressed as a function of the prevalence of defectives and the threshold value. Our upper bound is derived under an analytic assumption, and we verify that this assumption is satisfied for a threshold value of 2.
The value of $c_{\mathrm{inf}}^{\mathrm{TGT}}$ reveals that TGT on the constant-column design has the same information-theoretic behaviour as CGT in the low-prevalence regime. Yet, strikingly, at higher prevalences, the threshold leads to a significant reduction in the number of tests.
On the other hand, we provide evidence that when the asymptotic proportion of defective items is positive, TGT actually becomes strictly harder than CGT (excluding trivial reductions).2026-06-09T18:30:03ZRemco van der HofstadNoela MüllerConnor Riddlesdenhttp://arxiv.org/abs/2606.11351v1MJSAC: McCormick Relaxation-based Waveform Design for Joint Sensing and Communication2026-06-09T18:28:36ZIn the upcoming 5G Advanced and 6G technologies, joint sensing and communication (JSAC) will play a pivotal role in enabling the simultaneous utilization of hardware and spectrum resources for communication and sensing tasks. While current algorithms primarily focus on designing beampattern invariant covariance matrices for transmitting various symbols for communication, they often overlook the distances among these symbols. While these covariance matrices effectively facilitate ranging operations, they have adverse effects on communication performance. Designing beampattern invariance covariance matrices with maximal distances among themselves poses a challenging non-convex problem. In this paper, we introduce a novel waveform design method based on McCormick relaxation called McCormick-based JSAC (MJSAC). MJSAC sequentially solves an optimization problem to generate a set of covariance matrices by maximizing the distances (Frobenius norm) among themselves while ensuring a consistent beam pattern. Also, MJSAC eliminates the requirement for channel information to generate the covariance matrices. Through simulations, we demonstrate that MJSAC outperforms conventional algorithms, even those utilizing channel information at the transmitter.2026-06-09T18:28:36ZBodhibrata MukhopadhyaySajid AhmedMohamed-Slim Alouinihttp://arxiv.org/abs/2606.11288v1An Entropy-based Framework for Hybrid Coalitions in Game Theory. Part I: Human Arbitration2026-06-09T17:32:55ZClassical Game Theory underpins much of AI and multiagent research, but hybrid Human AI systems require a framework in which execution authority can alternate within a digital environment. We introduce NeoGame Theory, an extension of classical Game Theory for hybrid Human AI coalitions operating under Virtual Nature, the algorithmic analogue of classical (physical) Nature. The framework combines a lexicographic coalition utility with a delegation rule based on the Jensen-Shannon divergence between Human and AI policies. Two thresholds define agreement, contextual, and disagreement regions. In the contextual region, execution follows a scenario specific rule. Apart from the theory, in this paper we develop the first regime, Human arbitration, in which the AI learns by observation and frequency matching while the Human retains final execution authority. We establish the axiomatic basis of the framework and characterize a frequency convergence equilibrium, providing the foundation for later extensions and computational validation.2026-06-09T17:32:55Z29 pages, 2 figures (the second with four panels)Entropy 28 (2026) 473Salome A. Sepulveda-FontaineJose M. Amigo10.3390/e28040473http://arxiv.org/abs/2606.11003v1Weighing Timed Regular Languages: The Final Step (long version)2026-06-09T15:38:27ZThe bandwidth of a timed language characterizes the quantity of information per time unit (with a finite observation precision $\varepsilon$). The asymptotic behavior of the bandwidth as $\varepsilon \to 0$ classifies timed regular languages in three classes: meager, normal, and obese. Normal timed automata have a bounded frequency of events and some non-punctual transitions, and, up to now, were the only class of timed automata for which no algorithm was available for computing their bandwidth. In this article, we compute the bandwidth of any such automaton in the form $\approxα\log{1/\varepsilon}$. Our approach reduces this problem to computing the best reward-to-cost ratio in a weighted finite graph constructed from the given timed automaton.2026-06-09T15:38:27Z40 pages, 4 figures, accepted to QEST + FORMATS 2026 conference; a short (18 pages) version will be published by Springer Nature in the Proceedings of QEST + FORMATS 2026Eugene AsarinAldric DegorreCatalin DimaBernardo Jacobo Inclánhttp://arxiv.org/abs/2601.06688v5The Sample Complexity of Lossless Data Compression2026-06-09T14:33:11ZA new framework is introduced for examining and evaluating the fundamental limits of lossless data compression, that emphasizes genuinely non-asymptotic results. The {\em sample complexity} of compressing a given source is defined as the smallest blocklength at which it is possible to compress that source at a specifically constrained rate and to within a specified excess-rate probability. This formulation parallels corresponding developments in statistics and computer science, and it facilitates the use of existing results on the sample complexity of various hypothesis testing problems. For arbitrary sources, the sample complexity of general variable-length compressors is shown to be tightly coupled with the sample complexity of prefix-free codes and fixed-length codes. For memoryless sources, it is shown that the sample complexity is characterized not by the source entropy, but by its Rényi entropy of order~$1/2$. Nonasymptotic bounds on the sample complexity are obtained, with explicit constants. Generalizations to Markov sources are established, showing that the sample complexity is determined by the source's Rényi entropy rate of order~$1/2$. Finally, bounds on the sample complexity of universal data compression are developed for families of memoryless sources. There, the sample complexity is characterized by the minimum Rényi divergence of order~$1/2$ between elements of the family and the uniform distribution. The connection of this problem with identity testing and with the associated separation rates is explored and discussed.2026-01-10T21:24:43ZSeveral minor revisions and reviewer comments taken into account, additional content on the "actual compression rate" and asymmetric formulation for general target ratesTerence ViaudIoannis Kontoyiannis