http://arxiv.org/api/GShQuKHHZfAuSBTcU4Apgiugxx4 2025-04-22T00:00:00-04:00 49781 60 15 http://arxiv.org/abs/2504.13235v1 2025-04-17T16:02:48Z 2025-04-17T16:02:48Z Bayesian Rao test for distributed target detection in interference and noise with limited training data

This paper has studied the problem of detecting a range-spread target in interference and noise when the number of training data is limited. The interference is located within a certain subspace with an unknown coordinate, while the noise follows a Gaussian distribution with an unknown covariance matrix. We concentrate on the scenarios where the training data are limited and employ a Bayesian framework to ffnd a solution. Speciffcally, the covariance matrix is assumed to follow an inverse Wishart distribution. Then, we introduce the Bayesian detector according to the Rao test, which, demonstrated by both simulation experiment and real data, has superior detection performance to the existing detectors in certain situations.

Daipeng Xiao Weijian Liu Jun Liu Yuntao Wu Qinglei Du Xiaoqiang Hua 14 pages,18 figures http://arxiv.org/abs/2504.13031v1 2025-04-17T15:41:28Z 2025-04-17T15:41:28Z Degrees of Freedom of Holographic MIMO -- Fundamental Theory and Analytical Methods

Holographic multiple-input multiple-output (MIMO) is envisioned as one of the most promising technology enablers for future sixth-generation (6G) networks. The use of electrically large holographic surface (HoloS) antennas has the potential to significantly boost the spatial multiplexing gain by increasing the number of degrees of freedom (DoF), even in line-of-sight (LoS) channels. In this context, the research community has shown a growing interest in characterizing the fundamental limits of this technology. In this paper, we compare the two analytical methods commonly utilized in the literature for this purpose: the cut-set integral and the self-adjoint operator. We provide a detailed description of both methods and discuss their advantages and limitations.

Juan Carlos Ruiz-Sicilia Marco Di Renzo Placido Mursia Vincenzo Sciancalepore Merouane Debbah Presented at EUCAP 2025 http://arxiv.org/abs/2504.12989v1 2025-04-17T14:54:00Z 2025-04-17T14:54:00Z Query Complexity of Classical and Quantum Channel Discrimination

Quantum channel discrimination has been studied from an information-theoretic perspective, wherein one is interested in the optimal decay rate of error probabilities as a function of the number of unknown channel accesses. In this paper, we study the query complexity of quantum channel discrimination, wherein the goal is to determine the minimum number of channel uses needed to reach a desired error probability. To this end, we show that the query complexity of binary channel discrimination depends logarithmically on the inverse error probability and inversely on the negative logarithm of the (geometric and Holevo) channel fidelity. As a special case of these findings, we precisely characterize the query complexity of discriminating between two classical channels. We also provide lower and upper bounds on the query complexity of binary asymmetric channel discrimination and multiple quantum channel discrimination. For the former, the query complexity depends on the geometric R\'enyi and Petz R\'enyi channel divergences, while for the latter, it depends on the negative logarithm of (geometric and Uhlmann) channel fidelity. For multiple channel discrimination, the upper bound scales as the logarithm of the number of channels.

Theshani Nuradha Mark M. Wilde 22 pages; see also the independent work "Sampling complexity of quantum channel discrimination" DOI 10.1088/1572-9494/adcb9e http://arxiv.org/abs/2503.13379v3 2025-04-17T13:32:39Z 2025-03-17T17:06:27Z Error bounds for composite quantum hypothesis testing and a new characterization of the weighted Kubo-Ando geometric means

The optimal error exponents of binary composite i.i.d. state discrimination are trivially bounded by the worst-case pairwise exponents of discriminating individual elements of the sets representing the two hypotheses, and in the finite-dimensional classical case, these bounds in fact give exact single-copy expressions for the error exponents. In contrast, in the non-commutative case, the optimal exponents are only known to be expressible in terms of regularized divergences, resulting in formulas that, while conceptually relevant, practically not very useful. In this paper, we develop further an approach initiated in [Mosonyi, Szil\'agyi, Weiner, IEEE Trans. Inf. Th. 68(2):1032--1067, 2022] to give improved single-copy bounds on the error exponents by comparing not only individual states from the two hypotheses, but also various unnormalized positive semi-definite operators associated to them. Here, we show a number of equivalent characterizations of such operators giving valid bounds, and show that in the commutative case, considering weighted geometric means of the states, and in the case of two states per hypothesis, considering weighted Kubo-Ando geometric means, are optimal for this approach. As a result, we give a new characterization of the weighted Kubo-Ando geometric means as the only $2$-variable operator geometric means that are block additive, tensor multiplicative, and satisfy the arithmetic-geometric mean inequality. We also extend our results to composite quantum channel discrimination, and show an analogous optimality property of the weighted Kubo-Ando geometric means of two quantum channels, a notion that seems to be new. We extend this concept to defining the notion of superoperator perspective function and establish some of its basic properties, which may be of independent interest.

Péter E. Frenkel Milán Mosonyi Péter Vrana Mihály Weiner 36 pages. v3: Added explicit example with strict improvement in the strong converse exponent using geometric means http://arxiv.org/abs/2504.12885v1 2025-04-17T12:20:46Z 2025-04-17T12:20:46Z Optimizing Movable Antennas in Wideband Multi-User MIMO With Hardware Impairments

Movable antennas represent an emerging field in telecommunication research and a potential approach to achieving higher data rates in multiple-input multiple-output (MIMO) communications when the total number of antennas is limited. Most solutions and analyses to date have been limited to \emph{narrowband} setups. This work complements the prior studies by quantifying the benefit of using movable antennas in \emph{wideband} MIMO communication systems. First, we derive a novel uplink wideband system model that also accounts for distortion from transceiver hardware impairments. We then formulate and solve an optimization task to maximize the average sum rate by adjusting the antenna positions using particle swarm optimization. Finally, the performance with movable antennas is compared with fixed uniform arrays and the derived theoretical upper bound. The numerical study concludes that the data rate improvement from movable antennas over other arrays heavily depends on the level of hardware impairments, the richness of the multi-path environments, and the number of subcarriers. The present study provides vital insights into the most suitable use cases for movable antennas in future wideband systems.

Amna Irshad Emil Björnson Alva Kosasih Vitaly Petrov 5 pages, 6 figures http://arxiv.org/abs/2504.12604v1 2025-04-17T03:07:48Z 2025-04-17T03:07:48Z Codes over Finite Ring $\mathbb{Z}_k$, MacWilliams Identity and Theta Function

In this paper, we study linear codes over $\mathbb{Z}_k$ based on lattices and theta functions. We obtain the complete weight enumerators MacWilliams identity and the symmetrized weight enumerators MacWilliams identity based on the theory of theta function. We extend the main work by Bannai, Dougherty, Harada and Oura to the finite ring $\mathbb{Z}_k$ for any positive integer $k$ and present the complete weight enumerators MacWilliams identity in genus $g$. When $k=p$ is a prime number, we establish the relationship between the theta function of associated lattices over a cyclotomic field and the complete weight enumerators with Hamming weight of codes, which is an analogy of the results by G. Van der Geer and F. Hirzebruch since they showed the identity with the Lee weight enumerators.

Zhiyong Zheng Fengxia Liu Kun Tian http://arxiv.org/abs/2504.12594v1 2025-04-17T02:41:22Z 2025-04-17T02:41:22Z Meta-Dependence in Conditional Independence Testing

Constraint-based causal discovery algorithms utilize many statistical tests for conditional independence to uncover networks of causal dependencies. These approaches to causal discovery rely on an assumed correspondence between the graphical properties of a causal structure and the conditional independence properties of observed variables, known as the causal Markov condition and faithfulness. Finite data yields an empirical distribution that is "close" to the actual distribution. Across these many possible empirical distributions, the correspondence to the graphical properties can break down for different conditional independencies, and multiple violations can occur at the same time. We study this "meta-dependence" between conditional independence properties using the following geometric intuition: each conditional independence property constrains the space of possible joint distributions to a manifold. The "meta-dependence" between conditional independences is informed by the position of these manifolds relative to the true probability distribution. We provide a simple-to-compute measure of this meta-dependence using information projections and consolidate our findings empirically using both synthetic and real-world data.

Bijan Mazaheri Jiaqi Zhang Caroline Uhler http://arxiv.org/abs/2412.09839v2 2025-04-16T21:18:03Z 2024-12-13T04:13:40Z AI and Deep Learning for THz Ultra-Massive MIMO: From Model-Driven Approaches to Foundation Models

In this paper, we explore the potential of artificial intelligence (AI) to address challenges in terahertz ultra-massive multiple-input multiple-output (THz UM-MIMO) systems. We identify three key challenges for transceiver design: "hard to compute," "hard to model," and "hard to measure," and argue that AI can provide promising solutions. We propose three research roadmaps for AI algorithms tailored to THz UM-MIMO systems. The first, model-driven deep learning (DL), emphasizes leveraging domain knowledge and using AI to enhance bottleneck modules in established signal processing or optimization frameworks. We discuss four steps: algorithmic frameworks, basis algorithms, loss function design, and neural architecture design. The second roadmap presents channel station information (CSI) foundation models to unify transceiver module design by focusing on the wireless channel. We propose a compact foundation model to estimate wireless channel score functions, serving as a prior for designing transceiver modules. We outline four steps: general frameworks, conditioning, site-specific adaptation, and joint design of CSI models and model-driven DL. The third roadmap explores applying pre-trained large language models (LLMs) to THz UM-MIMO systems, with applications in estimation, optimization, searching, network management, and protocol understanding. Finally, we discuss open problems and future research directions.

Wentao Yu Hengtao He Shenghui Song Jun Zhang Linglong Dai Lizhong Zheng Khaled B. Letaief 25 pages, 8 figures, 1 table. Model-driven deep learning, CSI foundation models, and applications of LLMs are presented as three systematic research roadmaps for AI-enabled THz ultra-massive MIMO systems http://arxiv.org/abs/2504.12274v1 2025-04-16T17:34:58Z 2025-04-16T17:34:58Z Kernels for Storage Capacity and Dual Index Coding

The storage capacity of a graph measures the maximum amount of information that can be stored across its vertices, such that the information at any vertex can be recovered from the information stored at its neighborhood. The study of this graph quantity is motivated by applications in distributed storage and by its intimate relations to the index coding problem from the area of network information theory. In the latter, one wishes to minimize the amount of information that has to be transmitted to a collection of receivers, in a way that enables each of them to discover its required data using some prior side information. In this paper, we initiate the study of the Storage Capacity and Index Coding problems from the perspective of parameterized complexity. We prove that the Storage Capacity problem parameterized by the solution size admits a kernelization algorithm producing kernels of linear size. We also provide such a result for the Index Coding problem, in the linear and non-linear settings, where it is parameterized by the dual value of the solution, i.e., the length of the transmission that can be saved using the side information. A key ingredient in the proofs is the crown decomposition technique due to Chor, Fellows, and Juedes (WG 2003, WG 2004). As an application, we significantly extend an algorithmic result of Dau, Skachek, and Chee (IEEE Trans. Inform. Theory, 2014).

Ishay Haviv 15 pages http://arxiv.org/abs/2310.03311v3 2025-04-16T15:58:58Z 2023-10-05T04:59:58Z Deep Variational Multivariate Information Bottleneck -- A Framework for Variational Losses

Variational dimensionality reduction methods are widely used for their accuracy, generative capabilities, and robustness. We introduce a unifying framework that generalizes both such as traditional and state-of-the-art methods. The framework is based on an interpretation of the multivariate information bottleneck, trading off the information preserved in an encoder graph (defining what to compress) against that in a decoder graph (defining a generative model for data). Using this approach, we rederive existing methods, including the deep variational information bottleneck, variational autoencoders, and deep multiview information bottleneck. We naturally extend the deep variational CCA (DVCCA) family to beta-DVCCA and introduce a new method, the deep variational symmetric information bottleneck (DVSIB). DSIB, the deterministic limit of DVSIB, connects to modern contrastive learning approaches such as Barlow Twins, among others. We evaluate these methods on Noisy MNIST and Noisy CIFAR-100, showing that algorithms better matched to the structure of the problem like DVSIB and beta-DVCCA produce better latent spaces as measured by classification accuracy, dimensionality of the latent variables, sample efficiency, and consistently outperform other approaches under comparable conditions. Additionally, we benchmark against state-of-the-art models, achieving superior or competitive accuracy. Our results demonstrate that this framework can seamlessly incorporate diverse multi-view representation learning algorithms, providing a foundation for designing novel, problem-specific loss functions.

Eslam Abdelaleem Ilya Nemenman K. Michael Martini http://arxiv.org/abs/2504.12194v1 2025-04-16T15:47:38Z 2025-04-16T15:47:38Z The Optimal Condition Number for ReLU Function

ReLU is a widely used activation function in deep neural networks. This paper explores the stability properties of the ReLU map. For any weight matrix $\boldsymbol{A} \in \mathbb{R}^{m \times n}$ and bias vector $\boldsymbol{b} \in \mathbb{R}^{m}$ at a given layer, we define the condition number $\beta_{\boldsymbol{A},\boldsymbol{b}}$ as $\beta_{\boldsymbol{A},\boldsymbol{b}} = \frac{\mathcal{U}_{\boldsymbol{A},\boldsymbol{b}}}{\mathcal{L}_{\boldsymbol{A},\boldsymbol{b}}}$, where $\mathcal{U}_{\boldsymbol{A},\boldsymbol{b}}$ and $\mathcal{L}_{\boldsymbol{A},\boldsymbol{b}}$ are the upper and lower Lipschitz constants, respectively. We first demonstrate that for any given $\boldsymbol{A}$ and $\boldsymbol{b}$, the condition number satisfies $\beta_{\boldsymbol{A},\boldsymbol{b}} \geq \sqrt{2}$. Moreover, when the weights of the network at a given layer are initialized as random i.i.d. Gaussian variables and the bias term is set to zero, the condition number asymptotically approaches this lower bound. This theoretical finding suggests that Gaussian weight initialization is optimal for preserving distances in the context of random deep neural network weights.

Yu Xia Haoyu Zhou 29 pages http://arxiv.org/abs/2504.12181v1 2025-04-16T15:38:38Z 2025-04-16T15:38:38Z Battery-aware Cyclic Scheduling in Energy-harvesting Federated Learning

Federated Learning (FL) has emerged as a promising framework for distributed learning, but its growing complexity has led to significant energy consumption, particularly from computations on the client side. This challenge is especially critical in energy-harvesting FL (EHFL) systems, where device availability fluctuates due to limited and time-varying energy resources. We propose FedBacys, a battery-aware FL framework that introduces cyclic client participation based on users' battery levels to cope with these issues. FedBacys enables clients to save energy and strategically perform local training just before their designated transmission time by clustering clients and scheduling their involvement sequentially. This design minimizes redundant computation, reduces system-wide energy usage, and improves learning stability. Our experiments demonstrate that FedBacys outperforms existing approaches in terms of energy efficiency and performance consistency, exhibiting robustness even under non-i.i.d. training data distributions and with very infrequent battery charging. This work presents the first comprehensive evaluation of cyclic client participation in EHFL, incorporating both communication and computation costs into a unified, resource-aware scheduling strategy.

Eunjeong Jeong Nikolaos Pappas This paper is currently under review for presentation at a peer-reviewed conference http://arxiv.org/abs/2501.04285v3 2025-04-16T15:24:31Z 2025-01-08T05:17:09Z Separate Source Channel Coding Is Still What You Need: An LLM-based Rethinking

Along with the proliferating research interest in Semantic Communication (SemCom), Joint Source Channel Coding (JSCC) has dominated the attention due to the widely assumed existence in efficiently delivering information semantics. %has emerged as a pivotal area of research, aiming to enhance the efficiency and reliability of information transmission through deep learning-based methods. Nevertheless, this paper challenges the conventional JSCC paradigm, and advocates for adoption of Separate Source Channel Coding (SSCC) to enjoy the underlying more degree of freedom for optimization. We demonstrate that SSCC, after leveraging the strengths of Large Language Model (LLM) for source coding and Error Correction Code Transformer (ECCT) complemented for channel decoding, offers superior performance over JSCC. Our proposed framework also effectively highlights the compatibility challenges between SemCom approaches and digital communication systems, particularly concerning the resource costs associated with the transmission of high precision floating point numbers. Through comprehensive evaluations, we establish that empowered by LLM-based compression and ECCT-enhanced error correction, SSCC remains a viable and effective solution for modern communication systems. In other words, separate source and channel coding is still what we need!

Tianqi Ren Rongpeng Li Ming-min Zhao Xianfu Chen Guangyi Liu Yang Yang Zhifeng Zhao Honggang Zhang http://arxiv.org/abs/2410.15563v3 2025-04-16T14:51:57Z 2024-10-21T01:13:49Z Variants of Solovay reducibility

Outside of the left-c.e. reals, Solovay reducibility is considered to be behaved badly [10.1007/978-0-387-68441-3]. Proposals for variants of Solovay reducibility that are better suited for the investigation of arbitrary, not necessarily left-c.e. reals were made by Rettinger and Zheng [10.1007/978-3-540-27798-9_39], and, recently, by Titov [10.11588/heidok.00034250] and by Kumabe and co-authors [10.4115/jla.2020.12.2; 10.3233/COM-230486]. These variants all coincide with the original version of Solovay reducibility on the left-c.e. reals. Furthermore, they are all defined in terms of translation functions. The latter translate between computable approximations in the case of Rettinger and Zheng, are monotone in the case of Titov, and are functions between reals in the case of Kumabe et al. In what follows, we derive new results on the mentioned variants and their relation to each other. In particular, we obtain that Solovay reducibility defined in terms of translation function on rationals implies Solovay reducibility defined in terms of translation functions on reals, and we show that the original version of Solovay reducibility is strictly weaker than its monotone variant. Solovay reducibility and its variants mentioned so far have tight connections to Martin-L\"of randomness, the strongest and most central notion of a random sequence. For the investigation of Schnorr randomness, total variants of Solovay reducibility have been introduced by Merkle and Titov [10.48550/arXiv.2407.14869] in 2022 and, independently, by Kumabe et al. [10.3233/COM-230486] in 2024, the latter again via real-valued translation functions. In what follows, we show that total Solovay reducibility defined in terms of rational functions implies total Solovay reducibility defined in terms of real functions.

Ivan Titov to be published in the Proceedings of CiE2025 http://arxiv.org/abs/2504.12116v1 2025-04-16T14:26:51Z 2025-04-16T14:26:51Z Improvement of the square-root low bounds on the minimum distances of BCH codes and Matrix-product codes

The task of constructing infinite families of self-dual codes with unbounded lengths and minimum distances exhibiting square-root lower bounds is extremely challenging, especially when it comes to cyclic codes. Recently, the first infinite family of Euclidean self-dual binary and nonbinary cyclic codes, whose minimum distances have a square-root lower bound and have a lower bound better than square-root lower bounds are constructed in \cite{Chen23} for the lengths of these codes being unbounded. Let $q$ be a power of a prime number and $Q=q^2$. In this paper, we first improve the lower bounds on the minimum distances of Euclidean and Hermitian duals of BCH codes with length $\frac{q^m-1}{q^s-1}$ over $\mathbb{F}_q$ and $\frac{Q^m-1}{Q-1}$ over $\mathbb{F}_Q$ in \cite{Fan23,GDL21,Wang24} for the designed distances in some ranges, respectively, where $\frac{m}{s}\geq 3$. Then based on matrix-product construction and some lower bounds on the minimum distances of BCH codes and their duals, we obtain several classes of Euclidean and Hermitian self-dual codes, whose minimum distances have square-root lower bounds or a square-root-like lower bounds. Our lower bounds on the minimum distances of Euclidean and Hermitian self-dual cyclic codes improved many results in \cite{Chen23}. In addition, our lower bounds on the minimum distances of the duals of BCH codes are almost $q^s-1$ or $q$ times that of the existing lower bounds.

Xiaoqiang Wang Liuyi Li Yansheng Wu Dabin Zheng Shuxian Lu 29 pages, submitted to IEEE