http://arxiv.org/api/GShQuKHHZfAuSBTcU4Apgiugxx42025-04-22T00:00:00-04:00497816015http://arxiv.org/abs/2504.13235v12025-04-17T16:02:48Z2025-04-17T16:02:48ZBayesian Rao test for distributed target detection in interference and
noise with limited training data This paper has studied the problem of detecting a range-spread target in
interference and noise when the number of training data is limited. The
interference is located within a certain subspace with an unknown coordinate,
while the noise follows a Gaussian distribution with an unknown covariance
matrix. We concentrate on the scenarios where the training data are limited and
employ a Bayesian framework to ffnd a solution. Speciffcally, the covariance
matrix is assumed to follow an inverse Wishart distribution. Then, we introduce
the Bayesian detector according to the Rao test, which, demonstrated by both
simulation experiment and real data, has superior detection performance to the
existing detectors in certain situations.
Daipeng XiaoWeijian LiuJun LiuYuntao WuQinglei DuXiaoqiang Hua14 pages,18 figureshttp://arxiv.org/abs/2504.13031v12025-04-17T15:41:28Z2025-04-17T15:41:28ZDegrees of Freedom of Holographic MIMO -- Fundamental Theory and
Analytical Methods Holographic multiple-input multiple-output (MIMO) is envisioned as one of the
most promising technology enablers for future sixth-generation (6G) networks.
The use of electrically large holographic surface (HoloS) antennas has the
potential to significantly boost the spatial multiplexing gain by increasing
the number of degrees of freedom (DoF), even in line-of-sight (LoS) channels.
In this context, the research community has shown a growing interest in
characterizing the fundamental limits of this technology. In this paper, we
compare the two analytical methods commonly utilized in the literature for this
purpose: the cut-set integral and the self-adjoint operator. We provide a
detailed description of both methods and discuss their advantages and
limitations.
Juan Carlos Ruiz-SiciliaMarco Di RenzoPlacido MursiaVincenzo SciancaleporeMerouane DebbahPresented at EUCAP 2025http://arxiv.org/abs/2504.12989v12025-04-17T14:54:00Z2025-04-17T14:54:00ZQuery Complexity of Classical and Quantum Channel Discrimination Quantum channel discrimination has been studied from an information-theoretic
perspective, wherein one is interested in the optimal decay rate of error
probabilities as a function of the number of unknown channel accesses. In this
paper, we study the query complexity of quantum channel discrimination, wherein
the goal is to determine the minimum number of channel uses needed to reach a
desired error probability. To this end, we show that the query complexity of
binary channel discrimination depends logarithmically on the inverse error
probability and inversely on the negative logarithm of the (geometric and
Holevo) channel fidelity. As a special case of these findings, we precisely
characterize the query complexity of discriminating between two classical
channels. We also provide lower and upper bounds on the query complexity of
binary asymmetric channel discrimination and multiple quantum channel
discrimination. For the former, the query complexity depends on the geometric
R\'enyi and Petz R\'enyi channel divergences, while for the latter, it depends
on the negative logarithm of (geometric and Uhlmann) channel fidelity. For
multiple channel discrimination, the upper bound scales as the logarithm of the
number of channels.
Theshani NuradhaMark M. Wilde22 pages; see also the independent work "Sampling complexity of
quantum channel discrimination" DOI 10.1088/1572-9494/adcb9ehttp://arxiv.org/abs/2503.13379v32025-04-17T13:32:39Z2025-03-17T17:06:27ZError bounds for composite quantum hypothesis testing and a new
characterization of the weighted Kubo-Ando geometric means The optimal error exponents of binary composite i.i.d. state discrimination
are trivially bounded by the worst-case pairwise exponents of discriminating
individual elements of the sets representing the two hypotheses, and in the
finite-dimensional classical case, these bounds in fact give exact single-copy
expressions for the error exponents. In contrast, in the non-commutative case,
the optimal exponents are only known to be expressible in terms of regularized
divergences, resulting in formulas that, while conceptually relevant,
practically not very useful. In this paper, we develop further an approach
initiated in [Mosonyi, Szil\'agyi, Weiner, IEEE Trans. Inf. Th.
68(2):1032--1067, 2022] to give improved single-copy bounds on the error
exponents by comparing not only individual states from the two hypotheses, but
also various unnormalized positive semi-definite operators associated to them.
Here, we show a number of equivalent characterizations of such operators giving
valid bounds, and show that in the commutative case, considering weighted
geometric means of the states, and in the case of two states per hypothesis,
considering weighted Kubo-Ando geometric means, are optimal for this approach.
As a result, we give a new characterization of the weighted Kubo-Ando geometric
means as the only $2$-variable operator geometric means that are block
additive, tensor multiplicative, and satisfy the arithmetic-geometric mean
inequality. We also extend our results to composite quantum channel
discrimination, and show an analogous optimality property of the weighted
Kubo-Ando geometric means of two quantum channels, a notion that seems to be
new. We extend this concept to defining the notion of superoperator perspective
function and establish some of its basic properties, which may be of
independent interest.
Péter E. FrenkelMilán MosonyiPéter VranaMihály Weiner36 pages. v3: Added explicit example with strict improvement in the
strong converse exponent using geometric meanshttp://arxiv.org/abs/2504.12885v12025-04-17T12:20:46Z2025-04-17T12:20:46ZOptimizing Movable Antennas in Wideband Multi-User MIMO With Hardware
Impairments Movable antennas represent an emerging field in telecommunication research
and a potential approach to achieving higher data rates in multiple-input
multiple-output (MIMO) communications when the total number of antennas is
limited. Most solutions and analyses to date have been limited to
\emph{narrowband} setups. This work complements the prior studies by
quantifying the benefit of using movable antennas in \emph{wideband} MIMO
communication systems. First, we derive a novel uplink wideband system model
that also accounts for distortion from transceiver hardware impairments. We
then formulate and solve an optimization task to maximize the average sum rate
by adjusting the antenna positions using particle swarm optimization. Finally,
the performance with movable antennas is compared with fixed uniform arrays and
the derived theoretical upper bound. The numerical study concludes that the
data rate improvement from movable antennas over other arrays heavily depends
on the level of hardware impairments, the richness of the multi-path
environments, and the number of subcarriers. The present study provides vital
insights into the most suitable use cases for movable antennas in future
wideband systems.
Amna IrshadEmil BjörnsonAlva KosasihVitaly Petrov5 pages, 6 figureshttp://arxiv.org/abs/2504.12604v12025-04-17T03:07:48Z2025-04-17T03:07:48ZCodes over Finite Ring $\mathbb{Z}_k$, MacWilliams Identity and Theta
Function In this paper, we study linear codes over $\mathbb{Z}_k$ based on lattices
and theta functions. We obtain the complete weight enumerators MacWilliams
identity and the symmetrized weight enumerators MacWilliams identity based on
the theory of theta function. We extend the main work by Bannai, Dougherty,
Harada and Oura to the finite ring $\mathbb{Z}_k$ for any positive integer $k$
and present the complete weight enumerators MacWilliams identity in genus $g$.
When $k=p$ is a prime number, we establish the relationship between the theta
function of associated lattices over a cyclotomic field and the complete weight
enumerators with Hamming weight of codes, which is an analogy of the results by
G. Van der Geer and F. Hirzebruch since they showed the identity with the Lee
weight enumerators.
Zhiyong ZhengFengxia LiuKun Tianhttp://arxiv.org/abs/2504.12594v12025-04-17T02:41:22Z2025-04-17T02:41:22ZMeta-Dependence in Conditional Independence Testing Constraint-based causal discovery algorithms utilize many statistical tests
for conditional independence to uncover networks of causal dependencies. These
approaches to causal discovery rely on an assumed correspondence between the
graphical properties of a causal structure and the conditional independence
properties of observed variables, known as the causal Markov condition and
faithfulness. Finite data yields an empirical distribution that is "close" to
the actual distribution. Across these many possible empirical distributions,
the correspondence to the graphical properties can break down for different
conditional independencies, and multiple violations can occur at the same time.
We study this "meta-dependence" between conditional independence properties
using the following geometric intuition: each conditional independence property
constrains the space of possible joint distributions to a manifold. The
"meta-dependence" between conditional independences is informed by the position
of these manifolds relative to the true probability distribution. We provide a
simple-to-compute measure of this meta-dependence using information projections
and consolidate our findings empirically using both synthetic and real-world
data.
Bijan MazaheriJiaqi ZhangCaroline Uhlerhttp://arxiv.org/abs/2412.09839v22025-04-16T21:18:03Z2024-12-13T04:13:40ZAI and Deep Learning for THz Ultra-Massive MIMO: From Model-Driven
Approaches to Foundation Models In this paper, we explore the potential of artificial intelligence (AI) to
address challenges in terahertz ultra-massive multiple-input multiple-output
(THz UM-MIMO) systems. We identify three key challenges for transceiver design:
"hard to compute," "hard to model," and "hard to measure," and argue that AI
can provide promising solutions. We propose three research roadmaps for AI
algorithms tailored to THz UM-MIMO systems. The first, model-driven deep
learning (DL), emphasizes leveraging domain knowledge and using AI to enhance
bottleneck modules in established signal processing or optimization frameworks.
We discuss four steps: algorithmic frameworks, basis algorithms, loss function
design, and neural architecture design. The second roadmap presents channel
station information (CSI) foundation models to unify transceiver module design
by focusing on the wireless channel. We propose a compact foundation model to
estimate wireless channel score functions, serving as a prior for designing
transceiver modules. We outline four steps: general frameworks, conditioning,
site-specific adaptation, and joint design of CSI models and model-driven DL.
The third roadmap explores applying pre-trained large language models (LLMs) to
THz UM-MIMO systems, with applications in estimation, optimization, searching,
network management, and protocol understanding. Finally, we discuss open
problems and future research directions.
Wentao YuHengtao HeShenghui SongJun ZhangLinglong DaiLizhong ZhengKhaled B. Letaief25 pages, 8 figures, 1 table. Model-driven deep learning, CSI
foundation models, and applications of LLMs are presented as three systematic
research roadmaps for AI-enabled THz ultra-massive MIMO systemshttp://arxiv.org/abs/2504.12274v12025-04-16T17:34:58Z2025-04-16T17:34:58ZKernels for Storage Capacity and Dual Index Coding The storage capacity of a graph measures the maximum amount of information
that can be stored across its vertices, such that the information at any vertex
can be recovered from the information stored at its neighborhood. The study of
this graph quantity is motivated by applications in distributed storage and by
its intimate relations to the index coding problem from the area of network
information theory. In the latter, one wishes to minimize the amount of
information that has to be transmitted to a collection of receivers, in a way
that enables each of them to discover its required data using some prior side
information.
In this paper, we initiate the study of the Storage Capacity and Index Coding
problems from the perspective of parameterized complexity. We prove that the
Storage Capacity problem parameterized by the solution size admits a
kernelization algorithm producing kernels of linear size. We also provide such
a result for the Index Coding problem, in the linear and non-linear settings,
where it is parameterized by the dual value of the solution, i.e., the length
of the transmission that can be saved using the side information. A key
ingredient in the proofs is the crown decomposition technique due to Chor,
Fellows, and Juedes (WG 2003, WG 2004). As an application, we significantly
extend an algorithmic result of Dau, Skachek, and Chee (IEEE Trans. Inform.
Theory, 2014).
Ishay Haviv15 pageshttp://arxiv.org/abs/2310.03311v32025-04-16T15:58:58Z2023-10-05T04:59:58ZDeep Variational Multivariate Information Bottleneck -- A Framework for
Variational Losses Variational dimensionality reduction methods are widely used for their
accuracy, generative capabilities, and robustness. We introduce a unifying
framework that generalizes both such as traditional and state-of-the-art
methods. The framework is based on an interpretation of the multivariate
information bottleneck, trading off the information preserved in an encoder
graph (defining what to compress) against that in a decoder graph (defining a
generative model for data). Using this approach, we rederive existing methods,
including the deep variational information bottleneck, variational
autoencoders, and deep multiview information bottleneck. We naturally extend
the deep variational CCA (DVCCA) family to beta-DVCCA and introduce a new
method, the deep variational symmetric information bottleneck (DVSIB). DSIB,
the deterministic limit of DVSIB, connects to modern contrastive learning
approaches such as Barlow Twins, among others. We evaluate these methods on
Noisy MNIST and Noisy CIFAR-100, showing that algorithms better matched to the
structure of the problem like DVSIB and beta-DVCCA produce better latent spaces
as measured by classification accuracy, dimensionality of the latent variables,
sample efficiency, and consistently outperform other approaches under
comparable conditions. Additionally, we benchmark against state-of-the-art
models, achieving superior or competitive accuracy. Our results demonstrate
that this framework can seamlessly incorporate diverse multi-view
representation learning algorithms, providing a foundation for designing novel,
problem-specific loss functions.
Eslam AbdelaleemIlya NemenmanK. Michael Martinihttp://arxiv.org/abs/2504.12194v12025-04-16T15:47:38Z2025-04-16T15:47:38ZThe Optimal Condition Number for ReLU Function ReLU is a widely used activation function in deep neural networks. This paper
explores the stability properties of the ReLU map. For any weight matrix
$\boldsymbol{A} \in \mathbb{R}^{m \times n}$ and bias vector $\boldsymbol{b}
\in \mathbb{R}^{m}$ at a given layer, we define the condition number
$\beta_{\boldsymbol{A},\boldsymbol{b}}$ as
$\beta_{\boldsymbol{A},\boldsymbol{b}} =
\frac{\mathcal{U}_{\boldsymbol{A},\boldsymbol{b}}}{\mathcal{L}_{\boldsymbol{A},\boldsymbol{b}}}$,
where $\mathcal{U}_{\boldsymbol{A},\boldsymbol{b}}$
and $\mathcal{L}_{\boldsymbol{A},\boldsymbol{b}}$ are the upper and lower
Lipschitz constants, respectively. We first demonstrate that for any given
$\boldsymbol{A}$ and $\boldsymbol{b}$, the condition number satisfies
$\beta_{\boldsymbol{A},\boldsymbol{b}} \geq \sqrt{2}$. Moreover, when the
weights of the network at a given layer are initialized as random i.i.d.
Gaussian variables and the bias term is set to zero, the condition number
asymptotically approaches this lower bound. This theoretical finding suggests
that Gaussian weight initialization is optimal for preserving distances in the
context of random deep neural network weights.
Yu XiaHaoyu Zhou29 pageshttp://arxiv.org/abs/2504.12181v12025-04-16T15:38:38Z2025-04-16T15:38:38ZBattery-aware Cyclic Scheduling in Energy-harvesting Federated Learning Federated Learning (FL) has emerged as a promising framework for distributed
learning, but its growing complexity has led to significant energy consumption,
particularly from computations on the client side. This challenge is especially
critical in energy-harvesting FL (EHFL) systems, where device availability
fluctuates due to limited and time-varying energy resources. We propose
FedBacys, a battery-aware FL framework that introduces cyclic client
participation based on users' battery levels to cope with these issues.
FedBacys enables clients to save energy and strategically perform local
training just before their designated transmission time by clustering clients
and scheduling their involvement sequentially. This design minimizes redundant
computation, reduces system-wide energy usage, and improves learning stability.
Our experiments demonstrate that FedBacys outperforms existing approaches in
terms of energy efficiency and performance consistency, exhibiting robustness
even under non-i.i.d. training data distributions and with very infrequent
battery charging. This work presents the first comprehensive evaluation of
cyclic client participation in EHFL, incorporating both communication and
computation costs into a unified, resource-aware scheduling strategy.
Eunjeong JeongNikolaos PappasThis paper is currently under review for presentation at a
peer-reviewed conferencehttp://arxiv.org/abs/2501.04285v32025-04-16T15:24:31Z2025-01-08T05:17:09ZSeparate Source Channel Coding Is Still What You Need: An LLM-based
Rethinking Along with the proliferating research interest in Semantic Communication
(SemCom), Joint Source Channel Coding (JSCC) has dominated the attention due to
the widely assumed existence in efficiently delivering information semantics.
%has emerged as a pivotal area of research, aiming to enhance the efficiency
and reliability of information transmission through deep learning-based
methods. Nevertheless, this paper challenges the conventional JSCC paradigm,
and advocates for adoption of Separate Source Channel Coding (SSCC) to enjoy
the underlying more degree of freedom for optimization. We demonstrate that
SSCC, after leveraging the strengths of Large Language Model (LLM) for source
coding and Error Correction Code Transformer (ECCT) complemented for channel
decoding, offers superior performance over JSCC. Our proposed framework also
effectively highlights the compatibility challenges between SemCom approaches
and digital communication systems, particularly concerning the resource costs
associated with the transmission of high precision floating point numbers.
Through comprehensive evaluations, we establish that empowered by LLM-based
compression and ECCT-enhanced error correction, SSCC remains a viable and
effective solution for modern communication systems. In other words, separate
source and channel coding is still what we need!
Tianqi RenRongpeng LiMing-min ZhaoXianfu ChenGuangyi LiuYang YangZhifeng ZhaoHonggang Zhanghttp://arxiv.org/abs/2410.15563v32025-04-16T14:51:57Z2024-10-21T01:13:49ZVariants of Solovay reducibility Outside of the left-c.e. reals, Solovay reducibility is considered to be
behaved badly [10.1007/978-0-387-68441-3]. Proposals for variants of Solovay
reducibility that are better suited for the investigation of arbitrary, not
necessarily left-c.e. reals were made by Rettinger and Zheng
[10.1007/978-3-540-27798-9_39], and, recently, by Titov
[10.11588/heidok.00034250] and by Kumabe and co-authors [10.4115/jla.2020.12.2;
10.3233/COM-230486]. These variants all coincide with the original version of
Solovay reducibility on the left-c.e. reals. Furthermore, they are all defined
in terms of translation functions. The latter translate between computable
approximations in the case of Rettinger and Zheng, are monotone in the case of
Titov, and are functions between reals in the case of Kumabe et al.
In what follows, we derive new results on the mentioned variants and their
relation to each other. In particular, we obtain that Solovay reducibility
defined in terms of translation function on rationals implies Solovay
reducibility defined in terms of translation functions on reals, and we show
that the original version of Solovay reducibility is strictly weaker than its
monotone variant.
Solovay reducibility and its variants mentioned so far have tight connections
to Martin-L\"of randomness, the strongest and most central notion of a random
sequence. For the investigation of Schnorr randomness, total variants of
Solovay reducibility have been introduced by Merkle and Titov
[10.48550/arXiv.2407.14869] in 2022 and, independently, by Kumabe et al.
[10.3233/COM-230486] in 2024, the latter again via real-valued translation
functions. In what follows, we show that total Solovay reducibility defined in
terms of rational functions implies total Solovay reducibility defined in terms
of real functions.
Ivan Titovto be published in the Proceedings of CiE2025http://arxiv.org/abs/2504.12116v12025-04-16T14:26:51Z2025-04-16T14:26:51ZImprovement of the square-root low bounds on the minimum distances of
BCH codes and Matrix-product codes The task of constructing infinite families of self-dual codes with unbounded
lengths and minimum distances exhibiting square-root lower bounds is extremely
challenging, especially when it comes to cyclic codes. Recently, the first
infinite family of Euclidean self-dual binary and nonbinary cyclic codes, whose
minimum distances have a square-root lower bound and have a lower bound better
than square-root lower bounds are constructed in \cite{Chen23} for the lengths
of these codes being unbounded. Let $q$ be a power of a prime number and
$Q=q^2$. In this paper, we first improve the lower bounds on the minimum
distances of Euclidean and Hermitian duals of BCH codes with length
$\frac{q^m-1}{q^s-1}$ over $\mathbb{F}_q$ and $\frac{Q^m-1}{Q-1}$ over
$\mathbb{F}_Q$ in \cite{Fan23,GDL21,Wang24} for the designed distances in some
ranges, respectively, where $\frac{m}{s}\geq 3$. Then based on matrix-product
construction and some lower bounds on the minimum distances of BCH codes and
their duals, we obtain several classes of Euclidean and Hermitian self-dual
codes, whose minimum distances have square-root lower bounds or a
square-root-like lower bounds. Our lower bounds on the minimum distances of
Euclidean and Hermitian self-dual cyclic codes improved many results in
\cite{Chen23}. In addition, our lower bounds on the minimum distances of the
duals of BCH codes are almost $q^s-1$ or $q$ times that of the existing lower
bounds.
Xiaoqiang WangLiuyi LiYansheng WuDabin ZhengShuxian Lu29 pages, submitted to IEEE