https://arxiv.org/api/yA5yxl0Jo20TbpHR0r3x8kPIlMU
2026-06-24T05:02:17Z
54905
615
15
http://arxiv.org/abs/2506.19037v5
Plan for Speed: Dilated Scheduling for Masked Diffusion Language Models
2026-05-25T06:12:46Z
Masked diffusion language models (MDLMs) promise fast, non-autoregressive text generation, yet existing samplers, which pick tokens to unmask based on model confidence, ignore interactions when unmasking multiple positions in parallel and effectively reduce to slow, autoregressive behavior. We propose the Dilated Unmasking Scheduler (DUS), an inference-only, planner-model-free method that partitions sequence positions into non-adjacent dilated groups and unmasks them in parallel so as to minimize an upper bound on joint entropy gain at each denoising step. By explicitly trading off the number of network calls against generation quality, DUS recovers most of the performance lost under traditional parallel unmasking strategies. Across math (GSM8K, MATH500), code (HumanEval, MBPP), general-knowledge (BBH, MMLU-Pro), and instruction following (IFEval) benchmarks, DUS outperforms confidence-based planners and turns the diffusion-specific quality-speed trade-off into a deterministic, predictable speedup set by the block size $B$, yielding up to $5.8\times$ wall-clock speedup over token-by-token MDLM decoding without modifying the underlying denoiser. Applied as a drop-in post-filter, dilated spacing also improves adaptive samplers. Code is available at https://github.com/omerlux/DUS.
2025-06-23T18:49:23Z
Accepted at ICML 2026
Omer Luxembourg
Haim Permuter
Eliya Nachmani
http://arxiv.org/abs/2605.25422v1
A Token/KV-Cache Communication Media Selection and Resource Allocation Strategy for Multi-Agent Collaboration
2026-05-25T04:50:45Z
The convergence of large language models (LLMs) with 6G networks is fostering a paradigm of autonomous multi-agent cooperation, which in turn is expected to substantially increase east-west traffic. Although latent-space interaction mechanisms can enable more efficient collaboration than symbolic natural-language (NL) exchanges, prior work often abstracts away the associated communication overhead under practical wireless constraints. In embodied multi-agent settings, heterogeneous interaction media incur disparate inference and transmission costs, thereby inducing an inherent end-to-end (E2E) latency trade-off. To address this, we propose a joint design that integrates communication-media selection with wireless resource allocation. Through analytical characterization and simulation-based evaluation, we show that neither token-based transmission nor key-value (KV) cache-based transmission is uniformly optimal across operating regimes, as performance depends critically on system parameters such as available computational resources and channel conditions. Accordingly, we formulate a joint optimization problem aimed at minimizing the E2E latency of multi-agent collaboration and develop a low-complexity joint media selection and resource allocation (JMSRA) algorithm. Numerical results further confirm that, by adaptively coordinating the interaction media and bandwidth allocation over heterogeneous links, the proposed scheme achieves markedly reduced E2E latency relative to conventional NL-only and KV-cache-only baselines, enabling efficient and robust multi-agent collaboration in future wireless networks.
2026-05-25T04:50:45Z
Lipeng Dai
Luping Xiang
Kun Yang
http://arxiv.org/abs/2505.11788v2
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission
2026-05-25T03:16:45Z
To support emerging language-based applications using dispersed and heterogeneous computing resources, the hybrid language model (HLM) offers a promising architecture, where an on-device small language model (SLM) generates draft tokens that are validated and corrected by a remote large language model (LLM). However, the original HLM suffers from substantial communication overhead, as the LLM requires the SLM to upload the full vocabulary distribution for each token. Moreover, both communication and computation resources are wasted when the LLM validates tokens that are highly likely to be accepted. To overcome these limitations, we propose communication-efficient and uncertainty-aware HLM (CU-HLM). In CU-HLM, the SLM transmits truncated vocabulary distributions only when its output uncertainty is high. We validate the feasibility of this opportunistic transmission by discovering a strong correlation between SLM's uncertainty and LLM's rejection probability. Furthermore, we theoretically derive optimal uncertainty thresholds and optimal vocabulary truncation strategies. Simulation results show that, compared to standard HLM, CU-HLM achieves up to 206$\times$ higher token throughput by skipping 74.8% transmissions with 97.4% vocabulary compression, while maintaining 97.4% accuracy.
2025-05-17T02:10:34Z
17 pages, 13 figures, 5 tables; This article has been accepted for publication in IEEE Transactions on Communications. This is the author's accepted version; the final published version will be available via IEEE Xplore
Seungeun Oh
Jinhyuk Kim
Jihong Park
Seung-Woo Ko
Jinho Choi
Tony Q. S. Quek
Seong-Lyun Kim
http://arxiv.org/abs/2404.12589v7
Geometry and factorization of multivariate Markov chains with applications to MCMC acceleration and approximate inference
2026-05-25T02:39:41Z
This paper analyzes the factorizability and geometry of transition matrices of multivariate Markov chains. Specifically, we demonstrate that the induced chains on factors of a product space can be regarded as information projections with respect to the Kullback-Leibler divergence. This perspective yields Han-Shearer type inequalities and submodularity of the entropy rate of Markov chains, as well as applications in the context of large deviations and mixing time comparison. As concrete algorithmic applications in Markov chain Monte Carlo (MCMC) and approximate inference, we provide three illustrations based on lifted MCMC, swapping algorithm and factored filtering to demonstrate projection samplers improve mixing over the original samplers. The projection sampler based on the swapping algorithm resamples the highest-temperature coordinate at stationarity at each step, and we prove that such practice accelerates the mixing time by multiplicative factors related to the number of temperatures and the dimension of the underlying state space when compared with the original swapping algorithm. Through simple numerical experiments on a bimodal target distribution, we show that the projection samplers mix effectively, in contrast to lifted MCMC and the swapping algorithm, which mix less well. In filtering, our proposed factored filtering scheme is able to scale to high dimensions with linear-in-dimension computational cost per step at the price of an approximation error that can be tracked using the distance to independence, compared with the exponential-in-dimension cost per step of the exact filter.
2024-04-19T02:35:03Z
47 pages, 6 figures
Michael C. H. Choi
Youjia Wang
Geoffrey Wolfer
http://arxiv.org/abs/2602.17658v2
MARS: Margin and Semantic-Aware Data Augmentation for Reward Modeling
2026-05-25T00:20:23Z
Reward modeling is central to alignment pipelines such as RLHF, RLAIF, and PPO-based policy optimization, yet its reliability is constrained by limited and heterogeneous human preference data that are expensive to collect at scale. While synthetic augmentation can expand preference supervision, existing methods often augment uniformly or at the representation level, without targeting examples where the reward model is uncertain or prone to mis-ranking. In this paper, we introduce MARS (Margin and Semantic-Aware Data Augmentation for Reward Modeling), an adaptive augmentation framework that prioritizes low-margin preference pairs and uses semantic distance as a second layer for refinement to enhance the contrast between the chosen and rejected responses. Across multiple preference datasets, reward-model backbones, downstream alignment settings, and benchmarks including RewardBench and AlpacaEval, MARS improves both reward-model quality and alignment performance over existing baselines. Our results show that reward-model augmentation is most effective when guided by both model margins and semantic structure.
2026-02-19T18:59:03Z
Payel Bhattacharjee
Osvaldo Simeone
Ravi Tandon
http://arxiv.org/abs/2605.21815v2
Near-Field User Location Inference From Far-Field Power Measurements
2026-05-24T22:35:27Z
Near-field beamfocusing enabled by extremely large-aperture arrays (ELAA) is a promising 6G technique for massive connectivity and high spectrum efficiency. While beamfocusing concentrates energy at an intended user, the radiated field outside the focal point exhibits a structured leakage that varies with the focal-point coordinates. This paper shows that this leakage enables a new form of passive user localization in which distributed far-field sensors measuring only received power can infer the user's location by exploiting this location-dependent power signature. Using the induced noncentral chi-square statistics, we derive a Bayesian Cramér-Rao lower bound (BCRLB) that establishes the fundamental limits of this inference problem. We then evaluate a model-based grid-search estimator and an attention-based permutation-invariant deep learning regressor (DeepSet). Results under both line-of-sight (LoS) and multipath propagation confirm that reliable location inference is feasible, with accuracy improving as more sensors and snapshots are used.
2026-05-20T23:28:35Z
To appear in IEEE Vehicular Technology Conference: VTC2026-Fall
Shima Mashhadi
Tiep M. Hoang
Alireza Vahid
http://arxiv.org/abs/2301.09093v2
RIS-assisted Cell-Free MIMO with Dynamic Arrivals and Departures of Users: A Novel Network Stability Approach
2026-05-24T20:03:03Z
Reconfigurable Intelligent Surfaces (RIS) have recently emerged as a hot research topic, being widely advocated as a candidate technology for next generation wireless communications. These surfaces passively alter the behavior of propagation environments enhancing the performance of wireless communication systems. In this paper, we study the use of RIS in cell-free multiple-input multiple-output (MIMO) setting where distributed service antennas, called Access Points (APs), simultaneously serve the users in the network. While most existing works focus on the physical layer improvements RIS carry, less attention has been paid to the impact of dynamic arrivals and departures of the users. In such a case, ensuring the stability of the network is the main goal. For that, we propose an optimization framework of the phase shifts, for which we derived a low-complexity solution. We then provide a theoretical analysis of the network stability and show that our framework stabilizes the network whenever it is possible. We also prove that a low complexity solution of our framework stabilizes a guaranteed fraction (higher than 78.5%) of the stability region. We provide also numerical results that corroborate the theoretical claims.
2023-01-22T10:21:44Z
Charbel Bou Chaaya
Mohamad Assaad
Tijani Chahed
http://arxiv.org/abs/2605.26163v1
Adversarial Water-Filling: Theory, Algorithms and Foundation Model
2026-05-24T17:37:30Z
Competitive resource allocation problems over frequency and space can be formulated as minimax interaction between transmit power and worst-case interference. This formulation naturally arises in multi-operator low Earth orbit (LEO) satellite spectrum sharing, where transmissions from competing constellations interfere in real-time. Under Gaussian channels, AWF is strongly convex--concave on nondegenerate active channels, whereas discrete constellations yield generally nonconvex mercury/water-filling formulations. In this paper we propose the Adversarial Water-Filling (AWF) problem with corresponding theory and algorithms for these real situations. In addition, we develop a wireless foundation model for AWF to learn the AWF search dynamics. The architecture incorporates permutation-invariant channel representations, a constraint-aware graph neural network (GNN) with sparse message passing, and global latent variables capturing the low-dimensional water level implied by the AWF optimality. Through learned projected extragradient iterations, the model approximates stationary solutions of the constrained minimax problem arising under mercury/water-filling. We further show that, under local regularity and contractivity conditions, the learned AWF dynamics converge locally linearly around regular stationary points. Experiments demonstrate empirical generalization across unseen problem sizes, different constraints, and multiple discrete constellations, while achieving more than one-order-of-magnitude runtime improvements over iterative baselines. The related code can be found at https://github.com/convexsoft/AWF.
2026-05-24T17:37:30Z
Submitted to IEEE Journal of Selected Topics in Signal Processing
Xindi Tong
Chee Wei Tan
H. Vincent Poor
http://arxiv.org/abs/2605.25140v1
Eliminating Blind Spots from Wireless Network by Metasurface: A Blind Approach
2026-05-24T15:39:27Z
Deploying metasurfaces (MTSs) to eliminate wireless blind spots requires jointly determining the physical placement of MTSs and the meta-atom phase shifts. Existing methods typically rely on explicit channel estimation, which incurs prohibitive overhead and is often intractable in real-world networks. To sidestep this bottleneck, we propose RFZero, a channel-state-information (CSI)-free deployment paradigm. Instead of estimating channels, RFZero extracts macro-environmental features from visual photos to guide MTS placement, and leverages reference signal received power (RSRP) feedback for dynamic phase-shift optimization. Most importantly, RFZero operates independently of base stations, thereby enabling seamless plug-and-play implementation. Real-world field tests confirm that RFZero completely eliminates all blind spots in a $100\text{ m}^2$ indoor area using just a pair of $1.5\text{ m}\times 0.9\text{ m}$ MTSs.
2026-05-24T15:39:27Z
7 pages
Wenhai Lai
Mingxiao Li
Kaiming Shen
Liyao Xiang
Zhi-Quan Luo
http://arxiv.org/abs/2605.25104v1
The Uncertainty Principles of Quaternion Fractional Fourier Transform
2026-05-24T14:44:44Z
In this paper, we mainly establish the uncertainty principle (UP) for a function and its quaternion Fractional Fourier transform (QFrFT), as well as the UP for two QFrFTs. Using the polar representation of quaternion-valued signals, we give the UP for QFrFT in both the spatial and directional domains, providing a more precise condition for equality, example is given to verify the results. Furthermore, we extend the time-frequency UP to a frequency-frequency setting.
2026-05-24T14:44:44Z
Ke Cui
Haipan Shi
Xiaomin Tang
http://arxiv.org/abs/2605.25047v1
Design of APSK Constellations Approaching the Communication-Sensing Pareto Boundary for ISAC
2026-05-24T12:45:07Z
We propose a semi-analytical amplitude phase shift keying (APSK) signaling framework for integrated sensing and communication (ISAC), focusing on i.i.d. uniform discrete input distributions for practicality and analytical tractability. First, we establish APSK design criteria in which communication performance is measured by the gap to capacity and linked to the minimum Euclidean distance, while sensing performance is characterized by the symbol-energy variance. Based on these criteria, we propose a family of APSK constellations whose key parameters follow explicit scaling laws. Then we prove that this design achieves a constant gap to capacity independent of the signal-to-noise ratio. Building upon this foundation, we further construct a parametric APSK family that bridges the communication-optimal and sensing-optimal designs, with the communication and sensing (C&S) tradeoff controlled by the number of rings and energy allocation among rings. Simulation results show that the proposed APSK achieves C&S performance very close to the Pareto boundary achieved with time-independent, circularly symmetric, and otherwise unconstrained continuous input distributions.
2026-05-24T12:45:07Z
7 pages,3 figures
Yujie Shao
Min Qiu
Ming-Chun Lee
Yu-Chih Huang
Jinhong Yuan
http://arxiv.org/abs/2605.26157v1
When Does a Neural Receiver Help? Calibration-Drift Benchmarking and Detect-and-Rollback for 5G/6G NR
2026-05-24T11:15:08Z
Convolutional neural receivers such as DeepRx outperform minimum mean-square error physical uplink shared channel detection on in distribution channel and waveform configurations, but their behavior under calibration drift when transmitter or channel parameters depart from the training envelope is poorly characterized.
2026-05-24T11:15:08Z
Ayman Elnashar
http://arxiv.org/abs/2605.24972v1
Integrated Sensing, Communication, and Computing for NR-V2X: A Cross-Layer Resource Allocation Framework Using Multi-Agent Reinforcement Learning
2026-05-24T09:56:57Z
Integrated sensing, communication, and computation (ISCC) is emerging as a unified design paradigm for future vehicular networks that require joint environment perception, safety-critical information exchange, and latency-sensitive task processing. In New Radio Vehicle-to-Everything (NR-V2X) Mode 2, autonomous resource selection is performed through sensing-based semi-persistent scheduling (SB-SPS), which is effective for distributed communication resource reservation but does not explicitly consider sensing-resource demand, task-induced computation workload, and the additional latency introduced by mobile edge computing (MEC) offloading. This paper develops multi-agent proximal policy optimization-based SB-SPS (MAPPO-SPS), an ISCC-aware cross-layer scheduler that jointly adapts SB-SPS reservation, radio-resource partitioning, and overflow-driven computation-offloading decisions at control epochs. The scheduling problem is formulated as a cooperative partially observable Markov game and solved using MAPPO with centralized training and decentralized execution (CTDE). Simulation results show that MAPPO-SPS achieves a balanced tradeoff among CRLB-based sensing accuracy, packet reception ratio (PRR), effective throughput, energy consumption, and end-to-end delay.
2026-05-24T09:56:57Z
Indulekha K. P.
T. G. Venkatesh
http://arxiv.org/abs/2605.24929v1
Estimating Mixture Distributions via Stochastic Mirror Descent
2026-05-24T08:19:42Z
We revisit the classical problem of estimating an unknown distribution from its samples by fitting a mixture model that minimizes cross-entropy loss. Framing the task as a stochastic convex optimization problem over the space of $ M $-component mixture distributions, we propose a family of estimators derived from the stochastic mirror descent (SMD) algorithm. This optimization-based approach provides a principled and flexible framework that generalizes traditional estimators and proposes a variety of novel estimators through the choice of Bregman divergences.
A key advantage of our method is that it scales efficiently with the number of candidate components $ f_i $; that is, one can employ a large set of basis distributions in the mixture model without incurring significant computational overhead. This enables richer approximations and improved estimation accuracy.
Moreover, in the case of categorical distribution (discrete outcomes) our estimators do not require a strict lower bound, in other words our framework does not require the precise knowledge of the support of the distribution.
We demonstrate that, under mild conditions, the proposed $ \varphi $-SMD estimators achieve near-optimal convergence rates in both Kullback-Leibler (KL) divergence and $ \ell_2 $-norm and offer practical benefits when computation is expensive. Our numerical analysis highlights improved performance guaranties over classical estimators, particularly in terms of sample efficiency and scalability.
2026-05-24T08:19:42Z
Mohammadreza Ahmadypour
Tara Javidi
Farinaz Koushanfar
http://arxiv.org/abs/2511.06045v2
Online Learning of Modular Bayesian Deep Receivers: Single-Step Adaptation with Streaming Data
2026-05-24T08:02:38Z
Deep neural network (DNN)-based receivers offer a powerful alternative to classical model-based designs for wireless communication, especially in complex and nonlinear propagation environments. However, their adoption is challenged by the rapid variability of wireless channels, which makes pre-trained static DNN-based receivers ineffective, and by the latency and computational burden of online stochastic gradient descent (SGD)-based learning. In this work, we propose an online learning framework that enables rapid low-complexity adaptation of DNN-based receivers. Our approach is based on two main tenets. First, we cast online learning as Bayesian tracking in parameter space, enabling a single-step adaptation, which deviates from multi-epoch SGD . Second, we focus on modular DNN architectures that enable parallel, online, and localized variational Bayesian updates. Simulations with practical communication channels demonstrate that our proposed online learning framework can maintain a low error rate with markedly reduced update latency and increased robustness to channel dynamics as compared to traditional gradient descent based method.
2025-11-08T15:34:34Z
Under review for publication in the IEEE
Yakov Gusakov
Osvaldo Simeone
Tirza Routtenberg
Nir Shlezinger