https://arxiv.org/api/lcZAdB1tYHjd9+wZ66D1QndBqw82026-06-10T20:32:18Z1293333015http://arxiv.org/abs/2604.21847v1Sampling from the Hardcore Model on Random Regular Bipartite Graphs above the Uniqueness Threshold2026-04-23T16:37:49ZWe design an efficient sampling algorithm to generate samples from the hardcore model on random regular bipartite graphs as long as $λ\lesssim \frac{1}{\sqrtΔ}$, where $Δ$ is the degree. Combined with recent work of Jenssen, Keevash and Perkins this implies an FPRAS for the partition function of the hardcore model on random regular bipartite graphs at any fugacity. Our algorithm is shown by analyzing two new Markov chains that work in complementary regimes. Our proof then proceeds by showing the corresponding simplicial complexes are top-link spectral expanders and appealing to the trickle-down theorem to prove fast mixing.2026-04-23T16:37:49Z35 pagesNicholas KocurekShayan Oveis GharanDante Tjowasihttp://arxiv.org/abs/2604.21831v1Complexity Classes Arising from Circuits over Finite Algebraic Structures2026-04-23T16:23:46ZMost classical results in circuit complexity theory concern circuits over the Boolean domain. Besides their simplicity and the ease of comparing different languages, the actual architecture of computers is also an important motivating factor. On the other hand, by restricting attention to Boolean circuits, we lose sight of the much richer landscape of circuits over larger domains. Our goal is to bridge these two worlds: to use deep algebraic tools to obtain results in computational complexity theory, including circuit complexity, and to apply results from computational complexity to gain a better understanding of the structure of finite algebras.
In this paper, we propose a unifying algebraic framework which we believe will help achieve this goal. Our work is inspired by branching programs and nonuniform deterministic automata introduced by Barrington, as well as by their generalization proposed by Idziak et al. We begin our investigation by studying the languages recognized by natural classes of algebraic structures. In particular, we characterize language classes recognized by circuits over simple algebras and over algebras from congruence modular varieties.2026-04-23T16:23:46ZPiotr KawałekJacek Krzaczkowskihttp://arxiv.org/abs/2605.02923v1Experiments, Computability, and the Existence of Physical Functions2026-04-23T14:14:57ZExperimental science usually relies on laboratory procedures that, after finitely many steps, terminate with numerical reports on physical quantities. This paper argues that such procedures can be understood as algorithmic once the protocol, background conditions, and reporting rules are fixed. Assuming an explicit physical Church--Turing bridge principle, a reproducible experiment therefore computes a map from admissible inputs to outputs, and the corresponding function exists in the sense appropriate to those outputs. Furthermore, computable analysis allows us to explain why this conclusion is compatible with finite-precision measurement since in this case what matters is a systematic approximation to a requested accuracy, not the production of exact real numbers in a single step. Neither protocol dependence nor stochasticity undermines the existence claim. Rather, they specify which map is realized by a given protocol and what additional assumptions are required for stronger claims about a single protocol-independent quantity. The paper therefore separates three questions that are often conflated: whether the function exists, whether it is computable, and when results obtained under different protocols may be treated as measurements of the same quantity.2026-04-23T14:14:57ZIsaac Pérez Castillohttp://arxiv.org/abs/2601.18491v2AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security2026-04-23T11:55:10ZThe rise of AI agents introduces complex safety and security challenges arising from autonomous tool use and environmental interactions. Current guardrail models lack agentic risk awareness and transparency in risk diagnosis. To introduce an agentic guardrail that covers complex and numerous risky behaviors, we first propose a unified three-dimensional taxonomy that orthogonally categorizes agentic risks by their source (where), failure mode (how), and consequence (what). Guided by this structured and hierarchical taxonomy, we introduce a new fine-grained agentic safety benchmark (ATBench) and a Diagnostic Guardrail framework for agent safety and security (AgentDoG). AgentDoG provides fine-grained and contextual monitoring across agent trajectories. More Crucially, AgentDoG can diagnose the root causes of unsafe actions and seemingly safe but unreasonable actions, offering provenance and transparency beyond binary labels to facilitate effective agent alignment. AgentDoG variants are available in three sizes (4B, 7B, and 8B parameters) across Qwen and Llama model families. Extensive experimental results demonstrate that AgentDoG achieves state-of-the-art performance in agentic safety moderation in diverse and complex interactive scenarios. All models and datasets are openly released.2026-01-26T13:45:41Z40 pages, 26 figuresDongrui LiuQihan RenChen QianShuai ShaoYuejin XieYu LiZhonghao YangHaoyu LuoPeng WangQingyu LiuBinxin HuLing TangJilin MeiDadi GuoLeitao YuanJunyao YangGuanxu ChenQihao LinYi YuBo ZhangJiaxuan GuoJie ZhangWenqi ShaoHuiqi DengZhiheng XiWenjie WangWenxuan WangWen ShenZhikai ChenHaoyu XieJialing TaoJuntao DaiJiaming JiZhongjie BaLinfeng ZhangYong LiuQuanshi ZhangLei ZhuZhihua WeiHui XueChaochao LuJing ShaoXia Huhttp://arxiv.org/abs/2604.21531v1Kernelization Bounds for Constrained Coloring2026-04-23T10:53:43ZWe study the kernel complexity of constraint satisfaction problems over a finite domain, parameterized by the number of variables, whose constraint language consists of two relations: the non-equality relation and an additional permutation-invariant relation $R$. We establish a conditional lower bound on the kernel size in terms of the largest arity of an OR relation definable from $R$. Building on this, we investigate the kernel complexity of uniformly rainbow free coloring problems. In these problems, for fixed positive integers $d$, $\ell$, and $q \geq d$, we are given a graph $G$ on $n$ vertices and a collection $\cal F$ of $\ell$-tuples of $d$-subsets of its vertex set, and the goal is to decide whether there exists a proper coloring of $G$ with $q$ colors such that no $\ell$-tuple in $\cal F$ is uniformly rainbow, that is, no tuple has all its sets colored with the same $d$ distinct colors. We determine, for all admissible values of $d$, $\ell$, and $q$, the infimum over all values $η$ for which the problem admits a kernel of size $O(n^η)$, under the assumption $\mathsf{NP} \nsubseteq \mathsf{coNP/poly}$. As applications, we obtain nearly tight bounds on the kernel complexity of various coloring problems under diverse settings and parameterizations. This includes graph coloring problems parameterized by the vertex-deletion distance to a disjoint union of cliques, resolving a question of Schalken (2020), as well as uniform hypergraph coloring problems parameterized by the number of vertices, extending results of Jansen and Pieterse (2019) and Beukers (2021).2026-04-23T10:53:43Z32 pagesIshay Havivhttp://arxiv.org/abs/2510.08814v2A Quantale-Weakness Route to $P \neq NP$ via CD Evidence Normalization and Gauge-Buffered Locked Ensembles2026-04-22T18:01:05ZWe present a proof architecture for \(P \neq NP\) based on an upper--lower clash in polytime-capped conditional description length. We construct an efficiently samplable family of SAT instances \(Y\) such that every satisfying witness for \(Y\) yields the same global message \(M(Y)\). If \(P=NP\), then a standard polynomial-time SAT self-reduction recovers \(M(Y)\) from \(Y\), so \[ K_{\mathrm{poly}}(M(Y)\mid Y)=O(1). \]
The lower-bound side shows the opposite. For the same ensemble, no fixed polynomial-time observer can gain substantial predictive advantage on a linear number of selected message coordinates. The argument treats computation as an evidence-producing process: predictive advantage is converted into constructible-dual evidence skew and then into pairwise distinctions between message-opposite worlds. A normalization theorem shows that every target-relevant non-neutral evidence leaf is either a safe-buffer observation or a hidden-gauge observation. Safe-buffer observations have negligible leakage, while hidden-gauge observations are limited by gauge-rank accounting. This yields an atomic evidence budget implying that total message-resolving advantage is \(o(t)\) across \(t\) selected coordinates.
Boundary-law mixing gives the near-random baseline for the visible surface. Combining this with the evidence budget gives product small-success and then, by Compression-from-Success, \[ K_{\mathrm{poly}}(M(Y)\mid Y)\ge Ω(t) \] with high probability. This contradicts the constant upper bound from \(P=NP\). Therefore \(P \neq NP\).2025-10-09T21:01:17ZBen Goertzelhttp://arxiv.org/abs/2504.03605v2Constant Rate Isometric Embeddings of Hamming Metric into Edit Metric2026-04-22T17:23:04ZA function $\varphi:\{0,1\}^n \to \{0,1\}^N$ is called an isometric embedding of the $n$-dimensional Hamming metric space to the $N$-dimensional edit metric space if, for all $x,y\in\{0,1\}^n$, the Hamming distance between $x$ and $y$ is equal to the edit distance between $\varphi(x)$ and $\varphi(y)$. The rate of such an embedding is defined as the ratio $n/N$.
It is well known in the literature how to construct isometric embeddings with rate $Ω(1/\log n)$. However, achieving even near-isometric embeddings with positive constant rate has remained elusive until now.
In this paper, we present an isometric embedding with rate $1/8$ by discovering connections to synchronization strings, which were studied in the context of insertion-deletion codes (Haeupler-Shahrasbi [JACM'21]). At a technical level, we introduce a framework for obtaining high-rate isometric embeddings using a novel object called misaligners. As an immediate consequence of our constant-rate isometric embedding, we improve known conditional lower bounds for various optimization problems in the edit metric, now with optimal dependence on the dimension.
We complement our results by showing that no isometric embedding $\varphi:\{0,1\}^n \to \{0,1\}^N$ can have rate greater than $15/32$ for all positive integers $n$. En route to proving this upper bound, we uncover fundamental structural properties necessary for every Hamming-to-edit isometric embedding. We also prove similar upper and lower bounds for embeddings over larger alphabets.
Finally, we consider embeddings $\varphi:Σ_{\mathrm{in}}^n \to Σ_{\mathrm{out}}^N$ between different input and output alphabets, where the rate is given by $\frac{n\log|Σ_{\mathrm{in}}|}{N\log|Σ_{\mathrm{out}}|}$. In this setting, we show that the rate can be made arbitrarily close to $1$.2025-04-04T17:21:25ZSudatta BhattacharyaSanjana DeyElazar GoldenbergMursalin HabibBernhard HaeuplerKarthik C. S.Michal Kouckýhttp://arxiv.org/abs/2501.10633v2Answering Related Questions2026-04-22T12:34:26ZWe introduce the meta-problem Sidestep$(Π, \mathsf{dist}, d)$ for a problem $Π$, a metric $\mathsf{dist}$ over its inputs, and a map $d: \mathbb N \to \mathbb R_+ \cup \{\infty\}$. A solution to Sidestep$(Π, \mathsf{dist}, d)$ on an input $I$ of $Π$ is a pair $(J, Π(J))$ such that $\mathsf{dist}(I,J) \leqslant d(|I|)$ and $Π(J)$ is a correct answer to $Π$ on input $J$. This formalizes the notion of answering a related question (or sidestepping the question), for which we give some motivations, and compare it to the neighboring concepts of smoothed analysis, certified algorithms, planted problems, edition problems, and approximation algorithms. Informally, we call hardness radius the ``largest'' $d$ such that Sidestep$(Π, \mathsf{dist}, d)$ is NP-hard. This framework calls for establishing the hardness radius of problems $Π$ of interest for the relevant distances $\mathsf{dist}$.
We exemplify it with graph problems and two distances $\mathsf{dist}_Δ$ and $\mathsf{dist}_e$ (the edge edit distance) such that $\mathsf{dist}_Δ(G,H)$ (resp. $\mathsf{dist}_e(G,H)$) is the maximum degree (resp. number of edges) of the symmetric difference of $G$ and $H$ if these graphs are on the same vertex set, and $+\infty$ otherwise. We show that the decision problems Independent Set, Clique, Vertex Cover, Coloring, Clique Cover have hardness radius $n^{\frac{1}{2}-o(1)}$ for $\mathsf{dist}_Δ$, and $n^{\frac{4}{3}-o(1)}$ for $\mathsf{dist}_e$, that Hamiltonian Cycle has hardness radius 0 for $\mathsf{dist}_Δ$, and somewhere between $n^{\frac{1}{2}-o(1)}$ and $n/3$ for $\mathsf{dist}_e$, and that Dominating Set has hardness radius $n^{1-o(1)}$ for $\mathsf{dist}_e$. We leave several open questions.2025-01-18T02:25:18Z20 pages, 2 figuresÉdouard Bonnethttp://arxiv.org/abs/2202.08214v3Lower Bounds for Subset Sum in Resolution with Modular Counting2026-04-22T10:26:45ZIn this paper we prove lower bounds for sizes of refutations of unsatisfiable vector Subset Sum instances $\overrightarrow{a}_1 x_1 + \dots + \overrightarrow{a}_n x_n = \overrightarrow{b}$ in the proof system Res(lin$_{\mathbb{F}_q}$) where $char(\mathbb{F}_{q})\geq 5$. As a basis for the hardness criterion for such instances we choose the property of the matrix $A$ with columns $(\overrightarrow{a}_1, \ldots, \overrightarrow{a}_n)$ to be (the transpose of) the generating matrix for a good error-correcting code $C_{A} := \{x\cdot A\, |\, x \in \mathbb{F}_{q}^k\}\subset \mathbb{F}_{q}^n$ and prove the following lower bounds:
1) For a dag-like fragment of Res(lin$_{\mathbb{F}_q}$). We introduce the notion of $(s,r)$-robustness for Subset Sum instances, which in particular implies that $A$ defines an error-correcting code with the minimal distance $s\geq r$. For $(s,r)$-robust instances we prove $2^{Ω(r)}$ lower bound for sizes of refutations in a dag-like fragment of Res(lin$_{\mathbb{F}_q}$). We show that random instances are $(n / 3, Ω\left((n/(q + 1)\ln q))^{1/3}\right))$-robust and that specific examples achieving these bounds can be constructed using algebraic geometry codes.
2) For tree-like Res(lin$_{\mathbb{F}_q}$) refutations we show the size lower bound $2^{Ω({((q+1)\ln q)^{-1/3}}d^{1/5})}$ for any Subset Sum instance where $d$ is the minimal distance of $C_{A}$.2022-02-16T17:51:17ZFedor Parthttp://arxiv.org/abs/2511.17227v3A Lifting Theorem for Hybrid Classical-Quantum Communication Complexity2026-04-22T06:56:38ZWe investigates a model of hybrid classical-quantum communication complexity, in which two parties first exchange classical messages and subsequently communicate using quantum messages. We study the trade-off between the classical and quantum communication for composed functions of the form $f\circ G^n$, where $f:\{0,1\}^n\to\{\pm1\}$ and $G$ is an inner product function of $Θ(\log n)$ bits. To prove the trade-off, we establish a novel lifting theorem for hybrid communication complexity. This theorem unifies two previously separate lifting paradigms: the query-to-communication lifting framework for classical communication complexity and the approximate-degree-to-generalized-discrepancy lifting methods for quantum communication complexity. Our hybrid lifting theorem therefore offers a new framework for proving lower bounds in hybrid classical-quantum communication models.
As a corollary, we show that any hybrid protocol communicating $c$ classical bits followed by $q$ qubits to compute $f\circ G^n$ must satisfy $c+q^2=Ω\big(\max\{\mathrm{deg}(f),\mathrm{bs}(f)\}\cdot\log n\big)$, where $\mathrm{deg}(f)$ is the degree of $f$ and $\mathrm{bs}(f)$ is the block sensitivity of $f$. For read-once formula $f$, this yields an almost tight trade-off: either they have to exchange $Θ\big(n\cdot\log n\big)$ classical bits or $\widetildeΘ\big(\sqrt n\cdot\log n\big)$ qubits, showing that classical pre-processing cannot significantly reduce the quantum communication required. To the best of our knowledge, this is the first non-trivial trade-off between classical and quantum communication in hybrid two-way communication complexity.2025-11-21T13:14:04Z27 pages, 1 figure. accepted by ICALP 2026Xudong WuGuangxu YangPenghui Yaohttp://arxiv.org/abs/2604.19872v1Border subrank of higher order tensors and algebras2026-04-21T18:00:04ZWe determine the border subrank of higher order structure tensors of several families of algebras, and in particular obtain the following results. (1) We determine tight bounds on the border subrank of $k$-fold matrix multiplication and $k$-fold upper triangular matrix multiplication for all $k$. (2) We determine the border subrank of the higher order structure tensors of truncated polynomial algebras, null algebras, and apolar algebras of a quadric. (3) We determine the border subrank of the higher order structure tensors of the Lie algebra $\mathfrak{sl}_2$ for all orders. (4) We prove that degeneration of structure tensors of algebras propagates from higher to lower order. Along the way, we investigate which upper bound methods (geometric rank, $G$-stable rank, socle degree) are effective in which settings, and how they relate. Our work extends the results of Strassen (J.~Reine Angew.~Math., 1987, 1991), who determined the asymptotic subrank of these algebras for tensors of order three, in two directions: we determine the border subrank itself rather than its asymptotic version, and we consider higher order structure tensors.2026-04-21T18:00:04Z35 pages + one appendixChia-Yu ChangFulvio GesmundoJeroen Zuiddamhttp://arxiv.org/abs/2603.06315v4Intrinsic Information Flow in Structureless NP Search2026-04-21T17:49:05ZRather than measuring NP search in terms of Turing-machine time, we reinterpret witness recovery as an information-acquisition process: the hidden witness is the sole source of uncertainty, and identification requires sufficient reduction of this uncertainty through a rate-limited access interface in the sense of Shannon.
To make this perspective explicit, we analyze an extreme regime, the \emph{psocid model}, in which the witness is accessible only via equality probes $[π= w^\star]$ under a uniform, structureless prior. Each probe reveals at most $O(N/2^N)$ bits of mutual information, so polynomially many probes accumulate only $o(1)$ total information. By Fano's inequality, reliable recovery requires $Ω(N)$ bits, creating a fundamental mismatch between the information required for recovery and that obtainable through the interface.
The psocid setting isolates a fully symmetric search regime in which no intermediate computation yields global eliminative leverage, thereby exposing an intrinsic informational origin of exponential search complexity.2026-03-06T14:17:14ZConceptual reframing of NP witness recovery via information-theoretic constraints; introduces an equality-only access model and proves an impossibility via Fano's inequalityJing-Yuan Weihttp://arxiv.org/abs/2604.19717v1Qubit Routing for (Almost) Free2026-04-21T17:43:44ZIn this paper, we give a mathematical proof that bounds the number of CNOT gates required to synthesize an $n$ qubit phase polynomial with $g$ terms to be at least $O(\frac{gn}{\max (\log g, 1)})$ and at most $O(gn)$. However, when targeting restricted hardware, not all CNOTs are allowed. If we were to use SWAP-based methods to route the qubits on the architecture such that the earlier synthesized gates are natively allowed, we increase the number of CNOTs by a routing overhead factor of $O(\log n) \leq α\leq O(n \log^2 n)$. However, if we only synthesize allowed gates, we do not need to route any qubits. Moreover, in that case the routing overhead factor is $1 \leq α\leq 4 \simeq O(1)$. Additionally, since phase polynomials and Hadamard gates together form a universal gate set, we get qubit routing for almost free.2026-04-21T17:43:44Z14 pages, rough draftArianne Meijer-van de Griendhttp://arxiv.org/abs/2603.09172v5Reinforced Generation of Combinatorial Structures: Ramsey Numbers2026-04-21T16:45:32ZWe present improved lower bounds for nine classical Ramsey numbers: $\mathbf{R}(3, 13)$ is increased from $60$ to $61$, $\mathbf{R}(3, 18)$ from $99$ to $100$, $\mathbf{R}(4, 13)$ from $138$ to $139$, $\mathbf{R}(4, 14)$ from $147$ to $148$, $\mathbf{R}(4, 15)$ from $158$ to $159$, $\mathbf{R}(4, 16)$ from $170$ to $174$, $\mathbf{R}(4, 18)$ from $205$ to $209$, $\mathbf{R}(4, 19)$ from $213$ to $219$, and $\mathbf{R}(4, 20)$ from $234$ to $237$. These results were achieved using AlphaEvolve, an LLM-based code mutation agent. Beyond these new results, we successfully recovered lower bounds for all Ramsey numbers known to be exact, and matched the best known lower bounds across many other cases. These include bounds for which previous work does not detail the algorithms used. Virtually all known Ramsey lower bounds are derived computationally, with bespoke search algorithms each delivering a handful of results. AlphaEvolve is a single meta-algorithm yielding search algorithms for all of our results.2026-03-10T04:20:40ZAnsh NagdaPrabhakar RaghavanAbhradeep Thakurtahttp://arxiv.org/abs/2604.19625v1Coherent-State Propagation: A Computational Framework for Simulating Bosonic Quantum Systems2026-04-21T16:13:57ZWe introduce coherent-state propagation, a computational framework for simulating bosonic systems. We focus on bosonic circuits composed of displaced linear optics augmented by Kerr nonlinearities, a universal model of bosonic quantum computation that is also physically motivated by driven Bose-Hubbard dynamics. The method works in the Schrödinger picture representing the evolving state as a sparse superposition of coherent states. We develop approximation strategies that keep the simulation cost tractable in physically relevant regimes, notably when the number of Kerr gates is small or the Kerr nonlinearities are weak, and prove rigorous guarantees for both observable estimation and sampling. In particular, bosonic circuits with logarithmically many Kerr gates admit quasi-polynomial-time classical simulation at exponentially small error in trace distance. We further identify a weak-nonlinearity regime in which the runtime is polynomial for arbitrarily small constant precision. We complement these results with numerical benchmarks on the Bose-Hubbard model with all-to-all connectivity. The method reproduces Fock-basis and matrix-product-state reference data, suggesting that it offers a useful route to the classical simulation of bosonic systems.2026-04-21T16:13:57Z56 pages, 6 figuresNikita GuseynovZoë HolmesArmando Angrisani