https://arxiv.org/api/YFXaN5E8xTzIq4J0JgAMXSm3MGo2026-06-10T06:00:53Z1293312015http://arxiv.org/abs/2408.15377v4On Approximability of Satisfiable k-CSPs: V2026-05-19T08:17:47ZWe propose a framework of algorithm vs. hardness for all Max-CSPs and demonstrate it for a large class of predicates. This framework extends the work of Raghavendra [STOC, 2008], who showed a similar result for almost satisfiable Max-CSPs.
Our framework is based on a new hybrid approximation algorithm, which uses a combination of the Gaussian elimination technique (i.e., solving a system of linear equations over an Abelian group) and the semidefinite programming relaxation. We complement our algorithm with a matching dictator vs. quasirandom test that has perfect completeness.
The analysis of our dictator vs. quasirandom test is based on a novel invariance principle, which we call the mixed invariance principle. Our mixed invariance principle is an extension of the invariance principle of Mossel, O'Donnell and Oleszkiewicz [Annals of Mathematics, 2010] which plays a crucial role in Raghavendra's work. The mixed invariance principle allows one to relate 3-wise correlations over discrete probability spaces with expectations over spaces that are a mixture of Guassian spaces and Abelian groups, and may be of independent interest.2024-08-27T19:30:26Z89 pages. This is the TheoretiCS journal versionTheoretiCS, Volume 5 (May 20, 2026) theoretics:16123Amey BhangaleSubhash KhotDor Minzer10.46298/theoretics.26.9http://arxiv.org/abs/2603.14749v2The Counting General Dominating Set Framework2026-05-19T04:51:52ZWe introduce a new framework of counting problems called #GDS that encompasses #$(σ, ρ)$-Set, a class of domination-type problems that includes counting dominating sets and counting total dominating sets. We explore the intricate relation between #GDS and the well-known Holant. We adapt the technique of gadget construction of Holant to the #GDS framework; using this technique, we prove the #P-completeness of counting dominating sets for 3-regular planar bipartite simple graphs. Through a generalization of a Holant dichotomy, and a special reduction method via symmetric bipartite graphs, we also prove the #P-completeness of counting total dominating sets for the same graph class.2026-03-16T02:36:02ZJiayi ZhengBoning Menghttp://arxiv.org/abs/2605.19181v1Risk of Bad Tails: CVaR-Aware Pandora's Box and Prophet Inequality2026-05-18T23:05:25ZWe study Conditional Value-at-Risk (CVaR) variants of two canonical sequential decision problems: Pandora's box and the prophet inequality. For Pandora's box, the risk-aware problem retains an exact Weitzman-style index solution after a one-dimensional variational reduction. For the prophet inequality, the picture is different: for every CVaR level $α\in(0,1)$, no positive constant approximation guarantee can hold without distributional structure, in sharp contrast with the risk-neutral case $α=1$, and we characterize the tight instance-dependent guarantee. Already in two-item hard instances, the prophet's CVaR benchmark can be made arbitrarily large while every online policy's CVaR remains bounded. This impossibility is due to the nature of CVaR objective: it measures only the worst $α$-fraction of outcomes, so any compromise an online policy makes to preserve the chance of a large payoff in the upper $(1-α)$-fraction does not help its CVaR. It turns out that additional distributional structure restores a uniform result: under continuous reward distributions satisfying a recentered increasing-failure-rate-average (IFRA) condition, a threshold policy achieves an explicit constant bound.2026-05-18T23:05:25ZJingwei Jihttp://arxiv.org/abs/2604.20575v3A Quadratic Lower Bound for Noncommutative Circuits2026-05-18T18:29:01ZWe prove that every fan-in $2$ noncommutative arithmetic circuit computing the palindrome polynomial has size $Ω(nd)$. In particular, when $d=n$ we obtain an $Ω(n^2)$ lower bound. The proof builds on and refines a previous work of the author. Key ideas in the proof were generated by Gemini 3.1 Pro.2026-04-22T13:55:01Z9 pages. Improved parametrization, proof now works for small dPratik Shastrihttp://arxiv.org/abs/2512.02691v3A Tight Double-Exponentially Lower Bound for High-Multiplicity Bin Packing2026-05-18T14:43:04ZConsider a high-multiplicity Bin Packing instance $I$ with $d$ distinct item types. In 2014, Goemans and Rothvoss gave an algorithm with runtime ${{|I|}^2}^{O(d)}$ for this problem~[SODA'14], where $|I|$ denotes the encoding length of the instance $I$. Although Jansen and Klein~[SODA'17] later developed an algorithm that improves upon this runtime in a special case, it has remained a major open problem by Goemans and Rothvoss~[J.ACM'20] whether the doubly exponential dependency on $d$ is necessary.
We solve this open problem by showing that unless the ETH fails, there is no algorithm solving the high-multiplicity Bin Packing problem in time ${{|I|}^2}^{o(d)}$. To prove this, we introduce a novel reduction from 3-SAT. The core of our construction is efficiently encoding all information from a 3-SAT instance with $n$ variables into an ILP with $O(\log(n))$ variables and constraints.
This result confirms that the Goemans and Rothvoss algorithm is essentially best-possible for Bin Packing parameterized by the number $d$ of item sizes in the context of XP time algorithms.2025-12-02T12:24:08Zto appear in ICALP 2026Klaus JansenFelix OhnesorgeLis Pirottonhttp://arxiv.org/abs/2605.18241v1An Entropy-Governed Speedup for Quantum Algorithms on Local Hamiltonians2026-05-18T11:36:43ZLow-energy estimation and state preparation for general $k$-local Hamiltonians are fundamental challenges in quantum complexity theory. For constant relative accuracy, Buhrman et al. (PRL 2025) recently broke the natural Grover bound $O(2^{n/2})$, where $n$ denotes the number of qubits, for both problems. In this paper, for any sufficiently small parameter $d\ge 0$, we present an even faster quantum algorithm that outputs a quantum state with energy bounded by the minimum energy over all depth-$d$ states (i.e., states obtained by applying a depth-$d$ circuit to the all-zero state), together with an estimate of this energy. For the class of Hamiltonians with depth-$d$ ground states, our algorithm furthermore achieves exactly the same energy guarantees as Buhrman et al. Our results also provide insight into the distinction between strongly entangled states and those admitting efficient classical descriptions.2026-05-18T11:36:43Z12 pages, 1 figure, 1 tableRanitha MataraarachchiNagoya University, JapanFrançois Le GallNagoya University, JapanSuguru TamakiUniversity of Hyogo, Japanhttp://arxiv.org/abs/2510.15712v3PLS-complete problems with lexicographic cost functions: Max-$k$-SAT and Abelian Permutation Orbit Minimization2026-05-18T09:56:28ZHow hard is it to find a local optimum? If we are given a graph and want to find a locally maximal cut--meaning that the number of edges in the cut can't be improved by moving a single vertex from one side to the other--then just iterating improving steps finds a local maximum in $ |E|$ steps. If, on the other hand, the edges are weighted, this problem becomes hard for the class PLS (Polynomial Local Search). We are interested in optimization problems with lexicographic costs. For Max-Cut this would mean that the edges $e_1,\dots, e_m$ have costs $c(e_i) = 2^i$. For such a cost function finding a global Max-Cut is easy. In contrast, we show that it is PLS-complete to find an assignment for a 4-CNF formula that is locally maximal (when the clauses have lexicographic weights); and also for a 3-CNF when we allow switching two variables at a time. We use these results to answer a question in Scheder and Tantow, who showed that finding a lexicographic local minimum of a string $s \in \{0,1\}^n$ under the action of a list of given permutations $π_1, \dots, π_k \in S_{n}$ is PLS-complete. They ask whether the problem stays PLS-complete when the $π_1,\dots,π_k$ commute, i.e., generate an Abelian subgroup $G$ of $S_n$. We show that it does, and in fact stays PLS-complete even (1) when every element in $G$ has order two or (2) when $G$ is cyclic. Additionally, we use it to further investigate the complexity of computing pure $α$-Nash equilibria in congestion games. Using lexicographic 4-SAT, we obtain a simple proof of the PLS-completeness originally shown by Skopalik and Vöcking that can be extended to exponential and polynomial delay functions with positive coefficients. The number of strategies per player and players per resource is bounded. However, the degree of the polynomials is not bounded by a constant.2025-10-17T14:56:50Z23 pages, abstract shortened to fulfill arxiv requirementsDominik SchederJohannes Tantowhttp://arxiv.org/abs/2605.18125v1Complexity of Finding and Enumerating Interconnection Trees2026-05-18T09:31:19ZWe study the problem of connecting the parts of a multipartite graph using a minimum number of edges under a matching constraint. We introduce interconnection trees, defined as matchings whose projections onto the quotient graph form a spanning tree. Motivated by applications in chemoinformatics, we investigate the decision, counting, and enumeration variants of this problem.
We show that the decision problem is $NP$-complete. Nevertheless, it becomes tractable in several structured settings: it is fixed-parameter tractable in the number of parts, and admits polynomial or linear-time algorithms on complete, quasi-complete, and $t$-quasi-complete multipartite graphs. We also study enumeration, for which we design efficient flashlight-search based algorithms with optimal delay for complete multipartite graphs, and a weight-guided heuristic that prioritizes low-weight solutions and performs well in practice.2026-05-18T09:31:19Z18 pages, 3 figures, 2 tablesNoé DemangeYann Strozeckihttp://arxiv.org/abs/2605.18079v1The Expressive Power of Low Precision Softmax Transformers with (Summarized) Chain-of-Thought2026-05-18T08:57:53ZExisting expressivity results for transformers typically rely on hardmax attention, high precision, and other architectural modifications that disconnect them from the models used in practice. We bridge this gap by analyzing standard transformer decoders with softmax attention and rounding of activations and attention weights, while allowing depth and width to grow logarithmically with the context length. As an intermediate step, we construct hardmax transformers with ternary activations and well-separated attention scores that simulate Turing machines using Chain-of-Thought (CoT). This lets us convert the constructions to equivalent softmax transformers without the unrealistic parameter magnitudes or activation precision that prior approaches would require. Using the same technique, we analyze a recently proposed summarized CoT paradigm and show that it simulates Turing machines more efficiently, with model size scaling logarithmically in a space bound rather than a time bound. We empirically test predictions made by our results on a Sudoku reasoning task and find better alignment with learnability than for prior high-precision results. Our code is available at https://github.com/moritzbroe/transformer-expressivity.2026-05-18T08:57:53ZAccepted to ICML 2026Moritz BrösamleStephan Ecksteinhttp://arxiv.org/abs/2509.22849v2Parameterized Hardness of Zonotope Containment and Neural Network Verification2026-05-18T08:34:08ZNeural networks with ReLU activations are a widely used model in machine learning. It is thus important to have a profound understanding of the properties of the functions computed by such networks. Recently, there has been increasing interest in the (parameterized) computational complexity of determining these properties. In this work, we close several gaps and resolve an open problem posted by Froese et al. [COLT '25] regarding the parameterized complexity of various problems related to network verification. In particular, we prove that deciding positivity (and thus surjectivity) of a function $f\colon\mathbb{R}^d\to\mathbb{R}$ computed by a 2-layer ReLU network is W[1]-hard when parameterized by $d$. This result also implies that zonotope (non-)containment is W[1]-hard with respect to $d$, a problem that is of independent interest in computational geometry, control theory, and robotics. Moreover, we show that approximating the maximum within any multiplicative factor in 2-layer ReLU networks, computing the $L_p$-Lipschitz constant for $p\in(0,\infty]$ in 2-layer networks, and approximating the $L_p$-Lipschitz constant in 3-layer networks are NP-hard and W[1]-hard with respect to $d$. Notably, our hardness results are the strongest known so far and imply that the naive enumeration-based methods for solving these fundamental problems are all essentially optimal under the Exponential Time Hypothesis.2025-09-26T18:59:59Z20 pages, 5 figures, paper accepted at ICLR 2026Vincent FroeseMoritz GrilloChristoph HertrichMoritz Stargallahttp://arxiv.org/abs/2603.07280v6Automated Lower Bounds for Small Matrix Multiplication Complexity over Finite Fields2026-05-17T21:42:57ZWe develop an automated framework for proving lower bounds on the bilinear complexity of matrix multiplication over finite fields. Our approach systematically combines orbit classification of the restricted first matrix and dynamic programming over these orbits with recursive substitution strategies, culminating in efficiently verifiable proof certificates.
Using this framework, we obtain several new lower bounds for various small matrix formats. Most notably, we prove that the bilinear complexity of multiplying two $3 \times 3$ matrices over $\mathbb{F}_2$ is at least $20$, improving upon the longstanding lower bound of $19$ (Bläser 2003). Our search program finds the proof in under an hour on a laptop, and the resulting certificate verifies in seconds.2026-03-07T16:57:11ZChengu Wanghttp://arxiv.org/abs/2605.17572v1Modelling Network Resilience: The Complexity of Some Graph Division Games2026-05-17T18:00:49ZMotivated by the controller placement problems in software-defined networks and the fair division principles of classical "cake cutting", we investigate the following two-player zero-sum game. In our model, a defender places a limited number of controllers on graph vertices, while an attacker deletes a limited number of vertices. The defender score is the total number of surviving vertices reachable from any remaining controller. We formalize the computational problems associated with various game dynamics (defender plays first; attacker plays first; players play simultaneously; pure or mixed strategies).
We show that these natural problems are $\mathsf{NP}$-complete or $Σ^\mathsf{P}_2$-complete, depending on the specific variant. These hardness results provide limitations for optimal controller placement algorithms under different notions of quality of a solution. Finally, we present structural insights that yield efficient algorithms for restricted graph classes (namely interval graphs and graphs of bounded treewidth).2026-05-17T18:00:49ZGrzegorz GutowskiKonstanty Junosza-SzaniawskiAntonio LauerbachAlexander Wolffhttp://arxiv.org/abs/2510.16609v3Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods2026-05-17T14:04:24ZTest-time augmentation, such as Retrieval-Augmented Generation (RAG) or tool use, critically depends on an interplay between a model's parametric knowledge and externally retrieved information. However, the theoretical underpinnings of this relationship remain poorly understood. Specifically, it is not clear how much pre-training knowledge is required to answer queries with a small number of augmentation steps, which is a desirable property in practice. To address this question, we formulate multi-step reasoning as an $s$-$t$ connectivity problem on a knowledge graph. We represent a model's pre-training parametric knowledge as a partial, potentially noisy subgraph. We view augmentation as querying an oracle for true edges that augment the model's knowledge. Then, we characterize the necessary and sufficient number of augmentation steps for the model to generate an accurate answer given partial prior knowledge. One key result shows a phase transition: if the prior knowledge graph over $n$ vertices is disconnected into small components, then finding a path via augmentation is inefficient and requires $Ω(\sqrt{n})$ queries. On the other hand, once the density of correct knowledge surpasses a threshold, forming a giant component, we can find paths with an expected constant number of queries.2025-10-18T18:17:25ZAvrim BlumDaniel HsuCyrus RashtchianDonya Salesshttp://arxiv.org/abs/2605.12893v2LFPL: Revisited and Mechanized2026-05-17T04:22:36ZHofmann (1999) introduced the functional programming language LFPL to characterize the functions computable in polynomial time using an affine type system. LFPL enables a natural programming style, including nested recursion, and has inspired the development of type systems for automatic cost analysis, linear dependent type theories, and efficient memory management in functional programming languages. Despite its prominence, there does not exist a self-contained presentation, let alone a full mechanization, of LFPL and its core metatheory. This article presents a modern account and mechanization of LFPL and its metatheory with the goal of being self-contained and accessible while streamlining the strongest-known soundness and completeness results. The soundness proof works with the language LFPL+, which extends LFPL with additional language features. The proof is novel, adapting a technique by Aehlig and Schwichtenberg (2002) to construct explicit polynomials that bound the cost of an LFPL+ expression with respect to a big-step cost semantics. The completeness proof shows that LFPL programs can simulate polynomial-time Turing machines while only relying on restricted forms of linear functions and lists. It has the same structure as the original proof by Hofmann (2002) but greatly simplifies the core argument with a novel stack-like data structure that is implemented with first-class functions and lists. The mechanization includes the full soundness and completeness proofs, and serves as one of the first case studies of mechanized metatheory in the recently developed proof assistant Istari.2026-05-13T02:15:26ZThis is the extended version of the article with the same title that appeared at the Forty-First Annual Symposium on Logic in Computer Science (LICS 2026). The difference to the LICS version is that the extended version contains an appendix with additional technical detailsNathaniel GloverJan Hoffmannhttp://arxiv.org/abs/2605.20236v1Information Redistribution Under Reductions in NP Search2026-05-17T02:11:06ZUsing reductions from structured P-matrix violation search to classical NP-complete formulations such as 3-SAT and Subset Sum, we examine the relationship between representational expansion, auxiliary variables, local inferability, and information accessibility. Rather than viewing reductions purely as computational transformations, we interpret them as mechanisms that redistribute hidden witness information across enlarged representations. From this perspective, reductions, gadgets, and auxiliary structures may expose globally encoded witness information to local propagation and inference, while search algorithms act as decoding procedures attempting to recover the original hidden witness. The resulting observations suggest that representational expansion may improve local inferability by introducing auxiliary variables and consistency structures, while preserving the need to recover the underlying witness information. This work is exploratory in nature and proposes a conceptual framework for understanding how reductions reshape information accessibility in NP search.2026-05-17T02:11:06Z18 pages, Exploratory paper on information accessibility, reductions, and local inferability in NP searchJing-Yuan Wei