https://arxiv.org/api/j/Z4dvZH92NaRcnZbEzAjcrBguY2026-06-23T22:52:28Z15694109515http://arxiv.org/abs/2508.21005v2Measuring Ransomware Lateral Movement Susceptibility via Privilege-Weighted Adjacency Matrix Exponentiation2025-11-07T17:29:36ZRansomware impact hinges on how easily an intruder can move laterally and spread to the maximum number of assets. We present a graph-theoretic formulation that casts lateral movement as a path-closure problem over a probability semiring to measure lateral-movement susceptibility and estimate blast radius. We build a directed multigraph where vertices represent assets and edges represent reachable services (e.g., RDP/SSH) between them. We model lateral movement as a probabilistic process using a pivot potential factor $π(s)$ for each service, with step successes composed via a probabilistic path operator \( \otimes \) and alternative paths aggregated via a probabilistic union \( \oplus \) (noisy-OR). This yields a monotone fixed-point (iterative) computation of a $K$-hop compromise probability matrix that captures how compromise propagates through the network. Metrics derived from this model include: (1) Lateral-Movement Susceptibility (LMS$_K$): the average probability of a successful lateral movement between any two assets (0-1 scale); and (2) Blast-Radius Estimate (BRE$_K$): the expected percentage of assets compromised in an average attack scenario. Interactive services (SSH 22, RDP 3389) receive higher $π(s)$ than app-only ports (MySQL 3306, MSSQL 1433), which seldom enable pivoting without an RCE. Across anonymized enterprise snapshots, pruning high-$π(s)$ edges yields the largest LMS$_K$/BRE$_K$ drop, aligning with CISA guidance, MITRE ATT\&CK (TA0008: Lateral Movement), and NIST SP~800-207. The framework evaluates (micro)segmentation and helps prioritize controls that reduce lateral-movement susceptibility and shrink blast radius.2025-08-28T17:07:34Z16 pages, 14 figuresSatyam TyagiGanesh Murugesanhttp://arxiv.org/abs/2511.05295v1Language Generation and Identification From Partial Enumeration: Tight Density Bounds and Topological Characterizations2025-11-07T14:56:04ZThe success of large language models (LLMs) has motivated formal theories of language generation and learning. We study the framework of \emph{language generation in the limit}, where an adversary enumerates strings from an unknown language $K$ drawn from a countable class, and an algorithm must generate unseen strings from $K$. Prior work showed that generation is always possible, and that some algorithms achieve positive lower density, revealing a \emph{validity--breadth} trade-off between correctness and coverage. We resolve a main open question in this line, proving a tight bound of $1/2$ on the best achievable lower density. We then strengthen the model to allow \emph{partial enumeration}, where the adversary reveals only an infinite subset $C \subseteq K$. We show that generation in the limit remains achievable, and if $C$ has lower density $α$ in $K$, the algorithm's output achieves density at least $α/2$, matching the upper bound. This generalizes the $1/2$ bound to the partial-information setting, where the generator must recover within a factor $1/2$ of the revealed subset's density. We further revisit the classical Gold--Angluin model of \emph{language identification} under partial enumeration. We characterize when identification in the limit is possible -- when hypotheses $M_t$ eventually satisfy $C \subseteq M \subseteq K$ -- and in the process give a new topological formulation of Angluin's characterization, showing that her condition is precisely equivalent to an appropriate topological space having the $T_D$ separation property.2025-11-07T14:56:04ZJon KleinbergFan Weihttp://arxiv.org/abs/2511.04558v2Boolean function monotonicity testing requires (almost) $n^{1/2}$ queries2025-11-07T14:26:53ZWe show that for any constant $c>0$, any (two-sided error) adaptive algorithm for testing monotonicity of Boolean functions must have query complexity $Ω(n^{1/2-c})$. This improves the $\tildeΩ(n^{1/3})$ lower bound of [CWX17] and almost matches the $\tilde{O}(\sqrt{n})$ upper bound of [KMS18].2025-11-06T17:08:33ZMark ChenXi ChenHao CuiWilliam PiresJonah Stockwellhttp://arxiv.org/abs/2507.15616v2On zeros and algorithms for disordered systems: mean-field spin glasses2025-11-06T17:34:54ZSpin glasses are fundamental probability distributions at the core of statistical physics, the theory of average-case computational complexity, and modern high-dimensional statistical inference. In the mean-field setting, we design deterministic quasipolynomial-time algorithms for estimating the partition function to arbitrarily high accuracy for all inverse temperatures in the second moment regime. In particular, for the Sherrington--Kirkpatrick model, our algorithms succeed for the entire replica-symmetric phase. To achieve this, we study the locations of the zeros of the partition function. Notably, our methods are conceptually simple, and apply equally well to the spherical case and the case of Ising spins.2025-07-21T13:41:07ZCompared to the previous version, we establish an improved zero-free result for the second moment regimeFerenc BencsBrice HuangDaniel Z. LeeKuikui LiuGuus Regtshttp://arxiv.org/abs/2504.13063v2An exact approach for the multi-depot electric vehicle scheduling problem2025-11-06T12:33:59ZThe "avoid - shift - improve" framework and the European Clean Vehicles Directive set the path for improving the efficiency and ultimately decarbonizing the transport sector. While electric buses have already been adopted in several cities, regional bus lines may pose additional challenges due to the potentially longer distances they have to travel. In this work, we model and solve the electric bus scheduling problem, lexicographically minimizing the size of the bus fleet, the number of charging stops, and the total energy consumed, to provide decision support for bus operators planning to replace their diesel-powered fleet with zero emission vehicles. We propose a graph representation which allows partial charging without explicitly relying on time variables and derive 3-index and 2-index mixed-integer linear programming formulations for the multi-depot electric vehicle scheduling problem. While the 3-index model can be solved by an off-the-shelf solver directly, the 2-index model relies on an exponential number of constraints to ensure the correct depot pairing. These are separated in a cutting plane fashion. We propose a set of instances with up to 80 service trips to compare the two approaches, showing that, with a small number of depots, the compact 3-index model performs very well. However, as the number of depots increases the developed branch-and-cut algorithm proves to be of value. These findings not only offer algorithmic insights but the developed approaches also provide actionable guidance for transit agencies and operators, allowing to quantify trade-offs between fleet size, energy efficiency, and infrastructure needs under realistic operational conditions.2025-04-17T16:18:56ZXenia HaslingerElisabeth GaarSophie N. Parraghhttp://arxiv.org/abs/2511.03864v1Induced matching treewidth and tree-independence number, revisited2025-11-05T21:09:44ZWe study two graph parameters defined via tree decompositions: tree-independence number and induced matching treewidth. Both parameters are defined similarly as treewidth, but with respect to different measures of a tree decomposition $\mathcal{T}$ of a graph $G$: for tree-independence number, the measure is the maximum size of an independent set in $G$ included in some bag of $\mathcal{T}$, while for the induced matching treewidth, the measure is the maximum size of an induced matching in $G$ such that some bag of $\mathcal{T}$ contains at least one endpoint of every edge of the matching.
While the induced matching treewidth of any graph is bounded from above by its tree-independence number, the family of complete bipartite graphs shows that small induced matching treewidth does not imply small tree-independence number. On the other hand, Abrishami, Briański, Czyżewska, McCarty, Milanič, Rzążewski, and Walczak~[SIAM Journal on Discrete Mathematics, 2025] showed that, if a fixed biclique $K_{t,t}$ is excluded as an induced subgraph, then the tree-independence number is bounded from above by some function of the induced matching treewidth. The function resulting from their proof is exponential even for fixed $t$, as it relies on multiple applications of Ramsey's theorem. In this note we show, using the Kövári-Sós-Turán theorem, that for any class of $K_{t,t}$-free graphs, the two parameters are in fact polynomially related.2025-11-05T21:09:44ZNoga AlonMartin MilaničPaweł Rzążewskihttp://arxiv.org/abs/2511.03592v1Characterizations of undirected 2-quasi best match graphs2025-11-05T16:11:22ZBipartite best match graphs (BMG) and their generalizations arise in mathematical phylogenetics as combinatorial models describing evolutionary relationships among related genes in a pair of species. In this work, we characterize the class of \emph{undirected 2-quasi-BMGs} (un2qBMGs), which form a proper subclass of the $P_6$-free chordal bipartite graphs. We show that un2qBMGs are exactly the class of bipartite graphs free of $P_6$, $C_6$, and the eight-vertex Sunlet$_4$ graph. Equivalently, a bipartite graph $G$ is un2qBMG if and only if every connected induced subgraph contains a ``heart-vertex'' which is adjacent to all the vertices of the opposite color. We further provide a $O(|V(G)|^3)$ algorithm for the recognition of un2qBMGs that, in the affirmative case, constructs a labeled rooted tree that ``explains'' $G$. Finally, since un2qBMGs coincide with the $(P_6,C_6)$-free bi-cographs, they can also be recognized in linear time.2025-11-05T16:11:22Z24 pages, 6 figuresAnnachiara KorchmarosGuillaume E. ScholzPeter F. Stadlerhttp://arxiv.org/abs/2510.22193v2(Approximate) Matrix Multiplication via Convolutions2025-11-05T13:05:52ZWe study the capability of the Fast Fourier Transform (FFT) to accelerate exact and approximate matrix multiplication without using Strassen-like divide-and-conquer. We present a simple exact algorithm running in $O(n^{2.89})$ time, which only sums a few convolutions (FFTs) in $\mathbb{Z}_{m}^{k}$, building on the work of Cohn, Kleinberg, Szegedy and Umans (2005). As a corollary, combining this algorithm with linear sketching breaks the longstanding linear speed-accuracy tradeoff for "combinatorial" approximate matrix multiplication (AMM, Pagh'13, Sarlos'06, Clarkson-Woodruff'13), achieving error $\frac{1}{r^{1.1}}\left\lVert \mathbf{A} \right\rVert_{F}^{2}\left\lVert \mathbf{B}\right\rVert_{F}^{2}$ in $O(rn^{2})$ time, using nothing but FFTs.
Motivated by the rich literature for approximating polynomials, our main contribution in this paper is extending the group-theoretic framework of Cohn and Umans (2003) to approximate matrix multiplication (AMM). Specifically, we introduce and study an approximate notion of the Triple Product Property, which in the abelian case is equivalent to finding a Sumset which minimizes (multi-)intersections with an arithmetic progression. We prove tight bounds on this quantity for abelian groups (yielding a simple and practical AMM algorithm via polynomial multiplication), and establish a weaker lower bound for non-abelian groups, extending a lemma of Gowers. Finally, we propose a concrete approach that uses low-degree approximation of multi-variate polynomials for AMM, which we believe will lead to practical, non-asymptotic AMM algorithms in real-world applications, most notably LLM inference.2025-10-25T07:19:52ZYahel UffenheimerOmri Weinsteinhttp://arxiv.org/abs/2511.03335v1Extension of the Gyárfás-Sumner conjecture to signed graphs2025-11-05T10:08:15ZThe balanced chromatic number of a signed graph G is the minimum number of balanced sets that cover all vertices of G. Studying structural conditions which imply bounds on the balanced chromatic number of signed graphs is among the most fundamental problems in graph theory. In this work, we initiate the study of coloring hereditary classes of signed graphs. More precisely, we say that a set F = {F_1, F_2, ..., F_l} is a GS (for Gyárfás-Sumner) set if there exists a constant c such that signed graphs with no induced subgraph switching equivalent to a member of F admit a balanced c-coloring. The focus of this work is to study GS sets of order 2. We show that if F is a GS set of order 2, then F_1 is either (K_3, -) or (K_4, -), and F_2 is a linear forest. In the case of F_1 = (K_3, -), we show that any choice of a linear forest for F_2 works. In the case of F_1 = (K_4, -), we show that if each connected component of F_2 is a path of length at most 4, then {F_1, F_2} is a GS set.2025-11-05T10:08:15ZGuillaume AubianAllen IbiapinaLuis KuffnerReza NaserasrCyril PujolCléophée RobinHuan Zhouhttp://arxiv.org/abs/2511.02983v1Towards a geometric characterization of unbounded integer cubic optimization problems via thin rays2025-11-04T20:42:35ZWe study geometric characterizations of unbounded integer polynomial optimization problems. While unboundedness along a ray fully characterizes unbounded integer linear and quadratic optimization problems, we show that this is not the case for cubic polynomials. To overcome this, we introduce thin rays, which are rays with an arbitrarily small neighborhood, and prove that they characterize unboundedness for integer cubic optimization problems in dimension up to three, and we conjecture that the same holds in all dimensions. Our techniques also provide a complete characterization of unbounded integer quadratic optimization problems in arbitrary dimension, without assuming rational coefficients. These results underscore the significance of thin rays and offer new tools for analyzing integer polynomial optimization problems beyond the quadratic case.2025-11-04T20:42:35ZAlberto Del Piahttp://arxiv.org/abs/2510.15168v2A Courcelle-Type Metatheorem for Rank-Bounded Unconstrained Binary Optimization2025-11-04T19:33:25ZWe present the first uniform XP exact algorithm for unconstrained binary optimization of quadratic, polynomial, fractional, and other objectives under a single parameter, the differentially affine (DA) rank $r$. An objective $f: \{0,1\}^n \to \mathbb{R}$ has DA rank $r$ if there is a feature map $ψ: \{0,1\}^n \to \mathbb{R}^r$ such that each coordinate flip has finite gain $Δ_{\pm e_i}f(x)=\langle v_{\pm e_i},ψ(x)\rangle+β_{\pm e_i}$. Our algorithm enumerates the $O((2n)^r)$ chambers of the induced hyperplane arrangement and applies a two-sided local-optimality test: a solution exists on a chamber and is unique iff $\operatorname{sign}Δ_{+e_i}=-\operatorname{sign}Δ_{-e_i}$ for all $i$, in which case $x_i^\star=1$ iff $Δ_{+e_i}>0$. This yields $n^{O(r)}$ time with $O(n)$ decoding per chamber. The framework uniformly covers a wide range of nonlinear functions, including all rank-$r$ quadratics, low-Waring-rank pseudo-Boolean polynomials, finite products/ratios on positive domains, finite-basis separable sums via explicit lifts, Taylor-series approximations of analytic functions, and compositions of all the foregoing. Applications include Ising spin models, optimal experimental design, portfolio optimization, and robust statistics. Prior to our work, only specialized subcases involving sparsity, convexity, submodularity, etc. were known to be tractable. Analogous in spirit to Courcelle's theorem (MSO on bounded treewidth graphs) and Grohe's meta-theorems for constraint satisfaction, our result replaces logical width with analytic rank for nonlinear pseudo-Boolean optimization.2025-10-16T22:10:38ZMarc Hararyhttp://arxiv.org/abs/2511.02705v1A Simple and Fast $(3+\varepsilon)$-approximation for Constrained Correlation Clustering2025-11-04T16:25:06ZIn Constrained Correlation Clustering, the goal is to cluster a complete signed graph in a way that minimizes the number of negative edges inside clusters plus the number of positive edges between clusters, while respecting hard constraints on how to cluster certain friendly or hostile node pairs. Fischer et al. [FKKT25a] recently developed a $\tilde{O}(n^3)$-time 16-approximation algorithm for this problem. We settle an open question posed by these authors by designing an algorithm that is equally fast but brings the approximation factor down to $(3+\varepsilon)$ for arbitrary constant $\varepsilon > 0$. Although several new algorithmic steps are needed to obtain our improved approximation, our approach maintains many advantages in terms of simplicity. In particular, it relies mainly on rounding a (new) covering linear program, which can be approximated quickly and combinatorially. Furthermore, the rounding step amounts to applying the very familiar Pivot algorithm to an auxiliary graph. Finally, we develop much simpler algorithms for instances that involve only friendly or only hostile constraints.2025-11-04T16:25:06ZAccepted for publication at the 2026 SIAM Symposium on Simplicity in Algorithms (SOSA26)Nate Veldthttp://arxiv.org/abs/2511.02442v1Emerging consecutive pattern avoidance2025-11-04T10:15:52ZIn this note we study the {\em asymptotic popularity}, that is, the limit probability to find a given consecutive pattern at a random position in a random permutation in the eighteen classes of permutations avoiding at least two length 3 consecutive patterns. We show that for ten classes, this popularity can be readily deduced from the structure of permutations. By combining analytical and bijective approaches, we study in details two more involved cases. The problem remains open for five classes.2025-11-04T10:15:52Z10 pagesNathanaël HasslerSergey Kirgizovhttp://arxiv.org/abs/2511.02406v1Arithmetic Circuits and Neural Networks for Regular Matroids2025-11-04T09:37:14ZWe prove that there exist uniform $(+,\times,/)$-circuits of size $O(n^3)$ to compute the basis generating polynomial of regular matroids on $n$ elements. By tropicalization, this implies that there exist uniform $(\max,+,-)$-circuits and ReLU neural networks of the same size for weighted basis maximization of regular matroids. As a consequence in linear programming theory, we obtain a first example where taking the difference of two extended formulations can be more efficient than the best known individual extended formulation of size $O(n^6)$ by Aprile and Fiorini. Such differences have recently been introduced as virtual extended formulations. The proof of our main result relies on a fine-tuned version of Seymour's decomposition of regular matroids which allows us to identify and maintain graphic substructures to which we can apply a local version of the star-mesh transformation.2025-11-04T09:37:14ZChristoph HertrichStefan KoberGeorg Lohohttp://arxiv.org/abs/2511.02096v1Numbering Combinations for Compact Representation of Many-to-Many Relationship Sets2025-11-03T22:10:20ZIn this paper we propose an approach to implement specific relation-ship set between two entities called combinatorial relationship set. For the combinatorial relationship set B between entity sets G and I the mapping cardinality is many-to-many. Additionally, entities from G can be uniquely encoded with a pair of values (h, k) generated with the procedure for numbering combinations of entities from I. The encoding procedure is based on combinatorial number system that provides a representation of all possible k -combinations of a set of n elements by a single number. In general many-to-many relationship sets are represented by a relation or table, while the combinatorial relationship is not physically stored as separate table. However, all information is encapsulated into a single column added to G. The new column is a candidate key in G. Additional operation named Rank-Join to fundamental relational-algebra is presented to combine information from g and i associated with a combinatorial relationship set. Motivation for combinatorial relationship originates from challenges in designing and implementing multivalued dimensions and bridge tables in data-warehouse models.2025-11-03T22:10:20ZSavo Tomovic