https://arxiv.org/api/z6AxBsijfJHX2/DQif3mAnppWaA2026-06-13T20:44:09Z313813515http://arxiv.org/abs/2603.02398v1Fast Matrix Multiplication in Small Formats: Discovering New Schemes with an Open-Source Flip Graph Framework2026-03-02T21:20:42ZAn open-source C++ framework for discovering fast matrix multiplication schemes using the flip graph approach is presented. The framework supports multiple coefficient rings -- binary ($\mathbb{Z}_2$), modular ternary ($\mathbb{Z}_3$) and integer ternary ($\mathbb{Z}_T = \{-1,0,1\}$) -- and implements both fixed-dimension and meta-dimensional search operators. Using efficient bit-level encoding of coefficient vectors and OpenMP parallelism, the tools enable large-scale exploration on commodity hardware. The study covers 680 schemes ranging from $(2 \times 2 \times 2)$ to $(16 \times 16 \times 16)$, with 276 schemes now in $\mathbb{Z}_T$ coefficients and 117 in integer coefficients. With this framework, the multiplicative complexity (rank) is improved for 79 matrix multiplication schemes. Notably, a new $4 \times 4 \times 10$ scheme requiring only 115 multiplications is discovered, achieving $ω\approx 2.80478$ and beating Strassen's exponent for this specific size. Additionally, 93 schemes are rediscovered in ternary coefficients that were previously known only over rationals or integers, and 68 schemes in integer coefficients that previously required fractions. All tools and discovered schemes are made publicly available to enable reproducible research.2026-03-02T21:20:42ZA. I. Perminovhttp://arxiv.org/abs/2603.01984v1ViTex: Visual Texture Control for Multi-Track Symbolic Music Generation via Discrete Diffusion Models2026-03-02T15:39:30ZIn automatic music generation, a central challenge is to design controls that enable meaningful human-machine interaction. Existing systems often rely on extrinsic inputs such as text prompts or metadata, which do not allow humans to directly shape the composition. While prior work has explored intrinsic controls such as chords or hierarchical structure, these approaches mainly address piano or vocal-accompaniment settings, leaving multitrack symbolic music largely underexplored. We identify instrumentation, the choice of instruments and their roles, as a natural dimension of control in multi-track composition, and propose ViTex, a visual representation of instrumental texture. In ViTex, color encodes instrument choice, spatial position represents pitch and time, and stroke properties capture local textures. Building on this representation, we develop a discrete diffusion model conditioned on ViTex and chord progressions to generate 8-measure multi-track symbolic music, enabling explicit texture-level control while maintaining strong unconditional generation quality. The demo page and code are avaliable at https://vitex2025.github.io/.2026-03-02T15:39:30ZXiaoyu YiQi HeGus XiaZiyu Wanghttp://arxiv.org/abs/2602.23211v2Coalgebraic analysis of social systems2026-02-28T18:40:17ZThe algebraic analysis of social systems, or algebraic social network analysis, refers to a collection of methods designed to extract information about the structure of a social system represented as a directed graph. Central among these are methods to determine the roles that exist within a given system, and the positions. The analysis of roles and positions is highly developed for social systems that involve only pairwise interactions among actors - however, in contemporary social network analysis it is increasingly common to use models that can take into account higher-order interactions as well. In this paper we take a category-theoretic approach to the question of how to lift role and positional analysis from graphs to hypergraphs, which can accommodate higher-order interactions. We use the framework of universal coalgebra - a 'theory of systems' with origins in computer science and logic - to formalize the main concepts of role and positional analysis and extend them to a large class of structures that includes both graphs and hypergraphs. As evidence for the validity of our definitions, we prove a very general functoriality theorem that specializes, in the case of graphs, to a folkloric observation about the compatibility of positional and role analysis.2026-02-26T16:55:22Z44 pages, 8 figures; new version to fix cref rendering issue, new colors in figures, minor changesNima MotamedNina OtterEmily Roffhttp://arxiv.org/abs/2602.23561v1VaSST: Variational Inference for Symbolic Regression using Soft Symbolic Trees2026-02-27T00:07:31ZSymbolic regression has recently gained traction in AI-driven scientific discovery, aiming to recover explicit closed-form expressions from data that reveal underlying physical laws. Despite recent advances, existing methods remain dominated by heuristic search algorithms or data-intensive approaches that assume low-noise regimes and lack principled uncertainty quantification. Fully probabilistic formulations are scarce, and existing Markov chain Monte Carlo-based Bayesian methods often struggle to efficiently explore the highly multimodal combinatorial space of symbolic expressions. We introduce VaSST, a scalable probabilistic framework for symbolic regression based on variational inference. VaSST employs a continuous relaxation of symbolic expression trees, termed soft symbolic trees, where discrete operator and feature assignments are replaced by soft distributions over allowable components. This relaxation transforms the combinatorial search over an astronomically large symbolic space into an efficient gradient-based optimization problem while preserving a coherent probabilistic interpretation. The learned soft representations induce posterior distributions over symbolic structures, enabling principled uncertainty quantification. Across simulated experiments and Feynman Symbolic Regression Database within SRBench, VaSST achieves superior performance in both structural recovery and predictive accuracy compared to state-of-the-art symbolic regression methods.2026-02-27T00:07:31Z38 pages, 5 figures, 35 tables, SubmittedSomjit RoyPritam DeyBani K. Mallickhttp://arxiv.org/abs/2506.09950v4Oracle-Based Multistep Strategy for Solving Polynomial Systems Over Finite Fields and Algebraic Cryptanalysis of the Aradi Cipher2026-02-26T09:04:56ZThe multistep solving strategy consists in a divide-and-conquer approach: when a multivariate polynomial system is computationally infeasible to solve directly, one variable is assigned over the elements of the base finite field, and the procedure is recursively applied to the resulting simplified systems. In a previous work by the same authors (among others), this approach proved effective in the algebraic cryptanalysis of the Trivium cipher. In this paper, we present a new formulation of the corresponding algorithm based on a Depth-First Search strategy, along with a novel complexity analysis leveraging tree structures. We also introduce the notion of an ``oracle function'', which is intended to determine whether evaluating a new variable is required to simplify the current polynomial system. This notion allows us to unify all previously proposed variants of the multistep strategy, including the classical hybrid approach, by appropriately selecting the oracle function. Finally, we employ the multistep solving strategy in the cryptanalysis of the NSA's recently introduced low-latency block cipher Aradi, achieving a first full-round algebraic attack that exposes structural features in its symbolic model.2025-06-11T17:18:25Z20 pages. To appear in Advances in Mathematics of CommunicationsRoberto La ScalaSharwan Kumar Tiwarihttp://arxiv.org/abs/2602.22371v1Quadratization of Autonomous Partial Differential Equations: Theory and Algorithms2026-02-25T20:08:07ZQuadratization for partial differential equations (PDEs) is a process that transforms a nonquadratic PDE into a quadratic form by introducing auxiliary variables. This symbolic transformation has been used in diverse fields to simplify the analysis, simulation, and control of nonlinear and nonquadratic PDE models. This paper presents a rigorous definition of PDE quadratization, theoretical results for the PDE quadratization problem of spatially one-dimensional PDEs-including results on existence and complexity-and introduces QuPDE, an algorithm based on symbolic computation and discrete optimization that outputs a quadratization for any spatially one-dimensional polynomial or rational PDE. This algorithm is the first computational tool to find quadratizations for PDEs to date. We demonstrate QuPDE's performance by applying it to fourteen nonquadratic PDEs in diverse areas such as fluid mechanics, space physics, chemical engineering, and biological processes. QuPDE delivers a low-order quadratization in each case, uncovering quadratic transformations with fewer auxiliary variables than those previously discovered in the literature for some examples, and finding quadratizations for systems that had not been transformed to quadratic form before.2026-02-25T20:08:07ZAlbani OlivieriGleb PogudinBoris Kramerhttp://arxiv.org/abs/2603.03346v1Physics-constrained symbolic regression for discovering closed-form equations of multimodal water retention curves from experimental data2026-02-24T18:48:15ZModeling the unsaturated behavior of porous materials with multimodal pore size distributions presents significant challenges, as standard hydraulic models often fail to capture their complex, multi-scale characteristics. A common workaround involves superposing unimodal retention functions, each tailored to a specific pore size range; however, this approach requires separate parameter identification for each mode, which limits interpretability and generalizability, especially in data-sparse scenarios. In this work, we introduce a fundamentally different approach: a physics-constrained machine learning framework designed for meta-modeling, enabling the automatic discovery of closed-form mathematical expressions for multimodal water retention curves directly from experimental data. Mathematical expressions are represented as binary trees and evolved via genetic programming, while physical constraints are embedded into the loss function to guide the symbolic regressor toward solutions that are physically consistent and mathematically robust. Our results demonstrate that the proposed framework can discover closed-form equations that effectively represent the water retention characteristics of porous materials with varying pore structures. To support third-party validation, application, and extension, we make the full implementation publicly available in an open-source repository.2026-02-24T18:48:15ZYejin KimHyoung Suk Suhhttp://arxiv.org/abs/2602.19088v2A Formal Framework for Predicting Distributed System Performance under Faults (Extended Version)2026-02-24T07:53:48ZToday's distributed systems operate in complex environments that inevitably involve faults and even adversarial behaviors. Predicting their performance under such environments directly from formal designs remains a longstanding challenge. We present the first formal framework that systematically enables performance prediction of distributed systems across diverse faulty scenarios. Our framework features a fault injector together with a wide range of faults, reusable as a library, and model compositions that integrate the system and the fault injector into a unified model suitable for statistical analysis of performance properties such as throughput and latency. We formalize the framework in Maude and implement it as an automated tool, PERF. Applied to representative distributed systems, PERF accurately predicts system performance under varying fault settings, with estimations from formal designs consistent with evaluations on real deployments.2026-02-22T08:04:56Z32 pages, 3 figures. Accepted by FM 2026Ziwei ZhouSi LiuZhou ZhouPeixin WangMIn Zhanghttp://arxiv.org/abs/2602.20557v1GENSR: Symbolic Regression Based in Equation Generative Space2026-02-24T05:14:34ZSymbolic Regression (SR) tries to reveal the hidden equations behind observed data. However, most methods search within a discrete equation space, where the structural modifications of equations rarely align with their numerical behavior, leaving fitting error feedback too noisy to guide exploration. To address this challenge, we propose GenSR, a generative latent space-based SR framework following the `map construction -> coarse localization -> fine search'' paradigm. Specifically, GenSR first pretrains a dual-branch Conditional Variational Autoencoder (CVAE) to reparameterize symbolic equations into a generative latent space with symbolic continuity and local numerical smoothness. This space can be regarded as a well-structured `map'' of the equation space, providing directional signals for search. At inference, the CVAE coarsely localizes the input data to promising regions in the latent space. Then, a modified CMA-ES refines the candidate region, leveraging smooth latent gradients. From a Bayesian perspective, GenSR reframes the SR task as maximizing the conditional distribution $p(\mathrm{Equ.} \mid \mathrm{Num.})$, with CVAE training achieving this objective through the Evidence Lower Bound (ELBO). This new perspective provides a theoretical guarantee for the effectiveness of GenSR. Extensive experiments show that GenSR jointly optimizes predictive accuracy, expression simplicity, and computational efficiency, while remaining robust under noise.2026-02-24T05:14:34ZQian LiYuxiao HuJuncheng LiuYuntian Chenhttp://arxiv.org/abs/2506.07751v4AbstRaL: Augmenting LLMs' Reasoning by Reinforcing Abstract Thinking2026-02-23T18:25:13ZRecent studies have shown that large language models (LLMs), especially smaller ones, often lack robustness in grade school math (GSM) reasoning. In particular, they tend to experience performance drops when faced with distribution shifts, such as changes to numerical or nominal variables, or insertions of distracting clauses. A possible strategy to address this involves generating synthetic data to further "instantiate" reasoning problems on potential variations. In this work, we instead focus on the strategy of "abstracting" reasoning problems. This not only helps counteract distribution shifts but also facilitates the connection to symbolic tools for deriving solutions. Focusing on GSM, we find that this abstraction process is better acquired through reinforcement learning (RL) than just supervised fine-tuning, which often fails to produce faithful abstractions. Our method, AbstRaL -- which promotes abstract reasoning in LLMs using RL on granular abstraction data -- significantly mitigates performance degradation on recent GSM perturbation benchmarks. Besides, improving GSM robustness via AbstRaL is shown to also implicitly benefit LLMs' capabilities on OOD mathematical and general reasoning tasks, indicating that abstract thinking broadly enables better generalizability.2025-06-09T13:34:50ZICLR 2026Silin GaoAntoine BosselutSamy BengioEmmanuel Abbehttp://arxiv.org/abs/2602.19886v1Order Bounds for Hypergeometric and q-Hypergeometric Creative Telescoping2026-02-23T14:31:44ZLeveraging a general framework adapted from symbolic integration, a unified reduction-based algorithm for computing telescopers of minimal order for hypergeometric and q-hypergeometric terms has been recently developed. In this paper, we conduct a deeper exploration and put forth a new argument for the termination of the algorithm. This not only provides an independent proof of existence of telescopers, but also allows us to derive unified upper and lower bounds on the order of telescopers for hypergeometric terms and their q-analogues. Compared with known bounds in the literature, our bounds, in the hypergeometric case, are exactly the same as the tight ones obtained in 2016; while in the q-hypergeometric case, no lower bounds were known before, and our upper bound is sometimes better and never worse than the known one.2026-02-23T14:31:44ZHui Huanghttp://arxiv.org/abs/2602.19686v1A Flow Extension to Coroutine Types for Deadlock Detection in Go2026-02-23T10:31:22ZCoroutines, as an abstract programming construct, are a generalization of functions that can suspend execution part- way for later resumption. Coroutine Types are behavioral types to model interactions of coroutines with a single receiving operation followed by a single yielding operation. Coroutine Types have been applied to model-driven engineering, smart contracts, and test case generation. We contribute a Flow extension to Coroutine Types, so that coroutines with more than one receiving and yielding operation can be modeled. We accordingly revise the reduction rules of Coroutine Types. To show the usefulness of the Flow extension, we contribute a type system that maps expressions of the Go programming language to Coroutine Types. If the reduction result is 0, the two channel operations are paired properly and the program has no deadlocks. We choose Go because it is a popular programming language for distributed systems, but a frequent kind of bugs in Go is deadlocks due to the wrong use of concurrency features. We concentrate on the most commonly used semantics in Go: unbuffered channels with the keywords go and defer. Our Flow extension and the type system recognize 17 patterns of channels and goroutine interactions, including mismatched receivers and senders, nested goroutines, etc. We also integrate the Z3 SMT solver to take account of conditional execution and type inheritance. Other static or dynamic deadlock detectors crashed or gave wrong predictions in some patterns. Therefore, our type-based deadlock analyzer not only fills the gap in the landscape of value-based detection, but also complements existing detectors.2026-02-23T10:31:22ZAccepted in ICSESS 2025, MacaoQiqi Jason GuLixue LiuWei Kehttp://arxiv.org/abs/2602.19255v1Statistical Analysis of Hairpins and BasePairs in RNA Secondary Structures2026-02-22T16:11:22ZWe derive precise asymptotic expressions for the expectations, variances, covariance, and quite a few further mixed moments for the number of hairpins and the number of basepairs in RNA secondary structures, and give convincing evidence that the central-scaled distribution of the pair of random variables (hairpins, basepairs) tends in distribution to the bi-variate normal distribution with correlation $\sqrt{5 \sqrt{5} -11}/2= 0.2123322205\dots$2026-02-22T16:11:22ZAJ BuManuel KauersDoron Zeilbergerhttp://arxiv.org/abs/2602.18916v1Adaptive Collaboration of Arena-Based Argumentative LLMs for Explainable and Contestable Legal Reasoning2026-02-21T17:47:13ZLegal reasoning requires not only high accuracy but also the ability to justify decisions through verifiable and contestable arguments. However, existing Large Language Model (LLM) approaches, such as Chain-of-Thought (CoT) and Retrieval-Augmented Generation (RAG), often produce unstructured explanations that lack a formal mechanism for verification or user intervention. To address this limitation, we propose Adaptive Collaboration of Argumentative LLMs (ACAL), a neuro-symbolic framework that integrates adaptive multi-agent collaboration with an Arena-based Quantitative Bipolar Argumentation Framework (A-QBAF). ACAL dynamically deploys expert agent teams to construct arguments, employs a clash resolution mechanism to adjudicate conflicting claims, and utilizes uncertainty-aware escalation for borderline cases. Crucially, our framework supports a Human-in-the-Loop (HITL) contestability workflow, enabling users to directly audit and modify the underlying reasoning graph to influence the final judgment. Empirical evaluations on the LegalBench benchmark demonstrate that ACAL outperforms strong baselines across Gemini-2.5-Flash-Lite and Gemini-2.5-Flash architectures, effectively balancing efficient predictive performance with structured transparency and contestability. Our implementation is available at: https://github.com/loc110504/ACAL.2026-02-21T17:47:13ZHoang-Loc CaoPhuc HoTruong Thanh Hung NguyenPhuc Truong Loc NguyenDinh Thien Loc NguyenHung Caohttp://arxiv.org/abs/1811.10062v5On Exact Reznick, Hilbert-Artin and Putinar's Representations2026-02-20T22:12:08ZWe consider the problem of computing exact sums of squares (SOS) decompositions for certain classes of non-negative multivariate polynomials, relying on semidefinite programming (SDP) solvers.
We provide a hybrid numeric-symbolic algorithm computing exact rational SOS decompositions with rational coefficients for polynomials lying in the interior of the SOS cone. The first step of this algorithm computes an approximate SOS decomposition for a perturbation of the input polynomial with an arbitrary-precision SDP solver. Next, an exact SOS decomposition is obtained thanks to the perturbation terms and a compensation phenomenon. We prove that bit complexity estimates on output size and runtime are both singly exponential in the cardinality of the Newton polytope (or doubly exponential in the number of variables). Next, we apply this algorithm to compute exact Reznick, Hilbert-Artin's representation and Putinar's representations respectively for positive definite forms and positive polynomials over basic compact semi-algebraic sets. We also report on practical experiments done with the implementation of these algorithms and existing alternatives such as the critical point method and cylindrical algebraic decomposition.2018-11-25T17:51:55Z35 pages, 4 tables, extended version of the paper from ISSAC'18 conference (available at arXiv::1802.10339), fixed the statement and proof of Proposition 24Victor MagronMohab Safey El Din