https://arxiv.org/api/8Zk4sefjC2r6dumx1igh9OOa2mw 2026-04-01T08:33:10Z 3072 60 15 http://arxiv.org/abs/2602.17826v1 Ontology-Guided Neuro-Symbolic Inference: Grounding Language Models with Mathematical Domain Knowledge 2026-02-19T20:45:16Z Language models exhibit fundamental limitations -- hallucination, brittleness, and lack of formal grounding -- that are particularly problematic in high-stakes specialist fields requiring verifiable reasoning. I investigate whether formal domain ontologies can enhance language model reliability through retrieval-augmented generation. Using mathematics as proof of concept, I implement a neuro-symbolic pipeline leveraging the OpenMath ontology with hybrid retrieval and cross-encoder reranking to inject relevant definitions into model prompts. Evaluation on the MATH benchmark with three open-source models reveals that ontology-guided context improves performance when retrieval quality is high, but irrelevant context actively degrades it -- highlighting both the promise and challenges of neuro-symbolic approaches. 2026-02-19T20:45:16Z Submitted to NeuS 2026. Supplementary materials and code: https://doi.org/10.5281/zenodo.18665030 Marcelo Labre http://arxiv.org/abs/2602.11041v2 Exploiting the Structure in Tensor Decompositions for Matrix Multiplication 2026-02-19T20:18:59Z We present a new algorithm for fast matrix multiplication using tensor decompositions which have special features. Thanks to these features we obtain exponents lower than what the rank of the tensor decomposition suggests. In particular for $6\times 6$ matrix multiplication we reduce the exponent of the recent algorithm by Moosbauer and Poole from $2.8075$ to $2.8016$, while retaining a reasonable leading coefficient. 2026-02-11T17:12:34Z Manuel Kauers Jakob Moosbauer Isaac Wood http://arxiv.org/abs/2511.09943v2 SeQuant Framework for Symbolic and Numerical Tensor Algebra. I. Core Capabilities 2026-02-19T13:13:51Z SeQuant is an open-source library for symbolic algebra of tensors over commutative (scalar) and non-commutative (operator) rings. The key innovation supporting most of its functionality is a graph-theoretic tensor network (TN) canonicalizer that can handle tensor networks with symmetries faster than their standard group-theoretic counterparts. The TN canonicalizer is used for routine simplification of conventional tensor expressions, for optimizing application of Wick's theorem (used to canonicalize products of tensors over operator fields), and for manipulation of the intermediate representation leading to the numerical evaluation. Notable features of SeQuant include support for noncovariant tensor networks (which often arise from tensor decompositions) and for tensors with modes that depend parametrically on indices of other tensor modes (such dependencies between degrees of freedom are naturally viewed as nesting of tensors, "tensors of tensors" arising in block-wise data compressions in data science and modern quantum simulation). SeQuant blurs the line between pure symbolic manipulation/code generation and numerical evaluation by including compiler-like components to optimize and directly interpret tensor expressions using external numerical tensor algebra frameworks. The SeQuant source code is available at https://github.com/ValeevGroup/SeQuant. 2025-11-13T04:17:05Z Bimal Gaudel Robert G. Adam Ajay Melekamburath Conner Masteran Nakul Teke Azam Besharatnik Andreas Köhn Edward F. Valeev http://arxiv.org/abs/2602.15603v1 Symbolic recovery of PDEs from measurement data 2026-02-17T14:20:36Z Models based on partial differential equations (PDEs) are powerful for describing a wide range of complex relationships in the natural sciences. Accurately identifying the PDE model, which represents the underlying physical law, is essential for a proper understanding of the problem. This reconstruction typically relies on indirect and noisy measurements of the system's state and, without specifically tailored methods, rarely yields symbolic expressions, thereby hindering interpretability. In this work, we address this issue by considering existing neural network architectures based on rational functions for the symbolic representation of physical laws. These networks leverage the approximation power of rational functions while also benefiting from their flexibility in representing arithmetic operations. Our main contribution is an identifiability result, showing that, in the limit of noiseless, complete measurements, such symbolic networks can uniquely reconstruct the simplest physical law within the PDE model. Specifically, reconstructed laws remain expressible within the symbolic network architecture, with regularization-minimizing parameterizations promoting interpretability and sparsity in case of $L^1$-regularization. In addition, we provide regularity results for symbolic networks. Empirical validation using the ParFam architecture supports these theoretical findings, providing evidence for the practical reconstructibility of physical laws. 2026-02-17T14:20:36Z Erion Morina Philipp Scholl Martin Holler http://arxiv.org/abs/2602.15539v1 Dynamic Training-Free Fusion of Subject and Style LoRAs 2026-02-17T12:42:30Z Recent studies have explored the combination of multiple LoRAs to simultaneously generate user-specified subjects and styles. However, most existing approaches fuse LoRA weights using static statistical heuristics that deviate from LoRA's original purpose of learning adaptive feature adjustments and ignore the randomness of sampled inputs. To address this, we propose a dynamic training-free fusion framework that operates throughout the generation process. During the forward pass, at each LoRA-applied layer, we dynamically compute the KL divergence between the base model's original features and those produced by subject and style LoRAs, respectively, and adaptively select the most appropriate weights for fusion. In the reverse denoising stage, we further refine the generation trajectory by dynamically applying gradient-based corrections derived from objective metrics such as CLIP and DINO scores, providing continuous semantic and stylistic guidance. By integrating these two complementary mechanisms-feature-level selection and metric-guided latent adjustment-across the entire diffusion timeline, our method dynamically achieves coherent subject-style synthesis without any retraining. Extensive experiments across diverse subject-style combinations demonstrate that our approach consistently outperforms state-of-the-art LoRA fusion methods both qualitatively and quantitatively. 2026-02-17T12:42:30Z Qinglong Cao Yuntian Chen Chao Ma Xiaokang Yang http://arxiv.org/abs/2602.07672v2 Debugging code world models 2026-02-14T20:46:17Z Code World Models (CWMs) are language models trained to simulate program execution by predicting explicit runtime state after every executed command. This execution-based world modeling enables internal verification within the model, offering an alternative to natural language chain-of-thought reasoning. However, the sources of errors and the nature of CWMs' limitations remain poorly understood. We study CWMs from two complementary perspectives: local semantic execution and long-horizon state tracking. On real-code benchmarks, we identify two dominant failure regimes. First, dense runtime state reveals produce token-intensive execution traces, leading to token-budget exhaustion on programs with long execution histories. Second, failures disproportionately concentrate in string-valued state, which we attribute to limitations of subword tokenization rather than program structure. To study long-horizon behavior, we use a controlled permutation-tracking benchmark that isolates state propagation under action execution. We show that long-horizon degradation is driven primarily by incorrect action generation: when actions are replaced with ground-truth commands, a Transformer-based CWM propagates state accurately over long horizons, despite known limitations of Transformers in long-horizon state tracking. These findings suggest directions for more efficient supervision and state representations in CWMs that are better aligned with program execution and data types. 2026-02-07T19:32:15Z 8 pages, 4 figures, under review in conference Babak Rahmani http://arxiv.org/abs/2602.05100v2 Rule-Based Spatial Mixture-of-Experts U-Net for Explainable Edge Detection 2026-02-13T17:56:58Z Deep learning models like U-Net and its variants, have established state-of-the-art performance in edge detection tasks and are used by Generative AI services world-wide for their image generation models. However, their decision-making processes remain opaque, operating as "black boxes" that obscure the rationale behind specific boundary predictions. This lack of transparency is a critical barrier in safety-critical applications where verification is mandatory. To bridge the gap between high-performance deep learning and interpretable logic, we propose the Rule-Based Spatial Mixture-of-Experts U-Net (sMoE U-Net). Our architecture introduces two key innovations: (1) Spatially-Adaptive Mixture-of-Experts (sMoE) blocks integrated into the decoder skip connections, which dynamically gate between "Context" (smooth) and "Boundary" (sharp) experts based on local feature statistics; and (2) a Takagi-Sugeno-Kang (TSK) Fuzzy Head that replaces the standard classification layer. This fuzzy head fuses deep semantic features with heuristic edge signals using explicit IF-THEN rules. We evaluate our method on the BSDS500 benchmark, achieving an Optimal Dataset Scale (ODS) F-score of 0.7628, effectively matching purely deep baselines like HED (0.7688) while outperforming the standard U-Net (0.7437). Crucially, our model provides pixel-level explainability through "Rule Firing Maps" and "Strategy Maps," allowing users to visualize whether an edge was detected due to strong gradients, high semantic confidence, or specific logical rule combinations. 2026-02-04T22:33:18Z Bharadwaj Dogga Kaaustaaub Shankar Gibin Raju Wilhelm Louw Kelly Cohen http://arxiv.org/abs/2511.05849v2 EGG-SR: Embedding Symbolic Equivalence into Symbolic Regression via Equality Graph 2026-02-12T18:38:11Z Symbolic regression seeks to uncover physical laws from experimental data by searching for closed-form expressions, which is an important task in AI-driven scientific discovery. Yet the exponential growth of the search space of expression renders the task computationally challenging. A promising yet underexplored direction for reducing the search space and accelerating training lies in *symbolic equivalence*: many expressions, although syntactically different, define the same function -- for example, $\log(x_1^2x_2^3)$, $\log(x_1^2)+\log(x_2^3)$, and $2\log(x_1)+3\log(x_2)$. Existing algorithms treat such variants as distinct outputs, leading to redundant exploration and slow learning. We introduce EGG-SR, a unified framework that integrates symbolic equivalence into a class of modern symbolic regression methods, including Monte Carlo Tree Search (MCTS), Deep Reinforcement Learning (DRL), and Large Language Models (LLMs). EGG-SR compactly represents equivalent expressions through the proposed EGG module (via equality graphs), accelerating learning by: (1) pruning redundant subtree exploration in EGG-MCTS, (2) aggregating rewards across equivalent generated sequences in EGG-DRL, and (3) enriching feedback prompts in EGG-LLM. Theoretically, we show the benefit of embedding EGG into learning: it tightens the regret bound of MCTS and reduces the variance of the DRL gradient estimator. Empirically, EGG-SR consistently enhances a class of symbolic regression models across several benchmarks, discovering more accurate expressions within the same time limit. Project page is at: https://nan-jiang-group.github.io/egg-sr. 2025-11-08T04:39:11Z Camera-ready version accepted for ICLR 2026 Nan Jiang Ziyi Wang Yexiang Xue http://arxiv.org/abs/2510.23411v2 Border Bases in the Rational Weyl Algebra 2026-02-12T14:35:50Z Border bases are a generalization of Gröbner bases for zero-dimensional ideals in polynomial rings. In this article, we introduce border bases for a non-commutative ring of linear differential operators, namely the rational Weyl algebra. We elaborate on their properties and present algorithms to compute with them. We apply this theory to represent integrable connections as cyclic $D$-modules explicitly. As an application, we visit differential equations behind a string, a Feynman as well as a cosmological integral. We also address the classification of particular $D$-ideals of a fixed holonomic rank, namely the case of linear PDEs with constant coefficients as well as Frobenius ideals. Our approach rests on the theory of Hilbert schemes of points in affine space. 2025-10-27T15:15:16Z 30 pages, comments welcome Carlos Rodriguez Anna-Laura Sattelberger http://arxiv.org/abs/2603.00073v1 A Separation Method of the Positivity of A Quartic Polynomial 2026-02-12T04:48:27Z Although the positivity of a quartic polynomial is a well-researched topic, existing conditions are often highly complex. Some necessary and sufficient conditions for the positivity of a quartic polynomial are presented through a separation method based on Ferrari's technique of solving a quartic equation. We apply the result to the problem of the projection of the coefficient space. 2026-02-12T04:48:27Z Taehun Kim Jung Chan Lee ByoungSeon Choi http://arxiv.org/abs/2603.19241v1 Engineering-Oriented Symbolic Regression: LLMs as Physics Agents for Discovery of Simulation-Ready Constitutive Laws 2026-02-12T00:38:47Z The discovery of constitutive laws for complex materials has historically faced a dichotomy between high-fidelity data-driven approaches, which demand prohibitive full-field experimental data, and traditional engineering fitting, which often yields numerically unstable models outside calibration regimes. In this work, we propose an Engineering-Oriented Symbolic Regression (EO-SR) framework that bridges this gap by leveraging Large Language Models (LLMs) as "Physics-Informed Agents." Unlike unconstrained symbolic regression, our framework utilizes an LLM Agent to zero-shot synthesize executable physical constraints -- specifically thermodynamic consistency and frame indifference -- transforming the search process from mathematical curve-fitting into a physics-governed discovery engine. We validate this approach on the hyperelastic modeling of rubber-like materials using standard Treloar datasets. The framework autonomously identifies a novel hybrid constitutive law that combines a Mooney-Rivlin linear base with a rational locking term. This discovered model not only achieves high predictive accuracy across multi-axial deformation modes (including zero-shot prediction of pure shear) but also guarantees unconditional convexity. Finite element validation demonstrates that while industry-standard models (e.g., Ogden N=3) fail due to numerical singularities under severe transverse compression, the EO-SR-discovered model maintains robust convergence. This study establishes a generalized, low-barrier pathway for discovering simulation-ready constitutive closures that satisfy both data accuracy and rigorous physical laws. 2026-02-12T00:38:47Z Yue Wu Tianhao Su Mingchuan Zhao Shunbo Hu Deng Pan http://arxiv.org/abs/2602.10632v1 The Neurosymbolic Frontier of Nonuniform Ellipticity: Formalizing Sharp Schauder Theory via Topos-Theoretic Reasoning Models 2026-02-11T08:24:57Z This white paper presents a critical synthesis of the recent breakthrough in nonuniformly elliptic regularity theory and the burgeoning field of neurosymbolic large reasoning models (LRMs). We explore the resolution of the long-standing sharp growth rate conjecture in Schauder theory, achieved by Cristiana De Filippis and Giuseppe Mingione, which identifies the exact threshold $q/p < 1 + α/n$ for gradient Hölder continuity. Central to this mathematical achievement is the ``ghost equation'' methodology, a sophisticated auxiliary derivation that bypasses the non-differentiability of classical Euler-Lagrange systems. We propose that the next era of mathematical discovery lies in the integration of these pure analytical constructs with LRMs grounded in topos theory and formal verification frameworks such as Safe and Typed Chain-of-Thought (PC-CoT). By modeling the reasoning process as a categorical colimit in a slice topos, we demonstrate how LRMs can autonomously navigate the ``Dark Side'' of the calculus of variations, providing machine-checkable proofs for regularity bounds in complex, multi-phase physical systems. 2026-02-11T08:24:57Z Suyash Mishra http://arxiv.org/abs/2602.09702v1 On semidefinite-representable sets over valued fields 2026-02-10T12:01:20Z Polyhedra and spectrahedra over the real numbers, or more generally their images under linear maps, are respectively the feasible sets of linear and semidefinite programming, and form the family of semidefinite-representable sets. This paper studies analogues of these sets, as well as the associated optimization problems, when the data are taken over a valued field $K$. For $K$-polyhedra and linear programming over $K$ we present an algorithm based on the computation of Smith normal forms. We prove that fundamental properties of semidefinite-representable sets extend to the valued setting. In particular, we exhibit examples of non-polyhedral $K$-spectrahedra, as well as sets that are semidefinite-representable over $K$ but are not $K$-spectrahedra. 2026-02-10T12:01:20Z 9 pages, 1 figure Corentin Cornou Simone Naldi Tristan Vaccon http://arxiv.org/abs/2602.09577v1 Computational Explorations on Semifields 2026-02-10T09:26:50Z A finite semifield is a division algebra over a finite field where multiplication is not necessarily associative. We consider here the complexity of the multiplication in small semifields and finite field extensions. For this operation, the number of required base field multiplications is the tensor rank, or the multiplicative complexity. The other base field operations are additions and scalings by constants, which together we refer to as the additive complexity. When used recursively, the tensor rank determines the exponent while the other operations determine the constant of the associated asymptotic complexity bounds. For small extensions, both measures are of similar importance. In this paper, we establish the tensor rank of some semifields and finite fields of characteristics 2 and 3. We also propose new upper and lower bounds on their additive complexity, and give new associated algorithms improving on the state-of-the-art in terms of overall complexity. We achieve this by considering short straight line programs for encoding linear codes with given parameters. 2026-02-10T09:26:50Z Jean-Guillaume Dumas UGA, LJK, CASC Stefano Lia UCD John Sheekey UCD http://arxiv.org/abs/2204.05138v3 Artificial Intelligence Software Structured to Simulate Human Working Memory, Mental Imagery, and Mental Continuity 2026-02-09T23:30:19Z This article presents an artificial intelligence (AI) architecture intended to simulate the iterative updating of the human working memory system. It features several interconnected neural networks designed to emulate the specialized modules of the cerebral cortex. These are structured hierarchically and integrated into a global workspace. They are capable of temporarily maintaining high-level representational patterns akin to the psychological items maintained in working memory. This maintenance is made possible by persistent neural activity in the form of two modalities: sustained neural firing (resulting in a focus of attention) and synaptic potentiation (resulting in a short-term store). Representations held in persistent activity are recursively replaced resulting in incremental changes to the content of the working memory system. As this content gradually evolves, successive processing states overlap and are continuous with one another. The present article will explore how this architecture can lead to iterative shift in the distribution of coactive representations, ultimately leading to mental continuity between processing states, and thus to human-like thought and cognition. Taken together, these components outline a biologically motivated route toward synthetic consciousness or artificial sentience and subjectivity. 2022-03-29T22:23:36Z Jared Edward Reser