https://arxiv.org/api/uyjLekzyWx38CW5SpT03/kaWFDs2026-04-13T12:12:01Z307555515http://arxiv.org/abs/2406.08480v3Linear equations with monomial constraints and decision problems in abelian-by-cyclic groups2024-09-06T14:47:14ZWe show that it is undecidable whether a system of linear equations over the Laurent polynomial ring $\mathbb{Z}[X^{\pm}]$ admit solutions where a specified subset of variables take value in the set of monomials $\{X^z \mid z \in \mathbb{Z}\}$. In particular, we construct a finitely presented $\mathbb{Z}[X^{\pm}]$-module, where it is undecidable whether a linear equation $X^{z_1} \boldsymbol{f}_1 + \cdots + X^{z_n} \boldsymbol{f}_n = \boldsymbol{f}_0$ has solutions $z_1, \ldots, z_n \in \mathbb{Z}$. This contrasts the decidability of the case $n = 1$, which can be deduced from Noskov's Lemma.
We apply this result to settle a number of problems in computational group theory. We show that it is undecidable whether a system of equations has solutions in the wreath product $\mathbb{Z} \wr \mathbb{Z}$, providing a negative answer to an open problem of Kharlampovich, López and Miasnikov (2020). We show that there exists a finitely generated abelian-by-cyclic group in which the problem of solving a single quadratic equation is undecidable. We also construct a finitely generated abelian-by-cyclic group, different to that of Mishchenko and Treier (2017), in which the Knapsack Problem is undecidable. In contrast, we show that the problem of Coset Intersection is decidable in all finitely generated abelian-by-cyclic groups.2024-06-12T17:59:14ZCorrected an error in Lemma 6.8. Supersedes arXiv:2309.08811Ruiwen Donghttp://arxiv.org/abs/2401.00408v2Computing greatest common divisor of several parametric univariate polynomials via generalized subresultant polynomials2024-09-06T06:46:15ZIn this paper, we tackle the following problem: compute the gcd for several univariate polynomials with parametric coefficients. It amounts to partitioning the parameter space into ``cells'' so that the gcd has a uniform expression over each cell and constructing a uniform expression of gcd in each cell. We tackle the problem as follows. We begin by making a natural and obvious extension of subresultant polynomials of two polynomials to several polynomials. Then we develop the following structural theories about them.
1. We generalize Sylvester's theory to several polynomials, in order to obtain an elegant relationship between generalized subresultant polynomials and the gcd of several polynomials, yielding an elegant algorithm.
2. We generalize Habicht's theory to several polynomials, in order to obtain a systematic relationship between generalized subresultant polynomials and pseudo-remainders, yielding an efficient algorithm.
Using the generalized theories, we present a simple (structurally elegant) algorithm which is significantly more efficient (both in the output size and computing time) than algorithms based on previous approaches.2023-12-31T06:32:54ZHoon HongJing Yanghttp://arxiv.org/abs/2408.17334v2Role of Data-driven Regional Growth Model in Shaping Brain Folding Patterns2024-09-04T22:30:36ZThe surface morphology of the developing mammalian brain is crucial for understanding brain function and dysfunction. Computational modeling offers valuable insights into the underlying mechanisms for early brain folding. Recent findings indicate significant regional variations in brain tissue growth, while the role of these variations in cortical development remains unclear. In this study, we unprecedently explored how regional cortical growth affects brain folding patterns using computational simulation. We first developed growth models for typical cortical regions using machine learning (ML)-assisted symbolic regression, based on longitudinal real surface expansion and cortical thickness data from prenatal and infant brains derived from over 1,000 MRI scans of 735 pediatric subjects with ages ranging from 29 post-menstrual weeks to 24 months. These models were subsequently integrated into computational software to simulate cortical development with anatomically realistic geometric models. We comprehensively quantified the resulting folding patterns using multiple metrics such as mean curvature, sulcal depth, and gyrification index. Our results demonstrate that regional growth models generate complex brain folding patterns that more closely match actual brains structures, both quantitatively and qualitatively, compared to conventional uniform growth models. Growth magnitude plays a dominant role in shaping folding patterns, while growth trajectory has a minor influence. Moreover, multi-region models better capture the intricacies of brain folding than single-region models. Our results underscore the necessity and importance of incorporating regional growth heterogeneity into brain folding simulations, which could enhance early diagnosis and treatment of cortical malformations and neurodevelopmental disorders such as cerebral palsy and autism.2024-08-30T14:49:10Z43 pages, 16 figuresJixin HouZhengwang WuXianyan ChenLi WangDajiang ZhuTianming LiuGang LiXianqiao Wanghttp://arxiv.org/abs/2406.19121v3Towards Learning Abductive Reasoning using VSA Distributed Representations2024-08-30T06:17:46ZWe introduce the Abductive Rule Learner with Context-awareness (ARLC), a model that solves abstract reasoning tasks based on Learn-VRF. ARLC features a novel and more broadly applicable training objective for abductive reasoning, resulting in better interpretability and higher accuracy when solving Raven's progressive matrices (RPM). ARLC allows both programming domain knowledge and learning the rules underlying a data distribution. We evaluate ARLC on the I-RAVEN dataset, showcasing state-of-the-art accuracy across both in-distribution and out-of-distribution (unseen attribute-rule pairs) tests. ARLC surpasses neuro-symbolic and connectionist baselines, including large language models, despite having orders of magnitude fewer parameters. We show ARLC's robustness to post-programming training by incrementally learning from examples on top of programmed knowledge, which only improves its performance and does not result in catastrophic forgetting of the programmed solution. We validate ARLC's seamless transfer learning from a 2x2 RPM constellation to unseen constellations. Our code is available at https://github.com/IBM/abductive-rule-learner-with-context-awareness.2024-06-27T12:05:55ZAccepted at the 18th International Conference on Neural-Symbolic Learning and Reasoning (NeSy) 2024 [Spotlight]Giacomo CamposampieroMichael HerscheAleksandar TerzićRoger WattenhoferAbu SebastianAbbas Rahimihttp://arxiv.org/abs/2408.15917v1Comprehensive Systems for Primary Decompositions of Parametric Ideals2024-08-28T16:31:26ZWe present an effective method for computing parametric primary decomposition via comprehensive Gröbner systems. In general, it is very difficult to compute a parametric primary decomposition of a given ideal in the polynomial ring with rational coefficients $\mathbb{Q}[A,X]$ where $A$ is the set of parameters and $X$ is the set of ordinary variables. One cause of the difficulty is related to the irreducibility of the specialized polynomial. Thus, we introduce a new notion of ``feasibility'' on the stability of the structure of the ideal in terms of its primary decomposition, and we give a new algorithm for computing a so-called comprehensive system consisting of pairs $(C, \mathcal{Q})$, where for each parameter value in $C$, the ideal has the stable decomposition $\mathcal{Q}$. We may call this comprehensive system a parametric primary decomposition of the ideal. Also, one can also compute a dense set $\mathcal{O}$ such that $\varphi_α(\mathcal{Q})$ is a primary decomposition for any $α\in C\cap \mathcal{O}$ via irreducible polynomials. In addition, we give several computational examples to examine the effectiveness of our new decomposition.2024-08-28T16:31:26ZYuki IshiharaKazuhiro Yokoyamahttp://arxiv.org/abs/2405.07488v2Predictive Modeling of Flexible EHD Pumps using Kolmogorov-Arnold Networks2024-08-27T08:44:20ZWe present a novel approach to predicting the pressure and flow rate of flexible electrohydrodynamic pumps using the Kolmogorov-Arnold Network. Inspired by the Kolmogorov-Arnold representation theorem, KAN replaces fixed activation functions with learnable spline-based activation functions, enabling it to approximate complex nonlinear functions more effectively than traditional models like Multi-Layer Perceptron and Random Forest. We evaluated KAN on a dataset of flexible EHD pump parameters and compared its performance against RF, and MLP models. KAN achieved superior predictive accuracy, with Mean Squared Errors of 12.186 and 0.001 for pressure and flow rate predictions, respectively. The symbolic formulas extracted from KAN provided insights into the nonlinear relationships between input parameters and pump performance. These findings demonstrate that KAN offers exceptional accuracy and interpretability, making it a promising alternative for predictive modeling in electrohydrodynamic pumping.2024-05-13T06:04:26ZBiomimetic Intelligence and Robotics, Volume 4, Issue 4, 2024, 100184Yanhong PengYuxin WangFangchao HuMiao HeZebing MaoXia HuangJun Ding10.1016/j.birob.2024.100184http://arxiv.org/abs/2310.05466v2Geometry of the signed support of a multivariate polynomial and Descartes' rule of signs2024-08-27T07:52:48ZWe investigate the signed support, that is, the set of the exponent vectors and the signs of the coefficients, of a multivariate polynomial $f$. We describe conditions on the signed support ensuring that the semi-algebraic set, denoted as $\{ f < 0 \}$, containing points in the positive real orthant where $f$ takes negative values, has at most one connected component. These results generalize Descartes' rule of signs in the sense that they provide a bound which is independent of the values of the coefficients and the degree of the polynomial. Based on how the exponent vectors lie on the faces of the Newton polytope, we give a recursive algorithm that verifies a sufficient condition for the set $\{ f < 0 \}$ to have one connected component. We apply the algorithm to reaction networks in order to prove that the parameter region of multistationarity of a ubiquitous network comprising phosphorylation cycles is connected.2023-10-09T07:22:21ZFinal version to appear in SIAM Journal on Applied Algebra and GeometryMáté L. Telekhttp://arxiv.org/abs/2407.06889v3A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM2024-08-27T00:03:31ZAutonomous robots, autonomous vehicles, and humans wearing mixed-reality headsets require accurate and reliable tracking services for safety-critical applications in dynamically changing real-world environments. However, the existing tracking approaches, such as Simultaneous Localization and Mapping (SLAM), do not adapt well to environmental changes and boundary conditions despite extensive manual tuning. On the other hand, while deep learning-based approaches can better adapt to environmental changes, they typically demand substantial data for training and often lack flexibility in adapting to new domains. To solve this problem, we propose leveraging the neurosymbolic program synthesis approach to construct adaptable SLAM pipelines that integrate the domain knowledge from traditional SLAM approaches while leveraging data to learn complex relationships. While the approach can synthesize end-to-end SLAM pipelines, we focus on synthesizing the feature extraction module. We first devise a domain-specific language (DSL) that can encapsulate domain knowledge on the important attributes for feature extraction and the real-world performance of various feature extractors. Our neurosymbolic architecture then undertakes adaptive feature extraction, optimizing parameters via learning while employing symbolic reasoning to select the most suitable feature extractor. Our evaluations demonstrate that our approach, neurosymbolic Feature EXtraction (nFEX), yields higher-quality features. It also reduces the pose error observed for the state-of-the-art baseline feature extractors ORB and SIFT by up to 90% and up to 66%, respectively, thereby enhancing the system's efficiency and adaptability to novel environments.2024-07-09T14:18:35Z8 pages, 6 figures, and 5 tables. Published at the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Corresponding author: Yasra Chandio (ychandio@umass.edu)2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)Yasra ChandioMomin A. KhanKhotso SelialiaLuis GarciaJoseph DeGolFatima M. Anwarhttp://arxiv.org/abs/2301.02512v3D-Algebraic Functions2024-08-26T14:05:55ZDifferentially-algebraic (D-algebraic) functions are solutions of polynomial equations in the function, its derivatives, and the independent variables. We revisit closure properties of these functions by providing constructive proofs. We present algorithms to compute algebraic differential equations for compositions and arithmetic manipulations of univariate D-algebraic functions and derive bounds for the order of the resulting differential equations. We apply our methods to examples in the sciences.2023-01-06T13:55:45Z32 pages, comments welcomeRida Ait El ManssourAnna-Laura SattelbergerBertrand Teguia Tabuguiahttp://arxiv.org/abs/2304.05061v2Algebraic solutions of linear differential equations: an arithmetic approach2024-08-22T22:23:02ZGiven a linear differential equation with coefficients in $\mathbb{Q}(x)$, an important question is to know whether its full space of solutions consists of algebraic functions, or at least if one of its specific solutions is algebraic. After presenting motivating examples coming from various branches of mathematics, we advertise in an elementary way a beautiful local-global arithmetic approach to these questions, initiated by Grothendieck in the late sixties. This approach has deep ramifications and leads to the still unsolved Grothendieck-Katz $p$-curvature conjecture.2023-04-11T08:45:17Z52 pages, 2 figures, to appear in the Bulletin of the AMSAlin BostanXavier CarusoJulien Roqueshttp://arxiv.org/abs/2402.00854v4SymbolicAI: A framework for logic-based approaches combining generative models and solvers2024-08-21T22:07:31ZWe introduce SymbolicAI, a versatile and modular framework employing a logic-based approach to concept learning and flow management in generative processes. SymbolicAI enables the seamless integration of generative models with a diverse range of solvers by treating large language models (LLMs) as semantic parsers that execute tasks based on both natural and formal language instructions, thus bridging the gap between symbolic reasoning and generative AI. We leverage probabilistic programming principles to tackle complex tasks, and utilize differentiable and classical programming paradigms with their respective strengths. The framework introduces a set of polymorphic, compositional, and self-referential operations for multi-modal data that connects multi-step generative processes and aligns their outputs with user objectives in complex workflows. As a result, we can transition between the capabilities of various foundation models with in-context learning capabilities and specialized, fine-tuned models or solvers proficient in addressing specific problems. Through these operations based on in-context learning our framework enables the creation and evaluation of explainable computational graphs. Finally, we introduce a quality measure and its empirical score for evaluating these computational graphs, and propose a benchmark that compares various state-of-the-art LLMs across a set of complex workflows. We refer to the empirical score as the "Vector Embedding for Relational Trajectory Evaluation through Cross-similarity", or VERTEX score for short. The framework codebase and benchmark are linked below.2024-02-01T18:50:50Z46 pages, 13 figures, external resources: framework is available at https://github.com/ExtensityAI/symbolicai and benchmark at https://github.com/ExtensityAI/benchmarkMarius-Constantin DinuClaudiu Leoveanu-CondreiMarkus HolzleitnerWerner ZellingerSepp Hochreiterhttp://arxiv.org/abs/2402.07345v2Computing Krylov iterates in the time of matrix multiplication2024-08-20T11:43:32ZKrylov methods rely on iterated matrix-vector products $A^k u_j$ for an $n\times n$ matrix $A$ and vectors $u_1,\ldots,u_m$. The space spanned by all iterates $A^k u_j$ admits a particular basis -- the \emph{maximal Krylov basis} -- which consists of iterates of the first vector $u_1, Au_1, A^2u_1,\ldots$, until reaching linear dependency, then iterating similarly the subsequent vectors until a basis is obtained. Finding minimal polynomials and Frobenius normal forms is closely related to computing maximal Krylov bases. The fastest way to produce these bases was, until this paper, Keller-Gehrig's 1985 algorithm whose complexity bound $O(n^ω\log(n))$ comes from repeated squarings of $A$ and logarithmically many Gaussian eliminations. Here $ω>2$ is a feasible exponent for matrix multiplication over the base field. We present an algorithm computing the maximal Krylov basis in $O(n^ω\log\log(n))$ field operations when $m \in O(n)$, and even $O(n^ω)$ as soon as $m\in O(n/\log(n)^c)$ for some fixed real $c>0$. As a consequence, we show that the Frobenius normal form together with a transformation matrix can be computed deterministically in $O(n^ω(\log\log(n))^2)$, and therefore matrix exponentiation~$A^k$ can be performed in the latter complexity if $\log(k) \in O(n^{ω-1-\varepsilon})$ for some fixed $\varepsilon>0$. A key idea for these improvements is to rely on fast algorithms for $m\times m$ polynomial matrices of average degree $n/m$, involving high-order lifting and minimal kernel bases.2024-02-12T00:23:41Z10 pages, 5 algorithmsVincent NeigerClément PernetGilles Villard10.1145/3666000.3669715http://arxiv.org/abs/2408.09176v1Cognitive LLMs: Towards Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making2024-08-17T11:49:53ZResolving the dichotomy between the human-like yet constrained reasoning processes of Cognitive Architectures and the broad but often noisy inference behavior of Large Language Models (LLMs) remains a challenging but exciting pursuit, for enabling reliable machine reasoning capabilities in production systems. Because Cognitive Architectures are famously developed for the purpose of modeling the internal mechanisms of human cognitive decision-making at a computational level, new investigations consider the goal of informing LLMs with the knowledge necessary for replicating such processes, e.g., guided perception, memory, goal-setting, and action. Previous approaches that use LLMs for grounded decision-making struggle with complex reasoning tasks that require slower, deliberate cognition over fast and intuitive inference -- reporting issues related to the lack of sufficient grounding, as in hallucination. To resolve these challenges, we introduce LLM-ACTR, a novel neuro-symbolic architecture that provides human-aligned and versatile decision-making by integrating the ACT-R Cognitive Architecture with LLMs. Our framework extracts and embeds knowledge of ACT-R's internal decision-making process as latent neural representations, injects this information into trainable LLM adapter layers, and fine-tunes the LLMs for downstream prediction. Our experiments on novel Design for Manufacturing tasks show both improved task performance as well as improved grounded decision-making capability of our approach, compared to LLM-only baselines that leverage chain-of-thought reasoning strategies.2024-08-17T11:49:53Z20 pages, 8 figures, 2 tablesSiyu WuAlessandro OltramariJonathan FrancisC. Lee GilesFrank E. Ritterhttp://arxiv.org/abs/2408.07840v1ONSEP: A Novel Online Neural-Symbolic Framework for Event Prediction Based on Large Language Model2024-08-14T22:28:19ZIn the realm of event prediction, temporal knowledge graph forecasting (TKGF) stands as a pivotal technique. Previous approaches face the challenges of not utilizing experience during testing and relying on a single short-term history, which limits adaptation to evolving data. In this paper, we introduce the Online Neural-Symbolic Event Prediction (ONSEP) framework, which innovates by integrating dynamic causal rule mining (DCRM) and dual history augmented generation (DHAG). DCRM dynamically constructs causal rules from real-time data, allowing for swift adaptation to new causal relationships. In parallel, DHAG merges short-term and long-term historical contexts, leveraging a bi-branch approach to enrich event prediction. Our framework demonstrates notable performance enhancements across diverse datasets, with significant Hit@k (k=1,3,10) improvements, showcasing its ability to augment large language models (LLMs) for event prediction without necessitating extensive retraining. The ONSEP framework not only advances the field of TKGF but also underscores the potential of neural-symbolic approaches in adapting to dynamic data environments.2024-08-14T22:28:19Z16 pages, ACL 2024 FindingsXuanqing YuWangtao SunJingwei LiKang LiuChengbao LiuJie Tanhttp://arxiv.org/abs/2408.07815v1Algebraic Representations for Faster Predictions in Convolutional Neural Networks2024-08-14T21:10:05ZConvolutional neural networks (CNNs) are a popular choice of model for tasks in computer vision. When CNNs are made with many layers, resulting in a deep neural network, skip connections may be added to create an easier gradient optimization problem while retaining model expressiveness. In this paper, we show that arbitrarily complex, trained, linear CNNs with skip connections can be simplified into a single-layer model, resulting in greatly reduced computational requirements during prediction time. We also present a method for training nonlinear models with skip connections that are gradually removed throughout training, giving the benefits of skip connections without requiring computational overhead during during prediction time. These results are demonstrated with practical examples on Residual Networks (ResNet) architecture.2024-08-14T21:10:05ZAccepted for publication in the proceedings of the 27th International Workshop on Computer Algebra in Scientific Computing (CASC 2024)Johnny JoyceJan Verschelde