https://arxiv.org/api/lkOUVRVMlkPYOLr4dMx9eo67hsQ2026-06-15T00:09:04Z313852515http://arxiv.org/abs/2412.07939v2A Monadic Calculus with Episodic Flows2025-01-21T23:20:19ZWe define computational atoms named "actions" equipped primarily with three operations: reduction, collection, and inspection. We show how actions can be used for decision-making algorithms from simple axioms. We describe the encodings of typical data structures as actions, and provide a method of analysis for algorithms on the basis of data mutation.2024-12-10T21:51:52Z18 pages, 2 figuresSotirios Henninghttp://arxiv.org/abs/2411.16348v2Extracting Linear Relations from Gröbner Bases for Formal Verification of And-Inverter Graphs2025-01-20T12:07:57ZFormal verification techniques based on computer algebra have proven highly effective for circuit verification. The circuit, given as an and-inverter graph, is encoded as a set of polynomials that automatically generates a Gröbner basis with respect to a lexicographic term ordering. Correctness of the circuit can be derived by computing the polynomial remainder of the specification. However, the main obstacle is the monomial blow-up during the rewriting of the specification, which leads to the development of dedicated heuristics to overcome this issue. In this paper, we investigate an orthogonal approach and focus the computational effort on rewriting the Gröbner basis itself. Our goal is to ensure the basis contains linear polynomials that can be effectively used to rewrite the linearized specification. We first prove the soundness and completeness of this technique and then demonstrate its practical application. Our implementation of this method shows promising results on benchmarks related to multiplier verification.2024-11-25T12:55:49ZAccepted at 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS) 2025Daniela KaufmannJérémy Berthomieuhttp://arxiv.org/abs/2501.09729v1Generating particle physics Lagrangians with transformers2025-01-16T18:25:50ZIn physics, Lagrangians provide a systematic way to describe laws governing physical systems. In the context of particle physics, they encode the interactions and behavior of the fundamental building blocks of our universe. By treating Lagrangians as complex, rule-based constructs similar to linguistic expressions, we trained a transformer model -- proven to be effective in natural language tasks -- to predict the Lagrangian corresponding to a given list of particles. We report on the transformer's performance in constructing Lagrangians respecting the Standard Model $\mathrm{SU}(3)\times \mathrm{SU}(2)\times \mathrm{U}(1)$ gauge symmetries. The resulting model is shown to achieve high accuracies (over 90\%) with Lagrangians up to six matter fields, with the capacity to generalize beyond the training distribution, albeit within architectural constraints. We show through an analysis of input embeddings that the model has internalized concepts such as group representations and conjugation operations as it learned to generate Lagrangians. We make the model and training datasets available to the community. An interactive demonstration can be found at: \url{https://huggingface.co/spaces/JoseEliel/generate-lagrangians}.2025-01-16T18:25:50Z32 pages, 11 figues, 18 tablesYong Sheng KoayRikard EnbergStefano MorettiEliel Camargo-Molinahttp://arxiv.org/abs/2501.09201v1Towards Semantics Lifting for Scientific Computing: A Case Study on FFT2025-01-15T23:24:32ZThe rise of automated code generation tools, such as large language models (LLMs), has introduced new challenges in ensuring the correctness and efficiency of scientific software, particularly in complex kernels, where numerical stability, domain-specific optimizations, and precise floating-point arithmetic are critical. We propose a stepwise semantics lifting approach using an extended SPIRAL framework with symbolic execution and theorem proving to statically derive high-level code semantics from LLM-generated kernels. This method establishes a structured path for verifying the source code's correctness via a step-by-step lifting procedure to high-level specification. We conducted preliminary tests on the feasibility of this approach by successfully lifting GPT-generated fast Fourier transform code to high-level specifications.2025-01-15T23:24:32ZAccepted at the Theory and Practice of Static Analysis Workshop (TPSA), in conjunction with the ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025Naifeng ZhangSanil RaoMike FranusichFranz Franchettihttp://arxiv.org/abs/2501.08086v1NOMTO: Neural Operator-based symbolic Model approximaTion and discOvery2025-01-14T12:55:48ZWhile many physical and engineering processes are most effectively described by non-linear symbolic models, existing non-linear symbolic regression (SR) methods are restricted to a limited set of continuous algebraic functions, thereby limiting their applicability to discover higher order non-linear differential relations. In this work, we introduce the Neural Operator-based symbolic Model approximaTion and discOvery (NOMTO) method, a novel approach to symbolic model discovery that leverages Neural Operators to encompass a broad range of symbolic operations. We demonstrate that NOMTO can successfully identify symbolic expressions containing elementary functions with singularities, special functions, and derivatives. Additionally, our experiments demonstrate that NOMTO can accurately rediscover second-order non-linear partial differential equations. By broadening the set of symbolic operations available for discovery, NOMTO significantly advances the capabilities of existing SR methods. It provides a powerful and flexible tool for model discovery, capable of capturing complex relations in a variety of physical systems.2025-01-14T12:55:48ZSergei GarmaevSiddhartha MishraOlga Finkhttp://arxiv.org/abs/2501.07123v1Inferring Interpretable Models of Fragmentation Functions using Symbolic Regression2025-01-13T08:25:14ZMachine learning is rapidly making its path into natural sciences, including high-energy physics. We present the first study that infers, directly from experimental data, a functional form of fragmentation functions. The latter represent a key ingredient to describe physical observables measured in high-energy physics processes that involve hadron production, and predict their values at different energy. Fragmentation functions can not be calculated in theory and have to be determined instead from data. Traditional approaches rely on global fits of experimental data using a pre-assumed functional form inspired from phenomenological models to learn its parameters. This novel approach uses a ML technique, namely symbolic regression, to learn an analytical model from measured charged hadron multiplicities. The function learned by symbolic regression resembles the Lund string function and describes the data well, thus representing a potential candidate for use in global FFs fits. This study represents an approach to follow in such QCD-related phenomenology studies and more generally in sciences.2025-01-13T08:25:14ZNour MakkeSanjay Chawlahttp://arxiv.org/abs/2501.06707v1ELIZA Reanimated: The world's first chatbot restored on the world's first time sharing system2025-01-12T04:23:34ZELIZA, created by Joseph Weizenbaum at MIT in the early 1960s, is usually considered the world's first chatbot. It was developed in MAD-SLIP on MIT's CTSS, the world's first time-sharing system, on an IBM 7094. We discovered an original ELIZA printout in Prof. Weizenbaum's archives at MIT, including an early version of the famous DOCTOR script, a nearly complete version of the MAD-SLIP code, and various support functions in MAD and FAP. Here we describe the reanimation of this original ELIZA on a restored CTSS, itself running on an emulated IBM 7094. The entire stack is open source, so that any user of a unix-like OS can run the world's first chatbot on the world's first time-sharing system.2025-01-12T04:23:34ZIn reviewRupert LaneAnthony HayArthur SchwarzDavid M. BerryJeff Shragerhttp://arxiv.org/abs/2501.06699v1Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users' Questions2025-01-12T03:32:12ZMuch has been discussed about how Large Language Models, Knowledge Graphs and Search Engines can be combined in a synergistic manner. A dimension largely absent from current academic discourse is the user perspective. In particular, there remain many open questions regarding how best to address the diverse information needs of users, incorporating varying facets and levels of difficulty. This paper introduces a taxonomy of user information needs, which guides us to study the pros, cons and possible synergies of Large Language Models, Knowledge Graphs and Search Engines. From this study, we derive a roadmap for future research.2025-01-12T03:32:12ZAidan HoganXin Luna DongDenny VrandečićGerhard Weikumhttp://arxiv.org/abs/2501.05318v1Recursive matrix algorithms, distributed dynamic control, scaling, stability2025-01-09T15:39:39ZThe report is devoted to the concept of creating block-recursive matrix algorithms for computing on a supercomputer with distributed memory and dynamic decentralized control.2025-01-09T15:39:39Z6 paged2019 Computer Science and Information Technologies (CSIT), Yerevan, Armenia, 2019, pp. 112-115Gennadi Malaschonok10.1109/CSITechnol.2019.8895255http://arxiv.org/abs/2501.03381v1THOI: An efficient and accessible library for computing higher-order interactions enhanced by batch-processing2025-01-06T20:55:59ZComplex systems are characterized by nonlinear dynamics, multi-level interactions, and emergent collective behaviors. Traditional analyses that focus solely on pairwise interactions often oversimplify these systems, neglecting the higher-order interactions critical for understanding their full collective dynamics. Recent advances in multivariate information theory provide a principled framework for quantifying these higher-order interactions, capturing key properties such as redundancy, synergy, shared randomness, and collective constraints. However, two major challenges persist: accurately estimating joint entropies and addressing the combinatorial explosion of interacting terms. To overcome these challenges, we introduce THOI (Torch-based High-Order Interactions), a novel, accessible, and efficient Python library for computing high-order interactions in continuous-valued systems. THOI leverages the well-established Gaussian copula method for joint entropy estimation, combined with state-of-the-art batch and parallel processing techniques to optimize performance across CPU, GPU, and TPU environments. Our results demonstrate that THOI significantly outperforms existing tools in terms of speed and scalability. For larger systems, where exhaustive analysis is computationally impractical, THOI integrates optimization strategies that make higher-order interaction analysis feasible. We validate THOI accuracy using synthetic datasets with parametrically controlled interactions and further illustrate its utility by analyzing fMRI data from human subjects in wakeful resting states and under deep anesthesia. Finally, we analyzed over 900 real-world and synthetic datasets, establishing a comprehensive framework for applying higher-order interaction (HOI) analysis in complex systems.2025-01-06T20:55:59Z22 pages, 6 figuresLaouen BelloliPedro MedianoRodrigo CofréDiego Fernandez SlezakRubén Herzoghttp://arxiv.org/abs/2101.03482v3The Proper Basis for Polynomial Ideals2025-01-03T15:43:40ZWe define a new type of ideal basis called the proper basis that improves both Gröbner basis and Buchberger's algorithm. Let $x_1$ be the least variable of a monomial ordering in a polynomial ring $K[x_1,\dotsc,x_n]$ over a field $K$. The Gröbner basis of a zero-dimensional polynomial ideal contains a univariate polynomial in $x_1$. The proper basis is defined and computed in the variables $\tilde{\bm{x}}:=(x_2,\dotsc,x_n)$ with $x_1$ serving as a parameter in the algebra $K[x_1][\tilde{\bm{x}}]$. Its algorithm is more efficient than not only Buchberger's algorithm whose elimination of $\tilde{\bm{x}}$ unnecessarily involves the least variable $x_1$ but also Möller's algorithm due to its polynomial division mechanism. This is corroborated by a series of benchmark testings herein. The proper basis is in a modular form and neater than Gröbner basis and hence reduces its coefficient swell problem. It is expected that all the state of the art algorithms improving Buchberger's algorithm over the last decades can be further improved if we apply them to the proper basis.2021-01-10T06:29:49Z15 pages. The length of the previous version is shortened to 15 pages in its current formSheng-Ming Mahttp://arxiv.org/abs/2501.00563v1Motives meet SymPy: studying $λ$-ring expressions in Python2024-12-31T17:49:53ZWe present a new Python package called "motives", a symbolic manipulation package based on SymPy capable of handling and simplifying motivic expressions in the Grothendieck ring of Chow motives and other types of $λ$-rings. The package is able to manipulate and compare arbitrary expressions in $λ$-rings and, in particular, it contains explicit tools for manipulating motives of several types of commonly used moduli schemes and moduli stacks of decorated bundles on curves. We have applied this new tool to advance in the verification of Mozgovoy's conjectural formula for the motive of the moduli space of twisted Higgs bundles, proving that it holds in rank 2 and 3 for any curve of genus up to 18 and any twisting bundle of small degree.2024-12-31T17:49:53Z19 pages, 2 figures. The code of the library described in the paper is publicly hosted at https://github.com/CIAMOD/motivesDaniel SanchezDavid AlfayaJaime Pizarrosohttp://arxiv.org/abs/2412.20973v1A Qualitative Analysis of Kernel Extension for Higher Order Proof Checking2024-12-30T14:24:54ZFor the sake of reliability, the kernels of Interactive Theorem Provers (ITPs) are generally kept relatively small. On top of the kernel, additional symbols and inference rules are defined. This paper presents an analysis of how kernel extension reduces the size of proofs and impacts proof checking.2024-12-30T14:24:54ZThe paper was presented in the student session of the European Summer School in Logic, Language, and Information (ESSLLI) in 2016Shuai Wanghttp://arxiv.org/abs/2412.20332v1An Algorithm for Discriminating the Complete Multiplicities of a Parametric Univariate Polynomial2024-12-29T03:19:21ZIn this paper, we tackle the parametric complete multiplicity problem for a univariate polynomial. Our approach to the parametric complete multiplicity problem has a significant difference from the classical method, which relies on repeated gcd computation. Instead, we introduce a novel technique that uses incremental gcds of the given polynomial and its high-order derivatives. This approach, formulated as non-nested subresultants, sidesteps the exponential expansion of polynomial degrees in the generated condition. We also uncover the hidden structure between the incremental gcds and pseudo-remainders. Our analysis reveals that the conditions produced by our new algorithm are simpler than those generated by the classical approach in most cases. Experiments show that our algorithm is faster than the one based on repeated gcd computation for problems with relatively big size.2024-12-29T03:19:21ZSimin QinBican XiaJing Yanghttp://arxiv.org/abs/2403.06294v3ArgMed-Agents: Explainable Clinical Decision Reasoning with LLM Disscusion via Argumentation Schemes2024-12-28T17:40:48ZThere are two main barriers to using large language models (LLMs) in clinical reasoning. Firstly, while LLMs exhibit significant promise in Natural Language Processing (NLP) tasks, their performance in complex reasoning and planning falls short of expectations. Secondly, LLMs use uninterpretable methods to make clinical decisions that are fundamentally different from the clinician's cognitive processes. This leads to user distrust. In this paper, we present a multi-agent framework called ArgMed-Agents, which aims to enable LLM-based agents to make explainable clinical decision reasoning through interaction. ArgMed-Agents performs self-argumentation iterations via Argumentation Scheme for Clinical Discussion (a reasoning mechanism for modeling cognitive processes in clinical reasoning), and then constructs the argumentation process as a directed graph representing conflicting relationships. Ultimately, use symbolic solver to identify a series of rational and coherent arguments to support decision. We construct a formal model of ArgMed-Agents and present conjectures for theoretical guarantees. ArgMed-Agents enables LLMs to mimic the process of clinical argumentative reasoning by generating explanations of reasoning in a self-directed manner. The setup experiments show that ArgMed-Agents not only improves accuracy in complex clinical decision reasoning problems compared to other prompt methods, but more importantly, it provides users with decision explanations that increase their confidence.2024-03-10T19:47:00ZShengxin HongLiang XiaoXin ZhangJianxia Chen