https://arxiv.org/api/uVz+TFCO7sx6aN1cmvmqEpdqLqU 2026-03-22T14:24:17Z 4111 60 15 http://arxiv.org/abs/2510.03621v2 A flux-based approach for analyzing the disguised toric locus of reaction networks 2026-01-14T21:26:08Z

Dynamical systems with polynomial right-hand sides are very important in various applications, e.g., in biochemistry and population dynamics. The mathematical study of these dynamical systems is challenging due to the possibility of multistability, oscillations, and chaotic dynamics. One important tool for this study is the concept of reaction systems, which are dynamical systems generated by reaction networks for some choices of parameter values. Among these, disguised toric systems are remarkably stable: they have a unique attracting fixed point, and cannot give rise to oscillations or chaotic dynamics. The computation of the set of parameter values for which a network gives rise to disguised toric systems (i.e., the disguised toric locus of the network) is an important but difficult task. We introduce new ideas based on network fluxes for studying the disguised toric locus. We prove that the disguised toric locus of any network $G$ is a contractible manifold with boundary, and introduce an associated graph $G^{\max}$ that characterizes its interior. These theoretical tools allow us, for the first time, to compute the full disguised toric locus for many networks of interest.

2025-10-04T02:06:55Z resolved TexLive 2025 <-> cleveref issue Balázs Boros Gheorghe Craciun Oskar Henriksson Jiaxin Jin Diego Rojas La Luz http://arxiv.org/abs/2601.07415v1 PLANET v2.0: A comprehensive Protein-Ligand Affinity Prediction Model Based on Mixture Density Network 2026-01-12T10:56:29Z

Drug discovery represents a time-consuming and financially intensive process, and virtual screening can accelerate it. Scoring functions, as one of the tools guiding virtual screening, have their precision closely tied to screening efficiency. In our previous study, we developed a graph neural network model called PLANET (Protein-Ligand Affinity prediction NETwork), but it suffers from the defect in representing protein-ligand contact maps. Incorrect binding modes inevitably lead to poor affinity predictions, so accurate prediction of the protein-ligand contact map is desired to improve PLANET. In this study, we have proposed PLANET v2.0 as an upgraded version. The model is trained via multi-objective training strategy and incorporates the Mixture Density Network to predict binding modes. Except for the probability density distributions of non-covalent interactions, we innovatively employ another Gaussian mixture model to describe the relationship between distance and energy of each interaction pair and predict protein-ligand affinity like calculating the mathematical expectation. As on the CASF-2016 benchmark, PLANET v2.0 demonstrates excellent scoring power, ranking power, and docking power. The screening power of PLANET v2.0 gets notably improved compared to PLANET and Glide SP and it demonstrates robust validation on a commercial ultra-large-scale dataset. Given its efficiency and accuracy, PLANET v2.0 can hopefully become one of the practical tools for virtual screening workflows. PLANET v2.0 is freely available at https://www.pdbbind-plus.org.cn/planetv2.

2026-01-12T10:56:29Z Haotian Gao Xiangying Zhang Jingyuan Li Xinchong Chen Haojie Wang Yifei Qi Renxiao Wang http://arxiv.org/abs/2601.06712v1 In-context learning emerges in chemical reaction networks without attention 2026-01-10T22:40:02Z

We investigate whether chemical processes can perform in-context learning (ICL), a mode of computation typically associated with transformer architectures. ICL allows a system to infer task-specific rules from a sequence of examples without relying solely on fixed parameters. Traditional ICL relies on a pairwise attention mechanism which is not obviously implementable in chemical systems. However, we show theoretically and numerically that chemical processes can achieve ICL through a mechanism we call subspace projection, in which the entire input vector is mapped onto comparison subspaces, with the dominant projection determining the computational output. We illustrate this mechanism analytically in small chemical systems and show numerically that performance is robust to input encoding and dynamical choices, with the number of tunable degrees of freedom in the input encoding as a key limitation. Our results provide a blueprint for realizing ICL in chemical or other physical media and suggest new directions for designing adaptive synthetic chemical systems and understanding possible biological computation in cells.

2026-01-10T22:40:02Z 19 pages, 9 figures Carlos Floyd Hector Manuel Lopez Rios Aaron R. Dinner Suriyanarayanan Vaikuntanathan http://arxiv.org/abs/2509.17594v3 A Sensitivity Analysis Methodology for Rule-Based Stochastic Chemical Systems 2026-01-09T10:07:27Z

In this study, we introduce a sensitivity analysis methodology for stochastic systems in chemistry, where dynamics are often governed by random processes. Our approach is based on gradient estimation via finite differences, averaging simulation outcomes, and analyzing variability under intrinsic noise. We characterize gradient uncertainty as an angular range within which all plausible gradient directions are expected to lie. A key feature of our approach is that this uncertainty measure adaptively guides the number of simulations performed for each nominal-perturbation pair of points in order to minimize unnecessary computations while maintaining robustness. Systematically exploring a range of parameter values across the parameter space, rather than focusing on a single value, allows us to identify not only sensitive parameters but also regions of parameter space associated with different levels of sensitivity. These results are visualized through vector field plots to offer an intuitive representation of local sensitivity across parameter space. Additionally, global sensitivity coefficients over sampled points in the parameter space are computed to capture overall trends. Flexibility regarding the choice of output observable measures is another key feature of our method: while traditional sensitivity analyses often focus on species concentrations, our framework allows for the definition of a large range of problem-specific observables. This makes it broadly applicable in diverse chemical and biochemical scenarios. We demonstrate our approach on two systems: classical Michaelis-Menten kinetics and a rule-based model of the formose reaction, using the cheminformatics software MØD for Gillespie-based stochastic simulations.

2025-09-22T11:17:27Z Erika M. Herrera Machado Jakob L. Andersen Rolf Fagerberg Daniel Merkle http://arxiv.org/abs/2601.04335v1 Thermodynamic Constraints Drive Hierarchical Preemption in Cellular Decision-Making: A Hybrid Petri Net Framework with Application to Bacillus subtilis Sporulation 2026-01-07T19:15:23Z

Cellular decision-making under stress involves rapid pathway selection despite energy scarcity. Here we demonstrate that thermodynamic constraints actively drive energy-efficient sporulation, where continuous metabolic sources enable system robustness through dynamic energy management. Using hybrid Petri nets (stochastic transitions with continuous sources) to model Bacillus subtilis sporulation, we show that stress conditions (ATP = 300 mM, 94% depletion) enable sporulation completion with extreme energy efficiency: 0.73 mM ATP per mature spore versus 11.6 mM ATP under normal conditions--a 16-fold efficiency gain. Despite ATP dropping to 1 mM (99.7% depletion) during the crisis, continuous ATP regeneration rescues the system, producing 67 mM mature spores (89% of normal yield) with only 49 mM total ATP consumption. This efficiency emerges from the interplay between stochastic regulatory transitions and continuous metabolic sources, where GTP accumulation (+4974 mM, 166% increase) provides an energy buffer while ATP regeneration (+240 mM) prevents complete depletion. The hybrid Petri net formalism--combining stochastic transitions for regulatory events with continuous sources for metabolic flux--extended with thermodynamic constraints through inhibitor arcs and energy-coupled rate functions, provides the mathematical foundation enabling this discovery by integrating discrete regulatory logic with continuous energy dynamics in a resource-aware concurrency model.

2026-01-07T19:15:23Z 9 pages, 2 figures, 2 tables. Includes supplementary analysis and data availability statement. Model files and simulation code available at https://github.com/simao-eugenio/shypn Eugenio Simao http://arxiv.org/abs/2601.01850v2 Allostery Beyond Amplification: Temporal Regulation of Signaling Information 2026-01-07T17:12:18Z

Allostery is a fundamental mechanism of protein regulation and is commonly interpreted as modulating enzymatic activity or product abundance. Here we show that this view is incomplete. Using a stochastic model of allosteric regulation combined with an information-theoretic analysis, we quantify the mutual information between an enzyme's regulatory state and the states of downstream signaling components. Beyond controlling steady-state production levels, allostery also regulates the timing and duration over which information is transmitted. By tuning the temporal operating regime of signaling pathways, allosteric regulation enables distinct dynamical outcomes from identical molecular components, providing a physical mechanism for temporal information flow, signaling specificity, and coordination without changes in metabolic pathways.

2026-01-05T07:27:34Z Pedro Pessoa Steve Pressé S. Banu Ozkan http://arxiv.org/abs/2601.04016v1 Restoring information in aged gene regulatory networks by single knock-ins 2026-01-07T15:31:15Z

A hallmark of aging is loss of information in gene regulatory networks. These networks are tightly connected, raising the question of whether information could be restored by perturbing single genes. We develop a simple theoretical framework for information transmission in gene regulatory networks that describes the information gained or lost when a gene is "knocked in" (exogenously expressed). Applying the framework to gene expression data from muscle cells in young and old mice, we find that single knock-ins can restore network information by up to 10%. Our work advances the study of information flow in networks and identifies potential gene targets for rejuvenation.

2026-01-07T15:31:15Z 7 pages, 6 figures Ryan LeFebre Fabrisia Ambrosio Andrew Mugler http://arxiv.org/abs/2601.03704v1 Investigating Knowledge Distillation Through Neural Networks for Protein Binding Affinity Prediction 2026-01-07T08:43:08Z

The trade-off between predictive accuracy and data availability makes it difficult to predict protein--protein binding affinity accurately. The lack of experimentally resolved protein structures limits the performance of structure-based machine learning models, which generally outperform sequence-based methods. In order to overcome this constraint, we suggest a regression framework based on knowledge distillation that uses protein structural data during training and only needs sequence data during inference. The suggested method uses binding affinity labels and intermediate feature representations to jointly supervise the training of a sequence-based student network under the guidance of a structure-informed teacher network. Leave-One-Complex-Out (LOCO) cross-validation was used to assess the framework on a non-redundant protein--protein binding affinity benchmark dataset. A maximum Pearson correlation coefficient (P_r) of 0.375 and an RMSE of 2.712 kcal/mol were obtained by sequence-only baseline models, whereas a P_r of 0.512 and an RMSE of 2.445 kcal/mol were obtained by structure-based models. With a P_r of 0.481 and an RMSE of 2.488 kcal/mol, the distillation-based student model greatly enhanced sequence-only performance. Improved agreement and decreased bias were further confirmed by thorough error analyses. With the potential to close the performance gap between sequence-based and structure-based models as larger datasets become available, these findings show that knowledge distillation is an efficient method for transferring structural knowledge to sequence-based predictors. The source code for running inference with the proposed distillation-based binding affinity predictor can be accessed at https://github.com/wajidarshad/ProteinAffinityKD.

2026-01-07T08:43:08Z Wajid Arshad Abbasi Syed Ali Abbas Maryum Bibi Saiqa Andleeb Muhammad Naveed Akhtar http://arxiv.org/abs/2503.14437v3 Functional classification of metabolic networks 2026-01-07T00:10:43Z

Chemical reaction networks underpin biological and physical phenomena across scales, from microbial interactions to planetary atmosphere dynamics. Bacterial communities exhibit complex competitive interactions for resources, human organs and tissues demonstrate specialized biochemical functions, and planetary atmospheres can display diverse organic and inorganic chemical processes. Despite their complexities, comparing these networks methodically remains a challenge due to the vast underlying degrees of freedom. In biological systems, comparative genomics has been pivotal in tracing evolutionary trajectories and classifying organisms via DNA sequences. However, purely genomic classifications often fail to capture functional roles within ecological systems. Metabolic changes driven by nutrient availability highlight the need for classification schemes that integrate metabolic information. Here we introduce and apply a computational framework for a classification scheme of organisms that compares matrix representations of chemical reaction networks using the Grassmann distance, corresponding to measuring distances between the nullspaces of stoichiometric matrices. Applying this framework to human gut microbiome data confirms that metabolic distances are distinct from phylogenetic distances, underscoring the limitations of genetic information in metabolic classification. Importantly, our analysis of metabolic distances reveals functional groups of organisms enriched or depleted in specific metabolic processes and shows robustness to metabolically silent genetic perturbations. The generalizability of metabolic Grassmann distances is illustrated by application to chemical reaction networks in human tissue and planetary atmospheres, highlighting its potential for advancing functional comparisons across diverse chemical reaction systems.

2025-03-18T17:13:58Z 23 pages, 14 figures, 5 appendices; expanded methodology, theoretical and computational details added, conclusions unchanged Jorge Reyes Jörn Dunkel http://arxiv.org/abs/2601.02787v1 Simple chemical systems with chaos 2026-01-06T07:47:26Z

A number of simple chaotic three-dimensional dynamical systems (DSs) with quadratic polynomials on the right-hand sides are reported in the literature, containing exactly 5 or 6 monomials of which only 1 or 2 are quadratic. However, none of these simple systems are chemical dynamical systems (CDSs) - a special subset of polynomial DSs that model the dynamics of mass-action chemical reaction networks (CRNs). In particular, only a small number of three-dimensional quadratic CDSs with chaos are reported, all of which have at least 9 monomials and at least 3 quadratics, with CRNs containing at least 7 reactions and at least 3 quadratic ones. To bridge this gap, in this paper we prove some basic properties of chaotic CDSs, including that those in three dimensions have at least 6 monomials, at least one of which is negative and quadratic. We then use these results to computationally find 20 chaotic three-dimensional CDSs with 6 monomials and as few as 4 quadratics, or 7 monomials and as few as 2 quadratics. At the CRN level, some of these systems have 4 reactions of which only 3 are quadratic, or 5 reactions with only 2 being quadratic. These results quantify structural complexity of chaotic CDSs, and indicate that they are ubiquitous.

2026-01-06T07:47:26Z Tomislav Plesa Julien Clinton Sprott http://arxiv.org/abs/2601.00515v2 The Physics of Causation 2026-01-06T00:32:05Z

Assembly theory (AT) introduces a concept of causation as a material property, constitutive of a metrology of evolution and selection. The physical scale for causation is quantified with the assembly index, defined as the minimum number of steps necessary for a distinguishable object to exist, where steps are assembled recursively. Observing countable copies of high assembly index objects indicates that a mechanism to produce them is persistent, such that the object's environment builds a memory that traps causation within a contingent chain. Copy number and assembly index underlie the standardized metrology for detecting causation (assembly index), and evidence of contingency (copy number). Together, these allow the precise definition of a selective threshold in assembly space, understood as the set of all causal possibilities. This threshold demarcates life (and its derivative agential, intelligent and technological forms) as structures with persistent copies beyond the threshold. In introducing a fundamental concept of material causation to explain and measure life, AT represents a departure from prior theories of causation, such as interventional ones, which have so far proven incompatible with fundamental physics. We discuss how AT's concept of causation provides the foundation for a theory of physics where novelty, contingency and the potential for open-endedness are fundamental, and determinism is emergent along assembled lineages.

2026-01-02T00:20:53Z 58 pages, 7 Figures, 68 references Leroy Cronin Sara I. Walker http://arxiv.org/abs/2601.01337v1 HyperNetWalk: A Unified Framework for Personalized and Population-Level Cancer Driver Gene Identification via Multi-Network Hypergraph Diffusion 2026-01-04T02:49:51Z

Identifying cancer driver genes is crucial for understanding tumor biology and developing precision therapies. However, existing computational methods often rely on single biological networks or population-level mutation patterns, limiting their ability to identify patient-specific drivers and leverage the complementary information from multiple network types. Here, we present HyperNetWalk, a novel computational framework that integrates multiple biological networks and hypergraph diffusion to identify driver genes at both personalized and cohort levels. In the first stage, HyperNetWalk integrates protein-protein interaction networks, gene regulatory networks, and dynamic co-expression networks through sample-independent random walks on patient-specific subnetworks to capture topological importance and expression perturbation effects. In the second stage, it refines predictions through hypergraph-based random walks that leverage cross-sample information while preserving individual mutational contexts. Comprehensive evaluation on 12 TCGA cancer types demonstrates that HyperNetWalk achieves superior or competitive performance compared to state-of-the-art methods in both personalized and cohort-level predictions. Notably, HyperNetWalk successfully identifies known driver genes with high precision while revealing cancer type-specific drivers that reflect distinct biological mechanisms. Our framework provides a unified solution for personalized and population-based driver gene identification, offering valuable insights for precision oncology and therapeutic target discovery.

2026-01-04T02:49:51Z 31 pages, 4 main figures, 7 supplementary figures. Code is available at https://github.com/xqxu921/HyperNetWalk Xueqing Xu Yonghang Gao Duanchen Sun Ling-Yun Wu http://arxiv.org/abs/2512.24427v1 Epigenetic Control and Reprogramming-Induced Potential Landscapes of Gene Regulatory Networks: A Quantitative Theoretical Approach 2025-12-30T19:06:12Z

We develop an extended Dynamical Mean Field Theory framework to analyze gene regulatory networks (GRNs) incorporating epigenetic modifications. Building on the Hopfield network model analogy to spin glass systems, our approach introduces dynamic terms representing DNA methylation and histone modification to capture their regulatory influence on gene expression. The resulting formulation reduces high-dimensional GRN dynamics to effective stochastic equations, enabling the characterization of both stable and oscillatory states in epigenetically regulated systems. This framework provides a tractable and quantitative method for linking gene regulatory dynamics with epigenetic control, offering new theoretical insights into developmental processes and cell fate decisions.

2025-12-30T19:06:12Z 18 pages, 7 figures Sascha H. Hauck Sandip Saha Narsis A. Kiani Jesper N. Tegner http://arxiv.org/abs/2601.00036v1 Unifying Weak Independence and Signal Hierarchy Theory: Extended Biological Petri Net Formalism with Application to Vibrio fischeri Quorum Sensing 2025-12-30T17:43:26Z

Biological Petri Nets (Bio-PNs) require extensions beyond classical formalism to capture biochemical reality: multiple reactions simultaneously affect shared metabolites through convergent production or regulatory coupling, while signal places carry hierarchical control information distinct from material flow. We present a unified 13-tuple Extended Bio-PN formalism integrating two complementary theories: Weak Independence Theory (enabling coupled parallelism despite place-sharing) and Signal Hierarchy Theory (separating information flow from mass transfer). The extended definition adds signal partition (Psi subset P), arc type classification (A), regulatory structure (Sigma), environmental exchange (Theta), dependency taxonomy (Delta), heterogeneous transition types (tau), and biochemical formula tracking (rho). We formalize signal token consumption semantics through two-phase execution (enabling vs. consumption) and prove weak independence correctness for continuous dynamics. Application to Vibrio fischeri quorum sensing demonstrates how energy metabolism (ENERGY signals) orchestrates binary ON/OFF decisions through hierarchical constraint propagation to regulatory signals (LuxR-AHL complex), with 133-fold difference separating states. Analysis reveals signal saturation timing as the orchestrator forcing threshold-crossing, analogous to bacteriophage lambda lysogeny-lysis decisions. This work establishes formal foundations for modeling biological information flow in Petri nets, with implications for systems biology, synthetic circuit design, and parallel biochemical simulation.

2025-12-30T17:43:26Z 9 pages, 3 figures Eugenio Simao http://arxiv.org/abs/2512.23784v1 Sheaf-theoretic representation of the proteolipid code 2025-12-29T16:25:09Z

Membrane particles such as proteins and lipids organize into zones that perform unique functions. Here, I introduce a topological and category-theoretic framework to represent particle and zone intra-scale interactions and inter-scale coupling. This involves carefully demarcating between different presheaf- or sheaf-assigned data levels to preserve functorial structure and account for particle and zone generalized poses. The framework can accommodate Hamiltonian mechanics, enabling dynamical modeling. This amounts to a versatile mathematical formalism for membrane structure and multiscale coupling.

2025-12-29T16:25:09Z 16 pages, 3 figures Troy A. Kervin