https://arxiv.org/api/hziLUEezEVZG1R0Zw3uGbICXZ8E 2026-03-28T12:53:41Z 4112 150 15 http://arxiv.org/abs/2508.15273v2 Stoichiometric recipes for periodic oscillations in reaction networks 2025-10-09T12:33:10Z

Oscillatory chemical reactions are functional components in a variety of biological contexts. In chemistry, the construction and identification of even rudimentary oscillators remain elusive and lack a general framework. Using parameter-rich kinetics - a methodology enabling the disentanglement of parametric dependencies from structural analysis - we investigate the stoichiometry of chemical oscillators. We introduce the concept of oscillatory cores: minimal subnetworks that guarantee the potential for oscillations in any reaction network containing them. These cores fall into two classes, depending on whether they involve positive or negative feedback. In particular, the latter class unveils a family of oscillators - yet to be synthesized - that require a minimum number of reaction steps to exhibit oscillations, a phenomenon we refer to as the principle of length. We identify several mechanisms through which catalysis promotes oscillations: (I) furnishing instability (e.g. autocatalysis), (II) lifting dependencies, (III) lowering length thresholds. Notwithstanding this mechanistic ubiquity, we show that oscillators can also be realized without employing any catalysis. Our results highlight branches of chemistry where oscillators are likely to arise by chance, suggest new strategies for their design, and point to novel classes of oscillators yet to be realized experimentally.

2025-08-21T06:06:16Z 29 pages (Main) + 50 pages (Supplementary Material) Alexander Blokhuis Peter F. Stadler Nicola Vassena http://arxiv.org/abs/2510.07797v1 Non-Kramers State Transitions in a Synthetic Toggle Switch Biosystem 2025-10-09T05:18:11Z

State transitions are fundamental in biological systems but challenging to observe directly. Here, we present the first single-cell observation of state transitions in a synthetic bacterial genetic circuit. Using a mother machine, we tracked over 1007 cells for 27 hours. First-passage analysis and dynamical reconstruction reveal that transitions occur outside the small-noise regime, challenging the applicability of classical Kramers' theory. The process lacks a single characteristic rate, questioning the paradigm of transitions between discrete cell states. We observe significant multiplicative noise that distorts the effective potential landscape yet increases transition times. These findings necessitate theoretical frameworks for biological state transitions beyond the small-noise assumption.

2025-10-09T05:18:11Z 5 pages, 3 figures in main text; 17 pages, 15 figures in supplemental information Jianzhe Wei Jingwen Zhu Pan Chu Liang Luo Xiongfei Fu http://arxiv.org/abs/2403.17202v3 The generic temperature response of large biochemical networks 2025-10-08T12:20:29Z

Biological systems are remarkably susceptible to relatively small temperature changes. The most obvious example is fever, when a modest rise in body temperature of only few Kelvin has strong effects on our immune system and how it fights pathogens. Another very important example is climate change, when even smaller temperature changes lead to dramatic shifts in ecosystems. Although it is generally accepted that the main effect of an increase in temperature is the acceleration of biochemical reactions according to the Arrhenius equation, it is not clear how it affects large biochemical networks with complicated architectures. For developmental systems like fly and frog, it has been shown that the system response to temperature deviates in a characteristic manner from the linear Arrhenius plot of single reactions, but a rigorous explanation has not been given yet. Here we use a graph-theoretical interpretation of the mean first-passage times of a biochemical master equation to give a statistical description. We find that in the limit of large system size and if the network has a bias towards a target state, then the Arrhenius plot is generically quadratic, in excellent agreement with numerical simulations for large networks as well as with experimental data for developmental times in fly and frog. We also discuss under which conditions this generic response can be violated, for example for linear chains, which have only one spanning tree.

2024-03-25T21:27:38Z 25 pages, 24 figures Julian B. Voits Heidelberg University Ulrich S. Schwarz Heidelberg University http://arxiv.org/abs/2510.07337v1 Decoding the dark proteome: Deep learning-enabled discovery of druggable enzymes in Wuchereria bancrofti 2025-10-07T08:20:11Z

Wuchereria bancrofti, the parasitic roundworm responsible for lymphatic filariasis, permanently disables over 36 million people and places 657 million at risk across 39 countries. A major bottleneck for drug discovery is the lack of functional annotation for more than 90 percent of the W. bancrofti dark proteome, leaving many potential targets unidentified. In this work, we present a novel computational pipeline that converts W. bancrofti's unannotated amino acid sequence data into precise four-level Enzyme Commission (EC) numbers and drug candidates. We utilized a DEtection TRansformer to estimate the probability of enzymatic function, fine-tuned a hierarchical nearest neighbor EC predictor on 4,476 labeled parasite proteins, and applied rejection sampling to retain only four-level EC classifications at 100 percent confidence. This pipeline assigned precise EC numbers to 14,772 previously uncharacterized proteins and discovered 543 EC classes not previously known in W. bancrofti. A qualitative triage emphasizing parasite-specific targets, chemical tractability, biochemical importance, and biological plausibility prioritized six enzymes across five separate strategies: anti-Wolbachia cell-wall inhibition, proteolysis blockade, transmission disruption, purinergic immune interference, and cGMP-signaling destabilization. We curated a 43-compound library from ChEMBL and BindingDB and co-folded across multiple protein conformers with Boltz-2. All six targets exhibited at least moderately strong predicted binding affinities below 1 micromolar, with moenomycin analogs against peptidoglycan glycosyltransferase and NTPase inhibitors showing promising nanomolar hits and well-defined binding pockets. While experimental validation remains essential, our results provide the first large-scale functional map of the W. bancrofti dark proteome and accelerate early-stage drug development for the species.

2025-10-07T08:20:11Z Accepted for peer-reviewed publication at the STEM Fellowship Journal Shawnak Shivakumar Jefferson Hernandez http://arxiv.org/abs/2507.03720v2 Fast decisions with biophysically constrained gene promoter architectures 2025-10-07T08:05:46Z

Cells integrate signals and make decisions about their future state in short amounts of time. A lot of theoretical effort has gone into asking how to best design gene regulatory circuits that fulfill a given function, yet little is known about the constraints that performing that function in a small amount of time imposes on circuit architectures. Using an optimization framework, we explore the properties of a class of promoter architectures that distinguish small differences in transcription factor concentrations under time constraints. We show that the full temporal trajectory of gene activity allows for faster decisions than its integrated activity represented by the total number of transcribed mRNA. The topology of promoter architectures that allow for rapidly distinguishing low transcription factor concentrations result in a low, shallow, and non cooperative response, while at high concentrations, the response is high and cooperative. In the presence of non-cognate ligands, networks with fast and accurate decision times need not be optimally selective, especially if discrimination is difficult. While optimal networks are generically out of equilibrium, the energy associated with that irreversibility is only modest, and negligible at small concentrations. Instead, our results highlight the crucial role of rate-limiting steps imposed by biophysical constraints.

2025-07-04T17:16:59Z Tarek Tohme Massimo Vergassola Thierry Mora Aleksandra M. Walczak http://arxiv.org/abs/2510.05083v1 Robust multicellular programs dissect the complex tumor microenvironment and track disease progression in colorectal adenocarcinomas 2025-10-06T17:51:26Z

Colorectal cancer (CRC) is highly heterogeneous, with five-year survival rates dropping from $\sim$90% in localized disease to $\sim$15% with distant metastases. Disease progression is shaped not only by tumor-intrinsic alterations but also by the reorganization of the tumor microenvironment (TME). Metabolic, compositional, and spatial changes contribute to this progression, but considered individually they lack context and often fail as therapeutic targets. Understanding their coordination could reveal processes to alter the disease course. Here, we combined multiplexed ion beam imaging (MIBI) with machine learning to profile metabolic, functional and spatial states of 522 colorectal lesions with single-cell resolution. We observed recurrent stage-specific remodeling marked by a lymphoid-to-myeloid shift, stromal-cancer cooperation, and malignant metabolic shifts. Spatial organization of epithelial, stromal, and immune compartments provided stronger stratification of disease stage than tumor-intrinsic changes or bulk immune infiltration alone. To systematically model these coordinated changes, we condensed multimodal features into 10 latent factors of TME organization. These factors tracked disease progression, were conserved across cohorts, and revealed frequent multicellular metabolic niches and distinct, non-exclusive TME trajectories. Our framework MuVIcell exposes the elements that together drive CRC progression by grouping co-occurring changes across cell types and feature classes into coordinated multicellular programs. This creates a rational basis to therapeutically target TME reorganization. Importantly, the framework is scalable and flexible, offering a resource for studying multicellular organization in other solid tumors.

2025-10-06T17:51:26Z Loan Vulliard Teresa Glauner Sven Truxa Miray Cetin Yu-Le Wu Ronald Simon Laura Behm Jovan Tanevski Julio Saez-Rodriguez Guido Sauter Felix J. Hartmann http://arxiv.org/abs/2510.04176v1 Relief of EGFR/FOS-downregulated miR-103a by loganin alleviates NF-kappaB-triggered inflammation and gut barrier disruption in colitis 2025-10-05T12:36:31Z

Due to the ever-rising global incidence rate of inflammatory bowel disease (IBD) and the lack of effective clinical treatment drugs, elucidating the detailed pathogenesis, seeking novel targets, and developing promising drugs are the top priority for IBD treatment. Here, we demonstrate that the levels of microRNA (miR)-103a were significantly downregulated in the inflamed mucosa of ulcerative colitis (UC) patients, along with elevated inflammatory cytokines (IL-1beta/TNF-alpha) and reduced tight junction protein (Occludin/ZO-1) levels, as compared with healthy control objects. Consistently, miR-103a deficient intestinal epithelial cells Caco-2 showed serious inflammatory responses and increased permeability, and DSS induced more severe colitis in miR-103a-/- mice than wild-type ones. Mechanistic studies unraveled that c-FOS suppressed miR-103a transcription via binding to its promoter, then miR-103a-targeted NF-kappaB activation contributes to inflammatory responses and barrier disruption by targeting TAB2 and TAK1. Notably, the traditional Chinese medicine Cornus officinalis (CO) and its core active ingredient loganin potently mitigated inflammation and barrier disruption in UC by specifically blocking the EGFR/RAS/ERK/c-FOS signaling axis, these effects mainly attributed to modulated miR-103a levels as the therapeutic activities of them were almost completely shielded in miR-103a KO mice. Taken together, this work reveals that loganin relieves EGFR/c-FOS axis-suppressed epithelial miR-103a expression, thereby inhibiting NF-kappaB pathway activation, suppressing inflammatory responses, and preserving tight junction integrity in UC. Thus, our data enrich mechanistic insights and promising targets for UC treatment.

2025-10-05T12:36:31Z Yan Li Teng Hui Xinhui Zhang Zihan Cao Ping Wang Shirong Chen Ke Zhao Yiran Liu Yue Yuan Dou Niu Xiaobo Yu Gan Wang Changli Wang Yan Lin Fan Zhang Hefang Wu Guodong Feng Yan Liu Jiefang Kang Yaping Yan Hai Zhang Xiaochang Xue Xun Jiang http://arxiv.org/abs/2510.00512v1 Adaptive Data-Knowledge Alignment in Genetic Perturbation Prediction 2025-10-01T04:48:43Z

The transcriptional response to genetic perturbation reveals fundamental insights into complex cellular systems. While current approaches have made progress in predicting genetic perturbation responses, they provide limited biological understanding and cannot systematically refine existing knowledge. Overcoming these limitations requires an end-to-end integration of data-driven learning and existing knowledge. However, this integration is challenging due to inconsistencies between data and knowledge bases, such as noise, misannotation, and incompleteness. To address this challenge, we propose ALIGNED (Adaptive aLignment for Inconsistent Genetic kNowledgE and Data), a neuro-symbolic framework based on the Abductive Learning (ABL) paradigm. This end-to-end framework aligns neural and symbolic components and performs systematic knowledge refinement. We introduce a balanced consistency metric to evaluate the predictions' consistency against both data and knowledge. Our results show that ALIGNED outperforms state-of-the-art methods by achieving the highest balanced consistency, while also re-discovering biologically meaningful knowledge. Our work advances beyond existing methods to enable both the transparency and the evolution of mechanistic biological understanding.

2025-10-01T04:48:43Z Yuanfang Xiang Lun Ai http://arxiv.org/abs/2509.25417v1 Computational Drug Repurposing for Alzheimer's Disease via Sheaf Theoretic Population-Scale Analysis of snRNA-seq Data 2025-09-29T19:21:30Z

Single-cell and single-nucleus RNA sequencing (scRNA-seq /snRNA-seq) are widely used to reveal heterogeneity in cells, showing a growing potential for precision and personalized medicine. Nonetheless, sustainable drug discovery must be based on a population-level understanding of molecular mechanisms, which calls for the population-scale analysis of scRNA-seq/snRNA-seq data. This work introduces a sequential target-drug selection model for drug repurposing against Alzheimer's Disease (AD) targets inferred from population-level snRNA-seq studies of AD progression in microglia cells as well as different cell types taken from an AD affected brain vascular tissue atlas, involving hundreds of thousands of nuclei from multi-patient and multi-regional studies. We utilize Persistent Sheaf Laplacians (PSL) to facilitate a Protein-Protein Interaction (PPI) analysis of AD targets inferred from differential gene expression (DEG), and then use machine learning models to predict repurpose-able DrugBank compounds for molecular targeting. We screen the efficacy of different DrugBank small compounds and further examine their central nervous system (CNS)-relevant ADMET (Absorption, Distribution, Metabolism, Excretion, and Toxicity), resulting in a list of lead candidates for AD treatment. The list of significant genes establishes a target domain for effective machine learning based AD drug repurposing analysis of DrugBank small compounds to treat AD related molecular targets.

2025-09-29T19:21:30Z Sean Cottrell Seungmin Yoon Xiaoqi Wei Alex Dickson Guo-Wei Wei http://arxiv.org/abs/2503.09605v4 QWENDY: Gene Regulatory Network Inference by Quadruple Covariance Matrices 2025-09-29T16:13:11Z

Knowing gene regulatory networks (GRNs) is important for understanding various biological mechanisms. In this paper, we present a method, QWENDY, that uses single-cell gene expression data measured at four time points to infer GRNs. Based on a linear gene expression model, it solves the transformation of the covariance matrices. Unlike its predecessor WENDY, QWENDY avoids solving a non-convex optimization problem and produces a unique solution. We test the performance of QWENDY on three experimental data sets and two synthetic data sets. Compared to previously tested methods on the same data sets, QWENDY ranks the first on experimental data, although it does not perform well on synthetic data.

2025-02-22T16:22:52Z Yue Wang Xueying Tian http://arxiv.org/abs/2509.01504v2 Rule-Based Gillespie Simulation of Chemical Systems 2025-09-29T12:42:03Z

The MØD computational framework implements rule-based generative chemistries as explicit transformations of graphs representing chemical structural formulae. Here, we expand MØD by a stochastic simulation module that simulates the time evolution of species concentrations using Gillespie's well-known stochastic simulation algorithm (SSA). This module distinguishes itself among competing implementations of rule-based stochastic simulation engines by its flexible network expansion mechanism and its functionality for defining custom reaction rate functions. It enables direct sampling from actual reactions instead of rules. We present methodology and implementation details followed by examples which demonstrate the capabilities of the stochastic simulation engine.

2025-09-01T14:23:59Z 20 pages, 7 figures Erika M. Herrera Machado Jakob L. Andersen Rolf Fagerberg Christoph Flamm Daniel Merkle Peter F. Stadler http://arxiv.org/abs/2407.01760v4 Understanding Multistationarity of Fully Open Reaction Networks 2025-09-29T09:16:38Z

This work addresses multistationarity of fully open reaction networks equipped with mass action kinetics. We improve upon the existing results relating existence of positive feedback loops in a reaction network and multistationarity; and we provide a novel deterministic operation to generate new non-multistationary networks. This is interesting because while there were many operations to create infinitely many new multistationary networks from a multistationary example, this is the first such operation for the non-multistationary counterpart. Such tools for the generation of example networks have a use-case in the application of data science to reaction network theory. We demonstrate this by using the new data, along with a novel graph representation of reaction networks that is unique up to a permutation on the name of species of the network, to train a graph attention neural network model to predict multistationarity of reaction networks. This is the first time machine learning tools are used for studying classification problems of reaction networks.

2024-07-01T19:50:38Z 36 pages, 4 Figures, 2 Tables, the dataset and code related to this manuscript is available at the Zenodo link given inside the paper Shenghao Yao AmirHosein Sadeghimanesh Matthew England 10.1007/s11538-025-01537-8 http://arxiv.org/abs/2509.23543v1 Contrastive Learning Enhances Language Model Based Cell Embeddings for Low-Sample Single Cell Transcriptomics 2025-09-28T00:45:39Z

Large language models (LLMs) have shown strong ability in generating rich representations across domains such as natural language processing and generation, computer vision, and multimodal learning. However, their application in biomedical data analysis remains nascent. Single-cell transcriptomic profiling is essential for dissecting cell subtype diversity in development and disease, but rare subtypes pose challenges for scaling laws. We present a computational framework that integrates single-cell RNA sequencing (scRNA-seq) with LLMs to derive knowledge-informed gene embeddings. Highly expressed genes for each cell are mapped to NCBI Gene descriptions and embedded using models such as text-embedding-ada-002, BioBERT, and SciBERT. Applied to retinal ganglion cells (RGCs), which differ in vulnerability to glaucoma-related neurodegeneration, this strategy improves subtype classification, highlights biologically significant features, and reveals pathways underlying selective neuronal vulnerability. More broadly, it illustrates how LLM-derived embeddings can augment biological analysis under data-limited conditions and lay the groundwork for future foundation models in single-cell biology.

2025-09-28T00:45:39Z 14 pages, 4 figures, 2 tables Luxuan Zhang Douglas Jiang Qinglong Wang Haoqi Sun Feng Tian http://arxiv.org/abs/2501.12233v2 When algebra twinks system biology: a conjecture on the structure of Gröbner bases in complex chemical reaction networks 2025-09-26T12:16:55Z

We address the challenge of identifying all real positive steady states in chemical reaction networks (CRNs) governed by mass-action kinetics. Traditional numerical methods often require specific initial guesses and may fail to find all the solutions in systems exhibiting multistability. Gröbner bases offer an algebraic framework that systematically transforms polynomial equations into simpler forms, facilitating comprehensive solution enumeration. In this work, we propose a conjecture that CRNs with at most pairwise interactions yield Gröbner bases possessing a near-"triangular" structure, under appropriate assumptions. We illustrate this phenomenon using examples from a gene regulatory network and the Wnt signaling pathway, where the Gröbner basis approach reliably captures all real positive solutions. Our computational experiments reveal the potential of Gröbner bases to overcome limitations of local numerical methods for finding the steady states of complex biological systems, making them a powerful tool for understanding dynamical processes across diverse biochemical models.

2025-01-21T15:56:01Z Paola Ferrari Sara Sommariva Michele Piana Federico Benvenuto Matteo Varbaro http://arxiv.org/abs/2509.20693v1 Learning to Align Molecules and Proteins: A Geometry-Aware Approach to Binding Affinity 2025-09-25T02:55:24Z

Accurate prediction of drug-target binding affinity can accelerate drug discovery by prioritizing promising compounds before costly wet-lab screening. While deep learning has advanced this task, most models fuse ligand and protein representations via simple concatenation and lack explicit geometric regularization, resulting in poor generalization across chemical space and time. We introduce FIRM-DTI, a lightweight framework that conditions molecular embeddings on protein embeddings through a feature-wise linear modulation (FiLM) layer and enforces metric structure with a triplet loss. An RBF regression head operating on embedding distances yields smooth, interpretable affinity predictions. Despite its modest size, FIRM-DTI achieves state-of-the-art performance on the Therapeutics Data Commons DTI-DG benchmark, as demonstrated by an extensive ablation study and out-of-domain evaluation. Our results underscore the value of conditioning and metric learning for robust drug-target affinity prediction.

2025-09-25T02:55:24Z 10pages,2 figures Mohammadsaleh Refahi Bahrad A. Sokhansanj James R. Brown Gail Rosen