https://arxiv.org/api/VUhFD4WKW3IhTqtU1tM7LnpUNSY2026-03-26T12:56:58Z242516515http://arxiv.org/abs/2505.11712v1Mathematical models for the EP2 and EP4 signaling pathways and their crosstalk2025-05-16T21:46:40ZThe G-protein coupled receptors EP2 and EP4 both transduce the signal of the lipid messenger Prostaglandin E2 (PGE2). Changes in the cell response to PGE2 can have important effects on immunity and development of diseases, however, a thorough understanding of the EP2-EP4 receptors' signaling pathways is lacking. Experimental data show that receptor activity (indicated by cAMP expression) has a different kinetics depending on which receptor is triggered by PGE2 and that crosstalk exists between EP2 and EP4. To better understand the underlying mechanisms and be able to predict cell responses to PGE2, we develop novel mathematical models for the cAMP signaling pathways of EP2 and EP4 and their crosstalk. Ligand binding dynamics plays a crucial role for both, the single receptor activity and their crosstalk. The mathematical models can predict the qualitative cAMP levels observed experimentally and provide possible explanations for the differences and commonalities in the signaling behavior of EP2 and EP4. As inhibition of PGE2 signaling is gaining increasing attention in tumor immunology, these mathematical models could contribute to design effective anti-tumor therapies targeting EP2 and EP4.2025-05-16T21:46:40ZAlessandra CambiDiane S LidkeMariya PtashnykWillemijn SmitStefanie Sonnerhttp://arxiv.org/abs/2505.08443v1A nonlocal-to-local approach to aggregation-diffusion equations2025-05-13T11:09:43ZOver the past decades, nonlocal models have been widely used to describe aggregation phenomena in biology, physics, engineering, and the social sciences. These are often derived as mean-field limits of attraction-repulsion agent-based models, and consist of systems of nonlocal partial differential equations. Using differential adhesion between cells as a biological case study, we introduce a novel local model of aggregation-diffusion phenomena. This system of local aggregation-diffusion equations is fourth-order, resembling thin-film or Cahn-Hilliard type equations. In this framework, cell sorting phenomena are explained through relative surface tensions between distinct cell types. The local model emerges as a limiting case of short-range interactions, providing a significant simplification of earlier nonlocal models, while preserving the same phenomenology. This simplification makes the model easier to implement numerically and more amenable to calibration to quantitative data. Additionally, we discuss recent analytical results based on the gradient-flow structure of the model, along with open problems and future research directions.2025-05-13T11:09:43ZFalco, C., Baker, R. E., & Carrillo, J. A. (2025). A Nonlocal-to-Local Approach to Aggregation-Diffusion Equations. SIAM Review, 67(2), 353-372Carles FalcóRuth E. BakerJosé A. Carrillo10.1137/25M1726248http://arxiv.org/abs/2505.06067v1Oncolytic mechanisms and immunotherapeutic potential of Newcastle disease virus in cancer therapy2025-05-09T14:03:41ZNewcastle Disease Virus (NDV), classified as Avian orthoavulavirus 1 (avian paramyxovirus type 1), is a promising oncolytic agent that selectively targets and destroys cancer cells while sparing normal tissues. Its oncoselectivity exploits cancer-specific defects in antiviral defenses, particularly impaired Type I interferon signaling, and dysregulated apoptotic pathways, enabling robust viral replication and cytotoxicity in malignancies such as breast, colorectal, and melanoma. NDV induces intrinsic and extrinsic apoptosis through caspase activation and triggers immunogenic cell death via damage-associated molecular patterns, stimulating potent antitumours immune responses. Additionally, NDVs potential as a vaccine vector, expressing tumours-associated antigens, offers prospects for prophylactic and therapeutic cancer applications. This review provides a comprehensive analysis of NDVs morphology, classification, and molecular biology, focusing on its viral entry and replication mechanisms in host cells. It explores NDVs interactions with cancer cells, emphasizing its ability to induce cytotoxicity and immune activation. Understanding these mechanisms is critical for optimizing NDVs oncolytic potential and advancing its clinical translation. Future directions include enhancing NDV through genetic engineering, combining it with therapies like immune checkpoint inhibitors, and developing personalized medicine approaches tailored to tumours genomic profiles. These advancements position NDV as a versatile therapeutic agent in oncolytic virotherapy.2025-05-09T14:03:41ZUmar AhmadSurializa HarunMoussa Moise DiagneSyahril AbdullahKhatijah YusoffAbhi Veerakumarasivamhttp://arxiv.org/abs/2505.07865v1CellVerse: Do Large Language Models Really Understand Cell Biology?2025-05-09T06:47:23ZRecent studies have demonstrated the feasibility of modeling single-cell data as natural languages and the potential of leveraging powerful large language models (LLMs) for understanding cell biology. However, a comprehensive evaluation of LLMs' performance on language-driven single-cell analysis tasks still remains unexplored. Motivated by this challenge, we introduce CellVerse, a unified language-centric question-answering benchmark that integrates four types of single-cell multi-omics data and encompasses three hierarchical levels of single-cell analysis tasks: cell type annotation (cell-level), drug response prediction (drug-level), and perturbation analysis (gene-level). Going beyond this, we systematically evaluate the performance across 14 open-source and closed-source LLMs ranging from 160M to 671B on CellVerse. Remarkably, the experimental results reveal: (1) Existing specialist models (C2S-Pythia) fail to make reasonable decisions across all sub-tasks within CellVerse, while generalist models such as Qwen, Llama, GPT, and DeepSeek family models exhibit preliminary understanding capabilities within the realm of cell biology. (2) The performance of current LLMs falls short of expectations and has substantial room for improvement. Notably, in the widely studied drug response prediction task, none of the evaluated LLMs demonstrate significant performance improvement over random guessing. CellVerse offers the first large-scale empirical demonstration that significant challenges still remain in applying LLMs to cell biology. By introducing CellVerse, we lay the foundation for advancing cell biology through natural languages and hope this paradigm could facilitate next-generation single-cell analysis.2025-05-09T06:47:23ZFan ZhangTianyu LiuZhihong ZhuHao WuHaixin WangDonghao ZhouYefeng ZhengKun WangXian WuPheng-Ann Henghttp://arxiv.org/abs/2405.12258v3Scientific Hypothesis Generation by a Large Language Model: Laboratory Validation in Breast Cancer Treatment2025-05-08T09:15:15ZLarge language models LLMs have transformed AI and achieved breakthrough performance on a wide range of tasks In science the most interesting application of LLMs is for hypothesis formation A feature of LLMs which results from their probabilistic structure is that the output text is not necessarily a valid inference from the training text These are termed hallucinations and are harmful in many applications In science some hallucinations may be useful novel hypotheses whose validity may be tested by laboratory experiments Here we experimentally test the application of LLMs as a source of scientific hypotheses using the domain of breast cancer treatment We applied the LLM GPT4 to hypothesize novel synergistic pairs of FDA-approved noncancer drugs that target the MCF7 breast cancer cell line relative to the nontumorigenic breast cell line MCF10A In the first round of laboratory experiments GPT4 succeeded in discovering three drug combinations out of twelve tested with synergy scores above the positive controls GPT4 then generated new combinations based on its initial results this generated three more combinations with positive synergy scores out of four tested We conclude that LLMs are a valuable source of scientific hypotheses.2024-05-20T11:40:23Z12 pages, 6 tables, 1 figure. Supplementary information availableAbbi Abdel-RehimHector ZenilOghenejokpeme OrhoborMarie FisherRoss J. CollinsElizabeth BourneGareth W. FearnleyEmma TateHolly X. SmithLarisa N. SoldatovaRoss D. Kinghttp://arxiv.org/abs/2212.12049v2How different are self and nonself?2025-04-25T22:26:44ZBiological and artificial networks routinely make reliable distinctions between similar inputs, and the rules for making these distinctions are learned. In some ways, self/nonself discrimination in the immune system is similar, being both reliable and (partly) learned through thymic selection. In contrast to other examples, we show that the distributions of self and nonself peptides are nearly identical but strongly inhomogeneous. Reliable discrimination is possible only because self-peptides are a particular finite sample drawn out of this distribution, and T cells can target the spaces in between these samples. In conventional learning problems, this would constitute overfitting and lead to disaster. Here, the strong inhomogeneities imply instead that the immune system gains by targeting peptides which are similar to self, with maximum sensitivity for sequences just one or two substitutions away. This prediction from the structure of the underlying distribution in sequence space agrees, for example, with the observed responses to mutation derived cancer neoantigens.2022-12-22T21:54:54ZAndreas MayerJonathan A. LevineChristopher J. RussoQuentin MarcouWilliam BialekBenjamin D. Greenbaumhttp://arxiv.org/abs/2504.14724v2Parametrization of microbial survival models under UVC exposure2025-04-24T22:35:13ZThis study aims to identify and parameterize the optimal survival curves for 33 fundamental microorganisms subject to UVC exposure through experimental measurements. We compile published data on UVC doses and corresponding survival fractions for these microorganisms to estimate parameters for four prominent survival models: Single-target (ST), Multi-target (MT), Linear Quadratic (LQ), and Two-Stage Decay (TSD). The best-fitting model for each microorganism is determined by selecting the one with the lowest mean squared error (MSE) compared to the experimental data. Our analysis indicates that the MT model is the most frequently appropriate, accurately fitting 21 of the 33 microorganisms. The TSD model is the best fit for only three, while the LQ model, though occasionally suitable at lower doses, is often excluded due to unreliable performance at higher doses. The assessed models, particularly the MT model, demonstrate strong predictive capabilities for UVC surface sterilization of microorganisms. However, caution is warranted with the LQ model at higher doses due to its potential limitations.2025-04-20T19:37:11ZAikaterini A. TsantariKonstantinos K. DelibasisHarilaos G. SandalidisNestor D. Chatzidiamantishttp://arxiv.org/abs/2504.17694v1Using mathematical models of heart cells to assess the safety of new pharmaceutical drugs2025-04-24T16:03:06ZMany drugs have been withdrawn from the market worldwide, at a cost of billions of dollars, because of patient fatalities due to them unexpectedly disturbing heart rhythm. Even drugs for ailments as mild as hay fever have been withdrawn due to an unacceptable increase in risk of these heart rhythm disturbances. Consequently, the whole pharmaceutical industry expends a huge effort in checking all new drugs for any unwanted side effects on the heart. The predominant root cause has been identified as drug molecules blocking ionic current flows in the heart. Block of individual types of ionic currents can now be measured experimentally at an early stage of drug development, and this is the standard screening approach for a number of ion currents in many large pharmaceutical companies. However, clinical risk is a complex function of the degree of block of many different types of cardiac ion currents, and this is difficult to understand by looking at results of these screens independently. By using ordinary differential equation models for the electrical activity of heart cells (electrophysiology models) we can integrate information from different types of currents, to predict the effect on whole heart cells and subsequent risk of side effects. The resulting simulations can provide a more accurate summary of the risk of a drug earlier in development and hence more cheaply than the pre-existing approaches.2025-04-24T16:03:06Z7 pages, 2 figuresMirams, G.R. (2025). Using Mathematical Models of Heart Cells to Assess the Safety of New Pharmaceutical Drugs. In: Aston, P.J. (eds) More UK Success Stories in Industrial Mathematics. Mathematics in Industry, vol 42. Springer, ChamGary R. Mirams10.1007/978-3-031-48683-8_22http://arxiv.org/abs/2504.14887v1Quantitative Analysis of Cell Membrane Tension in Time-Series Imaging and A Minimal Lattice Model of Single Cell Motion2025-04-21T06:30:29ZCell membrane tension directly influences various cellular functions. In this study, we developed a method to estimate surface tension from time-series data. We obtained the curvature-velocity relationship from time-series of binarized cell shape images, and the effective surface tension term was calculated from linear regression.
During the process, we observed an S-shaped pattern in the curvature-velocity relationship. To understand the dynamics, we constructed a minimal lattice model describing single-cell motion. The model consists of surface tension and protrusion formation, and the characteristic parameters are obtained from experimental observations. We found that similar patterns emerged in the curvature-velocity relationship.2025-04-21T06:30:29ZHiroki NishitaniTakashi Miurahttp://arxiv.org/abs/2410.03395v2Local Clustering and Global Spreading of Receptors for Optimal Spatial Gradient Sensing2025-04-16T08:50:18ZSpatial information from cell-surface receptors is crucial for processes that require signal processing and sensing of the environment. Here, we investigate the optimal placement of such receptors through a theoretical model that minimizes uncertainty in gradient estimation. Without requiring a priori knowledge of the physical limits of sensing or biochemical processes, we reproduce the emergence of clusters that closely resemble those observed in real cells. On perfect spherical surfaces, optimally placed receptors spread uniformly. When perturbations break their symmetry, receptors cluster in regions of high curvature, massively reducing estimation uncertainty. This agrees with mechanistic models that minimize elastic preference discrepancies between receptors and cell membranes. We further extend our model to motile receptors responding to cell-shape changes and external fluid flow, demonstrating the relevance of our model in realistic scenarios. Our findings provide a simple and utilitarian explanation for receptor clustering at high-curvature regions when high sensing accuracy is paramount.2024-10-04T12:58:31ZThis version has been accepted for publication in Physical Review Letters. The final version is available at https://doi.org/10.1103/PhysRevLett.134.158401. Title has changedPhys. Rev. Lett. 134, 158401 (2025)Albert AlonsoRobert G. EndresJulius B. Kirkegaard10.1103/PhysRevLett.134.158401http://arxiv.org/abs/2504.08328v1Towards generalizable single-cell perturbation modeling via the Conditional Monge Gap2025-04-11T07:51:33ZLearning the response of single-cells to various treatments offers great potential to enable targeted therapies. In this context, neural optimal transport (OT) has emerged as a principled methodological framework because it inherently accommodates the challenges of unpaired data induced by cell destruction during data acquisition. However, most existing OT approaches are incapable of conditioning on different treatment contexts (e.g., time, drug treatment, drug dosage, or cell type) and we still lack methods that unanimously show promising generalization performance to unseen treatments. Here, we propose the Conditional Monge Gap which learns OT maps conditionally on arbitrary covariates. We demonstrate its value in predicting single-cell perturbation responses conditional to one or multiple drugs, a drug dosage, or combinations thereof. We find that our conditional models achieve results comparable and sometimes even superior to the condition-specific state-of-the-art on scRNA-seq as well as multiplexed protein imaging data. Notably, by aggregating data across conditions we perform cross-task learning which unlocks remarkable generalization abilities to unseen drugs or drug dosages, widely outperforming other conditional models in capturing heterogeneity (i.e., higher moments) in the perturbed population. Finally, by scaling to hundreds of conditions and testing on unseen drugs, we narrow the gap between structure-based and effect-based drug representations, suggesting a promising path to the successful prediction of perturbation effects for unseen treatments.2025-04-11T07:51:33ZMain text, 15 pages, 5 figures, 2 tablesAlice DriessenBenedek HarsanyiMarianna RapsomanikiJannis Bornhttp://arxiv.org/abs/2504.08164v1Information bounds on the accuracy of cell polarization2025-04-10T23:21:26ZHere we characterized an information measure for cell polarity that applies to non-motile cells responding to a chemical gradient. The central idea is that polarization represents information about the direction of the gradient. We applied a theory of optimal gradient sensing and response in the presence of external noise based on the information capacity of a Gaussian channel. First, we formulated an information framework that describes spatial gradient sensing and polarization response. As part of this section, we modeled ligand diffusion and receptor-binding dynamics as a mixed Poisson distribution, confirming the single receptor accuracy limits derived by ten Wolde and colleagues. Second, we performed numerical calculations of stochastic ligand levels at the cell surface to estimate the information provided about the directional component of the gradient vector, which was close to the Gaussian channel bound for low signal-to-noise ratios. Third, we used the information framework to evaluate the noise-robustness of three generic models of cell polarity, demonstrating that a filter-amplifier architecture and time integration can attenuate the detrimental impact of noise on polarity so that the model can approach the theoretical limits. Fourth, we compared the theory to published experimental data on yeast mating projection growth in a pheromone gradient, identifying the ligand association rate and integration time as two key parameters affecting directional accuracy. By varying these parameters, we showed that for certain ranges the theory is roughly in agreement with the data, and that the slow binding rate constant is a key limiting factor. We concluded that temporal averaging can help overcome the slow binding rate to achieve greater accuracy, but with the drawback of a slow mating response.2025-04-10T23:21:26ZTau-Mu Yihttp://arxiv.org/abs/2410.13629v2Phenotype structuring in collective cell migration:a tutorial of mathematical models and methods2025-04-10T12:47:06ZPopulations are heterogeneous, deviating in numerous ways. Phenotypic diversity refers to the range of traits or characteristics across a population, where for cells this could be the levels of signalling, movement and growth activity, etc. Clearly, the phenotypic distribution -- and how this changes over time and space -- could be a major determinant of population-level dynamics. For instance, across a cancerous population, variations in movement, growth, and ability to evade death may determine its growth trajectory and response to therapy. In this review, we discuss how classical partial differential equation (PDE) approaches for modelling cellular systems and collective cell migration can be extended to include phenotypic structuring. The resulting non-local models -- which we refer to as phenotype-structured partial integro-differential equations (PS-PIDEs) -- form a sophisticated class of models with rich dynamics. We set the scene through a brief history of structured population modelling, and then review the extension of several classic movement models -- including the Fisher-KPP and Keller-Segel equations -- into a PS-PIDE form. We proceed with a tutorial-style section on derivation, analysis, and simulation techniques. First, we show a method to formally derive these models from underlying agent-based models. Second, we recount travelling waves in PDE models of spatial spread dynamics and concentration phenomena in non-local PDE models of evolutionary dynamics, and combine the two to deduce phenotypic structuring across travelling waves in PS-PIDE models. Third, we discuss numerical methods to simulate PS-PIDEs, illustrating with a simple scheme based on the method of lines and noting the finer points of consideration. We conclude with a discussion of future modelling and mathematical challenges.2024-10-17T15:00:40ZTommaso LorenziKevin J PainterChiara Villahttp://arxiv.org/abs/2411.08327v3Cell size distributions in lineages2025-04-10T06:22:04ZCells actively regulate their size during the cell cycle to maintain volume homeostasis across generations. While various mathematical models of cell size regulation have been proposed to explain how this is achieved, relating these models to experimentally observed cell size distributions has proved challenging. In this paper we present a simple formula for the cell size distribution in lineages as observed in e.g. a mother machine, and provide a new derivation for the corresponding result in populations, assuming exponential cell growth. Our results are independent of the underlying cell size control mechanism and explain the characteristic shape underlying experimentally observed cell size distributions. We furthermore derive universal moment identities for these distributions, and show that our predictions agree well with experimental measurements of E. coli cells, both on the distribution and the moment level.2024-11-13T04:20:31Z8 pages, 2 figuresKaan ÖcalMichael P. H. Stumpfhttp://arxiv.org/abs/2411.12123v8Optimisation of neoadjuvant pembrolizumab therapy for locally advanced MSI-H/dMMR colorectal cancer using data-driven delay integro-differential equations2025-04-08T05:34:41ZColorectal cancer (CRC) poses a major public health challenge due to its increasing prevalence, particularly among younger populations. Microsatellite instability-high (MSI-H) CRC and deficient mismatch repair (dMMR) CRC constitute 15% of all CRC and exhibit remarkable responsiveness to immunotherapy, especially with PD-1 inhibitors. Despite this, there is a significant need to optimise immunotherapeutic regimens to maximise clinical efficacy and patient quality of life whilst minimising monetary costs. To address this, we employ a novel framework driven by delay integro-differential equations to model the interactions among cancer cells, immune cells, and immune checkpoints. Several of these components are being modelled deterministically for the first time in cancer, paving the way for a deeper understanding of the complex underlying immune dynamics. We consider two compartments: the tumour site and the tumour-draining lymph node, incorporating phenomena such as dendritic cell (DC) migration, T cell proliferation, and CD8+ T cell exhaustion and reinvigoration. Parameter values and initial conditions are derived from experimental data, integrating various pharmacokinetic, bioanalytical, and radiographic studies, along with deconvolution of bulk RNA-sequencing data from the TCGA COADREAD and GSE26571 datasets. We finally optimised neoadjuvant treatment with pembrolizumab, a widely used PD-1 inhibitor, to balance efficacy, efficiency, and toxicity in locally advanced MSI-H/dMMR CRC patients. We mechanistically analysed factors influencing treatment success and improved upon currently FDA-approved therapeutic regimens for metastatic MSI-H/dMMR CRC, demonstrating that a single medium-to-high dose of pembrolizumab may be sufficient for effective tumour eradication while being efficient, safe and practical.2024-11-18T23:26:35Z94 pages in total with 55 pages for the main body and 38 pages for the supporting information, 4 figures in the main body, 13 tables in the main body, 10 tables in the supporting information. Major edits and simplifications have been madeGeorgio HawiPeter S. KimPeter P. Lee10.1016/j.jtbi.2025.112231