https://arxiv.org/api/8BpMNFT+HA/KejU8J7JC61Tg8882026-04-01T10:12:05Z1283922515http://arxiv.org/abs/2601.19681v1Long-term evolution of regulatory DNA sequences. Part 1: Simulations on global, biophysically-realistic genotype-phenotype maps2026-01-27T14:59:49ZPromoters and enhancers are cis-regulatory elements (CREs), DNA sequences that bind transcription factor (TF) proteins to up- or down-regulate target genes. Decades-long efforts yielded TF-DNA interaction models that predict how strongly an individual TF binds arbitrary DNA sequences and how individual binding events on the CRE combine to affect gene expression. These insights can be synthesized into a global, biophysically-realistic, and quantitative genotype-phenotype (GP) map for gene regulation, a "holy grail" for the application of evolutionary theory. A global map provides a rare opportunity to simulate long-term evolution of regulatory sequences and pose several fundamental questions: How long does it take to evolve CREs de novo? How many non-trivial regulatory functions exist in sequence space? How connected are they? For which regulatory architecture is CRE evolution most rapid and evolvable? In this article, the first of a two-part series, we briefly review the pertinent modeling and simulation efforts for a unique system that enables close, quantitative, and mechanistic links between biophysics, as well as systems, synthetic, and evolutionary biology.2026-01-27T14:59:49ZInvited review (Part I of a two-part series), submitted to Current Opinion in Genetics & DevelopmentElia MascoloRéka BorbélySantiago Herrera-ÁlvarezCalin C GuetJustin CrockerGašper Tkačikhttp://arxiv.org/abs/2601.22177v1Emergent spatial organization of competing species under environmental stress and cooperation2026-01-27T12:02:51ZUnderstanding how species persist under interacting stressors is a central challenge in ecology. We develop a spatially explicit reaction-diffusion framework to investigate competing species in landscapes shaped by climate variability, pollution, resource heterogeneity, and cooperation. Here, temperature follows low-frequency oscillations, while pollution and resources diffuse from localized sources. Growth is governed by a dynamic carrying capacity integrating abiotic stress with an endogenous, pollution-sensitive cooperation field.
Numerical simulations reveal the spontaneous emergence of persistent spatial organization, including dominance segregation and stable competitive boundaries. Quantitative analyses-using boundary geometry, fractal dimension, and spatial entropy-demonstrate a transition from intermixed initial states to low-complexity, quasi-stationary configurations. Coexistence occurs through distinct strategies: one species occupies more area, while the other maintains higher local densities. Cooperation enhances resilience but collapses in polluted zones, creating heterogeneous "social buffering."
We further introduce a hybrid inverse modeling framework using a Swin Transformer to infer high-dimensional parameters from only two temporal snapshots. Trained on synthetic data, the model accurately recovers demographic, diffusive, and environmental-sensitivity parameters. While it achieves reliable short-term spatial predictions, long-term forecasts diverge due to the intrinsic sensitivity of nonlinear systems. This unified framework links sparse observations to mechanistic dynamics, advancing biodiversity forecasting under accelerating global change.2026-01-27T12:02:51ZTon Viet Tahttp://arxiv.org/abs/2601.19064v1LvD: A New Algorithm for Computing the Likelihood of a Phylogeny2026-01-27T01:02:09ZThere are few, if any, algorithms in statistical phylogenetics which are used more heavily than Felsenstein's 1973 pruning method for computing the likelihood of a tree. We present LvD, (Likelihood via Decomposition), an alternative to Felsenstein's algorithm based on a different decomposition of the underlying phylogeny. It works for all standard nucleotide models. The new algorithm allows updates of the likelihood calculation in worst case $O(\log n)$ time with $n$ taxa, as opposed to worst case $O(n)$ time for existing methods. In practice this leads to appreciable improvements in likelihood calculations, the extent of speed-up depending on how balanced or unbalanced the trees are. We explore implications for parallel computing, and show that the approach allows likelihoods to be computed in $O(\log n)$ parallel time per site, compared to (worst case) $O(n)$ time. We implemented and applied the algorithm to large numbers of simulated and empirical data sets and showed that these theoretical advances lead to a significant practical speed-up, although the extent of the improvement depends on how balanced the phylogenies already are.2026-01-27T01:02:09ZDavid BryantCeline ScornavaccaDavid Swoffordhttp://arxiv.org/abs/2601.13349v2Conservation priority mapping to prevent zoonotic spillovers2026-01-26T19:12:46ZDiseases originating from wildlife pose a significant threat to global health, causing human and economic losses each year. The transmission of disease from animals to humans occurs at the interface between humans, livestock, and wildlife reservoirs, influenced by abiotic factors and ecological mechanisms. Although evidence suggests that intact ecosystems can reduce transmission, disease prevention has largely been neglected in conservation efforts and remains underfunded compared to mitigation. A major constraint is the lack of reliable, spatially explicit information to guide efforts effectively. Given the increasing rate of new disease emergence, accelerated by climate change and biodiversity loss, identifying priority areas for mitigating the risk of disease transmission is more crucial than ever. We present new high-resolution (1 km) maps of priority areas for targeted ecological countermeasures aimed at reducing the likelihood of zoonotic spillover, along with a methodology adaptable to local contexts. Our study compiles data on well-documented risk factors, protection status, forest restoration potential, and opportunity cost of the land to map areas with high potential for cost-effective interventions. We identify low-cost priority areas across 50 countries, including 277,000 km2 where environmental restoration could mitigate the risk of zoonotic spillover and 198,000 km2 where preventing deforestation could do the same, 95% of which are not currently under protection. The resulting layers, covering tropical regions globally, are freely available alongside an interactive no-code platform that allows users to adjust parameters and identify priority areas at multiple scales. Ecological countermeasures can be a cost-effective strategy for reducing the emergence of new pathogens; however, our study highlights the extent to which current conservation efforts fall short of this goal.2026-01-19T19:38:00ZLeonardo ViottiLuis Diego HerreraGaro BatmanianFranck BertheRachael Kramphttp://arxiv.org/abs/2601.18703v1Chemotaxis-inspired PDE models of airborne infectious disease transmission: epidemiologically-motivated mathematical and numerical analyses2026-01-26T17:25:09ZPartial differential equation (PDE) models for infectious diseases, while less common than their ordinary differential equation (ODE) counterparts, have found successful applications for many years. Such models are typically of reaction-diffusion type, and model spatial propagation as a diffusive process. However, given the complex nature of human mobility, such models are limited in their ability to describe airborne infectious diseases in human populations. Recent work has advocated for the inclusion of an additional chemotaxis-type term as an alternative; spatial propagation of infection fronts is assumed additionally to flow from low-to-high concentrations of susceptible populations. The present work extends the study of such models by providing an epidemiologically interpretable analysis, directly connecting model behavior to information readily available to policymakers. In particular, we derive a spatially-aware basic reproduction number, which accounts for spatial heterogeneity in population density. Furthermore, we discuss several important aspects concerning the numerical solution of the model, including the introduction of a stabilization scheme. Finally, we perform a series of simulation studies in the Italian region of Lombardy (severely affected by the COVID-19 outbreak in 2020) and in the US state of Georgia, in which we demonstrate the model's potential to better capture important spatiotemporal dynamics observed in real-world data compared to pure reaction-diffusion models.2026-01-26T17:25:09ZAlex ViguerieMalú GraveAlvaro L. G. A. CoutinhoAlessandro VenezianiThomas J. R. Hugheshttp://arxiv.org/abs/2601.18214v1A model for a population of trees structured by phenological traits2026-01-26T07:00:41ZIn the context of global warming, tree populations rely on two primary mechanisms of adaptation: phenotypic plasticity, which enables individuals to adjust their behavior in response to environmental stress, and genetic evolution, driven by natural selection and genetic diversity within the population. Understanding the interplay between these mechanisms is crucial for assessing the impacts of climate change on forest ecosystems and for informing sustainable management strategies. In this manuscript, we focus on a specific phenological adaptation: the ability of trees to enter summer dormancy once a critical temperature threshold is exceeded. Individuals are characterized by this threshold temperature and by their seed production capacity. We first establish a detailed mathematical model describing the population dynamics under these traits, and progressively reduce it to a system of two coupled ordinary differential equations. This simpler macroscopic model is then analyzed numerically, to investigate how the population reacts to a shift in its environment: an temperature increase, a drop in precipitation levels, or a combination of the two. Our results highlight contrasting effects of water stress and temperature stress on population dynamics, as well as the ambivalent effect of the plasticity.2026-01-26T07:00:41ZSirine BoucennaUMR ISEMVasilis DakosUMR ISEMGaël RaoulCMAP, MERGEhttp://arxiv.org/abs/2601.15219v2A height-based metaconcept for rooted tree balance and its implications for the $B_1$ index2026-01-25T18:06:20ZTree balance has received considerable attention in recent years, both in phylogenetics and in other areas. Numerous (im)balance indices have been proposed to quantify the (im)balance of rooted trees. A recent comprehensive survey summarized this literature and showed that many existing indices are based on similar underlying principles. To unify these approaches, three general metaconcepts were introduced, providing a framework to classify, analyze, and extend imbalance indices. In this context, a metaconcept is a function $Φ_f$ that depends on another function $f$ capturing some aspect of tree shape. In this manuscript, we extend this line of research by introducing a new metaconcept based on the heights of the pending subtrees of all inner vertices. We provide a thorough analysis of this metaconcept and use it to answer open questions concerning the well-known $B_1$ balance index. In particular, we characterize the tree shapes that maximize the $B_1$ index in two cases: (i) arbitrary rooted trees and (ii) binary rooted trees. For both cases, we also determine the corresponding maximum values of the index.
Finally, while the $B_1$ index is induced by a so-called third-order metaconcept, we explicitly introduce three new (im)balance indices derived from the first- and second-order height metaconcepts, respectively, thereby demonstrating that pending subtree heights give rise to a variety of novel (im)balance indices.2026-01-21T17:49:51ZMareike FischerTom Niklas HamannKristina Wickehttp://arxiv.org/abs/2601.17763v1Tracking dynamics of superspreading through contacts, exposures, and transmissions in edge-based network epidemics2026-01-25T09:35:52ZInfectious disease superspreading caused by heterogeneity in contact behavior has been observed to be an important determinant of epidemic dynamics and size in both empirical and theoretical settings. However, it has also been observed that the importance of this type of superspreading changes throughout an epidemic, generally in a decreasing manner as infections cascade from individuals with many contacts to those with fewer contacts. We provide an exact mathematical formulation of this phenomenon in strongly-immunizing (SIR) epidemics on static contact networks. Building on the edge-based modeling framework, we construct three metrics to track how superspreading changes through the course of an epidemic, respectively measuring infected nodes' contacts, exposures, and transmissions: (1) the mean degree of infected nodes, (2) the mean number of susceptible neighbors of infected nodes, and (3) the mean number of secondary cases that will be caused by newly infected nodes. We prove results about the behaviors of these metrics, highlighting the fact that their peak times all occur at less than half the time it takes for population-level infection prevalence to peak. This suggests that the importance of superspreading will be low when an epidemic is already near its peak, so contact-based control strategies are best employed as early in an outbreak as possible. We discuss implications for accurately measuring epidemiological parameters from incidence, mobility, contact tracing, and transmission data.2026-01-25T09:35:52ZAri S. FreedmanBjarke F. NielsenMaximillian M. NguyenLaurent Hébert-DufresneSimon A. Levinhttp://arxiv.org/abs/2601.17590v1Travelling Waves in Wolbachia Spread Dynamics2026-01-24T21:01:43ZWolbachia, a maternally transmitted endosymbiont, offers a powerful biological control strategy for mosquito-borne diseases such as dengue, Zika, and malaria. We develop an integro-difference equation (IDE) model that integrates Wolbachia's nonlinear growth with spatially explicit mosquito dispersal kernels to study invasion dynamics in heterogeneous landscapes. Analytical results establish the existence and uniqueness of monotone traveling waves and provide explicit estimates of invasion speeds as functions of dispersal and growth parameters. Four kernels: Gaussian, Laplace, exponential square-root, and Cauchy, represent a continuum from short- to long-range movement. Fat-tailed kernels generate faster, broader wavefronts, while compact ones limit spread. We also identify a critical bubble, the minimal localized profile required for sustained invasion. Numerical simulations in one- and two-dimensional domains confirm theoretical predictions and reveal parameter regimes governing invasion success. This framework quantifies how dispersal mechanisms shape Wolbachia's spread, thus informing targeted and efficient vector-control strategies.2026-01-24T21:01:43ZZhuolin QuTong WuEddy Kwessihttp://arxiv.org/abs/2601.06272v3Crossing the Functional Desert: Cascade-Driven Assembly and Feasibility Transitions in Early Life2026-01-24T19:50:52ZThe origin of life poses a problem of combinatorial feasibility: How can temporally supported functional organization arise in exponentially branching assembly spaces when unguided exploration behaves as a memoryless random walk? We show that nonlinear threshold-cascade dynamics in connected interaction networks provide a minimal, substrate-agnostic mechanism that can soften this obstruction. Below a critical connectivity threshold, cascades die out locally and structured input-output response mappings remain sparse and transient-a "functional desert" in which accumulation is dynamically unsupported. Near the critical percolation threshold, system-spanning cascades emerge, enabling discriminative functional responses. We illustrate this transition using a minimal toy model and generalize the argument to arbitrary networked systems. Also near criticality, cascades introduce finite-timescale structural and functional coherence, directional bias, and weak dynamical path-dependence into otherwise memoryless exploration, allowing biased accumulation. This connectivity-driven transition-functional percolation-requires only generic ingredients: interacting units, nonlinear thresholds, influence transmission, and non-zero coherence times. The mechanism does not explain specific biochemical pathways, but it identifies a necessary dynamical regime in which structured functional organization can emerge and be temporarily supported, providing a physical foundation for how combinatorial feasibility barriers can be crossed through network dynamics alone.2026-01-09T19:37:36Z11 pages, 2 figuresGalen J. Wilkersonhttp://arxiv.org/abs/2601.17466v1$β$-diversity and Graph Sheaf Laplacians2026-01-24T13:46:31ZWe suggest a new approach to $β$-diversity in ecological systems, based on the energy of the graph sheaf Laplacian associated with the sample data. This scalar quantity is easily computable using methods of linear algebra. We show using simple examples that the energy is much more informative than the generally accepted definitions of $β$-diversity2026-01-24T13:46:31Z7 pagesPeter DavidsonMichael Grinfeldhttp://arxiv.org/abs/2502.19063v4Global population crisis scenarios predicted by the most general dynamic model2026-01-24T10:10:31ZWe show that a simple nonlinear differential equation (originally studied in the physics of disordered systems) is able to mathematically describe the global population growth over the past 12000 years. Different regimes of population growth since the early Neolithic until today are shown to be all solutions to the same nonlinear differential equation in its various limits. These also include the well-known Malthus (exponential) and Verhulst (logistic) growth regimes, as well as von Foerster's ``doomsday'' formula. All these limits correspond to neglecting higher-order terms in a more general nonlinear dynamic model described by the proposed nonlinear differential equation. While the older models may provide valid fittings to limited time intervals in the global population growth curve in time, their clearly approximate nature prevents them from being predictive over longer periods of time. The proposed comprehensive solution of the proposed model is instead well suited to provide predictions for future scenarios. These include halving of the global population as early as 2064 due to resource depletion, if the effect of the Earth's limited carrying capacity were to set in today.2025-02-26T11:41:45ZAlessio ZacconeKostya Trachenkohttp://arxiv.org/abs/2106.07292v4Ultrafast topological data analysis reveals pandemic-scale dynamics of convergent evolution2026-01-23T18:33:54ZGenome variants which re-occur independently across evolutionary lineages are key molecular signatures of adaptation. Inferring the dynamics of such genetic changes from pandemic-scale genomic datasets is now possible, which opens up unprecedented insight into evolutionary processes. However, existing approaches depend on the construction of accurate phylogenetic trees, which remains challenging at scale. Here we present EVOtRec, an organism-agnostic, fast and scalable Topological Data Analysis approach that enables the inference of convergently evolving genomic variants over time directly from topological patterns in the dataset, without requiring the construction of a phylogenetic tree. Using data from both simulations and published experiments, we show that EVOtRec can robustly identify variants under positive selection and performs orders of magnitude faster than state-of-the-art phylogeny-based approaches, with comparable results. We apply EVOtRec to three large viral genome datasets: SARS-CoV-2, influenza virus A subtype H5N1 and HIV-1. We identify key convergent genome variants and demonstrate how EVOtRec facilitates the real-time tracking of high fitness variants in large datasets with millions of genomes, including effects modulated by varying genomic backgrounds. We envision our Topological Data Analysis approach as a new framework for efficient comparative genomics.2021-06-14T10:38:40Zsubstantial revisionMichael BleherLukas HahnMaximilian NeumannZachary ArdernJuan Angel Patino-GalindoMathieu CarriereUlrich BauerRaul RabadanAndreas Otthttp://arxiv.org/abs/2601.18818v1LabelKAN -- Kolmogorov-Arnold Networks for Inter-Label Learning: Avian Community Learning2026-01-23T15:50:50ZGlobal biodiversity loss is accelerating, prompting international efforts such as the Kunming-Montreal Global Biodiversity Framework (GBF) and the United Nations Sustainable Development Goals to direct resources toward halting species declines. A key challenge in achieving this goal is having access to robust methodologies to understand where species occur and how they relate to each other within broader ecological communities. Recent deep learning-based advances in joint species distribution modeling have shown improved predictive performance, but effectively incorporating community-level learning, taking into account species-species relationships in addition to species-environment relationships, remains an outstanding challenge. We introduce LabelKAN, a novel framework based on Kolmogorov-Arnold Networks (KANs) to learn inter-label connections from predictions of each label. When modeling avian species distributions, LabelKAN achieves substantial gains in predictive performance across the vast majority of species. In particular, our method demonstrates strong improvements for rare and difficult-to-predict species, which are often the most important when setting biodiversity targets under frameworks like GBF. These performance gains also translate to more confident predictions of the species spatial patterns as well as more confident predictions of community structure. We illustrate how the LabelKAN leads to qualitative and quantitative improvements with a focused application on the Great Blue Heron, an emblematic species in freshwater ecosystems that has experienced significant population declines across the United States in recent years. Using the LabelKAN framework, we are able to identify communities and species in New York that will be most sensitive to further declines in Great Blue Heron populations.2026-01-23T15:50:50ZMarc GrimsonJoshua FanCourtney L. DavisDylan van BramerDaniel FinkCarla P. Gomeshttp://arxiv.org/abs/2508.09871v2Inference of germinal center evolutionary dynamics via simulation-based deep learning2026-01-23T12:56:18ZB cells and the antibodies they produce are vital to health and survival, motivating research on the details of the mutational and evolutionary processes in the germinal centers (GC) from which mature B cells arise. It is known that B cells with higher affinity for their cognate antigen (Ag) will, on average, tend to have more offspring. However the exact form of this relationship between affinity and fecundity, which we call the ``affinity-fitness response function'', is not known. Here we use deep learning and simulation-based inference to learn this function from a unique experiment that replays a particular combination of GC conditions many times. All code is freely available at https://github.com/matsengrp/gcdyn, while datasets and inference results can be found at https://doi.org/10.5281/zenodo.15022130.2025-08-13T15:09:45ZDuncan K RalphAthanasios G BakisJared GallowayAshni A VoraTatsuya ArakiGabriel D VictoraYun S SongWilliam S DeWittFrederick A Matsen10.7554/eLife.108880.1