https://arxiv.org/api/lcgQD1RWNLx1ZuYAsIC6c8zebW82026-06-13T15:04:36Z130164515http://arxiv.org/abs/2605.30662v1Spatio-temporal stochastic graph-based learning for infectious disease forecasting2026-05-28T23:43:39ZSpatio-temporal graph-based models have typically been used to forecast new cases of infectious diseases such as COVID-19 and chickenpox outbreaks. However, the use of stochastic modelling into their learning process has been surprisingly under-investigated and rarely considered entire data sets of large countries. As a result, it is unknown whether these models would provide accurate forecasts in real-world disease spread scenarios. In this work, we propose a spatio-temporal stochastic graph-based architecture that integrates a stochastic formulation and uncertainty approximation process to forecast new infectious disease cases. We find that our approach can adapt to encode large and small population geographical networks within a single model architecture. Using two real-world data sets, COVID-19 in the US and chickenpox in Hungary, we report an enhanced effect of the proposed architecture across predictions of the 2022 first wave for COVID-19 in the US and comparative results of chickenpox waves during 2012-2014 in Hungary. By benchmarking with four spatio-temporal graph-based models, quantitative results show competitive overall weekly performance of the proposed approach on forecasting new cases for all 3,218 US counties and all 20 Hungary counties. The proposed approach can represent overall epidemic progression relative to baselines, though with a one-step delay; while exhibiting a reduced sensitivity to high-frequency and low-amplitude variability.2026-05-28T23:43:39ZPreprint under reviewLuz Stefani Sotomayor ValenzuelaSusanna CrambDarren Wraithhttp://arxiv.org/abs/2605.30566v1Participation Costs Narrow Democratic Cooperation2026-05-28T20:59:21ZCollective action often requires institutions that make cooperation individually worthwhile. We ask whether democratic allocation of public-good return can transform a repeated public good into a self-sustaining cooperative institution, and how participation costs reshape that process. A simple evolutionary model shows that voted redistribution can support a prosocial allocation order, but can also sustain an antisocial allocation order or democratic free riding, in which individuals benefit from an institution maintained by others while avoiding the cost of participation. The model predicts competing effects of voting cost. Cost can suppress use of the institution to reward low contributors under strong selection, but can also thin the active electorate and erode contributor-rewarding support. We test these predictions in a preregistered online experiment with \NIncludedGroupsVone{} five-person groups. Endogenous democratic redistribution increased contributions relative to an equal-share public-goods control, with zero-cost voting producing the strongest temporal improvement. Voting costs did not mainly turn active voters toward low-contributor-rewarding allocation. Instead, they shifted behavior toward abstention and democratic free riding, made abstention locally rewarding, and widened the gap between post-task perceptions of democratic participation and the behavioral record. Democratic allocation can therefore stabilize cooperation, but participation costs can reduce the number of people actively sustaining the institution and can make that erosion less visible to participants themselves.2026-05-28T20:59:21Z32 page, 6 figuresMohammad SalahshourFjolle ShabaniUrs FischbacherIain D. Couzinhttp://arxiv.org/abs/2605.30109v1Training Ecosystems: A Computational Approach to Uncovering Learning Behavior in Unconventional Contexts2026-05-28T15:47:20ZRecent progress in diverse intelligence has shown simple learning capacities below the organism level - single cells and even molecular networks. However, there are still many knowledge gaps around learning capacity above the organism level, and about memory implemented purely by dynamical interactions without explicit memory media. We demonstrate that minimal ecological dynamics (in silico) are sufficient for several kinds of learning, assayed as changes in both, magnitude of response, and of recovery time. Systematic exploration of over 220,000 parameter combinations in a simulated classic predator-prey model revealed that, when perturbed by stimuli, recovery time exhibits habituation, sensitization, and a form of discrete number learning in a scale-invariant manner. Robustness analysis revealed that habituation and sensitization persist under stochastic perturbations, while discrete number learning is disrupted even at low noise levels. Dimensionality reduction revealed that the incidence of learning capacity is primarily determined by ecological interaction strengths. Clear, unique clustering patterns in parameter space allow high prediction accuracy for novel parameter combinations that enable learning. Response magnitude revealed a striking asymmetry: 90.6% of parameter combinations exhibited recovery time sensitization paired with habituation of response magnitude, while the opposite pattern was extremely rare. These findings highlight a set of phenomena at the intersection of ecology, basal cognition, and mathematics with many implications for a wide range of systems describable by similar kinds of equations. These properties provide numerous efforts in biology and engineering with a substrate that has considerable, pre-patterned, propensity for learning, which ultimately arises from mathematics, not depending on the details of physics or biology.2026-05-28T15:47:20Z26 pages, 14 figuresAdrita SamantaHananel HazanMichael Levinhttp://arxiv.org/abs/2605.29958v1Lattice Brownian bees with cooperative reproduction: steady states, collapse, and spreading2026-05-28T14:01:10ZWe extend the ``Brownian bees'' model of Berestycki et al. (2021, 2022) to cooperative reproduction, $kA\to(k{+}1)A$, of a population of $N$ symmetric random walkers with removal, at each birth event, of the particle farthest from the origin. Working in the limit $N\to\infty$, we formulate a hydrodynamic free-boundary problem for this model. Using this formalism, we determine steady state population densities for all~$k$ and prove their linear stability for $k\le 2$ and instability for $k\ge 4$. In the marginal case $k=3$, there is a whole continuous family of steady states at a single, critical ratio of the reproduction and diffusion rates. Above criticality the population undergoes an asymptotically self-similar finite-time collapse to the origin. Below the criticality the population spreads diffusively, but the reproduction remains quantitatively relevant. For $k\ge 4$, the unstable steady state separates regimes of a finite-time collapse and a diffusive spreading. Here the collapse dynamics is asymptotically self-similar, and the population density exhibits a scale separation requiring a matched-asymptotic description. Our analytical predictions are confirmed by numerical solutions of the hydrodynamic free-boundary problem and by Monte Carlo simulations of the original microscopic model.2026-05-28T14:01:10Z23 one-column pages, 6 figuresOhad VilkBaruch Meersonhttp://arxiv.org/abs/2512.18652v2Impact of temporary lockdown on disease extinction in assortative networks2026-05-28T10:30:04ZChanging environmental conditions can significantly affect the dynamics of disease spread. These changes may arise naturally or result from human interventions; in the latter case, lockdown measures that lead to abrupt but temporary reductions in transmission rates are used to combat disease spread. Yet, the impact of these measures on rare events in heterogeneous populations remains understudied. Here, we analyze the susceptible-infected-susceptible (SIS) model in a stochastic setting where disease extinction -- a sudden clearance of the infection -- occurs via a rare, large fluctuation. We use a semiclassical approximation and numerical simulations on heterogeneous assortative networks, with degree-degree correlations between neighboring nodes, to show how the extinction risk of the disease depends on the lockdown's duration and magnitude, and on the network topology.2025-12-21T09:00:45Z10 pages, 7 figures; to appear in Phys. Rev. E (2026)Elad KorngutMichael Assafhttp://arxiv.org/abs/2605.29736v1Phylogenetic dynamics of MRCA ages and empirical moments of a Brownian trait2026-05-28T10:29:21ZWe study the temporal dynamics of the first two empirical moments of Brownian traits on phylogenetic trees. For a fixed tree, we characterize the distributions of their empirical mean and empirical variance across all lineages extant at any given time. In particular, we show that the variance of the empirical mean and the expected empirical variance are piecewise linear between diversification events.
For lineage-homogeneous random trees, both the variance of the empirical mean and the expected empirical variance can be expressed in terms of the expected age of the most recent common ancestor (MRCA) of a uniformly sampled pair of extant lineages. In this representation, the expected MRCA age enters the two quantities with opposite signs, pointing to a structural opposition between the variance of the empirical mean and the expected empirical variance.
For generalized birth-death processes with time-dependent speciation and extinction rates, we derive an explicit formula for the distribution of the MRCA age of a uniformly sampled pair of extant lineages. This yields integral expressions, at any time, for both the variance of the empirical mean and the expected empirical variance. In the constant-rate birth-death case, we further obtain closed-form expressions for the expected empirical variance and describe its asymptotic behavior in the supercritical, critical and subcritical regimes.2026-05-28T10:29:21ZGilles Didierhttp://arxiv.org/abs/2605.30382v1On the Connection Between Differential Population Growth Rate and Epidemic Reproduction Numbers2026-05-28T02:08:57ZDuring pandemics, public health agencies need to rapidly assess whether a new viral variant is more transmissible than existing lineages. For co-circulating variants, relative fitness can be expressed as a selective coefficient, as the differential population growth rate (DPGR) estimated from genomic surveillance, or, with additional assumptions, as a contrast in epidemic reproduction numbers $R_t$. We show that DPGR estimates a pairwise growth-rate difference. Under a specified generation-interval model, this difference can be transformed into reproduction-number space; in the equal-generation-time SIR special case, it reduces to a scaled difference in variant-specific $R_t$. Related growth-rate contrasts also appear in multinomial logistic and growth-advantage random-walk models, although those methods differ from DPGR in likelihood, smoothing, priors, and data inputs. We evaluate the theory across five SARS-CoV-2 and influenza analyses totaling more than 2,200 matched data points. SIR simulation recovers the expected mapping when the true $R_t$ is known, and retrospective SARS-CoV-2 analyses show sustained DPGR signals 43 to 65 days before variant dominance, with 95\% sign accuracy in our analysis. DPGR is approximately transitive across lineage triplets, near zero for selected functionally similar sublineages, and directionally consistent across countries. These results connect sequence-count-based fitness estimates to reproduction-number contrasts through an assumption-explicit growth-rate bridge.2026-05-28T02:08:57Z23 pages, 5 figuresHong Qinhttp://arxiv.org/abs/2604.01187v3Competition at the front of expanding populations2026-05-28T01:23:53ZWhen competing species grow into new territory, the population is dominated by descendants of successful ancestors at the expansion front. Successful ancestry depends on both the reproductive advantage (fitness), as well as ability and opportunity to colonize new domains. We present a model that integrates both elements by coupling the classic description of one-dimensional competition (Fisher equation) to the minimal model of front shape (KPZ equation). Macroscopic manifestations of these equations are distinct growth morphologies controlled by expansion rates, competitive abilities, or spatial anisotropy. In some cases the ability to expand in space may overcome reproductive advantage in colonizing new territory. When new traits appear with accumulating mutations, we find that variations in fitness in range expansion may be described by the Tracy--Widom distribution.2026-04-01T17:35:04Z17 pages, 8 figuresSergio ErasoMehran Kardarhttp://arxiv.org/abs/2605.28976v1On a phenotype-structured Shigesada--Kawasaki--Teramoto model: Turing instability and pattern selection under fast phenotype switching2026-05-27T18:28:00ZThe Shigesada-Kawasaki-Teramoto (SKT) model has become a classical modelling framework for studying spatial segregation and cross-diffusion-driven pattern formation in competing populations. This model assumes phenotypic homogeneity, but phenotypic variability persists within any population and can strongly influence both ecological and evolutionary dynamics. In this paper, we present a generalised phenotype-structured formulation of the SKT model that accounts for phenotypic variability. In this formulation, the competing populations are continuously structured across some phenotype state spaces. Population members move and compete in phenotype-dependent ways, and can also switch between different phenotype states. First we show how a form of the classical SKT model, wherein parameters are written in terms of continuous weighted averages of the phenotype-dependent functions of the generalised structured model, with weights given by the phenotype distributions of the two populations, can be obtained in the quasi-invariant regime of fast phenotype switching. Then, still assuming fast phenotype switching and extending classical Turing-like linear and weakly nonlinear analyses, we explore the conditions for the emergence of spatial patterns, identify a Turing-type bifurcation threshold leading to pattern formation, and investigate the nature of such a bifurcation (super- or sub-critical) as well as the stability of the patterned state. The results obtained make it possible to draw connections between phenotype-dependent model functions and the emergence of population-scale aggregate spatial dynamics, showing in particular how phenotype distributions can act as effective control parameters for Turing instability and pattern selection. These findings are complemented by numerical simulations, which validate the formal asymptotics and confirm the predictions of the pattern formation analyses.2026-05-27T18:28:00Z24 pages, 6 figuresDavide CussedduGaetana GambinoTommaso Lorenzihttp://arxiv.org/abs/2605.28652v1Widespread quasi-steady state assumption in biological interaction modeling mischaracterizes system transitions2026-05-27T15:53:40ZFrom molecular, cellular, to ecological systems, the modeling of biological processes often stands on the assumption that fast components immediately reach the equilibrium at each moment (quasi-steady state) and only slow components govern the relevant system dynamics. This quasi-steady state approximation (QSSA) simplifies the modeling but discards the effects of the relaxation towards each quasi-steady state. Unclear is the QSSA's suitability around the transition point, a specific condition where the system changes to a qualitatively different state. In this regard, we here derived a theoretical framework for the near-transition dynamics of biological systems, explicitly considering the relaxation processes overlooked by the QSSA. Numerical simulations verify our predictions for cellular decision-making, metabolic oscillations, and ecological cycles. Despite the extreme slowdown near the transition point, the QSSA alone misestimates the duration of the transition from one state to another. Moreover, the QSSA erroneously predicts the transition point itself for the onset of oscillations, while the relaxation dynamics facilitates or suppresses the oscillation onset with a counterintuitive time-delay effect. Common feedback interactions between biological components are pivotal to those relaxation effects. Our study provides an analytical foundation to understand the rich transient or rhythmic dynamics of interacting biological components near the transitions.2026-05-27T15:53:40ZMain manuscript and supplementary information providedPan-Jun Kimhttp://arxiv.org/abs/2605.28545v1PhyloFrame: A DataFrame-based Library for Fast, Flexible Phylogenetic Computation2026-05-27T14:36:33ZPhyloFrame is a Python library for phylogenetic computation targeting the gap between specialist, compiler-optimized operations and flexible, script-based workflows -- with emphasis on fast, memory-efficient operations for very large tree sizes (e.g., $\geq$ 300,000 taxa). PhyloFrame is built around a DataFrame-based tree representation, where each row corresponds to a node and columns record ancestor relationships, branch lengths, taxon labels, and any user-defined attributes. Crucial for scalability, such array-backed storage allows both library and end-user code alike to seamlessly harness Just-in-Time (JIT) compilation (e.g., Numba) and vectorized execution (e.g., NumPy, Polars). At large tree sizes, performance generally matches or exceeds Python libraries backed by native code -- notably, achieving strong performance in topological-order traversals and Newick I/O.
DataFrame-based representation affords several additional conveniences, including:
- succinct bulk operations (e.g., NumPy);
- powerful queries and transformations (e.g., Polars expressions, Pandas indexing, SQL-style joins and merges);
- compatibility with modern tabular data formats that are compression-friendly, type-aware, nullable, and highly portable (e.g., Parquet); and
- broad interoperation with table-oriented data science tools (e.g., Seaborn, Plotly, Vega-Altair, tidyverse, Excel).
Current library features include tree input/output, synthetic tree generation, taxon-based queries, tree traversals, tree metrics, tree manipulation, tree downsampling, and tree comparison. Most functionality supports both Pandas and Polars DataFrames, and is available through programmatic and CLI-based interfaces.2026-05-27T14:36:33ZMatthew Andres MorenoJeet SukumaranLuis ZamanEmily Dolsonhttp://arxiv.org/abs/2603.17754v2Slow evolution towards generalism in a model of variable dietary range2026-05-27T07:31:37ZSpecies sharing a habitat will co-evolve to make use of the available resources, as consumption is modulated by competition and negative feedback loops between consumers and resources. The dietary range of a given species determines the resources it has access to and thus the other species with which it competes. A narrow dietary range avoids competition at the cost of over-reliance on a small selection of resources; conversely a wide dietary range provides more alternatives but also more chance of competition with other species. Here, we investigate the evolution of dietary range within a mathematical model of niche formation. We find highly path dependent co-evolution dynamics characterised by long-lived quasi-stable states. Ultimately, stochastic effects drive the evolution of generalist diets, as we uncover in our analysis and simulations.2026-03-18T14:16:49ZElliot M. ButterworthTim Rogershttp://arxiv.org/abs/2605.27677v1ESL-PSC Toolkit: a graphical software environment for linking shared genetic changes to convergent phenotypes2026-05-26T20:52:29ZConvergent evolution provides a useful framework for testing whether independent origins of similar traits share common genetic mechanisms. Evolutionary Sparse Learning with Paired Species Contrast (ESL-PSC) is an approach to identify genes and sites associated with convergent traits from aligned sequences by fitting sparse predictive models to phylogenetically informed species contrasts. However, practical use of ESL-PSC currently requires substantial command-line fluency for data assembly, species-pair design, execution, and output interpretation. Here we present an integrated ESL-PSC analysis environment (ESL-PSC Toolkit) centered on a graphical user interface (GUI). ESL-PSC Toolkit is designed to assist users from experimental design through data interpretation without requiring extensive technical expertise. It supports guided input validation, interactive tree-based pair selection, command preview, live execution, post-run exploration of ranked genes and aligned sites, a complementary substitution-counting method, and analysis of continuous quantitative convergent traits. The computational backend has been reimplemented in Rust with many performance optimizations and parallelism, greatly reducing runtime for most analyses and enabling cross-platform packaged distributions. Downloadable GUI and CLI toolkit software packages for Mac, Windows, and Linux are available at https://github.com/John-Allard/ESL-PSC/releases/latest.2026-05-26T20:52:29ZJohn B. AllardSudhir Kumarhttp://arxiv.org/abs/2602.18982v4Conditionally Site-Independent Neural Evolution of Antibody Sequences2026-05-26T18:29:18ZCommon deep learning approaches for antibody engineering focus on modeling the marginal distribution of sequences. By treating sequences as independent samples, however, these methods overlook affinity maturation as a rich and largely untapped source of information about the evolutionary process by which antibodies explore the underlying fitness landscape. In contrast, classical phylogenetic models explicitly represent evolutionary dynamics but lack the expressivity to capture complex epistatic interactions. We bridge this gap with CoSiNE, a continuous-time Markov chain parameterized by a deep neural network. Mathematically, we prove that CoSiNE provides a first-order approximation to the intractable sequential point mutation process, capturing epistatic effects with an error bound that is quadratic in branch length. Empirically, CoSiNE outperforms state-of-the-art language models in zero-shot variant effect prediction by explicitly disentangling selection from context-dependent somatic hypermutation. Finally, we introduce Guided Gillespie, a classifier-guided sampling scheme that steers CoSiNE at inference time, enabling efficient optimization of antibody binding affinity toward specific antigens.2026-02-21T23:23:30Z28 pages, 15 figures. Accepted as a poster at ICML 2026Stephen Zhewen LuAakarsh VermaniKohei SannoJiarui LuFrederick A MatsenMilind JagotaYun S. Songhttp://arxiv.org/abs/2502.19063v5Global population crisis scenarios predicted by a general nonlinear dynamical model2026-05-26T17:39:39ZWe show that a simple nonlinear differential equation (originally studied in the physics of disordered systems) is able to mathematically describe the global population growth over the past 12000 years. Different regimes of population growth since the early Neolithic until today are shown to be all solutions to the same nonlinear differential equation in its various limits. These also include the well-known Malthus (exponential) and Verhulst (logistic) growth regimes, as well as von Foerster's ``doomsday'' formula. All these limits correspond to neglecting higher-order terms in a more general nonlinear dynamic model described by the proposed nonlinear differential equation. While the older models may provide valid fittings to limited time intervals in the global population growth curve in time, their clearly approximate nature prevents them from being predictive over longer periods of time. The proposed comprehensive solution of the proposed model is instead well suited to provide predictions for future scenarios. These include a scenario where the global population could halve as early as 2064 under a deliberately conservative, worst-case assumption that carrying-capacity constraints become abruptly active today.2025-02-26T11:41:45ZChaos, Solitons & Fractals 209, 118542 (2026)Alessio ZacconeKostya Trachenko10.1016/j.chaos.2026.118542