https://arxiv.org/api/0qoX7qESNtd554pM7YW9m5cNp+w2026-06-20T11:33:09Z1302964515http://arxiv.org/abs/2512.02312v1Fast and Accurate Node-Age Estimation Under Fossil Calibration Uncertainty Using the Adjusted Pairwise Likelihood2025-12-02T01:12:50ZEstimating divergence times from molecular sequence data is central to reconstructing the evolutionary history of lineages. Although Bayesian relaxed-clock methods provide a principled framework for incorporating fossil information, their dependence on repeated evaluations of the full phylogenetic likelihood makes them computationally demanding for large genomic datasets. Furthermore, because disagreements in divergence-time estimates often arise from uncertainty or error in fossil placement and prior specification, there is a need for methods that are both computationally efficient and robust to fossil-calibration uncertainty. In this study, we introduce fast and accurate alternatives based on the phylogenetic pairwise composite likelihood, presenting two adjusted pairwise likelihood (APW) formulations that employ asymptotic moment-matching weights to better approximate the behavior of the full likelihood within a Bayesian MCMC framework. Extensive simulations across diverse fossil-calibration scenarios show that APW methods produce node-age estimates comparable to those obtained from the full likelihood while offering greater robustness to fossil misplacement and prior misspecification, due to the reduced sensitivity of composite likelihoods to local calibration errors. Applied to a genome-scale dataset of modern birds, APW methods recover divergence time patterns consistent with recent studies, while reducing computational cost by more than an order of magnitude. Overall, our results demonstrate that adjusted pairwise likelihoods provide a calibration-robust and computationally efficient framework for Bayesian node dating, especially suited for large phylogenomic datasets and analyses in which fossil priors may be uncertain or imperfectly placed.2025-12-02T01:12:50Z32 pages, 11 figuresGregory M EllisonLiang Liuhttp://arxiv.org/abs/2512.02223v1On the Approximation of Phylogenetic Distance Functions by Artificial Neural Networks2025-12-01T21:42:01ZInferring the phylogenetic relationships among a sample of organisms is a fundamental problem in modern biology. While distance-based hierarchical clustering algorithms achieved early success on this task, these have been supplanted by Bayesian and maximum likelihood search procedures based on complex models of molecular evolution. In this work we describe minimal neural network architectures that can approximate classic phylogenetic distance functions and the properties required to learn distances under a variety of molecular evolutionary models. In contrast to model-based inference (and recently proposed model-free convolutional and transformer networks), these architectures have a small computational footprint and are scalable to large numbers of taxa and molecular characters. The learned distance functions generalize well and, given an appropriate training dataset, achieve results comparable to state-of-the art inference methods.2025-12-01T21:42:01Z10 pagesBenjamin K. RosenzweigMatthew W. Hahnhttp://arxiv.org/abs/2512.02204v1MoRSAIK: Sequence Motif Reactor Simulation, Analysis and Inference Kit in Python2025-12-01T20:52:23ZOrigins of life research investigates how life could emerge from prebiotic chemistry only. One possible explanation provides the RNA world hypothesis. It states that life could emerge from RNA strands only, storing and transferring biological information, as well as catalyzing reactions as ribozymes. Before this state could have emerged, however, the prebiotic world was probably a purely chemical pool of short RNA strands with random sequences and without biological function performing hybridization and dehybridization, as well as ligation and cleavage. In this context relevant questions are what are the conditions that allow longer RNA strands to be built and how can information carrying in RNA sequence emerge?
In order to investigate such RNA reactors, efficient simulations are needed because the space of possible RNA sequences increases exponentially with the length of the strands, as well as the number of reactions between two strands. In addition, simulations have to be compared to experimental data for validation and parameter calibration. Here, we present the MoRSAIK python package for sequence motif (or k-mer) reactor simulation, analysis and inference. It enables users to simulate RNA sequence motif dynamics in the mean field approximation as well as to infer the reaction parameters from data with Bayesian methods and to analyze results by computing observables and plotting. MoRSAIK simulates an RNA reactor by following the reactions and the concentrations of all strands inside up to a certain length (of four nucleotides by default). Longer strands are followed indirectly, by tracking the concentrations of their containing sequence motifs of that maximum length.2025-12-01T20:52:23Z5 pages, 1 figureJohannes Harth-KitzerowUlrich GerlandTorsten A. Enßlinhttp://arxiv.org/abs/2512.01435v1Existence of two thresholds in a bistable equation with nonlocal competition2025-12-01T09:21:38ZWe consider a nonlocal bistable reaction-diffusion equation, which serves as a model for a population structured by a phenotypic trait, subject to mutation, trait-dependent fitness, and nonlocal competition. Within this replicator-mutator framework, we further incorporate a ''pseudo-Allee effect'' so that the long time behavior (extinction vs. survival) depends on the size of the initial data. After proving the well-posedness of the associated Cauchy problem, we investigate its long-time behavior. We first show that small initial data lead to extinction. More surprisingly, we then prove that that extinction may also occur for too large initial data, in particular when selection is not strong enough. Finally, we exhibit situations where intermediate initial data lead to persistence, thereby revealing the existence of (at least) two thresholds. These results stand in sharp contrast with the behavior observed in local bistable equations.2025-12-01T09:21:38ZMatthieu AlfaroLMRS, LPPCédric Chane Ki ChuneBioSPLionel RoquesBioSPhttp://arxiv.org/abs/2508.14740v2Modeling the impact of temperature and bird migration on the spread of West Nile virus2025-11-30T18:53:24ZWest Nile virus (WNV) is a climate-sensitive mosquito-borne arbovirus circulating between mosquitoes of the genus Culex and birds, with a potential spillover to humans and other mammals. Recent trends in climatic change, characterized by early and/or prolonged summer seasons, increased temperatures, and above-average rainfall, probably facilitated the spread of WNV in Europe, including Germany. In this work, we formulate a spatial WNV model consisting of a system of parabolic partial differential equations (PDEs), using the concept of diffusion and advection in combination with temperature-dependent parameters, i.e., mosquito biting rate, extrinsic incubation, and mortality rate. Diffusion represents the random movement of both mosquitoes and hosts across space, while advection captures the directed movement of migratory birds. The model is first studied mathematically, and we show that it has non-negative, unique, and bounded solutions in time and space. Numerical simulations of the PDE model are performed using temperature data for Germany (2019 - 2024). Results obtained from the simulation showed a high agreement with the reported WNV cases among birds and equids in Germany. The observed spreading patterns from the year 2018 to 2022 and the year 2024 were mainly driven by temperature in combination with diffusion processes of hosts and vectors. Only during the year 2023, the additional inclusion of advection for migratory birds was important to correctly predict new hotspots in new locations in Germany.2025-08-20T14:39:50Z1 supplentary pdf file, 6 videosPride DuveFelix SauerRenke Lühken10.1016/j.onehlt.2026.101386http://arxiv.org/abs/2512.01007v1Adaptation to time-varying environments in a reaction-diffusion model2025-11-30T18:01:12ZWe present a spatially-extended system of chemical reactions exhibiting adaptation to time-dependent influxes of reactants. Here adaptation is defined as improved reproductive success, namely the ability of one of the many locally stable states available to the system to expand in space at the expense of other states. We find that adaptation can arise simply by environmental exposure to sequences of varying influxes. This adaptation is specific to the temporal sequence yet flexible enough to generalize to related sequences. It is enhanced through repeated exposure to the same environmental sequence, representing a form of learning, and through spatial interactions, enabling natural selection to act and representing a form of collective learning. Finally, adaptation benefits from a nearby adapted state, representing a form of teacher-guided learning. By combining environmental drives and reproduction within a stochastic reaction-diffusion dynamics framework, our model lays a foundation for a theory of adaptation grounded in physical principles.2025-11-30T18:01:12ZOlivier RivoireGuy Buninhttp://arxiv.org/abs/2512.00467v1A Theoretical Framework for the Formation of Large Animal Groups: Topological Coordination, Subgroup Merging, and Velocity Inheritance2025-11-29T12:39:55ZLarge animal groups -- bird flocks, fish schools, insect swarms -- are often assumed to form by gradual aggregation of sparsely distributed individuals. Using a mathematically precise framework based on time-varying directed interaction networks, we show that this widely held view is incomplete.
The theory demonstrates that large moving groups do not arise by slow accumulation; instead, they emerge through the rapid merging of multiple pre-existing subgroups that are simultaneously activated under high-density conditions. The key mechanism is topological: the long-term interaction structure of any moving group contains a single dominant strongly connected component (SCC). This dominant SCC determines the collective velocity -- both speed and direction -- of the entire group.
When two subgroups encounter one another, the trailing subgroup aligns with -- and ultimately inherits -- the velocity of the dominant SCC of the leading subgroup. Repeated merging events naturally generate large groups whose speed is predicted to be lower than the mean speed of the original subgroups. The same dynamics explain several universal empirical features: broad neighbour-distance distributions, directional asymmetry in neighbour selection, and the characteristic narrow-front, wide-rear geometry of real flocks.
The framework yields testable predictions for STARFLAG-style 3D datasets, offering a unified explanation for the formation, maintenance, and geometry of coordinated animal groups.2025-11-29T12:39:55Z26pages, 5 figuresJidong Jinhttp://arxiv.org/abs/2511.23209v1Stochastic fluctuations in an eco-evolutionary game dynamics with environmental feedbacks2025-11-28T14:16:45ZBuilding upon the eco-evolutionary game dynamics framework established by Tilman et al., we investigate stochastic fluctuations in a two-strategy system incorporating environmental feedback mechanisms, where the payoff matrix exhibits population size dependence. We adopt a systematic approach which is the so-called $Ω$-expansion. When the stochastic factor is integrated, it is shown that the population size for each strategy fluctuates around the interior equilibrium of the macroscopic equations (corresponding to the deterministic model of the eco-evolutionary game) and its variance converges to a constant that is proportional to the environmental carrying capacity if the interior equilibrium is asymptotically stable. The simulation results demonstrate that the $Ω$ expansion provides a valid approximation, and the reliability of the aforementioned conclusions is verified. Therefore, analogous to Fudenberg and Harris' s stochastic replicator dynamics for infinite populations under external noise (\emph{J. Econ. Theory 57, 420-441}), the dynamic stability of the eco-evolutionary game can be extended to the stochastic regime when the environmental carrying capacity is sufficiently large.2025-11-28T14:16:45ZChao WangMinlan LiChang Liuhttp://arxiv.org/abs/2508.18038v2A homoclinic route to chaos in omnivore communities2025-11-28T05:48:51ZOmnivory, where species feed across multiple trophic levels, is a widespread feature of ecological networks. A key mechanism underlying such complexity is intraguild predation (IGP), in which a top predator consumes both an intermediate predator and a shared resource. Here, we show that Shilnikov homoclinic orbits emerge in a minimal intraguild predation model, triggering a cascade of homoclinic bifurcations near a saddle-focus equilibrium that culminates in chaos. Numerical simulations and Lyapunov spectrum analysis reveal multiple coexistence modes, ranging from regular oscillations to Shilnikov homoclinic orbits and chaos. Our model quantitatively reproduces patterns observed in natural omnivore networks, providing mechanistic insights into complex population fluctuations in ecological systems.2025-08-25T13:56:36ZMaintext: 8 pages, 4 figures; SM: 4 pages, 1 figureYiyuan NiuJu KangWei TaoXin Wanghttp://arxiv.org/abs/2511.22841v1A novel approach to profile global circulation pathway of SARS-CoV-2 variants by site-based mutation dynamics2025-11-28T02:18:43ZThe genetic evolution of SARS-CoV-2 has caused recurring epidemic waves, understanding its global dispersal patterns is critical for effective surveillance. We developed the Site-based mutation dynamics - Equal Power Sampling (S-EPS) framework, a phylogenetic-free, bias-correcting framework for profiling viral source-sink dynamics. Applying S-EPS to 6.6 million SARS-CoV-2 genomes (March 2020 - June 2024) from 13 regions worldwide, we identified Africa and the Indian subcontinent as the predominant sources of key mutations. Southeast Asia serves as an early transmission hub, while Russia and South America mainly acted as sinks. Key mutations took longer to establish fitness in source regions than externally. Once an amino acid substitution on the receptor-binding domain reached 1% prevalence in major sources, there is an 80% probability it would spread elsewhere, with a 2-month median lead time (IQR: 1-4). Our findings underscore the importance of genetic surveillance, with S-EPS offering enhanced capability for monitoring emerging viral threats.2025-11-28T02:18:43ZHong ZhengShimin SuCaiqi LiuJingzhi LouLirong CaoYexian ZhangZhihui ZhangMarc Ka Chun ChongBenny Chung-Ying ZeePeter Pak-Hang CheungHaogao GuJuan PuLeo Lit Man PoonHui-Ling YenMaggie Haitian Wanghttp://arxiv.org/abs/2508.00363v2Bayesian tit-for-tat fosters cooperation in evolutionary stochastic games2025-11-27T05:26:18ZLearning from experience is a key feature of decision-making in cognitively complex organisms. Strategic interactions involving Bayesian inferential strategies can enable us to better understand how evolving individual choices to be altruistic or selfish can affect collective outcomes in social dilemmas. Bayesian strategies are distinguished, from their reactive opponents, in their ability to modulate their actions in the light of new evidence. We investigate whether such strategies can be resilient against reactive strategies when actions not only determine the immediate payoff but can affect future payoffs by changing the state of the environment. We use stochastic games to mimic the change in environment in a manner that is conditioned on the players' actions. By considering three distinct rules governing transitions between a resource-rich and a resource-poor states, we ascertain the conditions under which Bayesian tit-for-tat strategy can resist being invaded by reactive strategies. We find that the Bayesian strategy is resilient against a large class of reactive strategies and is more effective in fostering cooperation leading to sustenance of the resource-rich state. However, the extent of success of the Bayesian strategies depends on the other strategies in the pool and the rule governing transition between the two different resource states.2025-08-01T06:47:38ZArunava PatraSupratim SenguptaSagar Chakrabortyhttp://arxiv.org/abs/2511.22005v1Assessing the Validity of the Fixed Tree Topology Assumption in Phylodynamic Inference2025-11-27T01:10:57ZFixed tree topologies are widely used in phylodynamic analyses to reduce computational burden, yet the consequences of this assumption remain insufficiently understood. Here, we systematically assess the impact of various fixed-topology strategies on phylogenetic and phylodynamic parameter estimates across a diverse set of viral datasets. We compare fully Bayesian joint inference with fixed-topology strategies, including conditioning on maximum likelihood trees subsequently dated with LSD or TreeTime. Our analyses show that global parameters of the substitution and site models are largely robust to the fixed-topology assumption, whereas parameters that depend on the temporal structure of the tree, such as molecular clock rates, node ages, and demographic histories, can exhibit substantial biases. We do treat unconstrained Bayesian analyses as the reference, although we recognize that these too are model-based approximations. Nevertheless, our results highlight serious discordance associated with fixing the topology and underscore the need for faster, time-aware methods that simultaneously integrate topology and parameter estimation. These findings raise important questions about the balance between computational efficiency and inferential accuracy in phylodynamic studies.2025-11-27T01:10:57ZMathieu FourmentJiansi GaoMarc A SuchardFrederick A Matsenhttp://arxiv.org/abs/2511.21587v1Approximate Bayesian Computation Made Easy: A Practical Guide to ABC-SMC for Dynamical Systems with \texttt{pymc}2025-11-26T17:05:27ZMechanistic models are essential tools across ecology, epidemiology, and the life sciences, but parameter inference remains challenging when likelihood functions are intractable. Approximate Bayesian Computation with Sequential Monte Carlo (ABC-SMC) offers a powerful likelihood-free alternative that requires only the ability to simulate data from mechanistic models. Despite its potential, many researchers remain hesitant to adopt these methods due to perceived complexity. This tutorial bridges that gap by providing a practical, example-driven introduction to ABC-SMC using Python. From predator-prey dynamics to hierarchical epidemic models, we illustrate by example how to implement, diagnose, and interpret ABC-SMC analyses. Each example builds intuition about when and why ABC-SMC works, how partial observability affects parameter identifiability, and how hierarchical structures naturally emerge in Bayesian frameworks. All code leverages PyMC's modern probabilistic programming interface, ensuring reproducibility and easy adaptation to new problems. The code its fully available for download at \href{https://github.com/mariocastro73/ABCSMC_pymc_by_example}{mariocastro73/ABCSMC\_pymc\_by\_example}2025-11-26T17:05:27Z17 pages, 10 figures. Link to github respositoryMario Castrohttp://arxiv.org/abs/2507.01046v4A Compartmental Model for Epidemiology with Human Behavior and Stochastic Effects2025-11-26T14:43:12ZWe propose a compartmental model for epidemiology wherein the population is split into groups with either comply or refuse to comply with protocols designed to slow the spread of a disease. Parallel to the disease spread, we assume that noncompliance with protocols spreads as a social contagion. We begin by deriving the reproductive ratio for a deterministic version of the model, and use this to fully characterize the local stability of disease free equilibrium points. We then append the deterministic model with stochastic effects, specifically assuming that the transmission rate of the disease and the transmission rate of the social contagion are uncertain. We prove global existence and nonnegativity for our stochastic model. Then using suitably constructed stochastic Lyapunov functions, we analyze the behavior of the stochastic system with respect to certain disease free states. We demonstrate all of our results with numerical simulations.2025-06-24T20:51:47Z30 pages, 7 figuresChristian ParkinsonWeinan Wanghttp://arxiv.org/abs/2502.04160v3Lotka-Volterra-type kinetic equations for interacting species2025-11-26T14:05:59ZIn this work, we examine a kinetic framework for modeling the time evolution of size distribution densities of two populations governed by predator-prey interactions. The model builds upon the classical Boltzmann-type equations, where the dynamics arise from elementary binary interactions between the populations. The model uniquely incorporates a linear redistribution operator to quantify the birth rates in both populations, inspired by wealth redistribution operators. We prove that, under a suitable scaling regime, the Boltzmann formulation transitions to a system of coupled Fokker-Planck-type equations. These equations describe the evolution of the distribution densities and link the macroscopic dynamics of their mean values to a Lotka-Volterra system of ordinary differential equations, with parameters explicitly derived from the microscopic interaction rules. We then determine the local equilibria of the Fokker-Planck system, which are Gamma-type densities, and investigate the problem of relaxation of its solutions toward these kinetic equilibria, in terms of their moments' dynamics. The results establish a bridge between kinetic modeling and classical population dynamics, offering a multiscale perspective on predator-prey systems.2025-02-06T15:44:54ZAndrea BondesanMarco MenaleGiuseppe ToscaniMattia Zanella