https://arxiv.org/api/Rrpgyd9frVyx9FlPBlyqOpX6Q04 2026-06-22T06:02:49Z 13029 1200 15 http://arxiv.org/abs/2504.16302v1 Enumerative combinatorics of unlabeled and labeled time-consistent galled trees 2025-04-22T22:42:24Z In mathematical phylogenetics, the time-consistent galled trees provide a simple class of rooted binary network structures that can be used to represent a variety of different biological phenomena. We study the enumerative combinatorics of unlabeled and labeled time-consistent galled trees. We present a new derivation via the symbolic method of the number of unlabeled time-consistent galled trees with a fixed number of leaves and a fixed number of galls. We also derive new generating functions and asymptotics for labeled time-consistent galled trees. 2025-04-22T22:42:24Z Lily Agranat-Tamir Michael Fuchs Bernhard Gittenberger Noah A. Rosenberg http://arxiv.org/abs/2504.16301v1 SLiM-Gym: Reinforcement Learning for Population Genetics 2025-04-22T22:41:35Z We introduce SLiM-Gym, a Python package for integrating reinforcement learning (RL) with forward-time population genetic simulations. Wright-Fisher evolutionary dynamics offer a tractable framework for modeling populations across discrete generations, yet applying RL to these systems requires a compatible training environment. SLiM-Gym connects the standardized RL interface provided by Gymnasium with the high-fidelity evolutionary simulations of SLiM, allowing agents to interact with evolving populations in real time. This framework enables the development and evaluation of RL-based strategies for understanding evolutionary processes. 2025-04-22T22:41:35Z 7 pages, 2 figures Niko Zuppas Bryan C. Carstens http://arxiv.org/abs/2504.15855v1 Generating heterogeneous data on gene trees 2025-04-22T12:52:10Z We introduce GenPhylo, a Python module that simulates nucleotide sequence data along a phylogeny avoiding the restriction of continuous-time Markov processes. GenPhylo uses directly a general Markov model and therefore naturally incorporates heterogeneity across lineages. We solve the challenge of generating transition matrices with a pre-given expected number of substitutions (the branch length information) by providing an algorithm that can be incorporated in other simulation software. 2025-04-22T12:52:10Z to appear in Journal of Computational Biology Martí Cortada Garcia Adrià Diéguez Moscardó Marta Casanellas http://arxiv.org/abs/1912.00518v3 Beyond classical Hamilton's Rule. State distribution asymmetry and the dynamics of altruism 2025-04-21T14:06:30Z This paper analyzes the relationships between demographic and state-based evolutionary games and Hamilton's rule. It is shown that the classical Hamilton's rule (counterfactual method), combined with demographic payoffs, leads to easily testable models. It works well when the roles of donor and receiver are randomly drawn during each interaction event. This is illustrated by the alarm call example. However, we can imagine situations in which role-switching results from external mechanism, such as, fluxes of individuals between the border and the interior of the habitat, when only border individuals may spot the threat and warn their neighbors. To cover these cases, a new model is extended to the case with explicit dynamics of the role distributions among carriers of different strategies, driven by some general mechanisms. It is shown that even in the case when fluxes between roles are driven by neutral mechanisms (acting in the same way on all strategies), differences in mortality in the focal interaction lead to different distributions of roles for different strategies. This leads to a more complex rule for cooperation than the classical Hamilton's rule. In addition to the cost and benefit components, the new rule contains a third component weighted by the difference in proportions of the donors among carriers of both strategies. Depending on the sign, this component can be termed the "survival surplus", when the donors survival have greater survival than receivers, or the "sacrifice cost" (when it decreases the benefit), when the receiver's survival exceeds that of the helping donor. When we allow different role-switching rates for different strategies, cooperators can win even in the case when the assortment mechanism is inefficient (i.e., the probability of receiving help for noncooperators is slightly greater than for cooperators), which is impossible in classical Hamilton's rule. 2019-12-01T23:18:21Z Krzysztof Argasinski Ryszard Rudnicki http://arxiv.org/abs/2504.14691v1 Quantifying scale-free behaviors in Rock-Paper-Scissors Models as a function of Mobility 2025-04-20T17:43:54Z We investigate the scale-free behavior of the spatial rock-paper-scissors model with May-Leonard dynamics, analyzing specific quantifiers that engender the power-law feature. The main results show that an important parameter that drives the scale-free behavior is the mobility, which can be used to quantitatively describe several scale-free aspects of the model, such as the number of clusters, the characteristic length, the individuals' lifespan and its corresponding mean traveled distance. All of these are novel quantifiers of current practical interest for the study of biodiversity. 2025-04-20T17:43:54Z 7 pages, 6 figures, Version to appear in Physica A: Statistical Mechanics and its Applications Physica A 670 (2025) 130612 D. Bazeia M. Bongestab M. J. B. Ferreira B. F. de Oliveira J. E. B. Santos 10.1016/j.physa.2025.130612 http://arxiv.org/abs/2501.01809v2 Estimating invasive rodent abundance using removal data and hierarchical models 2025-04-19T14:25:04Z Invasive rodents pose significant ecological, economic, and public health challenges. Robust methods are needed for estimating population abundance to guide effective management. Traditional methods such as capture-recapture are often impractical for invasive species due to ethical, legal and logistical constraints. Here, I showcase the application of hierarchical multinomial N-mixture models for estimating the abundance of invasive rodents using removal data. First, I perform a simulation study which demonstrates minimal bias, as well as good precision and reliable coverage of confidence intervals across a range of sampling scenarios. I also illustrate the consequences of violating the population closure assumption, showing how between-occasion dynamics can bias inference. Second, I analyze removal data for two invasive rodent species, namely coypus (Myocastor coypus) in France and muskrats (Ondatra zibethicus) in the Netherlands. Using hierarchical multinomial N-mixture models, I examine the effects of temperature on abundance while accounting for imperfect and time-varying capture probabilities. I also show how to accommodate spatial variability using random effects, quantify uncertainty in parameter estimates, and account for violations of closure by fitting an open-population model to multi-year data. Overall, I hope to demonstrate the flexibility and utility of hierarchical models in invasive species management. 2025-01-03T13:47:33Z 20 pages, 5 figures, 1 table Olivier Gimenez http://arxiv.org/abs/2504.14296v1 Analysis of Discrete Stochastic Population Models with Normal Distribution 2025-04-19T13:45:51Z This paper analyzes a stochastic logistic difference equation under the assumption that the population distribution follows a normal distribution. Our focus is on the mathematical relationship between the average growth rate and a newly introduced concept, the uniform structural growth rate, which captures how growth is influenced by the internal distributional structure of the population. We derive explicit relationships linking the uniform structural growth rate to the parameters of the normal distribution and the variance of a small stochastic perturbation. The analysis reveals the existence of two distinct branches of the uniform structural growth rate, corresponding to alternative population states characterized by higher and lower growth rates. This duality provides deeper insights into the dynamics of population growth under stochastic influences. A sufficient condition for the existence of two uniform structural growth rates is established and rigorously proved, demonstrating that there exist infeasible intervals where no uniform structural growth rate can be defined. We also explore the biological significance of these findings, emphasizing the role of stochastic perturbations and the distribution in shaping population dynamics. 2025-04-19T13:45:51Z Haiyan Wang http://arxiv.org/abs/2504.13706v1 Modelling Immunity in Agent-based Models 2025-04-18T14:09:39Z Vaccination policies play a central role in public health interventions and models are often used to assess the effectiveness of these policies. Many vaccines are leaky, in which case the observed vaccine effectiveness depends on the force of infection. Within models, the immunity parameters required for agent-based models to achieve observed vaccine effectiveness values are further influenced by model features such as its transmission algorithm, contact network structure, and approach to simulating vaccination. We present a method for determining parameters in agent-based models such that a set of target immunity values is achieved. We construct a dataset of desired population-level immunity values against various disease outcomes considering both vaccination and prior infection from COVID-19. This dataset incorporates immunological data, data collection methodologies, immunity models, and biological insights. We then describe how we choose minimal parameters for continuous waning immunity curves that result in those target values being realized in simulations. We use simulations of the household secondary attack rates to establish a relationship between the protection per infection attempt and overall immunity, thus accounting for the dependence of protection from acquisition on model features and the force of infection. 2025-04-18T14:09:39Z 29 pages, 5 figures Gray Manicom Emily Harvey Joshua Looker David Wu Oliver Maclaren Dion O' Neale http://arxiv.org/abs/2504.13556v1 On a stochastic epidemic SIR model with non homogenous population: a toy model for HIV 2025-04-18T08:49:41Z In this paper we generalise a simple discrete time stochastic SIR type model defined by Tuckwell and Williams. The SIR model by Tuckwell and Williams assumes a homogeneous population, a fixed infectious period, and a strict transition from susceptible to infected to recovered. In contrast, our model introduces two groups, $A$ and $B$, where group $B$ has a higher risk of infection due to increased contact rates. Additionally, the duration in the infected class follows a probability distribution rather than being fixed. Finally, individuals in group $B$ can transition directly to the recovered class R, allowing us to analyze the impact of this preventive measure on disease spread. Finally, we apply this model to the spread of HIV, analyzing how risk behaviors, rapid testing, and PrEP-like therapies influence the epidemic dynamics. 2025-04-18T08:49:41Z Carles Rovira http://arxiv.org/abs/2406.15449v4 Exponential rate of epidemic spreading on complex networks 2025-04-18T07:31:11Z The initial phase of an epidemic is often characterized by an exponential increase in the number of infected individuals. In this paper, we predict the exponential spreading rate of an epidemic on a complex network. We first find an expression of the reproduction number for a network, based on the degree distribution, the network assortativity, and the level of clustering. We then connect this reproduction number and the disease infectiousness to the spreading rate. Our result holds for a broad range of networks, apart from networks with very broad degree distribution, where no clear exponential regime is present. Our theory bridges the gap between classic epidemiology and the theory of complex networks, with broad implications for model inference and policy making. 2024-06-05T08:11:41Z 15 pages, 13 figures, accepted version Phys. Rev. E 111, 044311 (2025) Samuel Cure Florian G. Pflug Simone Pigolotti 10.1103/PhysRevE.111.044311 http://arxiv.org/abs/2303.02529v3 The Critical Beta-splitting Random Tree II: Overview and Open Problems 2025-04-18T01:37:22Z In the critical beta-splitting model of a random $n$-leaf rooted tree, clades are recursively (from the root) split into sub-clades, and a clade of $m$ leaves is split into sub-clades containing $i$ and $m-i$ leaves with probabilities $\propto 1/(i(m-i))$. Study of structure theory and explicit quantitative aspects of this model (in discrete or continuous versions) is an active research topic. For many results there are different proofs, probabilistic or analytic, so the model provides a testbed for a ``compare and contrast" discussion of techniques. This article provides an overview of results proved in the sequence of similarly-titled articles I, III, IV and related articles. We mostly do not repeat proofs given elsewhere: instead we seek to paint a ``Big Picture" via graphics and heuristics, and emphasize open problems. Our discussion is centered around three categories of results. (i) There is a CLT for leaf heights, and the analytic proofs can be extended to provide surprisingly precise analysis of other height-related aspects. (ii) There is an explicit description of the limit {\em fringe distribution} relative to a random leaf, whose graphical representation is essentially the format of the cladogram representation of biological phylogenies. (iii) There is a canonical embedding of the discrete model into a continuous-time model, that is a random tree CTCS(n) on $n$ leaves with real-valued edge lengths, and this model turns out more convenient to study. The family (CTCS(n), n \ge 2) is consistent under a ``delete random leaf and prune" operation. That leads to an explicit inductive construction of (CTCS(n), n \ge 2) as $n$ increases, and then to a limit structure CTCS($\infty$) formalized via exchangeable partitions. Many open problems remain, in particular to elucidate a relation between CTCS($\infty$) and the $β(2,1)$ coalescent. 2023-03-04T23:46:05Z Expansion and revision of version 2 to give current overview of active topic, complementing and partly overlapping technical journal articles arXiv:2302.05066 and arXiv:2412.09655 and arXiv:2412.12319. Not intended for journal publication in this format David J. Aldous Svante Janson http://arxiv.org/abs/2308.00354v2 Multidimensional scaling informed by $F$-statistic: Visualizing grouped microbiome data with inference 2025-04-17T14:20:44Z Multidimensional scaling (MDS) is a dimensionality reduction technique for microbial ecology data analysis that represents the multivariate structure while preserving pairwise distances between samples. While its improvement has enhanced the ability to reveal data patterns by sample groups, these MDS-based methods require prior assumptions for inference, limiting their application in general microbiome analysis. In this study, we introduce a new MDS-based ordination, $F$-informed MDS, which configures the data distribution based on the $F$-statistic, the ratio of dispersion between groups sharing common and different characteristics. Using simulated compositional datasets, we demonstrate that the proposed method is robust to hyperparameter selection while maintaining statistical significance throughout the ordination process. Various quality metrics for evaluating dimensionality reduction confirm that $F$-informed MDS is comparable to state-of-the-art methods in preserving both local and global data structures. Its application to a diatom-associated bacterial community suggests the role of this new method in interpreting the community response to the host. Our approach offers a well-founded refinement of MDS that aligns with statistical test results, which can be beneficial for broader compositional data analyses in microbiology and ecology. This new visualization tool can be incorporated into standard microbiome data analyses. 2023-08-01T07:51:01Z Hyungseok Kim Soobin Kim Jeffrey A. Kimbrel Megan M. Morris Xavier Mayali Cullen R. Buie http://arxiv.org/abs/2504.12895v1 Optimum Contribution Selection for Honeybees 2025-04-17T12:37:58Z In 1997, T. H. E. Meuwissen published a groundbreaking article titled 'Maximizing the response of selection with a predefined rate of inbreeding', in which he provided an optimized solution for the trade-off between genetic response and inbreeding avoidance in animal breeding. Evidently, this issue is highly relevant for the honeybee with its small breeding population sizes. However, the genetic peculiarities of bees have thus far prevented an application of the theory to this species. The present manuscript intends to fill this desideratum. It develops the necessary bee-specific theory and introduces a small R script that implements Optimum Contribution Selection (OCS) for honeybees. While researching for this manuscript, we found it rather cumbersome that even though Meuwissen's theory is 28 years old and has sparked research in many new directions, to our knowledge, there is still no comprehensive textbook on the topic. Instead, all relevant information had to be extracted from several articles, leading to a steep learning curve. We anticipate that many honeybee breeding scientists with a putative interest in OCS for honeybees have little to no experience with classical OCS. Thus, we decided to embed our new derivations into a general introduction to OCS that then specializes more and more to the honeybee case. The result are these 121 pages, of which we hope that at least the first sections can also be of use for breeding theorists concerned with other species than honeybees. 2025-04-17T12:37:58Z 121 pages, 48 figures Manuel Du Richard Bernstein Andreas Hoppe http://arxiv.org/abs/2504.12888v1 Anemia, weight, and height among children under five in Peru from 2007 to 2022: A Panel Data analysis 2025-04-17T12:27:06Z Econometrics in general, and Panel Data methods in particular, are becoming crucial in Public Health Economics and Social Policy analysis. In this discussion paper, we employ a helpful approach of Feasible Generalized Least Squares (FGLS) to assess if there are statistically relevant relationships between hemoglobin (adjusted to sea-level), weight, and height from 2007 to 2022 in children up to five years of age in Peru. By using this method, we may find a tool that allows us to confirm if the relationships considered between the target variables by the Peruvian agencies and authorities are in the right direction to fight against chronic malnutrition and stunting. 2025-04-17T12:27:06Z Original research that employs advanced econometrics methods, such as Panel Data with Feasible Generalized Least Squares in biostatistics and Public Health evaluation Studies un Health Sciences, ISSN 2764-0884 year 2025 Luis-Felipe Arizmendi Carlos De la Torre-Domingo Erick W. Rengifo 10.54022/shsv6n2-005 http://arxiv.org/abs/2504.13215v1 Use of Topological Data Analysis for the Detection of Phenomenological Bifurcations in Stochastic Epidemiological Models 2025-04-16T23:28:31Z We investigate predictions of stochastic compartmental models on the severity of disease outbreaks. The models we consider are the Susceptible-Infected-Susceptible (SIS) for bacterial infections, and the Susceptible -Infected-Removed (SIR) for airborne diseases. Stochasticity enters the compartmental models as random fluctuations of the contact rate, to account for uncertainties in the disease spread. We consider three types of noise to model the random fluctuations: the Gaussian white and Ornstein-Uhlenbeck noises, and the logarithmic Ornstein-Uhlenbeck (logOU). The advantages of logOU noise are its positivity and its ability to model the presence of superspreaders. We utilize homological bifurcation plots from Topological Data Analysis to automatically determine the shape of the long-time distributions of the number of infected for the SIS, and removed for the SIR model, over a range of basic reproduction numbers and relative noise intensities. LogOU noise results in distributions that stay close to the endemic deterministic equilibrium even for high noise intensities. For low reproduction rates and increasing intensity, the distribution peak shifts towards zero, that is, disease eradication, for all three noises; for logOU noise the shift is the slowest. Our study underlines the sensitivity of model predictions to the type of noise considered in contact rate. 2025-04-16T23:28:31Z 27 pages, 20 figures Sunia Tanweer Konstantinos Mamis Firas A. Khasawneh