https://arxiv.org/api/Pnc+i/DaqLR0Gnbf4+LTGV1E5ic2026-06-21T14:03:30Z1302999015http://arxiv.org/abs/2508.00150v1Information and fitness in two-state systems: self-replicating individuals in a fluctuating environment2025-07-31T20:22:41ZA population of individuals with the same genes can present heterogeneous traits (phenotypes). The prevalence of this heterogeneity can be explained as a bet-hedging strategy that improves the population proliferation rate (fitness) in fluctuating environments. The phenotype distribution is influenced by factors such as competition between phenotypes, the duration of environmental states, and the rate of phenotype-switching. We illustrate these effects in a system where both the environment and the phenotype can adopt two states. This system includes scenarios such as symmetric bet-hedging and dormant-proliferating phenotypes. We examine how environmental and phenotypic states share mutual information, measured in bits, and explore the relationship between this information and population fitness. We propose that when fitness is measured relative to the case where phenotype and environment are independent, information and fitness can be treated as equivalent measures. We investigate strategies that individuals can use to improve this information, such as adjusting the rates of proliferation and phenotype-switching relative to the environmental fluctuation rate. Through these strategies, with fixed marginal distributions, an increase in information implies an increase in population fitness. We also identify limits to the maximum achievable fitness and information and discuss the value of the information in terms of this new normalized fitness. Our framework offers new insights into how organisms adapt to fluctuating environmental conditions.2025-07-31T20:22:41ZPoulami ChatterjeeCesar NietoJuan Manuel PedrazaAbhyudai Singhhttp://arxiv.org/abs/2507.23636v1Household scale Wolbachia release strategies for effective dengue control2025-07-31T15:13:05ZThe release of Wolbachia-infected mosquitoes into Aedes aegypti infested areas is a promising strategy for localised eradication of dengue infection. Ae aegypti mosquitoes favour urban environments as breeding habitats, so are often found in and around houses. Therefore, it is likely that they will infect members of the households that they reside around. Since population groupings within households are small, stochastic effects become important. Despite this, little work has been carried out to investigate the outcome of releasing Wolbachia-infected mosquitoes at a household scale, either from an empirical and theoretical stand point. In previous work, we developed and analysed a stochastic (continuous time Markov chain) model for the invasion of Wolbachia-infected mosquitoes into a single household containing a population of wildtype mosquitoes. In the present study, we extend our framework to a connected community of households coupled by the movement of mosquitoes. We use numerical results obtained via Gillespie's stochastic simulation algorithm to investigate optimal strategies for the release of Wolbachia-infected mosquitoes carried out at either the community or the household scale. We find that household scale releases can facilitate rapid and successful invasion of the Wolbachia-infected mosquitoes into the household population and then into the wider community. We further explore the impact of regular household scale releases of Wolbachia-infected mosquitoes for a range of compositions for the release population, time intervals between releases and proportion of households participating in the releases. We find that a single release household can provide sufficient protection to the entire community of households if releases are carried out frequently for a number of years and a sufficient number of females are released on each occasion.2025-07-31T15:13:05ZAbby BarlowBen Adamshttp://arxiv.org/abs/2507.23537v1Global, Regional, and National Burden of Chronic Kidney Disease Attributable to High Body Mass Index (BMI) among Individuals Aged 20-54 Years from 1990 to 2021: An Analysis of the Global Burden of Disease Study2025-07-31T13:24:41ZBackground:Chronic kidney disease is one of the most prevalent non-communicable health issues globally, and high body mass index plays a significant role in the onset and progression of chronic kidney disease. Methods: Data on the disease burden attributable to high body mass index were retrieved from the 2021 Global Burden of Disease, Injuries, and Risk Factors Study . The global cases, age-standardized mortality rate , and age-standardized disability-adjusted life years attributable to high body mass index were estimated based on age, sex, geographic location, and the Social-demographic Index (SDI). The estimated annual percentage change was calculated to quantify trends in ASMR and ASDR from 1990 to 2019. Decomposition and frontier analyses were conducted to understand the drivers behind changes in burden and to identify top-performing countries. Inequality analysis was performed to assess disparities in burden across different SDI levels. The Bayesian age-period-cohort model was used to predict the disease burden up to 2035.Results: In 2021, there were 4,643.41 global deaths and 2,514,227.16 DALYs attributable to high body mass index-related CKD, more than triple the figures from 1990. Additionally, from 1990 to 2021, the ASMR and ASDR accelerated, with EAPCs of 2.25 (95% CI: 2.13 to 2.37) and 1.98 (95% CI: 1.89 to 2.08), respectively, particularly among males, in High-income North America, and in Low-middle SDI regions. In terms of SDI, the Low-middle SDI region had the highest ASMR and ASDR related to CKD in 2021. Conclusion: From 1990 to 2021, there was a significant increase in global deaths and DALYs attributable to high high body mass index related CKD. As a major public health issue for CKD patients, high BMI urgently requires targeted measures to address it.2025-07-31T13:24:41ZYu ChenGuangxi Wuhttp://arxiv.org/abs/2507.23481v1Factors controlling protein evolvability, at the molecular scale2025-07-31T12:03:19ZThis piece serves two purposes. Firstly, it aims at elucidating the role of epistasis in shaping, at a molecular level, the evolutionary paths of proteins, as well as the extent to which these epistatic effects are the outcome of an as-yet-unidentified epistatic force. Second, it seeks to ascertain the extent to which the principle of least action will enable us to identify which of all potential trajectories has the highest evolutionary efficiency, as well as how variations in factors such as protein robustness and folding rates, resulting from the unavoidability of destabilizing mutations, might influence this critical evolutionary process. The initial findings suggest that protein evolution, at a molecular level, may be more predictable than previously thought, as epistasis and the principle of least action collectively impose constraints on evolutionary paths and trajectories, and consequently, on protein evolvability. Thus, this work should advance our understanding of the main molecular mechanisms that underlie the evolution of mutation-driven proteins and also provide grounds to answer a fundamental evolutionary question: how does Darwinian selection regard all potential trajectories available?2025-07-31T12:03:19ZJorge A. Vilahttp://arxiv.org/abs/2507.23056v1Phylogenetic network models as graphical models2025-07-30T19:34:56ZThe displayed tree phylogenetic network model is shown to sit as a natural submodel of the graphical model associated to a directed acyclic graph (DAG). This representation allows to derive a number of results about the displayed tree model. In particular, the concept of a local modification to a DAG model is developed and applied to the displayed tree model. As an application, some nonidentifiability issues related to the displayed tree models are highlighted as they relate to reticulation edges and stacked reticulations in the networks. We also derive rank conditions on flattenings of probability tensors for the displayed tree model, generalizing classic results for phylogenetic tree models.2025-07-30T19:34:56Z21 pages, 7 figuresSeth Sullivanthttp://arxiv.org/abs/2506.08614v3Metaconcepts of rooted tree balance2025-07-30T07:28:39ZMeasures of tree balance play an important role in many different research areas such as mathematical phylogenetics or theoretical computer science. Typically, tree balance is quantified by a single number which is assigned to the tree by a balance or imbalance index, of which several exist in the literature. Most of these indices are based on structural aspects of tree shape, such as clade sizes or leaf depths. For instance, indices like the Sackin index, total cophenetic index, and $\widehat{s}$-shape statistic all quantify tree balance through clade sizes, albeit with different definitions and properties.
In this paper, we formalize the idea that many tree (im)balance indices are functions of similar underlying tree shape characteristics by introducing metaconcepts of tree balance. A metaconcept is a function $Φ_f$ that depends on a function $f$ capturing some aspect of tree shape, such as balance values, clade sizes, or leaf depths. These metaconcepts encompass existing indices but also provide new means of measuring tree balance. The versatility and generality of metaconcepts allow for the systematic study of entire families of (im)balance indices, providing deeper insights that extend beyond index-by-index analysis.2025-06-10T09:23:10ZMareike FischerTom Niklas HamannKristina Wickehttp://arxiv.org/abs/2411.08083v2An age-structured diffusive model for epidemic modelling: Lie symmetries and exact solutions2025-07-30T05:55:21ZA new age-structured diffusive model for the mathematical modelling of epidemics is suggested. The model can be considered as a generalization of two models suggested earlier for the same purposes. The Lie symmetry classification of the model is derived. It is shown that the model admits an infinite-dimensional Lie algebra of invariance. Using the Lie symmetries, exact solutions, in particular those of the travelling wave types and in terms of special functions, are constructed. An example of application of the correctly-specified exact solution for calculation of total numbers of infected individuals during an epidemic is presented.2024-11-12T12:59:23ZQual. Theory Dyn. Syst. 24 (2025) 181Roman ChernihaVasyl' Davydovych10.1007/s12346-025-01340-9http://arxiv.org/abs/2405.13239v3A hybrid framework for compartmental models enabling simulation-based inference2025-07-30T01:42:09ZMulti-scale systems often exhibit a combination of stochastic and deterministic dynamics. In compartmental models, low occupancy compartments tend to exhibit stochastic dynamics while high occupancy compartments tend to follow deterministic dynamics. Representing both dynamics with existing methods is challenging. Failing to account for stochasticity in small populations can produce ``atto-foxes'', for example in the Lotka-Volterra ordinary differential equation (ODE) model. This limitation becomes problematic when studying the extinction of species or the clearance of infection, but it can be overcome by using discrete stochastic models, such as continuous time Markov chains (CTMCs). Unfortunately, simulating CTMCs is impractical for many realistic models, where discrete events have very high frequencies.
In this work, we develop a novel mathematical framework to couple continuous ODEs and discrete CTMCs: ``Jump-Switch-Flow'' (JSF). In this framework, compartments can reach extinct states (``absorbing states''), thereby resolving atto-fox-type problems. JSF has the desired behaviours of exact CTMC simulation, but is substantially computationally faster than existing alternatives, by at least one order of magnitude, and can even obtain constant scaling, irrespective of compartment occupancy.
We demonstrate JSF's utility for simulation-based inference, particularly multi-scale problems, with several case-studies. In a simulation study, we demonstrate how JSF can enable a more nuanced analysis of the efficacy of public health interventions. We also carry out a novel analysis of longitudinal within-host data from SARS-CoV-2 infections to quantify the timing of viral clearance. In this work, we show how JSF offers a novel approach to compartmental model simulation.2024-05-21T22:54:00ZDomenic P. J. GermanoAlexander E. ZarebskiSophie HautphenneRobert MossJennifer A. FleggMark B. Flegghttp://arxiv.org/abs/2507.22287v1Self-organized biodiversity and species abundance distribution patterns in ecosystems with higher-order interactions2025-07-29T23:42:04ZExplaining the emergence of self-organized biodiversity and species abundance distribution patterns remians a fundamental challenge in ecology. While classical frameworks, such as neutral theory and models based on pairwise species interactions, have provided valuable insights, they often neglect higher-order interactions (HOIs), whose role in stabilizing ecological communities is increasingly recognized. Here, we extend the Generalized Lotka-Volterra framework to incorporate HOIs and demonstrate that these interactions can enhance ecosystem stability and prevent collapse. Our model exhibits a diverse range of emergent dynamics, including self-sustained oscillations, quasi-periodic (torus) trajectories, and intermittent chaos. Remarkably, it also reproduces empirical species abundance distributions observed across diverse natural communities. These results underscore the critical role of HOIs in structuring biodiversity and offer a broadly applicable theoretical framework for capturing complexity in ecological systems2025-07-29T23:42:04ZMain: 10 pages, 3 figures; SM: 17 pages, 15 figuresChaos, Solitons and Fractals 202 (2026) 117442Ju KangYiyuan NiuYuanzhi LiChengjin Chu10.1016/j.chaos.2025.117442http://arxiv.org/abs/2507.22256v1Spatiodynamic inference using vision-based generative modelling2025-07-29T22:10:50ZBiological systems commonly exhibit complex spatiotemporal patterns whose underlying generative mechanisms pose a significant analytical challenge. Traditional approaches to spatiodynamic inference rely on dimensionality reduction through summary statistics, which sacrifice complexity and interdependent structure intrinsic to these data in favor of parameter identifiability. This imposes a fundamental constraint on reliably extracting mechanistic insights from spatiotemporal data, highlighting the need for analytical frameworks that preserve the full richness of these dynamical systems. To address this, we developed a simulation-based inference framework that employs vision transformer-driven variational encoding to generate compact representations of the data, exploiting the inherent contextual dependencies. These representations are subsequently integrated into a likelihood-free Bayesian approach for parameter inference. The central idea is to construct a fine-grained, structured mesh of latent representations from simulated dynamics through systematic exploration of the parameter space. This encoded mesh of latent embeddings then serves as a reference map for retrieving parameter values that correspond to observed data. By integrating generative modeling with Bayesian principles, our approach provides a unified inference framework to identify both spatial and temporal patterns that manifest in multivariate dynamical systems.2025-07-29T22:10:50ZJun Won ParkKangyu ZhaoSanket Ranehttp://arxiv.org/abs/2412.05107v3Metrics for classes of semi-binary phylogenetic networks using $μ$-representations2025-07-29T11:29:54ZPhylogenetic networks are useful in representing the evolutionary history of taxa. In certain scenarios, one requires a way to compare different networks. In practice, this can be rather difficult, except within specific classes of networks. In this paper, we derive metrics for the class of \emph{orchard networks} and the class of \emph{strongly reticulation-visible} networks, from variants of so-called \emph{$μ$-representations}, which are vector representations of networks. For both network classes, we impose degree constraints on the vertices, by considering \emph{semi-binary} networks.2024-12-06T15:09:15Z31 pages, 16 figuresChristopher ReichlingLeo van IerselYukihiro Murakamihttp://arxiv.org/abs/2507.20644v1Deep Generative Models of Evolution: SNP-level Population Adaptation by Genomic Linkage Incorporation2025-07-28T09:03:09ZThe investigation of allele frequency trajectories in populations evolving under controlled environmental pressures has become a popular approach to study evolutionary processes on the molecular level. Statistical models based on well-defined evolutionary concepts can be used to validate different hypotheses about empirical observations. Despite their popularity, classic statistical models like the Wright-Fisher model suffer from simplified assumptions such as the independence of selected loci along a chromosome and uncertainty about the parameters. Deep generative neural networks offer a powerful alternative known for the integration of multivariate dependencies and noise reduction. Due to their high data demands and challenging interpretability they have, so far, not been widely considered in the area of population genomics. To address the challenges in the area of Evolve and Resequencing experiments (E&R) based on pooled sequencing (Pool-Seq) data, we introduce a deep generative neural network that aims to model a concept of evolution based on empirical observations over time. The proposed model estimates the distribution of allele frequency trajectories by embedding the observations from single nucleotide polymorphisms (SNPs) with information from neighboring loci. Evaluation on simulated E&R experiments demonstrates the model's ability to capture the distribution of allele frequency trajectories and illustrates the representational power of deep generative models on the example of linkage disequilibrium (LD) estimation. Inspecting the internally learned representations enables estimating pairwise LD, which is typically inaccessible in Pool-Seq data. Our model provides competitive LD estimation in Pool-Seq data high degree of LD when compared to existing methods.2025-07-28T09:03:09Z10 pages, 5 figuresJulia SiekieraChristian SchlöttererStefan Kramerhttp://arxiv.org/abs/2507.19711v1Pre-exposure prophylaxis and syphilis in men who have sex with men: a network analysis2025-07-25T23:09:51ZPre-exposure prophylaxis (PrEP) has been established as an effective tool for preventing HIV infection among men who have sex with men (MSM). However, there is the possibility of PrEP usage leading to increased sexual partners and increased transmission of non-HIV sexually transmitted infections such as syphilis. We take here a network perspective to examine this possibility using data on sexual partnerships, demographic data, PrEP usage, and syphilis among MSM in Columbus, Ohio. We use a recently developed community detection algorithm, an adaptation of the community detection algorithm InfoMap to absorbing random walks, to identify clusters of people (`communities') that may drive syphilis transmission. Our community detection approach takes into account both sexual partnerships as well as syphilis treatment rates when detecting communities. We apply this algorithm to sexual networks fitted to empirical data from the Network Epidemiology of Syphilis Transmission (NEST) study in Columbus, Ohio. We assume that PrEP usage is associated with regular visits to a sexual health provider, and thus is correlated with syphilis detection and treatment rates. We examine how PrEP usage can affect community structure in the sexual networks fitted to the NEST data. We identify two types of PrEP users, those belonging to a large, highly connected community and tending to have a large number of sexual partners, versus those with a small number of sexual partners and belonging to smaller communities. A stochastic syphilis model indicates that PrEP users in the large community may play an important role in sustaining syphilis transmission.2025-07-25T23:09:51ZEsteban Vargas BernalMorgan SpahnieWilliam MillerAbigail TurnerJoseph Tienhttp://arxiv.org/abs/2507.18595v2Investigating Mobility in Spatial Biodiversity Models through Recurrence Quantification Analysis2025-07-25T21:07:42ZRecurrence plots and their associated quantifiers provide a robust framework for detecting and characterising complex patterns in non-linear time-series. In this paper, we employ recurrence quantification analysis to investigate the dynamics of the cyclic, non-hierarchical May-Leonard model, also referred to as rock--paper--scissors systems, that describes competitive interactions among three species. A crucial control parameter in these systems is the species' mobility $m$, which governs the spatial displacement of individuals and profoundly influences the resulting dynamics. By systematically varying $m$ and constructing suitable recurrence plots from numerical simulations, we explore how recurrence quantifiers reflect distinct dynamical features associated with different ecological states. We then introduce an ensemble-based approach that leverages statistical distributions of recurrence quantifiers, computed from numerous independent realisations, allowing us to identify dynamical outliers as significant deviations from typical system behaviour. Through detailed numerical analyses, we demonstrate that these outliers correspond to divergent ecological regimes associated with specific mobility values, providing also a robust manner to infer the mobility parameter from observed numerical data. Our results highlight the potential of recurrence-based methods as diagnostic tools for analysing spatial ecological systems and extracting ecologically relevant information from their non-linear dynamical patterns.2025-07-24T17:26:30ZM. S. PalmeroM. BongestabN. Marwanhttp://arxiv.org/abs/2507.19659v1Posterior bounds on divergence time of two sequences under dependent-site evolutionary models2025-07-25T20:09:20ZLet x and y be two length n DNA sequences, and suppose we would like to estimate the divergence time T. A well known simple but crude estimate of T is p := d(x,y)/n, the fraction of mutated sites (the p-distance). We establish a posterior concentration bound on T, showing that the posterior distribution of T concentrates within a logarithmic factor of p when d(x,y)log(n)/n = o(1). Our bounds hold under a large class of evolutionary models, including many standard models that incorporate site dependence. As a special case, we show that T exceeds p with vanishingly small posterior probability as n increases under models with constant mutation rates, complementing the result of Mihaescu and Steel (Appl Math Lett 23(9):975--979, 2010). Our approach is based on bounding sequence transition probabilities in various convergence regimes of the underlying evolutionary process. Our result may be useful for improving the efficiency of iterative optimization and sampling schemes for estimating divergence times in phylogenetic inference.2025-07-25T20:09:20ZJoseph MathewsScott C. Schmidler