RDA-PSO: A computational method to quantify the diffusive dispersal of insects

2026-01-16T19:58:52Z

This article introduces a computational method, called "Recapture of Diffusive Agents & Particle Swarm Optimization" (RDA-PSO), designed to estimate the dispersal parameter of diffusive insects in mark-release-recapture (MRR) field experiments. In addition to describing the method, its properties are discussed, with particular focus on robustness in estimating the observed diffusion coefficient in the presence of uncertainty. It is shown that RDA-PSO provides a simple and reliable approach to quantify insect dispersal that can handle low recapture rates and uneven capture site distributions without the need for area corrections. Tests on synthetic data, for which the actual diffusion coefficient is known, show the method outperforms three techniques based on the solution of the diffusion equation, which are also introduced in this work. Examples of application to real field data for the yellow fever mosquito are provided.

Integrating Household Dynamics in Stochastic Epidemic Modeling: An SDE Approach to the SIR Framework

2026-01-16T02:08:16Z

Understanding infectious disease spread remains a critical public health challenge, particularly given the interplay between household dynamics and community transmission patterns. Traditional epidemiological models often oversimplify these dynamics by treating populations as homogeneous, failing to capture crucial household-level interactions that can significantly impact disease spread. This paper introduces a new stochastic differential equation model extending the SIR framework by capturing the randomness in disease spread and incorporating household structure and heterogeneous mixing patterns. The model divides the population into groups based on age and household size, includes subpopulation-targeted lockdown parameters and constructs detailed contact matrices accounting for both public and within-household interactions. Through the approximation of Markov jump processes by branching processes near the disease free equilibrium, we derive the basic reproduction number of our model and conduct global sensitivity analysis using Sobol indices to identify influential factors. Our simulations reveal that incorporating household structure leads to substantially different predictions compared to traditional models, particularly in epidemic timing and peak intensity. The stochastic framework captures important variations in outbreak trajectories overlooked by deterministic approaches, especially during early and peak phases. This work contributes to both mathematical epidemiology and practical public health planning by providing a sophisticated mathematical understanding of how population structure and randomness influence disease dynamics, offering insights for intervention strategies where household transmission plays a significant role.

The genetic and developmental enigma of rhizomes: crucial traits with limited understanding

2026-01-15T20:28:29Z

Rhizomes play fundamental roles in plant evolution, persistence, and environmental adaptation by enabling clonal propagation, resource storage, and stress resilience. Despite their ecological and agronomic importance across diverse plant lineages, the genetic and developmental regulation of rhizomes remains poorly characterized. Here, we synthesize findings from in vitro induction studies, in vivo physiological and developmental analyses, quantitative trait loci (QTL) mapping, comparative transcriptomics, and limited functional studies to evaluate current knowledge and highlight outstanding questions in rhizome biology. Results show that phytohormones are central regulators of rhizome initiation and growth, with effects mediated in a context-dependent manner through interactions with environmental and developmental cues. Across rhizomatous species, traits such as rhizome initiation, branching, and elongation are typically under polygenic control, although comparatively simpler genetic architectures have been documented in emerging model systems like Mimulus. Transcriptomic analyses further highlight hormone signaling, stress-response, and carbohydrate metabolism pathways as key regulatory components. However, few genes have been functionally validated, underscoring the need for experimentally tractable systems for genetic dissection. Perennial Mimulus species are proposed as promising models for rhizome research due to their experimental accessibility, ecological relevance, and established genomic resources. Integrated approaches leveraging fine-mapping, near-isogenic lines, multi-omics network reconstruction, and genome editing are poised to accelerate the discovery of causal loci and regulatory networks underlying rhizome development, thereby illuminating key processes involved in plant adaptation and perenniality, with direct implications for evolutionary biology and crop improvement.

Gene genealogies in diploid populations evolving according to sweepstakes reproduction

2026-01-15T13:11:26Z

Recruitment dynamics, or the distribution of the number of offspring among individuals, is central for understanding ecology and evolution. Sweepstakes reproduction (heavy right-tailed offspring number distribution) is central for understanding the ecology and evolution of highly fecund natural populations. Sweepstakes reproduction can induce jumps in type frequencies and multiple mergers in gene genealogies of sampled gene copies. We take sweepstakes reproduction to be skewed offspring number distribution due to mechanisms not involving natural selection, such as in chance matching of broadcast spawning with favourable environmental conditions. Here, we consider population genetic models of sweepstakes reproduction in a diploid panmictic populations absent selfing and evolving in a random environment. Our main results are {\it (i)} continuous-time Beta and Poisson-Dirichlet coalescents, when combining the results the skewness parameter $α$ of the Beta-coalescent ranges from $0$ to $2$, and the Beta-coalescents may be incomplete due to an upper bound on the number of potential offspring produced by any pair of parents; {\it (ii)} in large populations time is measured in units proportional to either $N/\log N$ or $N$ generations (where $2N$ is the population size when constant); {\it (iii)} it follows that incorporating population size changes leads to time-changed coalescents with the time-change independent of $α$; {\it (iv)} using simulations we show that the ancestral process is not well approximated by the corresponding coalescent (as measured through certain functionals of the processes); {\it (v)} whenever the skewness of the offspring number distribution is increased the conditional (conditioned on the population ancestry) and the unconditional ancestral processes are not in good agreement.

The multi-allelic Moran process as a multi-zealot voter model: exact results and consequences for diversity thresholds

2026-01-14T19:24:08Z

The Moran process is a foundational model of genetic drift and mutation in finite populations. In its standard two-allele form with population size $n$, allele counts, and hence allele frequencies, change through stochastic replacement and mutation, yet converge to a stationary distribution. This distribution undergoes a qualitative transition at the \emph{critical mutation rate} $μ_c=1/(2n)$: at $μ=μ_c$ it is exactly uniform, so that the probability of observing $k$ copies of allele~1 (and $n-k$ of allele~2) is $π(k)=1/(n+1)$ for $k=0,\dots,n$. For $μ<μ_c$ diversity is low: the stationary distribution places most of its mass near $k=0$ and $k=n$, and the population is therefore typically dominated by one allele. For $μ>μ_c$, on the other hand, diversity is high: the distribution concentrates around intermediate values, so that both alleles are commonly present at comparable frequencies. Recently, the two-allele Moran process was shown to be exactly equivalent to the voter model with two candidates and $α_1$ and $α_2$ committed voters (\emph{zealots}) in a population of $n+α_1+α_2$, where mutation is played by zealot influence. Here we extend this equivalence to multiple alleles and multiple candidates. Using the mapping, we derive the exact stationary distribution of allele counts for well-mixed populations with an arbitrary number $m$ of alleles, and obtain the critical mutation rate $μ_c = 1/(m+2n-2)$, which depends explicitly on $m$. We then analyze the Moran process on randomly connected populations and show that both the stationary distribution and $μ_c$ are invariant to network structure and coincide with the well-mixed results. Finally, simulations on general network topologies show that structural heterogeneity can substantially reshape the stationary allele distribution and, consequently, the level of genetic diversity.

An agent-based modelling approach to investigate the impact of gender on tuberculosis transmission in Uganda

2026-01-14T19:20:52Z

Tuberculosis (TB) is an airborne disease caused by the pathogen Mycobacterium tuberculosis. In 2023, it returned to being the leading cause of death from an infectious agent globally, replacing COVID-19; in the nineteenth century, one in seven of all humans died of tuberculosis. More than 10 million people are diagnosed with TB every year. The majority of cases in adults occur in males (62.5% of all global adult cases in 2023, compared to 37.5% in females). The main reasons for males suffering from a higher burden of global TB cases, compared to females, may be in large part due to population-scale factors, such as employment type, the quantity and type of social contacts they make, and their health-seeking behaviours (e.g. differences in diagnostic and treatment delays between genders). To investigate which population-scale factors are most important in determining this higher TB burden in males, we have developed an age- and gender-stratified, spatially heterogeneous epidemiological agent-based model. We have focused specifically on Kampala, the capital of Uganda, which is a high-burden TB country. We considered counterfactual scenarios to elucidate the impact of gender on the epidemiology of TB. Setting disease progression parameters equal between the genders leads to a reduction in both male-to-female case ratio and total case numbers.

Graph Neural Network Surrogates to leverage Mechanistic Expert Knowledge towards Reliable and Immediate Pandemic Response

2026-01-14T15:26:51Z

During the COVID-19 crisis, mechanistic models have guided evidence-based decision making. However, time-critical decisions in a dynamical environment limit the time available to gather supporting evidence. We address this bottleneck by developing a graph neural network (GNN) surrogate of an age-structured and spatially resolved mechanistic metapopulation simulation model. This combined approach complements classical modeling approaches which are mostly mechanistic and purely data-driven machine learning approaches which are often black box. Our design of experiments spans outbreak and persistent-threat regimes, up to three contact change points, and age-structured contact matrices on a spatial graph with 400 nodes representing German counties. We benchmark multiple GNN layers and identify an ARMAConv-based architecture that offers a strong accuracy-runtime trade-off. Across horizons of 30-90 day simulation and prediction, allowing up to three contact change points, the surrogate model attains 10-27 \% mean absolute percentage error (MAPE) while delivering (near) constant runtime with respect to the forecast horizon. Our approach accelerates evaluation by up to 28,670 times compared with the mechanistic model, allowing responsive decision support in time-critical scenarios and straightforward web integration. These results show how GNN surrogates can translate complex metapopulation models into immediate, reliable tools for pandemic response.

Gene genealogies in haploid populations evolving according to sweepstakes reproduction

2026-01-14T14:57:44Z

Sweepstakes reproduction may be generated by chance matching of reproduction with favorable environmental conditions. Gene genealogies generated by sweepstakes reproduction are in the domain of attraction of multiple-merger coalescents where a random number of lineages merges at such times. We consider population genetic models of sweepstakes reproduction for haploid panmictic populations of both constant ($N$), and varying population size, and evolving in a random environment. We construct our models so that we can recover the observed number of new mutations in a given sample without requiring strong assumptions regarding the population size or the mutation rate. Our main results are {\it (i)} continuous-time coalescents that are either the Kingman coalescent or specific families of Beta- or Poisson-Dirichlet coalescents; when combining the results the parameter $α$ of the Beta-coalescent ranges from 0 to 2, and the Beta-coalescents may be incomplete due to an upper bound on the number of potential offspring an arbitrary individual may produce; {\it (ii)} in large populations we measure time in units proportional to either $ N/\log N$ or $N$ generations; {\it (iii)} incorporating fluctuations in population size leads to time-changed multiple-merger coalescents where the time-change does not depend on $α$; {\it (iv)} using simulations we show that in some cases approximations of functionals of a given coalescent do not match the ones of the ancestral process in the domain of attraction of the given coalescent; {\it (v)} approximations of functionals obtained by conditioning on the population ancestry (the ancestral relations of all gene copies at all times) are broadly similar (for the models considered here) to the approximations obtained without conditioning on the population ancestry.

geohabnet: An R package for mapping habitat connectivity for biosecurity and conservation

2026-01-14T02:59:42Z

Mapping habitat suitability, based on factors like host availability and environmental suitability, is a common approach to determining which locations are important for the spread of a species. Mapping habitat connectivity takes geographic analyses a step further, evaluating the potential roles of locations in biological invasions, pandemics, or species conservation. Locations with high habitat suitability may play a minor role in species spread if they are geographically isolated. Yet, a location with lower habitat suitability may play a major role in a species' spread if it acts as a bridge between regions that would otherwise be physically fragmented. Here we introduce the geohabnet R package, which evaluates the potential importance of locations for the spread of species through habitat landscapes. geohabnet incorporates key factors such as dispersal probabilities and habitat suitability in a network framework, for better understanding habitat connectivity for host-dependent species, such as pathogens, arthropod pests, or pollinators. geohabnet uses publicly available or user-provided datasets, six network centrality metrics, and a user-selected geographic scale. We provide examples using geohabnet for surveillance prioritization of emerging plant pests in Africa and the Americas. These examples illustrate how users can apply geohabnet for their species of interest and generate maps of the estimated importance of geographic locations for species spread. geohabnet provides a quick, open-source, and reproducible baseline to quantify a species' habitat connectivity across a wide range of geographic scales and evaluates potential scenarios for the expansion of a species through habitat landscapes. geohabnet supports biosecurity programs, invasion science, and conservation biology when prioritizing management efforts for transboundary pathogens, pests, or endangered species.

Beta-coalescents when sample size is large

2026-01-13T13:24:26Z

Sweepstakes reproduction refers to a highly skewed individual recruitment success without involving natural selection and may apply to individuals in broadcast spawning populations characterised by Type III survivorship. We consider an extension of the model of sweepstakes reproduction for a haploid panmictic population of constant size $N$; the extension also works as an alternative to the Wright-Fisher model. Our model incorporates an upper bound on the random number of potential offspring (juveniles) produced by a given individual. Depending on how the bound behaves relative to the total population size, we obtain the Kingman coalescent, an incomplete Beta-coalescent, or the (complete) Beta-coalescent. We argue that applying such an upper bound is biologically reasonable. Moreover, we estimate the error of the coalescent approximation. The error estimates reveal that convergence can be slow, and small sample size can be sufficient to invalidate convergence, for example if the stated bound is of the form $N/\log N$. We use simulations to investigate the effect of increasing sample size on the site-frequency spectrum. When the limit is a Beta-coalescent, the site frequency spectrum will be as predicted by the limiting tree even though the full coalescent tree may deviate from the limiting one. When in the domain of attraction of the Kingman coalescent the effect of increasing sample size depends on the effective population size as has been noted in the case of the Wright-Fisher model. Conditioning on the population ancestry (the random ancestral relations of the entire population at all times) may have little effect on the site-frequency spectrum for the models considered here (as evidenced by simulation results).

Tara Polaris expeditions: Sustained decadal observations of the coupled Arctic system in rapid transition

2026-01-13T09:33:11Z

The coupled Arctic system is in rapid transition and is set to undergo further dramatic changes over the coming decades. These changes will lead most likely to an ice-free ocean in summer, expected before mid-century. The Arctic will become more strongly influenced by atmospheric and oceanographic processes characteristic of mid-latitudes, increasing the prevalence of contaminants and new biological species. This ongoing transition of the Arctic to a new state necessitates systematic monitoring of all sentinels (variables that make an essential contribution to characterizing the Earth's state) to improve our understanding of the system, enhance forecasting and support knowledge-based decisions. Here, we describe a sustained multi-decadal observation program to be implemented on the Tara Polar Station between 2026 and 2046. The monitoring program is designed as a series of year-long drift expeditions, called Tara Polaris, in the central Arctic Ocean, covering all seasons. The multidisciplinary data will bridge ecological, geochemical, biological, and physical parameters and processes in the atmosphere, sea ice and ocean. In addition, data collected with consistent methodologies over a 20-year period will make it possible to distinguish long-term trends from seasonal and interannual variability. In this paper, we discuss specific measurement challenges in each compartment (i.e., atmosphere, sea ice and ocean) along key sentinels and the most pressing scientific questions to be addressed. The expected outcomes of the Tara Polaris program will enable us to understand and quantify the main feedbacks of the coupled Arctic system, with their seasonal and interannual trends and spatial variability.

Combinatorial comparison of general galled trees, time-consistent galled trees, and simplex time-consistent galled trees

2026-01-12T23:08:03Z

Rooted binary phylogenetic networks are extensions of rooted binary trees, adding reticulation nodes that are designed to represent evolutionary processes that involve hybridization events. Enumerative combinatorics studies have counted leaf-labeled phylogenetic networks in a variety of classes, finding that when the number of reticulations is fixed, the time-consistent galled trees are asymptotically less numerous than each of several network classes that had been previously examined. Here we provide enumerative results on two additional network classes: general galled trees and simplex time-consistent galled trees. We show that for a fixed number of galls, as the number of leaves goes to infinity, the asymptotic count of general galled trees is identical to that of time-consistent galled trees, whereas the count of simplex time-consistent galled trees is smaller. If the number of galls is not restricted, then the asymptotic approximations all differ: simplex time-consistent galled trees are less numerous than time-consistent galled trees, which are in turn less numerous than general galled trees. We also report a variety of additional results: recursions to count the studied networks with small numbers of leaves a fixed number of galls, as well as enumerative results for unlabeled networks in the classes that we investigate.

Modeling and analysis of a novel two-strain dengue epidemics model considering secondary infections with increased mortality

2026-01-12T10:39:19Z

In this study, we develop and analyze a deterministic two-strain host-vector model for dengue transmission that incorporates key immuno-epidemiological mechanisms, including temporary cross-immunity, antibody-dependent enhancement (ADE), disease-induced mortality during secondary infections, and explicit vector co-infection. The human population is divided into compartments for primary and secondary infections, while the mosquito population includes single- and co-infected classes. ADE is modeled through distinct primary ($α$) and secondary ($σ$) transmission rates. Using the next-generation matrix method, we derive the basic reproduction number $R_0$ and establish the local stability of the disease-free equilibrium for $R_0 < 1$. Analytical results show that one-strain endemic equilibria lose stability under ADE conditions ($σ> α$), allowing invasion by a heterologous strain. Employing center-manifold theory and numerical continuation (COCO), we demonstrate the occurrence of backward bifurcation, bistability between disease-free and endemic states, and Hopf-induced oscillations. Numerical simulations confirm transitions among disease-free, endemic, and periodic regimes as key parameters vary. The model highlights how ADE, waning cross-immunity, and vector co-infection interact to generate complex dengue dynamics and provides insights useful for designing effective control and vaccination strategies in dengue-endemic regions.

When is local search both effective and efficient?

2026-01-10T00:34:51Z

Combinatorial optimization problems implicitly define fitness landscapes that combine the numeric structure of the 'fitness' function to be maximized with the combinatorial structure of which assignments are 'adjacent'. Local search starts at an assignment in this landscape and successively moves assignments until no further improvement is possible among the adjacent assignments. Classic analyses of local search algorithms have focused more on the question of effectiveness ("did we find a good solution?") and often implicitly assumed that there are no doubts about their efficiency ("did we find it quickly?"). But there are many reasons to doubt the efficiency of local search. Even if we focus on fitness landscapes on the hypercube that are single peaked on every subcube (i.e., semismooth fitness landscapes) where effectiveness is obvious, many local search algorithms are known to be inefficient. Since fitness landscapes are unwieldy exponentially large objects, we focus on their polynomial-sized representations by instances of valued constraint satisfaction problems (VCSP). We define a "direction" for valued constraints such that directed VCSPs generate semismooth fitness landscapes. We call VCSPs oriented if they do not have any pair of variables with arcs in both directions. Since recognizing if a VCSP-instance is directed or oriented is coNP-complete, we generalized oriented VCSPs as conditionally-smooth fitness landscapes that are recognizable in polynomial time for a VCSP-instance. We prove that many popular local search algorithms like random ascent, simulated annealing, history-based rules, jumping rules, and the Kernighan-Lin heuristic are very efficient on conditionally-smooth landscapes. But conditionally-smooth landscapes are still expressive enough so that algorithms like steepest ascent and random facet require a super-polynomial number of steps to find the fitness peak.

Designing a Resilient Allee-Ornstein-Uhlenbeck model

2026-01-09T23:19:20Z

In stochastic population dynamics, stochastic wandering can produce transition to an absorbing state. In particular, under Allee effects, low densities amplify the possibility of population collapse. We investigate this in an Allee-Ornstein-Uhlenbeck (Allee-OU) model, that couples a bistable Allee growth equation, with demographic noise, and environmental fluctuations modeled as an Ornstein-Uhlenbeck process. This process replaces the bifurcation parameter of the deterministic Allee effect equation. In the model, small noise may induce escape from the safe basin around the positive equilibrium toward extinction. We construct a stochastic control, altering the process to have a stationary distribution. We enable tractable control design, approximating the process by one with a stationary distribution. Two controlled models are developed, one acting directly on population size and another also modulating the environment. A threshold-based implementation minimizes the frequency of interventions while maximizing safe time. Simulations demonstrate that the control stabilizes fluctuations around the equilibrium.