https://arxiv.org/api/uuLpg69zjYvkFvX6xWM4waUJ4uo 2026-06-21T22:01:59Z 13029 1095 15 http://arxiv.org/abs/2506.13568v1 Digging deeper: deep joint species distribution modeling reveals environmental drivers of Earthworm Communities 2025-06-16T14:52:02Z

Earthworms are key drivers of soil function, influencing organic matter turnover, nutrient cycling, and soil structure. Understanding the environmental controls on their distribution is essential for predicting the impacts of land use and climate change on soil ecosystems. While local studies have identified abiotic drivers of earthworm communities, broad-scale spatial patterns remain underexplored. We developed a multi-species, multi-task deep learning model to jointly predict the distribution of 77 earthworm species across metropolitan France, using historical (1960-1970) and contemporary (1990-2020) records. The model integrates climate, soil, and land cover variables to estimate habitat suitability. We applied SHapley Additive exPlanations (SHAP) to identify key environmental drivers and used species clustering to reveal ecological response groups. The joint model achieved high predictive performance (TSS >= 0.7) and improved predictions for rare species compared to traditional species distribution models. Shared feature extraction across species allowed for more robust identification of common and contrasting environmental responses. Precipitation variability, temperature seasonality, and land cover emerged as dominant predictors of earthworm distribution. Species clustering revealed distinct ecological strategies tied to climatic and land use gradients. Our study advances both the methodological and ecological understanding of soil biodiversity. We demonstrate the utility of interpretable deep learning approaches for large-scale soil fauna modeling and provide new insights into earthworm habitat specialization. These findings support improved soil biodiversity monitoring and conservation planning in the face of global environmental change.

2025-06-16T14:52:02Z Sara Si-moussi Wilfried Thuiller Esther Galbrun Thibaud Decaëns Sylvain Gérard Daniel F. Marchán Claire Marsden Yvan Capowiez Mickaël Hedde 10.1016/j.soilbio.2025.110021 http://arxiv.org/abs/2506.13837v1 Recent trends in socio-epidemic modelling: behaviours and their determinants 2025-06-16T09:01:15Z

The spreading dynamics of infectious diseases is influenced by individual behaviours, which are in turn affected by the level of awareness about the epidemic. Modelling the co-evolution of disease transmission and behavioural changes within a population enables better understanding, prediction and control of epidemics. Here, our primary goal is to provide an overview of the most popular modelling approaches, ranging from compartmental mean-field to agent-based models, with a particular focus on how behavioural factors are incorporated into epidemic dynamics. We classify modelling approaches based on the fundamental conceptual distinction between models of behaviours and models of behavioural determinants (such as awareness, beliefs, opinions, or trust); in particular, we observe that most studies model and interpret the variables related to individual responses either as behaviours or as determinants, with the implicit assumption that they correlate linearly. Based on preliminary empirical observations, we then challenge this assumption by analysing a recent dataset about time series of social indicators, collected during the COVID-19 pandemic. We examine the case study of Italian regions and we discover that behavioural responses are poorly explained by awareness, beliefs or trust, thereby calling for a careful interpretation of the modelling assumptions and for the development of further models, which fully account for the inherent complexity of individual responses and human behaviours.

2025-06-16T09:01:15Z Daniele Proverbio Riccardo Tessarin Giulia Giordano 10.1007/s40574-025-00490-7 http://arxiv.org/abs/2409.03935v4 Galled Perfect Transfer Networks 2025-06-15T14:37:47Z

Predicting horizontal gene transfers often requires comparative sequence data, but recent work has shown that character-based approaches could also be useful for this task. Notably, perfect transfer networks (PTN) explain the character diversity of a set of taxa for traits that are gained once, rarely lost, but that can be transferred laterally. Characterizing the structure of such characters is an important step towards understanding more complex characters. Although efficient algorithms can infer such networks from character data, they can sometimes predict overly complicated transfer histories. With the goal of recovering the simplest possible scenarios in this model, we introduce galled perfect transfer networks, which are PTNs that are galled trees. Such networks are useful for characters that are incompatible in terms of tree-like evolution, but that do fit in an almost-tree scenario. We provide polynomial-time algorithms for two problems: deciding whether one can add transfer edges to a tree to transform it into a galled PTN, and deciding whether a set of characters are galled-compatible, that is, they can be explained by some galled PTN. We also analyze a real dataset comprising of a bacterial species trees and KEGG functions as characters, and derive several conclusions on the difficulty of explaining characters in a galled tree, which provide several directions for future research.

2024-09-05T23:01:31Z This is an extended version of the RECOMB-CG 2024 article procedings. This new version contains the reviews made by PCI and some fixed typos Alitzel López Sánchez Manuel Lafond http://arxiv.org/abs/2506.13809v1 Analysis and Optimization of Probabilities of Beneficial Mutation and Crossover Recombination in a Hamming Space 2025-06-13T22:24:41Z

Inspired by Fisher's geometric approach to study beneficial mutations, we analyse probabilities of beneficial mutation and crossover recombination of strings in a general Hamming space with arbitrary finite alphabet. Mutations and recombinations that reduce the distance to an optimum are considered as beneficial. Geometric and combinatorial analysis is used to derive closed-form expressions for transition probabilities between spheres around an optimum giving a complete description of Markov evolution of distances from an optimum over multiple generations. This paves the way for optimization of parameters of mutation and recombination operators. Here we derive optimality conditions for mutation and recombination radii maximizing the probabilities of mutation and crossover into the optimum. The analysis highlights important differences between these evolutionary operators. While mutation can potentially reach any part of the search space, the probability of beneficial mutation decreases with distance to an optimum, and the optimal mutation radius or rate should also decrease resulting in a slow-down of evolution near the optimum. Crossover recombination, on the other hand, acts in a subspace of the search space defined by the current population of strings. However, probabilities of beneficial and deleterious crossover are balanced, and their characteristics, such as variance, are translation invariant in a Hamming space, suggesting that recombination may complement mutation and boost the rate of evolution near the optimum.

2025-06-13T22:24:41Z 42 pages Roman V. Belavkin http://arxiv.org/abs/2506.10856v1 The space of multifurcating ranked tree shapes: enumeration, lattice structure, and Markov chains 2025-06-12T16:16:49Z

Coalescent models of bifurcating genealogies are used to infer evolutionary parameters from molecular data. However, there are many situations where bifurcating genealogies do not accurately reflect the true underlying ancestral history of samples, and a multifurcating genealogy is required. The space of multifurcating genealogical trees, where nodes can have more than two descendants, is largely underexplored in the setting of coalescent inference. In this paper, we examine the space of rooted, ranked, and unlabeled multifurcating trees. We recursively enumerate the space and then construct a partial ordering which induces a lattice on the space of multifurcating ranked tree shapes. The lattice structure lends itself naturally to defining Markov chains that permit exploration on the space of multifurcating ranked tree shapes. Finally, we prove theoretical bounds for the mixing time of two Markov chains defined on the lattice, and we present simulation results comparing the distribution of trees and tree statistics under various coalescent models to the uniform distribution on this tree space.

2025-06-12T16:16:49Z 42 pages, 17 figures Julie Zhang Noah A. Rosenberg Julia A. Palacios http://arxiv.org/abs/2409.02884v2 How much regulation do we need from genomes to society? 2025-06-11T23:12:53Z

Regulatory functions are essential in both socioeconomic and biological systems, from corporate managers to regulatory genes. Regulatory functions come with substantial costs and benefits, and the balance of the two is often taken for granted. A fundamental question for all complex systems becomes how much regulatory function do they need for their size and function? Here, we present empirical evidence that regulatory functions scale systematically across diverse systems: biological organisms (bacterial and eukaryotic genomes), human organizations (companies, federal agencies, universities), and decentralized entities (Wikipedia, cities). We combine an analysis of large data sets from each of these domains with a simple conceptual model. The model predicts that the scaling of regulatory costs shifts with system structure. Well-mixed small systems exhibit superlinear scaling between size and regulatory function, while modular large ones show sublinear or linear scaling, both in agreement with data. Finally, we find that socioeconomic systems that contain more diverse occupational functions tend to have more regulatory costs than expected from the scaling relationships, confirming the hypothesis that the type and complexity of interactions also play a role in regulatory costs. Our cross-system comparison offers a mechanistic framework for understanding regulatory function and can potentially guide efforts to analyze the costs and benefits of regulatory function in diverse systems.

2024-09-04T17:14:01Z 15 pages, 4 figures Vicky Chuqiao Yang Christopher P. Kempes S. Redner José Ignacio Arroyo Geoffrey B. West Hyejin Youn http://arxiv.org/abs/2408.08069v2 Integrated population model reveals human and environment driven changes in Baltic ringed seal (Pusa hispida botnica) demography and behavior 2025-06-11T16:37:10Z

Integrated population models (IPMs) are a promising approach to test ecological theories and assess wildlife populations in dynamic and uncertain conditions. By combining multiple data sources into a unified model, they enable the parametrization of versatile, mechanistic models that can predict population dynamics in novel circumstances. Here, we present a Bayesian IPM for the ringed seal (Pusa hispida botnica) population inhabiting the Bothnian Bay in the Baltic Sea. Despite the availability of long-term monitoring data, traditional assessment methods have faltered due to dynamic environmental conditions, varying reproductive rates, and the recently re-introduced hunting, thus limiting the quality of information available to managers. We fit our model to census and various demographic, reproductive, and harvest data from 1988 to 2023 to provide a comprehensive assessment of past population trends, and predict population response to alternative hunting scenarios. We estimated that 20,000 to 36,000 ringed seals inhabited the Bothnian Bay in 2024, increasing at a rate of 3% to 6% per year. Reproductive rates have increased since 1988, leading to a substantial increase in the growth rate up until 2015. However, the re-introduction of hunting has since reduced the growth rate, and even minor quota increases are likely to reduce it further. Our results also support the hypothesis that a greater proportion of the population hauls out under lower ice cover circumstances, leading to higher aerial survey results in such years. In general, our study demonstrates the value of IPMs for monitoring wildlife populations under changing environments, and supporting science-based management decisions.

2024-08-15T10:34:23Z Murat Ersalman Mervi Kunnasranta Markus Ahola Anja M. Carlsson Sara Persson Britt-Marie Bäcklin Inari Helle Linnea Cervin Jarno Vanhatalo 10.3354/meps14886 http://arxiv.org/abs/2506.10040v1 Evaluating interventions for Plasmodium vivax forest malaria using a three-scale mathematical model 2025-06-10T23:56:26Z

The rising proportion of Plasmodium vivax cases concentrated in forest-fringe areas across the Greater Mekong Subregion highlights the importance of pharmaceutical and mosquito control techniques specifically targeted towards forest-going populations. To mathematically assess best-possible antimalarial interventions in the context of hypnozoite reactivation and seasonal forest migration, we extend a previously developed three-scale integro-differential equations model of P. vivax transmission. In particular, we fit the model to data gathered over a four-year period in Vietnam to gain insight into local P. vivax dynamics and validate the model's ability to capture epidemiological trends. The calibrated model is then used to generate optimal schedules for mass-drug administration (MDA) in forest-goers and gauge the efficacy of vector control techniques (such as long-lasting insecticide nets and indoor residual spraying) in forest-adjacent areas. Our results highlight the dependence of optimal MDA timing on the demographics of the human population, the importance of interventions targeting the mosquito bite rate, and the need for efficacy in hypnozoite-targeting antimalarial drugs.

2025-06-10T23:56:26Z 40 pages, 8 figures Shoshana Elgart Mark B. Flegg Jennifer A. Flegg http://arxiv.org/abs/2501.13195v3 Reducing Size Bias in Epidemic Network Modelling 2025-06-10T11:22:37Z

Epidemiological models help policymakers mitigate disease spread by predicting transmission metrics based on disease dynamics and contact networks. Calibrating these models requires representative network sampling. We investigate the Random Walk (RW) and Metropolis-Hastings Random Walk (MHRW) algorithms for three network types: Erdős-Rényi (ER), Small-world (SW), and Scale-free (SF). Disease transmission is simulated using a stochastic susceptible-infected-recovered (SIR) framework. For ER and SW networks, RW overestimates infected individuals and secondary infections by $25\%$ due to size bias, favouring highly connected nodes. MHRW, though more computationally intensive, reduces size bias and provides more representative samples. For time-to-infection, both algorithms provide representative estimates. However, neither algorithm samples SF networks representatively, exhibiting significant variability. Furthermore, removing duplicate sample nodes reduces MHRW's accuracy across three network types. We apply both algorithms to a cattle movement network of 46,512 farms combining ER, SW, and SF features. RW overestimates infected farms by about $100\%$ and secondary infections by over $900\%$, reflecting significant size bias, while MHRW estimates align within $1\%$ of the cattle network values. RW underestimates time-to-infection by about $40\%$, while MHRW overestimates it by $10\%$. Accuracy, again, deteriorates when duplicates nodes are removed. Our findings guide algorithm selection and intervention strategies based on network structure and disease severity; RW's conservative estimates suit high-mortality, fast-spreading epidemics, while MHRW enables more precise interventions for slower epidemics.

2025-01-22T19:56:47Z Neha Bansal Katerina Kaouri Thomas E. Woolley http://arxiv.org/abs/2408.07011v2 A complete characterization of pairs of binary phylogenetic trees with identical $A_k$-alignments 2025-06-10T09:03:37Z

Phylogenetic trees play a key role in the reconstruction of evolutionary relationships. Typically, they are derived from aligned sequence data (like DNA, RNA, or proteins) by using optimization criteria like, e.g., maximum parsimony (MP). It is believed that the latter is able to reconstruct the \enquote{true} tree, i.e., the tree that generated the data, whenever the number of substitutions required to explain the data with that tree is relatively small compared to the size of the tree (measured in the number $n$ of leaves of the tree, which represent the species under investigation). However, reconstructing the correct tree from any alignment first and foremost requires the given alignment to perform differently on the \enquote{correct} tree than on others. A special type of alignments, namely so-called $A_k$-alignments, has gained considerable interest in recent literature. These alignments consist of all binary characters (\enquote{sites}) which require precisely $k$ substitutions on a given tree. It has been found that whenever $k$ is small enough (in comparison to $n$), $A_k$-alignments uniquely characterize the trees that generated them. However, recent literature has left a significant gap between $n\leq 2k+2$ -- namely the cases in which no such characterization is possible -- and $n\geq 4k$ -- namely the cases in which this characterization works. It is the main aim of the present manuscript to close this gap, i.e., to present a full characterization of all pairs of trees that share the same $A_k$-alignment. In particular, we show that indeed every binary phylogenetic tree with $n$ leaves is uniquely defined by its $A_k$-alignments if $n\geq 2k+3$. By closing said gap, we also ensure that our result is optimal.

2024-08-13T16:18:00Z Mirko Wilde Mareike Fischer http://arxiv.org/abs/2506.08304v1 Long-range dispersal promotes spatial synchrony but reduces the length and time scales of synchronous fluctuations 2025-06-10T00:27:39Z

Synchronous oscillations of spatially disjunct populations are widely observed in ecology. Even in the absence of spatially synchronized exogenous forces, metapopulations may synchronize via dispersal. For many species, most dispersal is local, but rare long-distance dispersal events also occur. While even small amounts of long-range dispersal are known to be important for processes like invasion and spatial spread rates, their potential influence on population synchrony is often overlooked, since local dispersal on its own can be strongly synchronizing. In this work, we investigate the effect of random, rare, long-range dispersal on the spatial synchrony of a metapopulation and find profound effects not only on synchrony but also on properties of the resulting spatial patterns. While controlling for the overall amount of emigration from each local subpopulation, we vary the fraction of dispersal that occurs locally (to nearest neighbors) versus globally (to random locations, irrespective of distance). Using a metric that measures the instantaneous level of global synchrony, we show that this form of long-range dispersal significantly favors the spatially synchronous state and homogenizes the population by decreasing the size of clusters of subpopulations that are out of phase with the rest of the metapopulation. Moreover, the addition of non-local dispersal significantly decreases the equilibration time of the metapopulation.

2025-06-10T00:27:39Z Davi Arrais Nobre Karen C. Abbott Jonathan Machta Alan Hastings http://arxiv.org/abs/2401.11686v3 Evolutionary dynamics of any multiplayer game on regular graphs 2025-06-09T18:24:12Z

Multiplayer games on graphs are at the heart of theoretical descriptions of key evolutionary processes that govern vital social and natural systems. However, a comprehensive theoretical framework for solving multiplayer games with an arbitrary number of strategies on graphs is still missing. Here, we solve this by drawing an analogy with the Balls-and-Boxes problem, based on which we show that the local configuration of multiplayer games on graphs is equivalent to distributing $k$ identical co-players among $n$ distinct strategies. We use this to derive the replicator equation for any $n$-strategy multiplayer game under weak selection, which can be solved in polynomial time. As an example, we revisit the second-order free-riding problem, where costly punishment cannot truly resolve social dilemmas in a well-mixed population. Yet, in structured populations, we derive an accurate threshold for the punishment strength, beyond which punishment can either lead to the extinction of defection or transform the system into a rock-paper-scissors-like cycle. The analytical solution also qualitatively agrees with the phase diagrams that were previously obtained for non-marginal selection strengths. Our framework thus allows an exploration of any multi-strategy multiplayer game on regular graphs.

2024-01-22T04:52:22Z 69 pages, 12 figures Nat. Commun. 15, 5349 (2024) Chaoqian Wang Matjaž Perc Attila Szolnoki 10.1038/s41467-024-49505-5 http://arxiv.org/abs/2506.04508v2 Mechanistic models for panel data: Analysis of ecological experiments with four interacting species 2025-06-08T20:52:51Z

In an ecological context, panel data arise when time series measurements are made on a collection of ecological processes. Each process may correspond to a spatial location for field data, or to an experimental ecosystem in a designed experiment. Statistical models for ecological panel data should capture the high levels of nonlinearity, stochasticity, and measurement uncertainty inherent in ecological systems. Furthermore, the system dynamics may depend on unobservable variables. This study applies iterated particle filtering techniques to explore new possibilities for likelihood-based statistical analysis of these complex systems. We analyze data from a mesocosm experiment in which two species of the freshwater planktonic crustacean genus, Daphnia, coexist with an alga and a fungal parasite. Time series data were collected on replicated mesocosms under six treatment conditions. Iterated filtering enables maximization of the likelihood for scientifically motivated nonlinear partially observed Markov process models, providing access to standard likelihood-based methods for parameter estimation, confidence intervals, hypothesis testing, model selection and diagnostics. This toolbox allows scientists to propose and evaluate scientifically motivated stochastic dynamic models for panel data, constrained only by the requirement to write code to simulate from the model and to specify a measurement distribution describing how the system state is observed.

2025-06-04T23:11:21Z 73 pages, 31 figures Bo Yang Jesse Wheeler Meghan A. Duffy Aaron A. King Edward L. Ionides http://arxiv.org/abs/2411.13228v2 A general relationship between extinction risk and carrying capacity 2025-06-06T12:37:37Z

Understanding the relationship between a populations probability of extinction and its carrying capacity frames conservation status assessments and guides efforts to understand and mitigate the ongoing biodiversity crisis. Despite this, our understanding of the mathematical form of this relationship remains limited. We conducted ~5 billion population viability assessments that jointly converge on a modified Gompertz curve. This pattern is consistent across >1700 distinct model populations, representing different breeding systems and widely varying rates of population growth, levels of environmental stochasticity, adult survival rate, age at first breeding, and initial population size. Analytical treatment of the underlying dynamics shows that few assumptions suffice to show that the relationship holds for any extant population subject to density-dependent growth. Finally, we discuss the implications of these results and consider the practical use of our findings by conservationists.

2024-11-20T11:42:48Z Thomas S Ball Ben Balmford Andrew Balmford Daniele Rinaldo Piero Visconti Rhys Green http://arxiv.org/abs/2409.10588v8 ADIOS: Antibody Development via Opponent Shaping 2025-06-06T11:07:55Z

Anti-viral therapies are typically designed to target only the current strains of a virus, a myopic response. However, therapy-induced selective pressures drive the emergence of new viral strains, against which the original myopic therapies are no longer effective. This evolutionary response presents an opportunity: our therapies could both defend against and actively influence viral evolution. This motivates our method ADIOS: Antibody Development vIa Opponent Shaping. ADIOS is a meta-learning framework where the process of antibody therapy design, the outer loop, accounts for the virus's adaptive response, the inner loop. With ADIOS, antibodies are not only robust against potential future variants, they also influence, i.e., shape, which future variants emerge. In line with the opponent shaping literature, we refer to our optimised antibodies as shapers. To demonstrate the value of ADIOS, we build a viral evolution simulator using the Absolut! framework, in which shapers successfully target both current and future viral variants, outperforming myopic antibodies. Furthermore, we show that shapers modify the distribution over viral evolutionary trajectories to result in weaker variants. We believe that our ADIOS paradigm will facilitate the discovery of long-lived vaccines and antibody therapies while also generalising to other domains. Specifically, domains such as antimicrobial resistance, cancer treatment, and others with evolutionarily adaptive opponents. Our code is available at https://github.com/olakalisz/adios.

2024-09-16T14:56:27Z Accepted at ICML 2025 Proceedings of the 42nd International Conference on Machine Learning (ICML 2025), PMLR 267 Sebastian Towers Aleksandra Kalisz Philippe A. Robert Alicia Higueruelo Francesca Vianello Ming-Han Chloe Tsai Harrison Steel Jakob N. Foerster