https://arxiv.org/api/VNgbczNED8fn5Q9dal73Kwj+PDo2026-06-21T03:17:22Z1302985515http://arxiv.org/abs/2503.09076v2Bounding the SNPR distance between two tree-child networks using generalised agreement forests2025-09-19T21:57:49ZAgreement forests continue to play a central role in the comparison of phylogenetic trees since their introduction more than 25 years ago. More specifically, they are used to characterise several distances that are based on tree rearrangement operations and related quantifiers of dissimilarity between phylogenetic trees. In addition, the concept of agreement forests continues to underlie most advancements in the development of algorithms that exactly compute the aforementioned measures. In this paper, we introduce agreement digraphs, a concept that generalises agreement forests for two phylogenetic trees to two phylogenetic networks. Analogous to the way in which agreement forests compute the subtree prune and regraft distance between two phylogenetic trees but inherently more complex, we then use agreement digraphs to bound the subnet prune and regraft distance between two tree-child networks from above and below and show that our bounds are tight.2025-03-12T05:18:47ZThe Electronic Journal of Combinatorics, 32, P3.46, 2025Steven KelkSimone LinzCharles Semple10.37236/13976http://arxiv.org/abs/2508.06835v3Expand or better manage protected areas: a framework for minimizing extinction risk when threats are concentrated near edges2025-09-19T21:00:36ZSeveral international agreements have called for the rapid expansion of protected areas to halt biodiversity declines. However, recent research has shown that expanding protected areas may be less cost-effective than redirecting resources towards threat management in existing reserves. These findings often assume that threats are homogeneously distributed in the landscape. In some cases, threats are more concentrated near the edge of protected areas. As protected areas expand, core habitat in the centre expands more rapidly than its edge, potentially creating a refuge from threats. In this paper, we present a framework linking protected area expansion and threat management to extinction risk, via their impact on population carrying capacity and growth rate within core and edge habitats. We demonstrate the framework using a simple population model where individuals are uniformly distributed in a circular protected area threatened by poachers who penetrate the protected area to a fixed distance. We parameterise the model for Peter's Duiker (Cephalophus callipygus) harvested for food in the dense undergrowth of African forests using snares. Expanding protected areas can reduce extinction risk more effectively compared to an equivalent investment in snare removal for larger protected areas that already sustain core unhunted habitat. Our results demonstrate the importance of protected area expansion in buffering susceptible populations from fixed hunting pressure restricted to protected area edges. However, for cases where threats, wildlife, and managers respond to each other strategically in space, the relative importance of expansion versus increased management remains a significant open problem.2025-08-09T05:24:36ZBiological Conservation, 311, 111469 (2025)Brendan G DillonHugh P PossinghamMatthew H Holden10.1016/j.biocon.2025.111469http://arxiv.org/abs/2509.16405v1Ordered Leaf Attachment (OLA) Vectors can Identify Reticulation Events even in Multifurcated Trees2025-09-19T20:28:48ZRecently, a new vector encoding, Ordered Leaf Attachment (OLA), was introduced that represents $n$-leaf phylogenetic trees as $n-1$ length integer vectors by recording the placement location of each leaf. Both encoding and decoding of trees run in linear time and depend on a fixed ordering of the leaves. Here, we investigate the connection between OLA vectors and the maximum acyclic agreement forest (MAAF) problem. A MAAF represents an optimal breakdown of $k$ trees into reticulation-free subtrees, with the roots of these subtrees representing reticulation events. We introduce a corrected OLA distance index over OLA vectors of $k$ trees, which is easily computable in linear time. We prove that the corrected OLA distance corresponds to the size of a MAAF, given an optimal leaf ordering that minimizes that distance. Additionally, a MAAF can be easily reconstructed from optimal OLA vectors. We expand these results to multifurcated trees: we introduce an $O(kn \cdot m\log m)$ algorithm that optimally resolves a set of multifurcated trees given a leaf-ordering, where $m$ is the size of a largest multifurcation, and show that trees resolved via this algorithm also minimize the size of a MAAF. These results suggest a new approach to fast computation of phylogenetic networks and identification of reticulation events via random permutations of leaves. Additionally, in the case of microbial evolution, a natural ordering of leaves is often given by the sample collection date, which means that under mild assumptions, reticulation events can be identified in polynomial time on such datasets.2025-09-19T20:28:48Z18 pages, 4 figuresAlexey MarkinTavis K. Andersonhttp://arxiv.org/abs/2509.16385v1Parameter variability can produce heavy tails in a model for the spatial distribution of settling organisms2025-09-19T19:58:58ZWe show that a simple mechanistic model of spatial dispersal for settling organisms, subject to parameter variability, can generate heavy-tailed radial probability density functions. The movement of organisms in the model consists of a two-dimensional diffusion that ceases after a random time, where the parameters that characterize each of these stages have been randomized. Our findings show that these minimal assumptions can yield heavy-tailed dispersal patterns, providing a simplified framework that increases the understanding of long-distance dispersal events in movement ecology.2025-09-19T19:58:58ZLuis F. GordilloPriscilla E. Greenwoodhttp://arxiv.org/abs/2509.08578v3Multi-modal Adaptive Estimation for Temporal Respiratory Disease Outbreak2025-09-19T17:05:44ZTimely and robust influenza incidence forecasting is critical for public health decision-making. This paper presents MAESTRO (Multi-modal Adaptive Estimation for Temporal Respiratory Disease Outbreak), a novel, unified framework that synergistically integrates advanced spectro-temporal modeling with multi-modal data fusion, including surveillance, web search trends, and meteorological data. By adaptively weighting heterogeneous data sources and decomposing complex time series patterns, the model achieves robust and accurate forecasts. Evaluated on over 11 years of Hong Kong influenza data (excluding the COVID-19 period), MAESTRO demonstrates state-of-the-art performance, achieving a superior model fit with an R-square of 0.956. Extensive ablations confirm the significant contributions of its multi-modal and spectro-temporal components. The modular and reproducible pipeline is made publicly available to facilitate deployment and extension to other regions and pathogens, presenting a powerful tool for epidemiological forecasting.2025-09-10T13:27:40ZHong LiuKerui CenYanxing ChenZige LiuDong ChenZifeng YangChitin Honhttp://arxiv.org/abs/2509.16133v1A Unified and Predictive Measure of Functional Diversity2025-09-19T16:31:51ZDespite the critical role of functional diversity (FD) in understanding ecological systems and processes, its robust quantification remains a significant challenge. A long-held view in the field is that it is not possible to capture its three facets -- functional richness, functional divergence, and functional evenness -- in a single index. This perspective has prompted recent proposals for FD measurement to use three separate indices, one for each aspect. Here, we challenge this paradigm by demonstrating that the probability-weighted Vendi Score (pVS), first introduced by Friedman and Dieng (2023), can serve as a powerful functional diversity index that can capture its three facets. We adapt pVS to functional ecology by defining it as the exponential of the Rényi entropy of the eigenvalues of the abundance-weighted trait similarity matrix. This formulation allows pVS to be applicable at any biological level. It can be defined at the species level, at which most existing FD metrics are defined, and at the individual level to naturally incorporate intraspecific trait variation (ITV) when detailed data are available. We theoretically and empirically demonstrate the robustness of pVS. We first mathematically prove it satisfies several essential desiderata for FD metrics, including invariance to functional redundancy, set monotonicity, distance monotonicity, and concavity. We then show that pVS consistently exhibits the expected ground-truth behavior on simulated ecosystem scenarios under which many FD metrics fail. By integrating abundances and trait similarities within a single, theoretically sound framework, pVS provides a generally applicable index for ecology.2025-09-19T16:31:51ZA single index to accurately measure functional diversity at different biological levelsAdji Bousso DiengAmey Pasarkarhttp://arxiv.org/abs/2509.15911v1The evolution of asymmetrical regulation of physiology is central to aging2025-09-19T12:08:14ZThe evolutionary biology of aging is fundamental to understanding the mechanisms of aging and how to develop anti-aging treatments. Thus far most evolutionary theory concerns the genetics of aging with limited physiological integration. Here we present an intuitive evolutionary framework built on how physiology is regulated and how this regulation itself is then predicted to age. Life has evolved to secure reproduction and avoid system failure in early life, and it is the physiological regulation that evolves in response to those early life selection pressures that leads to the emergence of aging. Importantly, asymmetrical regulation of physiology will evolve as the Darwinian fitness costs of loss of regulation will not be symmetrical. When asymmetrical regulatory systems break during aging, they cause physiological function to drift towards the physiological range where costs of dysregulation are lowest, rendering aging directional. Our model explains many puzzling aspects of the biology of aging. These include why aging appears (but is not) programmed, why aging is gradual yet heterogeneous, why cellular and hormonal signaling are closely related to aging, the compensation law of mortality, why trade-offs between reproduction and aging remain elusive, why longer-lived organisms show more signs of aging during their natural lifespans, and why longer-lived organisms can be less responsive to treatments of aging that work well in short-lived organisms. We provide predictions of our theory that are empirically testable. By incorporating physiological regulation into evolutionary models of aging, we provide a novel perspective to guide empirical research in this still growing field.2025-09-19T12:08:14Z22 pages, 5 figuresMirre J P SimonsMarc Tatarhttp://arxiv.org/abs/2509.15787v1A Visual Discrete Event-based Simulator for Protection of Plants against Herbivores Employed as Computational Optimization Game2025-09-19T09:18:05ZPlants come with sophisticated strategies to survive within a highly competing environment. In addition, they need to resist frequent attacks from a variety of herbivores acting alone, in small groups, or in swarms. Since the amount of energy a plant might invest in defense and reproduction is limited, a complex optimization problem emerges. In a shared habitat, plants fight herbivores by shape and camouflage, by the release of specific toxins, or by attracting predators of herbivores. Furthermore, plants alert their surrounding field by signaling substances in the event of an assault. Transported by air or through a network of roots, signaling substances reach neighbors to trigger their defense. The offsprings of a plant commonly grow within a certain distance to benefit from symbiotic protection. We introduce a grid-based visual simulation software for detailed configuration and subsequent processing of the behavior of the resulting system in time and space. In terms of solution to a computational optimization problem inspired by nature, settings with low energy need and long life able to cope with different patterns of attack can be figured out and analyzed. Applications include novel techniques for efficient construction and secure operation of sensor networks.2025-09-19T09:18:05ZLucas DietrichBenjamin FörsterPeter LangendörferThomas Hinzehttp://arxiv.org/abs/2509.16284v1Temporally staggered cropping co-benefits beneficial insects and pest control globally2025-09-19T03:54:14ZReconciling increasing food production with biodiversity conservation is critical yet challenging, particularly given global declines in beneficial insects driven by monoculture intensification. Intercropping, the simultaneous or sequential cultivation of multiple crops, has been proposed as a viable strategy to enhance beneficial insect services and suppress pests, yet global evidence regarding optimal spatiotemporal intercropping configurations remains fragmented. Here, we synthesize results from 7,584 field experiments spanning six continents and 22 Koppen climate regions, evaluating effects of spatial (row, strip, mixed, agroforestry) and temporal (additive, replacement, relay) intercropping configuations on beneficial insect (predators, parasitoids, pollinators) abundance and pest suppression using the Management Efficiency Ratio (MER; log ratio of abundance in intercropping versus monoculture). Relay intercropping, characterized by temporally staggered planting, emerged as the universally optimal temporal configuration, substantially increasing predator (MER = 0.473) and parasitoid populations (MER = 0.512) and effectively suppressing pests (MER = -0.611) globally. At regional scales, identical spatiotemporal configurations simultaneously optimized beneficial insect predator abundance and pest suppression in 57% of regions, while other regions required distinct, insect-specific approaches. Our findings highlight relay intercropping as a globally generalizable solution, but underscore regional variation that calls for targeted policies to simultaneously secure food production and biodiversity conservation.2025-09-19T03:54:14ZAdrija DattaDepartment of Earth Sciences, Indian Institute of Technology, Gandhinagar, Gujarat, IndiaSubramanian SankaranarayananDepartment of Biological Sciences and Engineering, Indian Institute of Technology, Gandhinagar, Gujarat, IndiaUdit BhatiaDepartment of Computer Science and Engineering, Indian Institute of Technology, Gandhinagar, Gujarat, India, Department of Civil Engineering, Indian Institute of Technology, Gandhinagar, Gujarat, Indiahttp://arxiv.org/abs/2509.15338v1Epidemic amplification by correlated superspreading2025-09-18T18:28:25ZInfectious pathogens often propagate by superspreading, which focusses onward transmission on disproportionately few infected individuals. At the same time, infector-infectee pairs tend to have more similar transmission potentials than expected by chance, as risk factors assort among individuals who frequently interact. A key problem for infectious disease epidemiology, and in the dynamics of complex systems, is to understand how structured variation in individual transmission will scale to impact epidemic dynamics. Here we introduce a framework that reveals how population structure shapes epidemic thresholds, through autocorrelation of individual reproductive numbers along chains of transmission. We show that chains of superspreading can sustain epidemics even when the average transmission rate in the host population is below one, and derive a mathematical threshold beyond which correlated superspreading allows epidemics in otherwise subcritical systems. Empirical analysis of 47 transmission trees for 13 human pathogens indicate self-organizing bursts of superspreading are common and that many trees are near the critical boundary. Vaccination campaigns that proceed up assortative hierarchies of transmission are predicted to sustain the force of infection until herd immunity is reached, providing a mechanistic basis for threshold dynamics observed in real-world settings. Conversely, modulating correlations in transmission, rather than mean or variance, could enable cities and other complex systems to develop immune-like capacities that suppress contagion while preserving core functions.2025-09-18T18:28:25ZNoah Silva de LeonardiBenjamin D. Dalzielhttp://arxiv.org/abs/2509.10987v2A High-Order Cumulant Extension of Quasi-Linkage Equilibrium2025-09-18T14:08:11ZA central question in evolutionary biology is how to quantitatively understand the dynamics of genetically diverse populations. Modeling the genotype distribution is challenging, as it ultimately requires tracking all correlations (or cumulants) among alleles at different loci. The quasi-linkage equilibrium (QLE) approximation simplifies this by assuming that correlations between alleles at different loci are weak -- i.e., low linkage disequilibrium -- allowing their dynamics to be modeled perturbatively. However, QLE breaks down under strong selection, significant epistatic interactions, or weak recombination. We extend the multilocus QLE framework to allow cumulants up to order $K$ to evolve dynamically, while higher-order cumulants ($>K$) are assumed to equilibrate rapidly. This extended QLE (exQLE) framework yields a general equation of motion for cumulants up to order $K$, which parallels the standard QLE dynamics (recovered when $K = 1$). In this formulation, cumulant dynamics are driven by the gradient of average fitness, mediated by a geometrically interpretable matrix that stems from competition among genotypes. Our analysis shows that the exQLE with $K=2$ accurately captures cumulant dynamics even when the fitness function includes higher-order (e.g., third- or fourth-order) epistatic interactions, capabilities that standard QLE lacks. We also applied the exQLE framework to infer fitness parameters from temporal sequence data. Overall, exQLE provides a systematic and interpretable approximation scheme, leveraging analytical cumulant dynamics and reducing complexity by progressively truncating higher-order cumulants.2025-09-13T21:33:48ZKai S. ShimagakiJorge Fernandez-de-Cossio-DiazMauro PastoreRémi MonassonSimona CoccoJohn P. Bartonhttp://arxiv.org/abs/2509.14862v1Modelling species distributions using remote sensing predictors: Comparing Dynamic Habitat Index and LULC2025-09-18T11:29:46ZThis study compares the predictive capacity of the Dynamic Habitat Index (DHI) - a remote sensing (RS)-based measure of habitat productivity and variability - against traditional land-use/land-cover (LULC) metrics in species distribution modelling (SDM) applications. RS and LULC-based SDMs were built using distribution data for eleven bird, amphibian, and mammal species in Île-de-France. Predictor variables were derived from Sentinel-2 RS data and LULC classifications, with the latter incorporating Euclidean distance to habitat types. Ensemble SDMs were built using nine algorithms and evaluated with the Continuous Boyce Index (CBI) and a calibrated AUC. Habitat suitability scores and their binary transformations were assessed using niche overlap indices (Schoener, Warren, and Spearman rank correlation coefficient). Both RS and LULC approaches exhibited similar predictive accuracy overall. After binarisation however, the resulting niche maps diverged significantly. While LULC-based models exhibited spatial constraints (habitat suitability decreased as distance from recorded occurrences increased), RS-based models, which used continuous data, were not affected by geographic bias or distance effects. These results underscore the need to account for spatial biases in LULC-based SDMs. The DHI may offer a more spatially neutral alternative, making it a promising predictor for modelling species niches at regional scales.2025-09-18T11:29:46Z27 pages, 7 figures + AppendixEcological Indicators 179, 114198 (2025)Maïri Souza OliveiraClémentine PréauSamuel AlleaumeMaxime LenormandSandra Luque10.1016/j.ecolind.2025.114198http://arxiv.org/abs/2509.14468v1A generative model of function growth explains hidden self-similarities across biological and social systems2025-09-17T22:48:39ZFrom genomes and ecosystems to bureaucracies and cities, the growth of complex systems occurs by adding new types of functions and expanding existing ones. We present a simple generative model that generalizes the Yule-Simon process by including: (i) a size-dependent probability of introducing new functions, and (ii) a generalized preferential attachment mechanism for expanding existing ones. We uncover a shared underlying structure that helps explain how function diversity evolves in empirical observations, such as prokaryotic proteomes, U.S. federal agencies, and urban economies. We show that real systems are often best represented as having non-Zipfian rank-frequency distributions, driven by sublinear preferential attachment, whilst still maintaining power-law scaling in their abundance distributions. Furthermore, our analytics explain five distinct phases of the organization of functional elements across complex systems. The model integrates empirical findings regarding the logarithmic growth of diversity in cities and the self-similarity of their rank-frequency distributions. Self-similarity previously observed in the rank-frequency distributions of cities is not observed in cells and federal agencies -- however, under a rescaling relative to the total diversity, all systems admit self-similar structures predicted by our theory.2025-09-17T22:48:39Z11 pages main text, 7 main text figs, 9 pages of SI, 3 SI figsJames HolehouseS. RednerVicky Chuqiao YangP. L. KrapivskyJose Ignacio ArroyoGeoffrey B WestChris KempesHyejin Younhttp://arxiv.org/abs/2503.09841v2Maintaining diversity in structured populations2025-09-16T22:07:05ZWe examine population structures for their ability to maintain diversity in neutral evolution. We use the general framework of evolutionary graph theory and consider birth-death (bd) and death-birth (db) updating. The population is of size $N$. Initially all individuals represent different types. The basic question is: what is the time $T_N$ until one type takes over the population? This time is known as consensus time in computer science and as total coalescent time in evolutionary biology. For the complete graph, it is known that $T_N$ is quadratic in $N$ for db and bd. For the cycle, we prove that $T_N$ is cubic in $N$ for db and bd. For the star, we prove that $T_N$ is cubic for bd and quasilinear ($N\log N$) for db. For the double star, we show that $T_N$ is quartic for bd. We derive upper and lower bounds for all undirected graphs for bd and db. We also show the Pareto front of graphs (of size $N=8$) that maintain diversity the longest for bd and db. Further, we show that some graphs that quickly homogenize can maintain high levels of diversity longer than graphs that slowly homogenize. For directed graphs, we give simple contracting star-like structures that have superexponential time scales for maintaining diversity.2025-03-12T21:01:41Z43 pagesPNAS Nexus, Volume 4, Issue 8, August 2025, pgaf252David A. BrewsterJakub SvobodaDylan RoscowKrishnendu ChatterjeeJosef TkadlecMartin A. Nowak10.1093/pnasnexus/pgaf252http://arxiv.org/abs/2509.13428v1Autonomous Reporting of Normal Chest X-rays by Artificial Intelligence in the United Kingdom. Can We Take the Human Out of the Loop?2025-09-16T18:07:00ZChest X-rays (CXRs) are the most commonly performed imaging investigation. In the UK, many centres experience reporting delays due to radiologist workforce shortages. Artificial intelligence (AI) tools capable of distinguishing normal from abnormal CXRs have emerged as a potential solution. If normal CXRs could be safely identified and reported without human input, a substantial portion of radiology workload could be reduced.
This article examines the feasibility and implications of autonomous AI reporting of normal CXRs. Key issues include defining normal, ensuring generalisability across populations, and managing the sensitivity-specificity trade-off. It also addresses legal and regulatory challenges, such as compliance with IR(ME)R and GDPR, and the lack accountability frameworks for errors. Further considerations include the impact on radiologists practice, the need for robust post-market surveillance, and incorporation of patient perspectives. While the benefits are clear, adoption must be cautious.2025-09-16T18:07:00ZKatrina NashJames VazAhmed MaiterChristopher JohnsNicholas WoznitzaAditya KaleAbdala Espinosa MorgadoRhidian BramleyMark HallDavid LoweAlex NovakSarim Ather