https://arxiv.org/api/N7ghAoeCnwL++UWxKdodlgnfifw2026-06-21T15:23:09Z13029100515http://arxiv.org/abs/2507.19615v1Population dynamics under random switching2025-07-25T18:30:18ZPopulations interact non-linearly and are influenced by environmental fluctuations. In order to have realistic mathematical models, one needs to take into account that the environmental fluctuations are inherently stochastic. Often, environmental stochasticity is modeled by systems of stochastic differential equations. However, this type of stochasticity is not always the best suited for ecological modeling. Instead, biological systems can be modeled using piecewise deterministic Markov processes (PDMP). For a PDMP the process follows the flow of a system of ordinary differential equations for a random time, after which the environment switches to a different state, where the dynamics is given by a different system of differential equations. Then this is repeated. The current paper is devoted to the study of the dynamics of $n$ populations described by $n$-dimensional Kolmogorov PDMP. We provide sharp conditions for persistence and extinction, based on the invasion rates (Lyapunov exponents) of the ergodic probability measures supported on the boundary of the positive orthant. In order to showcase the applicability of our results, we apply the theory in some interesting ecological examples.2025-07-25T18:30:18Z72 pages, 3 figuresAlexandru HeningSiddharth Sabharwalhttp://arxiv.org/abs/2006.12926v3A self-supervised neural-analytic method to predict the evolution of COVID-19 in Romania2025-07-25T08:32:46ZAnalysing and understanding the transmission and evolution of the COVID-19 pandemic is mandatory to be able to design the best social and medical policies, foresee their outcomes and deal with all the subsequent socio-economic effects. We address this important problem from a computational and machine learning perspective. More specifically, we want to statistically estimate all the relevant parameters for the new coronavirus COVID-19, such as the reproduction number, fatality rate or length of infectiousness period, based on Romanian patients, as well as be able to predict future outcomes. This endeavor is important, since it is well known that these factors vary across the globe, and might be dependent on many causes, including social, medical, age and genetic factors. We use a recently published improved version of SEIR, which is the classic, established model for infectious diseases. We want to infer all the parameters of the model, which govern the evolution of the pandemic in Romania, based on the only reliable, true measurement, which is the number of deaths. Once the model parameters are estimated, we are able to predict all the other relevant measures, such as the number of exposed and infectious people. To this end, we propose a self-supervised approach to train a deep convolutional network to guess the correct set of Modified-SEIR model parameters, given the observed number of daily fatalities. Then, we refine the solution with a stochastic coordinate descent approach. We compare our deep learning optimization scheme with the classic grid search approach and show great improvement in both computational time and prediction accuracy. We find an optimistic result in the case fatality rate for Romania which may be around 0.3% and we also demonstrate that our model is able to correctly predict the number of daily fatalities for up to three weeks in the future.2020-06-23T12:00:04ZRadu D. StochiţoiuMarian PetricaTraian RebedeaIonel PopescuMarius Leordeanuhttp://arxiv.org/abs/2505.21519v2Stationary and Non-Stationary Transition Probabilities in Decision Making: Modeling COVID-19 Dynamics2025-07-25T01:17:52ZThis study introduces a comparative modeling framework using stationary and non-stationary transition probabilities within a Markov Decision Process (MDP) to assess COVID-19 disease dynamics. Stationary transition probabilities assume constant transition rates, while non-stationary transitions reflect time-dependent behaviors including policy interventions or behavioral changes. We develop a comprehensive compartmental model with transitions based on binomial and multinomial processes. Mathematical models for both stationary and non-stationary transition frameworks are developed and simulated over a 365-day period to emphasize dynamic variations in epidemic outcomes. Our findings highlight the significance of non-stationary modeling in accurately representing the dynamic characteristics of pandemic situations and provide recommendations for optimizing public health interventions under uncertainty. This comparative analysis offers useful information for epidemiological modeling and decision-making in dynamic risk environments.2025-05-22T12:08:55Z26 pages, 4 figuresRomario Gildas Foko TiomelaSamson Adekola AlagbeOlawale Nasiru LawalSerges Love Teutu TallaIsabella Kemajou-Brownhttp://arxiv.org/abs/2507.18380v1ARTreeFormer: A Faster Attention-based Autoregressive Model for Phylogenetic Inference2025-07-24T12:56:16ZProbabilistic modeling over the combinatorially large space of tree topologies remains a central challenge in phylogenetic inference. Previous approaches often necessitate pre-sampled tree topologies, limiting their modeling capability to a subset of the entire tree space. A recent advancement is ARTree, a deep autoregressive model that offers unrestricted distributions for tree topologies. However, its reliance on repetitive tree traversals and inefficient local message passing for computing topological node representations may hamper the scalability to large datasets. This paper proposes ARTreeFormer, a novel approach that harnesses fixed-point iteration and attention mechanisms to accelerate ARTree. By introducing a fixed-point iteration algorithm for computing the topological node embeddings, ARTreeFormer allows fast vectorized computation, especially on CUDA devices. This, together with an attention-based global message passing scheme, significantly improves the computation speed of ARTree while maintaining great approximation performance. We demonstrate the effectiveness and efficiency of our method on a benchmark of challenging real data phylogenetic inference problems.2025-07-24T12:56:16Z29 pages, 7 figuresTianyu XieYicong MaoCheng Zhanghttp://arxiv.org/abs/2507.17923v1Altruism and energy flow in dynamic beehive models2025-07-23T20:39:34ZThis work explores the relationship between altruism and the genetic system of arrhenotoky through an evolutionary game theory (EGT)-inspired lens, using a dynamic model of beehive populations consisting of three castes: workers, drones, and the queen. Arrhenotoky is a form of asexual reproduction in which unfertilized eggs become males while fertilized eggs develop into females, leading to unusual patterns of genetic relatedness between family members. This mode of reproduction occurs in insects such as the Hymenoptera, including bees. In the hive environment, bees often display altruistic behavior, or actions taken by an organism that reduce its own fitness to increase the fitness of others. Eusociality, an elaborate form of social organization characterized by complex and altruistic social behaviors, is also observed in the Hymenoptera. To explore the interplay between altruism and the reproductive patterns of arrhenotoky, we employ a population dynamics model to simulate beehive populations over a range of parameters, controlling for altruism in workers and the queen. Our results show that altruistic behaviors are essential for beehive success, with optimal worker altruism corresponding to the division of labor observed in eusocial species. Furthermore, we find that modest altruism from the queen is also vital for hive survival, emphasizing the delicate balance that can exist in these complex social systems. Overall, our findings shed light on the co-evolution of altruism, arrhenotoky, and eusociality in the natural world.2025-07-23T20:39:34Z29 pages, 7 figures, for associated code see https://github.com/ZackNathan/arrhenotoky_simulationZachary NathanDaniel DiPietroOlivia J. Chuhttp://arxiv.org/abs/2411.01691v2Closing the complexity gap of the double distance problem2025-07-22T11:25:26ZGenome rearrangement has been an active area of research in computational comparative genomics for the last three decades. While initially mostly an interesting algorithmic endeavor, now the practical application by applying rearrangement distance methods and more advanced phylogenetic tasks is becoming common practice, given the availability of many completely sequenced genomes. Several genome rearrangement models have been developed over time, sometimes with surprising computational properties. A prominent example is the fact that computing the reversal distance of two signed permutations is possible in linear time, while for two unsigned permutations it is NP-hard. Therefore one has always to be careful about the precise problem formulation and complexity analysis of rearrangement problems in order not to be fooled. The double distance is the minimum number of genomic rearrangements between a singular and a duplicated genome that, in addition to rearrangements, are separated by a whole genome duplication. At the same time it allows to assign the genes of the duplicated genome to the two paralogous chromosome copies that existed right after the duplication event. Computing the double distance is another example of a tricky hardness landscape: If the distance measure underlying the double distance is the simple breakpoint distance, the problem can be solved in linear time, while with the more elaborate DCJ distance it is NP-hard. Indeed, there is a family of distance measures, parameterized by an even number k, between the breakpoint distance (k=2) and the DCJ distance (k=\infty). Little was known about the hardness border between these extremes; the problem complexity was known only for k=4 and k=6. In this paper, we close the gap, providing a full picture of the hardness landscape when computing the double distance.2024-11-03T21:34:08ZLuís CunhaThiago LopesUéverton SouzaLeonard BohnenkämperMarília D. V. BragaJens Stoyehttp://arxiv.org/abs/2410.23729v2Basic offspring number and robust feedback design for the biological control of vectors by sterile insect release technique2025-07-21T08:38:41ZSterile Insect Technique (SIT) is a promising control method against insect pests and insect vectors. It consists in releasing males previously sterilized in laboratory, in order to reduce or eliminate a specific wild population. We study in this paper the implementation by feedback control of SIT-based elimination campaign of Aedes mosquitoes. We provide state-feedback and output-feedback control laws and establish their convergence, as well as their robustness properties. In this design procedure, a pivotal role is played by the average number of secondary female insects produced by a single female insect, called basic offspring number, and by the use of properties of monotone systems. Illustrative simulations are provided.2024-10-31T08:33:22ZThis paper has received the 2025 European Control Conference Best Paper Award23rd European Control Conference, Jun 2025, Thessaloniki, GreecePierre-Alexandre BlimanMUSCLEES, LJLLhttp://arxiv.org/abs/2408.09305v2Constructing the Molecular Tree of Life using Assembly Theory and Mass Spectrometry2025-07-20T12:16:29ZHere we demonstrate the first biochemistry-agnostic approach to map evolutionary relationships at the molecular scale, allowing the construction of phylogenetic models using mass spectrometry (MS) and Assembly Theory (AT) without elucidating molecular identities. AT allows us to estimate the complexity of molecules by deducing the amount of shared information stored within them when . By examining 74 samples from a diverse range of biotic and abiotic sources, we used tandem MS data to detect 24102 analytes (9262 unique) and 59518 molecular fragments (6755 unique). Using this MS dataset, together with AT, we were able to infer the joint assembly spaces (JAS) of samples from molecular analytes. We show how JAS allows agnostic annotation of samples without fingerprinting exact analyte identities, facilitating accurate determination of their biogenicity and taxonomical grouping. Furthermore, we developed an AT-based framework to construct a biochemistry-agnostic phylogenetic tree which is consistent with genome-based models and outperforms other similarity-based algorithms. Finally, we were able to use AT to track colony lineages of a single bacterial species based on phenotypic variation in their molecular composition with high accuracy, which would be challenging to track with genomic data. Our results demonstrate how AT can expand causal molecular inference to non-sequence information without requiring exact molecular identities, thereby opening the possibility to study previously inaccessible biological domains.2024-08-17T22:39:34Z26 pages, 8 figures, 49 referencesAmit KahanaAlasdair MacLeodHessam MehrAbhishek SharmaEmma CarrickMichael JirasekSara WalkerLeroy Croninhttp://arxiv.org/abs/2507.14829v1Partitioning of Eddy Covariance Footprint Evapotranspiration Using Field Data, UAS Observations and GeoAI in the U.S. Chihuahuan Desert2025-07-20T05:56:16ZThis study proposes a new method for computing transpiration across an eddy covariance footprint using field observations of plant sap flow, phytomorphology sampling, uncrewed aerial system (UAS), deep learning-based digital image processing, and eddy covariance micrometeorological measurements. The method is applied to the Jornada Experimental Range, New Mexico, where we address three key questions: (1) What are the daily summer transpiration rates of Mesquite (Prosopis glandulosa) and Creosote (Larrea tridentata) individuals, and how do these species contribute to footprint-scale evapotranspiration? (2) How can the plant-level measurements be integrated for terrain-wide transpiration estimates? (3) What is the contribution of transpiration to total evapotranspiration within the eddy covariance footprint? Data collected from June to October 2022, during the North American Monsoon season, include hourly evapotranspiration and precipitation rates from the Ameriflux eddy covariance system (US Jo-1 Bajada site) and sap flux rates from heat-balance sensors. We used plant biometric measurements and supervised classification of multispectral imagery to upscale from the patch to footprint-scale estimations. A proportional relationship between the plant's horizontal projected area and the estimated number of water flow conduits was extended to the eddy covariance footprint via UAS data. Our results show that Mesquite's average daily summer transpiration is 2.84 mm/d, while Creosote's is 1.78 mm/d (a ratio of 1.6:1). The summer footprint integrated transpiration to evapotranspiration ratio (T/ET) was 0.50, decreasing to 0.44 during dry spells and increasing to 0.63 following significant precipitation. Further testing of this method is needed in different regions to validate its applicability. With appropriate adjustments, it could be relevant for other areas with similar ecological conditions.2025-07-20T05:56:16ZHabibur R. HowliderHernan A. MorenoMarguerite E. MauritzStephanie N. Marquezhttp://arxiv.org/abs/2503.12331v2Carbon removal capacity estimation of taiga reforestation and afforestation at the western boreal edge using spatially explicit carbon budget modeling2025-07-20T03:51:18ZCanada's northern boreal forest edge offers considerable potential for climate change mitigation through large-scale tree planting. Afforestation in these sparsely forested regions could assist the natural northward migration of forests while capitalizing on their carbon removal capacity. However, the sequestration potential is uncertain due to a lack of spatially explicit models. This study uses Monte Carlo estimates and a carbon budget model to quantify the carbon removal capacity of afforestation at the northwestern boreal edge from 2025 to 2100. We combined satellite inventory data and probabilistic fire regime representations to simulate total ecosystem carbon under scenarios considering fire return intervals, land classes, planting mortality, and climate variables. Our results indicate that afforesting ~6.4-32 million hectares could sequester ~3.88-19.4 Gigatonnes of $CO_{2}$e over the next 75 years, with the Taiga Shield West ecozone showing the most potential. Even the conservative estimate (3.88 Gt$CO_{2}$e) is over five times Canada's total annual greenhouse gas emissions, making it a substantial contribution toward the nation's 2050 net-zero goal. Further research is needed to refine these estimates, assess economic viability, and investigate impacts on regional processes like permafrost thaw and surface albedo.2025-03-16T02:59:58ZKevin Bradley DsouzaEnoch OfosuRichard BoudreaultJuan Moreno-CruzYuri Leonenkohttp://arxiv.org/abs/2507.14614v1Knowing when to stop: insights from ecology for building catalogues, collections, and corpora2025-07-19T13:25:08ZA major locus of musicological activity-increasingly in the digital domain-is the cataloguing of sources, which requires large-scale and long-lasting research collaborations. Yet, the databases aiming at covering and representing musical repertoires are never quite complete, and scholars must contend with the question: how much are we still missing? This question structurally resembles the 'unseen species' problem in ecology, where the true number of species must be estimated from limited observations. In this case study, we apply for the first time the common Chao1 estimator to music, specifically to Gregorian chant. We find that, overall, upper bounds for repertoire coverage of the major chant genres range between 50 and 80 %. As expected, we find that Mass Propers are covered better than the Divine Office, though not overwhelmingly so. However, the accumulation curve suggests that those bounds are not tight: a stable ~5% of chants in sources indexed between 1993 and 2020 was new, so diminishing returns in terms of repertoire diversity are not yet to be expected. Our study demonstrates that these questions can be addressed empirically to inform musicological data-gathering, showing the potential of unseen species models in musicology.2025-07-19T13:25:08Z12th International Conference on Digital Libraries for Musicology, Sogang University, Seoul, South Korea, 26 September 2025Jan HajičFabian Moss10.1145/3748336.3748347http://arxiv.org/abs/2507.14062v1Directionality measures in evolutionary ecological networks: Insights from the Tangled Nature model2025-07-18T16:41:45ZThe myriad microscopic interactions among the individual organisms that constitute an ecological system collectively give rise, at the macroscopic scale, to evolutionary trends. The ability to detect the directionality of such trends is crucial for understanding and managing the dynamics of natural systems. Nevertheless, identifying the key observable quantities that capture such directional behaviour poses a major challenge. In this study, we propose that translating ecological data into a network framework is a valuable strategy to measure system stability and evolution. We examine the Tangled Nature model as a test case, evaluating network entropy, species diversity, and the clustering coefficient as metrics of network stability and directionality.2025-07-18T16:41:45ZTwo Supplementary Information files providedAndrea MarchettiHenrik Jeldtoft Jensenhttp://arxiv.org/abs/2507.13310v2Modelling the spillover from online engagement to offline protest: stochastic dynamics and mean-field approximations on networks2025-07-18T03:14:04ZSocial media is transforming various aspects of offline life, from everyday decisions such as dining choices to the progression of conflicts. In this study, we propose a coupled modelling framework with an online social network layer to analyse how engagement on a specific topic spills over into offline protest activities. We develop a stochastic model and derive several mean-field models of varying complexity. These models allow us to estimate the reproductive number and anticipate when surges in activity are likely to occur. A key factor is the transmission rate between the online and offline domains; for offline outbursts to emerge, this rate must fall within a critical range, neither too low nor too high. Additionally, using synthetic networks, we examine how network structure influences the accuracy of these approximations. Our findings indicate that low-density networks need more complex approximations, whereas simpler models can effectively represent higher-density networks. When tested on two real-world networks, however, increased complexity did not enhance accuracy.2025-07-17T17:30:13Z44 pages, 33 figuresJournal of Complex Networks, Volume 14, Issue 2, April 2026, cnaf057Moyi TianP. Jeffrey BrantinghamNancy Rodríguez10.1093/comnet/cnaf057http://arxiv.org/abs/2507.13531v1Methodological considerations for semialgebraic hypothesis testing with incomplete U-statistics2025-07-17T20:46:11ZRecently, Sturma, Drton, and Leung proposed a general-purpose stochastic method for hypothesis testing in models defined by polynomial equality and inequality constraints. Notably, the method remains theoretically valid even near irregular points, such as singularities and boundaries, where traditional testing approaches often break down. In this paper, we evaluate its practical performance on a collection of biologically motivated models from phylogenetics. While the method performs remarkably well across different settings, we catalogue a number of issues that should be considered for effective application.2025-07-17T20:46:11Z26 pages + 11 pages Supplementary MaterialsDavid BarnhillMarina Garrote-LópezElizabeth GrossMax HillBryson KagyJohn A. RhodesJoy Z. Zhanghttp://arxiv.org/abs/2501.05365v2Control of Overpopulated Tails in Kinetic Epidemic Models2025-07-17T17:23:19ZWe introduce model-based transition rates for controlled compartmental models in mathematical epidemiology, with a focus on the effects of control strategies applied to interacting multi-agent systems describing contact formation dynamics. In the framework of kinetic control problems, we compare two prototypical control protocols: one additive control directly influencing the dynamics and another targeting the interaction strength between agents. The emerging controlled macroscopic models are derived for an SIR compartmentalization to illustrate their impact on epidemic progression and contact interaction dynamics. Numerical results show the effectiveness of this approach in steering the dynamics and controlling epidemic trends, even in scenarios where contact distributions exhibit an overpopulated tail.2025-01-09T16:48:14ZMattia ZanellaAndrea Medaglia