https://arxiv.org/api/cutKVGhS1W9r+jQ+zQps3C1SMj02026-06-14T06:47:57Z1301627015http://arxiv.org/abs/2603.28200v1A Deep Reinforcement Learning Framework for Closed-loop Guidance of Fish Schools via Virtual Agents2026-03-30T09:10:02ZGuiding collective motion in biological groups is a fundamental challenge in understanding social interaction rules and developing automated systems for animal management. In this study, we propose a deep reinforcement learning (RL) framework for the closed-loop guidance of fish schools using virtual agents. These agents are controlled by policies trained via Proximal Policy Optimization (PPO) in simulation and deployed in physical experiments with rummy-nose tetras (Petitella bleheri), enabling real-time interaction between artificial agents and live individuals. To cope with the stochastic behavior of live individuals, we design a composite reward function to balance directional guidance with social cohesion. Our systematic evaluation of visual parameters shows that a white background and larger stimulus sizes maximize guidance efficacy in physical trials. Furthermore, evaluation across group sizes revealed that while the system demonstrates effective guidance for groups of five individuals, this capability markedly degrades as group size increases to eight. This study highlights the potential of deep RL for automated guidance of biological collectives and identifies challenges in maintaining artificial influence in larger groups.2026-03-30T09:10:02Z18 pages, 8 figuresTakato ShibayamaHiroaki Kawashimahttp://arxiv.org/abs/2603.27255v1When can fitness epistasis be ignored in a polygenic trait at equilibrium?2026-03-28T12:13:51ZAlthough many phenotypic traits are determined by a large number of genetic variants, the behavior of allele frequencies in a polygenic trait is not completely understood. The problem is especially challenging when the quantitative trait of interest is under epistatic selection as the allele frequency at a locus is affected by those at other loci. Here, we consider a panmictic, diploid finite population evolving under stabilizing selection and symmetric mutations when the population is in linkage equilibrium. In the stationary state, using a diffusion theory, we calculate the marginal distribution of allele frequency, and find parameter regimes where fitness epistasis can not be ignored for an accurate description of the frequency distribution. For such parameters, the mean deviation in the phenotypic optimum and genic variance are, however, found to be well captured even when epistatic interactions are neglected. Thus, while the presence of epistasis may not be evident in phenotypic quantities, it can strongly affect the allele frequency distribution.We also find that the allele frequency distribution at a locus is unimodal if its effect size is below a threshold effect and bimodal otherwise; these results are the stochastic analog of the deterministic ones where the stable allele frequency becomes bistable when the effect size exceeds a threshold. Our analytical results are verified against Monte Carlo simulations and numerical integration of a Langevin equation.2026-03-28T12:13:51ZSignificantly revised version of bioRxiv 2023.01.25.525607: no change in previous results, new results added, focus on stationary stateArchana DeviKavita Jainhttp://arxiv.org/abs/2405.16885v3Hidden Markov modelling of spatio-temporal dynamics of measles in 1750-1850 Finland2026-03-27T10:43:54ZReal world spatio-temporal datasets, and phenomena related to them, are often challenging to visualise or gain a general overview of. In order to summarise information encompassed in such data, we combine two well known statistical modelling methods. To account for the spatial dimension, we use the intrinsic modification of the conditional autoregression, and incorporate it with the hidden Markov model, allowing the spatial patterns to vary over time. We apply our method to parish register data considering deaths caused by measles in Finland in 1750-1850, and gain novel insight of previously undiscovered infection dynamics. Five distinctive, reoccurring states, describing spatially and temporally differing infection burden and potential routes of spread, are identified. We also find that there is a change in the occurrences of the most typical spatial patterns circa 1812, possibly due to changes in communication networks after major administrative transformations in Finland.2024-05-27T07:08:14ZJournal of Applied Statistics, 1-25. (2026)Tiia-Maria PasanenJouni HelskeTarmo Ketola10.1080/02664763.2026.2634794http://arxiv.org/abs/2603.26226v1Braess's paradox in tandem-running ants: When shortest path is not the quickest2026-03-27T09:49:43ZBraess's paradox -- where adding network capacity increases travel time -- is typically attributed to selfish agents. Although eusocial colonies maximize collective fitness, we find experimentally that \emph{Diacamma indicum} ants exhibit this paradox: Leaders favour the shortest path even when it slows the colony. We present a quantitative model of the exploration-exploitation trade-off, demonstrating that evolutionary forces selecting for shortest-path identification can force suboptimal global states. This proves the paradox can emerge in highly cooperative systems without individual selfishness.2026-03-27T09:49:43ZJoy Das BairagyaUdipta ChakrabortiSumana AnnagiriSagar Chakrabortyhttp://arxiv.org/abs/2507.03005v2Beyond cognacy2026-03-27T08:44:27ZComputational phylogenetics has become an established tool in historical linguistics, with many language families now analyzed using likelihood-based inference. However, standard approaches rely on expert-annotated cognate sets, which are sparse, labor-intensive to produce, and limited to individual language families. This paper explores alternatives by comparing the established method to two fully automated methods that extract phylogenetic signal directly from lexical data. One uses automatic cognate clustering with unigram/concept features; the other applies multiple sequence alignment (MSA) derived from a pair-hidden Markov model. Both are evaluated against expert classifications from Glottolog and typological data from Grambank. Also, the intrinsic strengths of the phylogenetic signal in the characters are compared. Results show that MSA-based inference yields trees more consistent with linguistic classifications, better predicts typological variation, and provides a clearer phylogenetic signal, suggesting it as a promising, scalable alternative to traditional cognate-based methods. This opens new avenues for global-scale language phylogenies beyond expert annotation bottlenecks.2025-07-02T06:47:34Z9 pages, 2 figuresGerhard Jäger0.18653/v1/2025.sigtyp-1.6http://arxiv.org/abs/2603.25986v1Evaluating Phylogenetic Comparative Methods under Reticulate Evolutionary Scenarios2026-03-27T00:21:54ZPhylogenetic comparative methods (PCMs) are widely used to study trait evolution. However, many evolutionary histories involve reticulate evolutionary scenarios, such as hybridization, that violate core assumptions of these methods. In this study, we evaluate how such violations affect the performance of PCMs. In particular, we focus on the ancestral character estimation, evolutionary rate estimation, and model selection. We simulate continuous trait evolution on various phylogenetic network topologies and assess the performance of PCMs that assume a bifurcating tree (i.e., major tree of the network) as the underlying model of evolution. We found that the performance of the tested PCMs was suboptimal. Using random forest, generalized linear models, and model-based clustering, we identified key factors contributing to these inaccuracies. Our results show that frequent and/or recent hybridization accompanied by one ore more transgressive events and rapidly evolving traits (i.e., high evolutionary rate) lead to significant estimation error, especially with respect to rate estimation and model choice. These factors substantially shift trait values away from tree-based model expectations, leading to overall increased error in parameter estimates. Our study demonstrates cases in which PCMs that rely on trees are likely to misinterpret biological histories and offers recommendations for researchers studying systems with complex evolutionary histories.2026-03-27T00:21:54Z28 pages, 10 figures, 4 tablesLydia MorleyEmma LehmbergSungsik Konghttp://arxiv.org/abs/2603.26822v1Modularity, asymmetry, and polarization shape consensus speed in the voter model2026-03-26T23:59:38ZIn populations with community structure, the formation of consensus requires both alignment within and diffusion of beliefs across groups, processes that evolve on distinct time scales. How do modularity, asymmetry, and polarization shape this process? We study a variant of the voter model in which a population is divided into two cliques of sizes $N_1$ and $N_2$. At each time step, a pair of nodes is selected; if their binary opinions differ, each agent adopts the opinion of the other with probability $p$. With probability $α$, the pairing occurs with a single clique, and with probability $1-α$, across cliques. We analyze how this coupling strength, population imbalance, and initial polarization jointly determine the time to consensus. Formation of consensus generally starts with inter-clique interactions rapidly synchronizing the two cliques' opinion fractions, after which consensus is reached through a slower diffusion along the synchronized manifold; this slow stage is largely insensitive to $α$ except when the cliques are nearly disconnected. To analyze these dynamics, we derive stochastic differential equations and Fokker-Planck approximations in the large-population limit, and assess their accuracy against the discrete model. While $α$ primarily affects the fast alignment stage, initially polarized and asymmetric populations exhibit nontrivial effects, including regimes in which an intermediate level coupling minimizes consensus time. A small-clique scaling analysis reveals that this optimum arises from a competition between fast alignment drift and noise amplification in the smaller group, and provides an approximate decomposition of consensus time into fast and slow contributions.2026-03-26T23:59:38Z23 pages, 9 figuresMadi YerlanovZachary KilpatrickNancy Rodriguezhttp://arxiv.org/abs/2603.25628v1Modeling the mutational dynamics of very short tandem repeats2026-03-26T16:40:51ZShort tandem repeats (STRs) are low-entropy regions in the genome, consisting of a short (1-6 bp) unit that is consecutively repeated multiple times. They are known for high mutational instability, due to so-called stutter-mutations, in which the number of units in the run increases or descreases. In particular, STRs with repeat unit length of 1-2 bp are prone to mutate even within several cell divisions. The extremely rapid accumulation of variation makes them interesting phylogenetic markers for retrospective single-cell lineage reconstruction. Here we model their mutational dynamics at the level of individual repeat unit type and then aggregate length variations over many STR loci with the aim of obtaining a very fast ``molecular clock''. We calibrate our model based on several datasets with known lineage structure prepared from cultured cells. We find that the mutational dynamics of STRs are reasonably consistent for a given cell line, but vary among different ones. This suggests that the dynamics are not entirely explained by mutations in caretaker genes, rather, various other factors play a role -- possibly tissue origin and differentiation state. Further data and research is necessary to asses their relative effects.2026-03-26T16:40:51Z13 pages, 4 figures. To be published in RECOMB-CG 2026 (Comparative Genomics). Conceptualization, A.O. and P.F.S.; formal analysis and software, A.O.; wet-lab methodology, single-cell isolation, and sample preparation, L.T., T.M. and T.B.; funding acquistion, E.S. and C.A.K.; wet-lab supervision, E.S.; supervision, C.A.K and P.F.SAmos OnnChair of Experimental Medicine and Therapy Research, University of RegensburgBioinformatics Group, Faculty of Mathematics and Computer Science, and Interdisciplinary Center for Bioinformatics, University of LeipzigTzipy MarxDepartment of Computer Science and Applied Mathematics, Weizmann Institute of ScienceLiming TaoCellular Tissue Genomics, GenentechTamir BiezunerDepartment of Computer Science and Applied Mathematics, Weizmann Institute of ScienceEhud ShapiroDepartment of Computer Science and Applied Mathematics, Weizmann Institute of ScienceChristoph A. KleinChair of Experimental Medicine and Therapy Research, University of RegensburgFraunhofer Institute for Toxicology and Experimental Medicine RegensburgPeter F. StadlerBioinformatics Group, Faculty of Mathematics and Computer Science, and Interdisciplinary Center for Bioinformatics, University of LeipzigMax Planck Institute for Mathematics in the SciencesInstitute for Theoretical Chemistry, University of ViennaFacultad de Ciencias, Universidad Nacional de ColombiaCenter for non-coding RNA in Technology and Health, University of CopenhagenSanta Fe Institutehttp://arxiv.org/abs/2603.25276v1Global Stability Analysis of the Age-Structured Chemostat With Substrate Dynamics2026-03-26T10:14:50ZIn this paper we study the stability properties of the equilibrium point for an age-structured chemostat model with renewal boundary condition and coupled substrate dynamics under constant dilution rate. This is a complex infinite-dimensional feedback system. It has two feedback loops, both nonlinear. A positive static loop due to reproduction at the age-zero boundary of the PDE, counteracted and dominated by a negative dynamic loop with the substrate dynamics. The derivation of explicit sufficient conditions that guarantee global stability estimates is carried out by using an appropriate Lyapunov functional. The constructed Lyapunov functional guarantees global exponential decay estimates and uniform global asymptotic stability with respect to a measure related to the Lyapunov functional. From a biological perspective, stability arises because reproduction is constrained by substrate availability, while dilution, mortality, and substrate depletion suppress transient increases in biomass before age-structure effects can amplify them. The obtained results are applied to a chemostat model from the literature, where the derived stability condition is compared with existing results that are based on (necessarily local) linearization methods.2026-03-26T10:14:50Z46 pagesIasson KarafyllisDionysios TheodosisMiroslav Krstichttp://arxiv.org/abs/2603.25239v1The Self-Replication Phase Diagram: Mapping Where Life Becomes Possible in Cellular Automata Rule Space2026-03-26T09:44:44ZWhat substrate features allow life? We exhaustively classify all 262,144 outer-totalistic binary cellular automata rules with Moore neighbourhood for self-replication and produce phase diagrams in the $(λ, F)$ plane, where $λ$ is Langton's rule density and $F$ is a background-stability parameter. Of these rules, 20,152 (7.69%) support pattern proliferation, concentrated at low rule density ($λ\approx 0.15$--$0.25$) and low-to-moderate background stability ($F \approx 0.2$--$0.3$), in the weakly supercritical regime (Derrida coefficient $μ= 1.81$ for replicators vs. $1.39$ for non-replicators). Self-replicating rules are more approximately mass-conserving (mass-balance 0.21 vs. 0.34), and this generalises to $k{=}3$ Moore rules. A three-tier detection hierarchy (pattern proliferation, extended-length confirmation, and causal perturbation) yields an estimated 1.56% causal self-replication rate. Self-replication rate increases monotonically with neighbourhood size under equalised detection: von Neumann 4.79%, Moore 7.69%, extended Moore 16.69%. These results identify background stability and approximate mass conservation as the primary axes of the self-replication phase boundary.2026-03-26T09:44:44Z20 pages, 9 figures, 1 table. Submitted to J. R. Soc. InterfaceDon Yinhttp://arxiv.org/abs/2506.21498v2Evolution of noisy learning in games2026-03-25T14:23:40ZPeople make strategic decisions many times a day - during negotiations, when coordinating actions with others, or when choosing partners for cooperation. The resulting dynamics can be studied with learning theory and evolutionary game theory. These frameworks explore how people adapt their decisions over time, in light of how effective their strategies have been. The outcomes of such learning processes depend on how sensitive individuals are to the performance of their strategies. When they are more sensitive, they systematically favor strategies they deem more successful. When they are less sensitive, their learning process is noisier and more erratic. Traditionally, most models treat this sensitivity as a fixed parameter - like the "selection strength" parameter in evolutionary models. Instead, we study how strategies and sensitivities co-evolve. We find that the co-evolutionary endpoints depend on both the type of strategic interaction and the learning rule employed. In prisoner's dilemmas, we often observe sensitivities to increase indefinitely. But in snowdrift and stag-hunt games, sensitivities often converge to a finite value, or we observe evolutionary branching altogether. These results shed light on how evolution might shape learning mechanisms for social behavior. They suggest that noisy learning does not need to be a by-product of cognitive constraints. Instead, it can serve as a means to gain strategic advantages.2025-06-26T17:26:56ZMarta C. CoutoFernando P. SantosChristian Hilbehttp://arxiv.org/abs/2603.24000v1Self-organized pattern synchronization modulated by stochasticity in coupled plankton ecosystems2026-03-25T07:00:10ZSpatial patterning and synchronization are pervasive features of plankton communities, yet the mechanisms that allow such patterns to persist coherently under environmental noise remain unresolved. In vertically structured aquatic ecosystems, plankton populations are often organized into distinct layers, raising the question of how interactions between layers shape both spatial self-organization and robustness. Here, we develop a spatiotemporal ecosystem model of a two-layer plankton community to examine the role of passive diffusive coupling under stochastic environmental fluctuations. We show that interlayer diffusion induces a sharp transition from independent, layer-specific Turing patterns to fully synchronized spatial patterns once the coupling strength exceeds a critical threshold. Importantly, the same coupling mechanism markedly enhances the stability of spatial patterns against environmental noise, extending their persistence far beyond that of non-coupled layers. Moreover, we uncover a trophic hierarchy in noise sensitivity, with zooplankton exhibiting substantially greater vulnerability than phytoplankton. Together, these results identify passive diffusive coupling as a unifying mechanism that simultaneously promotes spatial synchronization and robustness, providing a mechanistic explanation for the persistence of coherent plankton patterns in fluctuating aquatic environments.2026-03-25T07:00:10Zmain: 13 pages, 7 figures; SM: 7 pages, 5 figuresJu KangYiyuan NiuYuanzhi LiQuan-Xing LiuChengjin Chuhttp://arxiv.org/abs/2603.18385v2Evolutionarily Stable Stackelberg Equilibrium2026-03-25T05:09:43ZWe present a new solution concept called evolutionarily stable Stackelberg equilibrium (SESS). We study the Stackelberg evolutionary game setting in which there is a single leading player and a symmetric population of followers. The leader selects an optimal mixed strategy, anticipating that the follower population plays an evolutionarily stable strategy (ESS) in the induced subgame and may satisfy additional ecological conditions. We consider both leader-optimal and follower-optimal selection among ESSs, which arise as special cases of our framework. Prior approaches to Stackelberg evolutionary games either define the follower response via evolutionary dynamics or assume rational best-response behavior, without explicitly enforcing stability against invasion by mutations. We present algorithms for computing SESS in discrete and continuous games, and validate the latter empirically. Our model applies naturally to biological settings; for example, in cancer treatment the leader represents the physician and the followers correspond to competing cancer cell phenotypes.2026-03-19T01:06:10ZSam Ganzfriedhttp://arxiv.org/abs/2511.18000v2Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning2026-03-24T23:32:08ZWe present ContagionRL, a Gymnasium-compatible reinforcement learning platform specifically designed for systematic reward engineering in spatial epidemic simulations. Unlike traditional agent-based models that rely on fixed behavioral rules, our platform enables rigorous evaluation of how reward function design affects learned survival strategies across diverse epidemic scenarios. ContagionRL integrates a spatial SIRS+D epidemiological model with configurable environmental parameters, allowing researchers to stress-test reward functions under varying conditions including limited observability, different movement patterns, and heterogeneous population dynamics. We evaluate five distinct reward designs, ranging from sparse survival bonuses to a novel potential field approach, across multiple RL algorithms (PPO, SAC, A2C). Through systematic ablation studies, we identify that directional guidance and explicit adherence incentives are critical components for robust policy learning. Our comprehensive evaluation across varying infection rates, grid sizes, visibility constraints, and movement patterns reveals that reward function choice dramatically impacts agent behavior and survival outcomes. Agents trained with our potential field reward consistently achieve superior performance, learning maximal adherence to non-pharmaceutical interventions while developing sophisticated spatial avoidance strategies. The platform's modular design enables systematic exploration of reward-behavior relationships, addressing a knowledge gap in models of this type where reward engineering has received limited attention. ContagionRL is an effective platform for studying adaptive behavioral responses in epidemic contexts and highlight the importance of reward design, information structure, and environmental predictability in learning. Our code is publicly available at https://github.com/redradman/ContagionRL2025-11-22T10:02:37Z38 pages, 15 figures and 18 tables; Accepted to TMLR. OpenReview: https://openreview.net/forum?id=yPEASsx3hkTransactions on Machine Learning Research, 2026Radman RakhshandehrooDaniel Coombshttp://arxiv.org/abs/2503.22784v3Geometry and stability of species complexes: larger species speciate less often2026-03-24T20:09:24ZSpecies complexes are groups of closely related populations exchanging genes through dispersal. We study the dynamics of the structure of species complexes in a class of metapopulation models where demes can exchange genetic material through migration and diverge through the accumulation of new mutations. Importantly, we model the ecological feedback of differentiation on gene flow by assuming that the success of migrations decreases with genetic distance, through a specific function $h$. We investigate the effects of metapopulation size on the coherence of species structures, depending on some mathematical characteristics of the feedback function $h$. Our results suggest that with larger metapopulation sizes, species form increasingly coherent, transitive, and uniform entities. We conclude that the initiation of speciation events in large species requires the existence of idiosyncratic geographic or selective restrictions on gene flow.2025-03-28T17:50:16ZAmaury LambertEmmanuel SchertzerYannic Wenzel