https://arxiv.org/api/6vdg0NcxWCTFdZ5kT58MzzraQyA 2026-06-14T05:51:26Z 13016 255 15 http://arxiv.org/abs/2509.07013v3 Generalized Machine Learning for Fast Calibration of Agent-Based Epidemic Models 2026-04-02T16:43:50Z Agent-based models (ABMs) are widely used to study infectious disease dynamics, but their calibration is often computationally intensive, limiting their applicability in time-sensitive public health settings. We propose DeepIMC (Deep Inverse Mapping Calibration), a machine learning-based calibration framework that directly learns the inverse mapping from epidemic time series to epidemiological parameters. DeepIMC trains a bidirectional Long Short-Term Memory (BiLSTM) neural network on synthetic epidemic trajectories generated from agent-based models such as the Susceptible-Infected-Recovered (SIR) model, enabling rapid parameter estimation without repeated simulation at inference time. We evaluate DeepIMC through an extensive simulation study comprising 5,000 heterogeneous epidemic scenarios and benchmark its performance against Approximate Bayesian Computation (ABC) using likelihood-free Markov Chain Monte Carlo. The results show that DeepIMC substantially improves parameter recovery accuracy, produces sharp and well-calibrated predictive intervals, and reduces computational time by more than an order of magnitude relative to ABC. Although structural parameter identifiability constraints limit the precise recovery of all model parameters simultaneously, the calibrated models reliably reproduce epidemic trajectories and support accurate forward prediction with their estimated parameters. DeepIMC is implemented in the open-source R package epiworldRCalibrate, facilitating practical adoption for real-time epidemic modeling and policy analysis. Overall, our findings demonstrate that DeepIMC provides a scalable, operationally effective alternative to traditional simulation-based calibration methods for agent-based epidemic models. 2025-09-06T18:28:00Z Sima Najafzadehkhoei George Vega Yon Derek S. Meyer Bernardo Modenesi http://arxiv.org/abs/2410.11548v3 Bayesian inference of mixed Gaussian phylogenetic models 2026-04-02T11:40:09Z Background: Continuous traits evolution of a group of taxa that are correlated through a phylogenetic tree is commonly modelled using parametric stochastic differential equations to represent deterministic change of trait through time, while incorporating noises that represent different unobservable evolutionary pressures. Often times, a heterogeneous Gaussian process that consists of multiple parametric sub-processes is often used when the observed data come from a very diverse set of taxa. In the maximum likelihood setting, challenges can be found when exploring the involved likelihood surface and when interpreting the uncertainty around the parameters. Results: We extend the methods to tackle inference problems for mixed Gaussian phylogenetic models (MGPMs) by implementing a Bayesian scheme that can take into account biologically relevant priors. The posterior inference method is based on the Population Monte Carlo (PMC) algorithm that are easily parallelized, and using an efficient algorithm to calculate the likelihood of phylogenetically correlated observations. A model evaluation method that is based on the proximity of the posterior predictive distribution to the observed data is also implemented. Simulation study is done to test the inference and evaluation capability of the method. Finally, we test our method on a real-world dataset. Conclusion: We implement the method in the R package bgphy, available at github.com/bayubeta/bgphy. Simulation study demonstrates that the method is able to infer parameters and evaluate models properly, while its implementation on the real-world dataset indicates that a carefully selected model of evolution based on naturally occurring classifications results in a better fit to the observed data. 2024-10-15T12:32:49Z BMC Bioinformatics 27 (Suppl 1), 77 (2026) Bayu Brahmantio Krzysztof Bartoszek Etka Yapar 10.1186/s12859-026-06399-y http://arxiv.org/abs/2604.00602v1 The fitness landscape of overlapping genes 2026-04-01T08:07:31Z Natural genomes sometimes encode two different proteins in staggered reading frames of the same DNA sequence. Despite the prevalence of these 'overlapping genes' across the tree of life, it remains unknown whether arbitrary protein pairs can overlap, to what extent such overlaps are feasible, or what design principles govern them. Here, we study compatibility, frustration, and connectivity in the fitness landscape of overlapping genes. We computationally design sequences de novo that satisfy the dual functional constraints of two distinct protein families. The joint fitness landscape, inferred via Potts models from multiple sequence alignments, reveals a fundamental trade-off between the two proteins and provides a simple criterion for when overlap is feasible. We find widespread compatibility between protein families, with one class of reading frames markedly more permissible than others. By exploring alternative genetic codes, we find that the natural genetic code is uniquely well-suited to support overlapping genes. Constructing mutational paths between sequences, we find that sequence-diverse overlapped genes can be connected via a network of near-neutral mutations. Overall, our results suggest that protein fitness landscapes are sufficiently flexible so as to accommodate the stringent, orthogonal requirements of overlapping genes. 2026-04-01T08:07:31Z Orson Kirsch Nicole Wood Steven A Redford Kabir Husain http://arxiv.org/abs/2604.00393v1 How to Forage for a Mate? 2026-04-01T02:22:25Z Foraging is a central decision-making behavior performed by all animals, essential to garnishing enough energy for an organism to survive. Similarly, mating is crucial for evolutionary continuity and offspring production. Mate choice is one of the central tenets of sexual selection, driving major evolutionary processes, and can be regarded as a decision-making process between potential mating partners. Often researchers have used coarse-grained models to describe macroscopic phenomenology pertaining to mate choice without detailed quantitative mechanisms of how animals use individual and environmental signals to guide their mating decisions. In this letter, we show that mate choice can be cast as a foraging problem, and we present an analytically tractable optimal foraging-inspired mechanistic theory of decision-making underlying mate choice. We begin from the premise that deciding upon which partner with which to mate is at its core a stochastic decision-making process. Agents adopt a variety of decision strategies, tuned by decision thresholds for leaving or committing to a mate. We find that sensitive leaving thresholds are favored independently of signal availability in the population. By contrast, optimal thresholds for committing to a mate depend upon signal availability in the population, with signal-rich populations generally favoring less eager strategies compared to signal-poor populations. 2026-04-01T02:22:25Z 7 pages, 4 figures Daniel T Bernstein Ahmed El Hady http://arxiv.org/abs/2604.00153v1 Macroscopic Signatures of Gauge-Mediated Contagion: Deriving Behavioral Shielding from Stochastic Field Theory 2026-03-31T18:59:42Z We present a unified theoretical model relating stochastic microscopic epidemic dynamics with macroscopic non-linear population behavior. Utilizing the Doi-Peliti formalism, we model the pathogen as a gauge mediator field coupled to susceptible and infected host populations, and introduce a Reactive Immunity Field capable of spontaneous symmetry breaking. We demonstrate that the naive epidemic vacuum is destabilized by radiative loop corrections via the Coleman-Weinberg mechanism, generating a dynamic herd immunity threshold. By extracting the classical saddle-point limit of the Effective Action, we derive the macroscopic reaction-diffusion equations governing the host population. We show that integrating out the gauge mediator inherently generates a thermodynamic Free Energy dependent on the square of the susceptible density. This non-linearity produces a macroscopic spatial ``Fear Drift'' proportional to the magnitude of the immunity field, and a cubic shielding penalty in the effective reproductive number ($R_{eff}$). In this work, we establish a mapping between fundamental field-theoretic mechanisms and specific terms in the macroscopic behavioral equations. We demonstrate that Debye screening is physically executed by the spatial cross-diffusion fluxes driving host evacuation. Simultaneously, vacuum polarization manifests as a non-linear cubic penalty ($-S^3 I$) in the dressed reaction rate that dynamically suppresses the effective reproductive number. As a validation of our model, we apply the formalism to high-resolution spatiotemporal COVID-19 data from Germany. 2026-03-31T18:59:42Z 14 pages, 3 figures Jose de Jesus Bernal-Alvarado David Delepine http://arxiv.org/abs/2602.13913v2 Gauge-Mediated Contagion: A Quantum Electrodynamics-Inspired Framework for Non-Local Epidemic Dynamics and Superdiffusion 2026-03-31T18:51:42Z In this paper, we introduce a gauge-mediated Epidemiological Model inspired by Quantum Electrodynamics (QED). In this model, the ``direct contact'' paradigm of classical SIR models is replaced by a gauge-mediated interaction where the environment, represented by a pathogen field $\varphi$, plays a fundamental role in the epidemic dynamics. In this model, the non-local characteristics of epidemics appear naturally by integrating out the pathogen field. Utilizing the Doi-Peliti formalism, we derive the effective action of the system and the standard Feynman rules that can be used to compute perturbatively any observables. The standard deterministic SIR equations emerge as the mean-field saddle-point approximation of this formalism. Going beyond this classical limit, we utilize 1-loop fluctuation computations to analytically derive spatial shielding effects that are inaccessible to standard compartmental models. Using standard QED techniques, we show how to relate renormalized pathogen mass, Debye screening, to epidemiological concepts and we compute at first order the effective reproductive number,$R_{eff}$, and how the condition to have an epidemic is related to a phase transition in the pathogen mass. We show that the superspreading hosts can be included easily in this formalism. We applied our model using high-resolution spatial data from the COVID-19 pandemic across 400 districts in Germany. Our analysis reveals that the gauge field provides a early warning signal, consistently anticipating surges in reported cases with a predictive lead time of approximately one week. Furthermore, the data analysis confirms a density-driven non-linear scaling in the correlation length. By linking out of equilibrium statistical physics to epidemiology, this model shows to be a predictive tool that anticipates outbreaks based on the structural instability of the network. 2026-02-14T22:31:57Z 12, 6 figures, we include an application of the model to study the COVID-19 epidemic in Germany from 2020-2023 Jose de Jesus Bernal-Alvarado David Delepine http://arxiv.org/abs/2603.29916v1 Growth-rate distributions at stationarity 2026-03-31T15:57:12Z We propose new analytical tools for describing growth-rate distributions generated by stationary time-series. Our analysis shows how deviations from normality are not pathological behaviour, as suggested by some traditional views, but instead can be accounted for by clean and general statistical considerations. In contrast, strict normality is the effect of specific modelling choices. Systems characterized by stationary Gamma or heavy-tailed abundance distributions produce log-growth-rate distributions well described by a generalized logistic distribution, which can describe tent-shaped or nearly normal datasets and serves as a useful null model for these observables. These results prove that, for large enough time lags, in practice, growth-rate distributions cease to be time-dependent and exhibit finite variance. Based on this analysis, we identify some key stylized macroecological patterns and specific stochastic differential equations capable of reproducing them. A pragmatic workflow for heuristic selection between these models is then introduced. This approach is particularly useful for systems with limited data-tracking quality, where applying sophisticated inference methods is challenging. 2026-03-31T15:57:12Z 9 pages, 3 figures Edgardo Brigatti http://arxiv.org/abs/2603.29398v1 Pathogen diversity emerging from coevolutionary dynamics in interconnected systems 2026-03-31T08:01:39Z The spread of infectious disease and the evolution of antigenically distinct strains are often modeled separately, despite strong feedbacks mediated by host immune memory and heterogeneous contacts. To tackle this challenging problem, we introduce a coevolutionary framework in which transmission occurs on a metapopulation network while mutational exploration of strain space follows a mutation network. In this multiscale model, cross-immunity is encoded by similarity in the latent diffusion geometry of the strain network, so that nearby strains confer partial immune protection. We first identify an effective critical region that controls the transition between extinction, recurrent outbreak episodes, and long-lived endemic persistence, thus characterizing the resulting strain-turnover dynamics. We then derive a replicator-mutator-like equation for strain composition and an explicit dynamical evolutionary landscape induced by the coupling of mutation and transmission. Finally, allowing host heterogeneity to modulate the local mutation structure, we show that spreading across demes can effectively connect otherwise disconnected components of strain space, increasing long-term endemic diversity while producing a non-monotonic change in overall prevalence. Together, our results isolate minimal mechanisms by which immune-mediated competition and network structure can shape antigenic diversification. 2026-03-31T08:01:39Z Davide Zanchetta Vittoria Bettio Sandro Azaele Manlio De Domenico http://arxiv.org/abs/2603.29116v1 Disentangling the interactive effects of anthropogenic disturbances on biodiversity 2026-03-31T01:08:10Z Anthropogenic activity threatens biodiversity through climate change, habitat fragmentation, and increasing frequency and scale of disturbance. Various theoretical studies have sought to shed light on how these factors could promote or hinder the coexistence of species. However, our understanding of the relative importance of, and interactions between, these factors remains limited. In this study, we employ a theoretical approach integrating three commonly cited coexistence mechanisms -- the competition-colonisation trade-off, the intermediate disturbance hypothesis, and spatial heterogeneity -- into a unified model. We implement a novel method to integrate habitat autocorrelation into a system of differential equations, to create a simple and flexible model that can be used to investigate coexistence of multiple species arranged in a competitive hierarchy under different disturbance and habitat structure scenarios. Using this model, we find that considering interactions between different mechanisms is crucial for explaining the coexistence of species. Biodiversity patterns alternative to the uni-peak curve predicted by the intermediate disturbance hypothesis (e.g., bimodal) emerge along disturbance gradients as habitat fragmentation increases. Furthermore, habitat loss outweighs habitat autocorrelation effects in highly disturbed scenarios, yet autocorrelation can shape species coexistence under low disturbance. These findings underscore the need to integrate spatial and temporal mechanisms in biodiversity management. 2026-03-31T01:08:10Z Length: 36 pages (including main manuscript and supplementary material). Main manuscript contains 6 figures and 1 table Isaac Planas-Sitjà Ryosuke Iritani Adam L. Cronin http://arxiv.org/abs/2511.03849v4 Which Similarity-Sensitive Entropy (Sentropy)? 2026-03-30T19:43:21Z Shannon entropy is not the only entropy that is relevant to machine-learning datasets, nor possibly even the most important one. Traditional entropies such as Shannon entropy capture information represented by elements' frequencies but not the richer information encoded by their similarities and differences. Capturing the latter requires similarity-sensitive entropy (``sentropy''). Sentropy can be measured using either the recently developed Leinster-Cobbold-Reeve framework (LCR) or the newer Vendi score (VS). This raises the practical question of which one to use: LCR or VS. Here we address this question theoretically and numerically, using 53 large and well-known imaging and tabular datasets. We find that LCR and VS values can differ by orders of magnitude and are complementary, except in limiting cases. We show that both LCR and VS results depend on how similarities are scaled, and introduce the notion of ``half-distance'' to parameterize this dependence. We prove the VS provides an upper bound on LCR for all non-negative values of the Rényi-Hill order parameter, as well as for negative values in the special case that the similarity matrix is full rank. We conclude that VS is preferable only when a dataset's elements can be usefully interpreted as linear combinations of a more fundamental set of ``ur-elements'' or when the system that the dataset describes has a quantum-mechanical character. In the broader case where one simply wishes to capture the rich information encoded by elements' similarities and differences as well as their frequencies, we propose that LCR should be favored; nevertheless, for certain half-distances the two methods can complement each other. 2025-11-05T20:39:29Z 17 pages, two columns, 9 figures Phuc Nguyen Josiah Couch Rahul Bansal Alexandra Morgan Chris Tam Miao Li Rima Arnaout Ramy Arnaout http://arxiv.org/abs/2512.11164v2 Mixed updating in structured populations 2026-03-30T18:38:45Z Evolutionary graph theory (EGT) studies the effect of population structure on evolutionary dynamics. The vertices of the graph represent the $N$ individuals. The edges denote interactions for competitive replacement. Two standard update rules are death-Birth (dB) and Birth-death (Bd). Under dB, an individual is chosen uniformly at random to die, and its neighbors compete to fill the vacancy proportional to their fitness. Under Bd, an individual is chosen for reproduction proportional to fitness, and its offspring replaces a randomly chosen neighbor. Here we study mixed updating between those two scenarios. In each time step, with probability $δ$ the update is dB and with remaining probability it is Bd. We study fixation probabilities and times as functions of $δ$ under neutral evolution and constant selection. Despite the fact that fixation probabilities and times can be increasing, decreasing, or non-monotonic in $δ$, we prove nearly all unweighted undirected graphs have short fixation times and provide an efficient algorithm to estimate their fixation probabilities. Finally, we prove exact formulas for fixation probabilities on cycles, stars, and more complex structures and classify their sensitivities to $δ$. 2025-12-11T22:58:24Z 35 pages, 7 figures. Clearer presentation. This article is a distinct manuscript by the same authors and differs in content from the conference version, available as arXiv:2511.18252. Compared to arXiv:2511.18252 (ITCS '26), we focused on different quantities, changed much of the wording, added new results for weighted and directed graphs, proved sensitivity results, and incorporated 7 figures David A. Brewster Yichen Huang Michael Mitzenmacher Martin A. Nowak http://arxiv.org/abs/2603.22498v2 Modelling SARS-CoV-2 epidemics via compartmental and cellular automaton SEIRS model with temporal immunity and vaccination 2026-03-30T16:28:06Z We consider the SEIRS epidemiology model with such features of the COVID-19 outbreak as: abundance of unidentified infected individuals, limited time of immunity and a possibility of vaccination. The control of the pandemic dynamics is possible by restricting the transmission rate, increasing identification and isolation rate of infected individuals, and via vaccination. For the compartmental version of this model, we found stable disease-free and endemic stationary states. The basic reproductive number is analysed with respect to balancing quarantine and vaccination measures. The positions and heights of the first peak of outbreak are obtained numerically and fitted to simple in usage algebraic forms. Lattice-based realization of this model is studied by means of the asynchronous cellular automaton algorithm. This permitted to study the effect of social distancing by varying the neighbourhood size of the model. The attempt is made to match the quarantine and vaccination effects. 2026-03-23T19:03:30Z 20 pages, 11 figures. arXiv admin note: substantial text overlap with arXiv:2112.02661 Condens. Matter Phys., vol. 29, no. 1, p. 13501, Mar. 2026 J. Ilnytskyi T. Patsahan 10.5488/CMP.29.13501 http://arxiv.org/abs/1904.03236v5 Log-normal Superstatistics Reveals Statistical Resilience in the Panic Response of Confined Ants 2026-03-30T15:41:46Z We report the emergence of Log-normal Superstatistics in the collective motion of ants confined in a quasi-2D arena and exposed to a panic-inducing stimulus. A data-driven superstatistical Langevin model accurately reproduces the transition from stationary behavior to an organized escape response, characterized by non-Gaussian velocity distributions and a stochastic diffusion coefficient. Our findings show that danger information propagates via a memory-limited, cascade-like mechanism, resulting in a stable cluster formation despite individual memory constraints. These results indicate that a slowly varying diffusivity arises from the multiplicative combination of interaction-mediated processes under confinement, leading naturally to Log-normal fluctuations. The persistence of this statistical structure under panic reveals a form of collective resilience, establishing a mechanistic bridge between Superstatistics and living active matter in confined environments. 2019-04-05T18:57:31Z 8 pages, 8 figures A. Reyes M. Curbelo F. Tejera A. Rivera M. S. Turner O. Ramos E. Altshuler http://arxiv.org/abs/2603.28464v1 Will a time-varying complex system be stable? 2026-03-30T14:05:19Z Randomly-assembled dynamical systems are theoretically predicted to be unstable upon crossing a critical threshold of complexity, as first shown by May. Yet, empirical complex systems exhibit remarkable stability, indicating the presence of additional mechanisms playing a stabilizing role. The relation between complexity and stability is typically assessed by assuming fixed interactions, whereas real systems often evolve in intrinsically time-dependent states. To understand how this affects stability, we linearize a general non-autonomous dynamics around a reference operating state and model the resulting parameters as stochastic processes, which represent the minimal extension of static random interactions to time-varying ones. We derive exact stability bounds that generalize complexity-stability theory to dynamically varying systems. Notably, we find that temporal variability allows systems to remain stable even when their instantaneous Jacobian would predict instability. We compare our results against a non-linear neural network model, where our theory applies exactly, and the generalized Lotka-Volterra equations, where we numerically find that time-varying interactions systematically postpone the onset of replica-symmetry breaking. Overall, our results indicate that temporal variability systematically improves stability, demonstrating a general mechanism by which complex systems can violate classical complexity-stability bounds. 2026-03-30T14:05:19Z 8+4 pages, 3+3 figures Francesco Ferraro Christian Grilletta Amos Maritan Samir Suweis Sandro Azaele http://arxiv.org/abs/2603.28285v1 Global stability and uniform persistence in an epidemic model with saturating fomite-mediated transmission 2026-03-30T11:06:16Z We analyse the global dynamics of a Susceptible--Vaccinated--Exposed--Infected--Recovered (SVEIR) epidemic model with demographic turnover, imperfect vaccination, and two transmission routes: direct host-to-host contagion and indirect transmission via contaminated fomites. Indirect transmission is described through an environmental pathogen concentration and a Holling-type dose--response function, accounting for nonlinear incidence at high contamination levels. Threshold conditions separating disease elimination from long-term persistence are expressed in terms of the control reproduction number $\mathcal R_c$, and the classical threshold condition $\mathcal R_c<1$ is derived for the local asymptotic stability of the disease-free equilibrium. For the Holling type~II case, we further obtain an explicit closed-form sufficient condition for the global asymptotic stability of the disease-free equilibrium by applying the Kamgang--Sallet approach for monotone systems with a Metzler infected subsystem. In the absence of vaccination, this criterion recovers the sharp threshold $\mathcal R_0\le 1$ for the global asymptotic stability of the disease-free equilibrium, where $\mathcal R_0$ denotes the basic reproduction number. Conversely, when $\mathcal R_c>1$, we establish uniform persistence of the infection and the existence of at least one endemic equilibrium using persistence theory for semiflows and an acyclicity analysis of the boundary dynamics. Overall, our results quantify the combined impact of vaccination and saturating fomite-mediated transmission on the global behaviour of the model. 2026-03-30T11:06:16Z Emanuela Penitente Urszula Foryś Burcu Gürbüz