https://arxiv.org/api/ZoOxu04IHS3504NoRBr7XoYzZd42026-06-26T04:09:16Z13050178515http://arxiv.org/abs/2403.19852v4A Review of Graph Neural Networks in Epidemic Modeling2024-09-09T12:54:05ZSince the onset of the COVID-19 pandemic, there has been a growing interest in studying epidemiological models. Traditional mechanistic models mathematically describe the transmission mechanisms of infectious diseases. However, they often suffer from limitations of oversimplified or fixed assumptions, which could cause sub-optimal predictive power and inefficiency in capturing complex relation information. Consequently, Graph Neural Networks(GNNs) have emerged as a progressively popular tool in epidemic research. In this paper, we endeavor to furnish a comprehensive review of GNNs in epidemic tasks and highlight potential future directions. To accomplish this objective, we introduce hierarchical taxonomies for both epidemic tasks and methodologies, offering a trajectory of development within this domain. For epidemic tasks, we establish a taxonomy akin to those typically employed within the epidemic domain. For methodology, we categorize existing work into Neural Models and Hybrid Models. Following this, we perform an exhaustive and systematic examination of the methodologies, encompassing both the tasks and their technical details. Furthermore, we discuss the limitations of existing methods from diverse perspectives and systematically propose future research directions. This survey aims to bridge literature gaps and promote the progression of this promising field, with a list of relevant papers at https://github.com/Emory-Melody/awesome-epidemic-modeling-papers. We hope that it will facilitate synergies between the communities of GNNs and epidemiology, and contribute to their collective progress.2024-03-28T21:54:48ZZewen LiuGuancheng WanB. Aditya PrakashMax S. Y. LauWei Jinhttp://arxiv.org/abs/2408.11524v2What you saw is what you got? -- Correcting reported incidence data for testing intensity2024-09-09T05:53:52ZDuring the COVID-19 pandemic, different types of non-pharmaceutical interventions played an important role in the efforts to control outbreaks and to limit the spread of the SARS-CoV-2 virus. In certain countries, large-scale voluntary testing of non-symptomatic individuals was done, with the aim of identifying asymptomatic and pre-symptomatic infections as well as gauging the prevalence in the general population. In this work, we present a mathematical model, used to investigate the dynamics of both observed and unobserved infections as a function of the rate of voluntary testing. The model indicate that increasing the rate of testing causes the observed prevalence to increase, despite a decrease in the true prevalence. For large testing rates, the observed prevalence also decrease. The non-monotonicity of observed prevalence explains some of the discrepancies seen when comparing uncorrected case-counts between countries. An example of such discrepancy is the COVID-19 epidemics observed in Denmark and Hungary during winter 2020/2021, for which the reported case-counts were comparable but the true prevalence were very different. The model provides a quantitative measure for the ascertainment rate between observed and true incidence, allowing for test-intensity correction of incidence data. By comparing the model to the country-wide epidemic of the Omicron variant (BA.1 and BA.2) in Denmark during the winter 2021/2022, we find a good agreement between the cumulative incidence as estimated by the model and as suggested by serology-studies. While the model does not capture the full complexity of epidemic outbreaks and the effect of different interventions, it provides a simple way to correct raw case-counts for differences in voluntary testing, making comparison across international borders and testing behaviour possible.2024-08-21T10:59:56ZRasmus Kristoffer PedersenChristian BerrigTamás TekeliGergely RöstViggo Andreasenhttp://arxiv.org/abs/2409.05282v1Improving Tree Probability Estimation with Stochastic Optimization and Variance Reduction2024-09-09T02:22:52ZProbability estimation of tree topologies is one of the fundamental tasks in phylogenetic inference. The recently proposed subsplit Bayesian networks (SBNs) provide a powerful probabilistic graphical model for tree topology probability estimation by properly leveraging the hierarchical structure of phylogenetic trees. However, the expectation maximization (EM) method currently used for learning SBN parameters does not scale up to large data sets. In this paper, we introduce several computationally efficient methods for training SBNs and show that variance reduction could be the key for better performance. Furthermore, we also introduce the variance reduction technique to improve the optimization of SBN parameters for variational Bayesian phylogenetic inference (VBPI). Extensive synthetic and real data experiments demonstrate that our methods outperform previous baseline methods on the tasks of tree topology probability estimation as well as Bayesian phylogenetic inference using SBNs.2024-09-09T02:22:52Z23 pages, 6 figures, 7 tablesTianyu XieMusu YuanMinghua DengCheng Zhanghttp://arxiv.org/abs/2409.05245v1Bayesian estimation of transmission networks for infectious diseases2024-09-08T23:36:15ZReconstructing transmission networks is essential for identifying key factors like superspreaders and high-risk locations, which are critical for developing effective pandemic prevention strategies. In this study, we developed a Bayesian framework that integrates genomic and temporal data to reconstruct transmission networks for infectious diseases. The Bayesian transmission model accounts for the latent period and differentiates between symptom onset and actual infection time, enhancing the accuracy of transmission dynamics and epidemiological models. Additionally, the model allows for the transmission of multiple pathogen lineages, reflecting the complexity of real-world transmission events more accurately than models that assume a single lineage transmission. Simulation results show that the Bayesian model reliably estimates both the model parameters and the transmission network. Moreover, hypothesis testing effectively identifies direct transmission events. This approach highlights the crucial role of genetic data in reconstructing transmission networks and understanding the origins and transmission dynamics of infectious diseases.2024-09-08T23:36:15ZJianing XuHuimin HuGregory EllisonLili YuChristopher WhalenLiang Liuhttp://arxiv.org/abs/2409.05237v1The Stochastic Gause predator-prey model: noise-induced extinctions and invariance2024-09-08T22:41:28ZThis paper explores a stochastic Gause predator-prey model with bounded or sub-linear functional response. The model, described by a system of stochastic differential equations, captures the influence of stochastic fluctuations on predator-prey dynamics, with particular focus on the stability, extinction, and persistence of populations. We provide sufficient conditions for the existence and boundedness of solutions, analyze noise-induced extinction events, and investigate the existence of unique stationary distributions for the case of Holing Type I functional response. Our analysis highlights the critical role of noise in determining long-term ecological outcomes, demonstrating that even in cases where deterministic models predict stable coexistence, stochastic noise can drive populations to extinction or alter the system's dynamics significantly.2024-09-08T22:41:28ZLeon Alexander ValenciaPh. DJorge Mario Ramirez OsorioJorge Andres Sanchezhttp://arxiv.org/abs/2409.15333v1Fractional and fractal extensions of epidemiological models2024-09-08T00:01:24ZOne way to study the spread of disease is through mathematical models. The most successful models compartmentalize the host population according to their infectious stage, e.g., susceptible (S), infected (I), exposed (E), and recovered (R). The composition of these compartments leads to the SI, SIS, SIR, and SEIR models. In this Chapter, we present and compare three formulations of SI, SIS, SIR, and SEIR models in the framework of standard (integer operators), fractional (Caputo sense), and fractal derivatives (Hausdorff sense). As an application of the SI model, we study the evolution of AIDS cases in Bangladesh from 2001 to 2021. For this case, our simulations suggest that fractal formulation describes the data well. For the SIS model, we consider syphilis data from Brazil from 2006 to 2017. In this case, the three frameworks describe the data with good accuracy. We used data from Influenza A to adjust the SIR model in previous approaches and observed that the fractional formulation was better. The last application considers the COVID-19 data from India in the range 2020-04-10 to 2020-12-31 to adjust the parameters of the SEIR model. The standard formulation fits the data better than the other approaches. As a common result, all models exhibit steady solutions in the different formulations. The time to reach a steady solution is correlated to the considered approach. The standard and fractal formulations reach the steady state earlier when compared with the fractional formulation.2024-09-08T00:01:24ZEnrique C. GabrickErvin K. LenziAntonio M. Batistahttp://arxiv.org/abs/2310.15729v3Phenotype selection due to mutational robustness2024-09-07T06:08:04ZThe mutation-selection mechanism of Darwinian evolution gives rise not only to adaptation to environmental conditions but also to the enhancement of robustness against mutations. When two or more phenotypes have the same fitness value, the robustness distribution for different phenotypes can vary. Thus, we expect that some phenotypes are favored in evolution and that some are hardly selected because of a selection bias for mutational robustness. In this study, we investigated this selection bias for phenotypes in a model of gene regulatory networks (GRNs) using numerical simulations. The model had one input gene accepting a signal from the outside and one output gene producing a target protein, and the fitness was high if the output for the full signal was much higher than that for no signal. The model exhibited three types of responses to changes in the input signal: monostable, toggle switch, and one-way switch. We regarded these three response types as three distinguishable phenotypes. We constructed a randomly generated set of GRNs using the multicanonical Monte Carlo method originally developed in statistical physics and compared it to the outcomes of evolutionary simulations. One-way switches were strongly suppressed during evolution because of their lack of mutational robustness. By examining one-way switch GRNs in detail, we found that mutationally robust GRNs obtained by evolutionary simulations and non-robust GRNs obtained by McMC have different network structures. While robust GRNs have a common core motif, non-robust GRNs lack this motif. The bistability of non-robust GRNs is considered to be realized cooperatively by many genes, and these cooperative genotypes have been suppressed by evolution.2023-10-24T11:05:14Z15 pages, 14 figuresPLoS ONE 19 (2024): e0311058Macoto Kikuchi10.1371/journal.pone.0311058http://arxiv.org/abs/2401.15046v2Lane formation and aggregation spots in a model of ants2024-09-06T20:42:33ZWe investigate an interacting particle model to simulate a foraging colony of ants, where each ant is represented as an active Brownian particle. The interactions among ants are mediated through chemotaxis, aligning their orientations with the upward gradient of the pheromone field. Unlike conventional models, our study introduces a parameter that enables the reproduction of two distinctive behaviors: the well-known Keller--Segel aggregation into spots and the formation of traveling clusters, without relying on external constraints such as food sources or nests. We consider the associated mean-field limit partial differential equation (PDE) of this system and establish the analytical and numerical foundations for understanding these particle behaviors. Remarkably, the mean-field PDE not only supports aggregation spots and lane formation but also unveils a bistable region where these two behaviors compete. The patterns associated with these phenomena are elucidated by the shape of the growing eigenfunctions derived from linear stability analysis. This study not only contributes to our understanding of complex ant colony dynamics but also introduces a novel parameter-dependent perspective on pattern formation in collective systems.2024-01-26T18:18:51ZMaria BrunaMartin BurgerOscar de Withttp://arxiv.org/abs/2202.00234v2Periodic Traveling Waves in an Integro-Difference Equation With Non-Monotonic Growth and Strong Allee Effect2024-09-06T20:16:44ZWe derive sufficient conditions for the existence of a periodic traveling wave solution to an integro-difference equation with a piecewise constant growth function exhibiting a stable period2 cycle and strong Allee effect. The mean traveling wave speed is shown to be the asymptotic spreading speed of solutions with compactly supported initial data under appropriate conditions. We then conduct case studies for the Laplace kernel and uniform kernel.2022-02-01T06:03:16ZMichael NestorBingtuan Lihttp://arxiv.org/abs/2406.12002v2Modeling, Inference, and Prediction in Mobility-Based Compartmental Models for Epidemiology2024-09-06T17:57:52ZClassical compartmental models in epidemiology often assume a homogeneous population for simplicity, which neglects the inherent heterogeneity among individuals. This assumption frequently leads to inaccurate predictions when applied to real-world data. For example, evidence has shown that classical models overestimate the final pandemic size in the H1N1-2009 and COVID-19 outbreaks. To address this issue, we introduce individual mobility as a key factor in disease transmission and control. We characterize disease dynamics using mobility distribution functions for each compartment and propose a mobility-based compartmental model that incorporates population heterogeneity. Our results demonstrate that, for the same basic reproduction number, our mobility-based model predicts a smaller final pandemic size compared to the classical models, effectively addressing the common overestimation problem. Additionally, we infer mobility distributions from the time series of the infected population. We provide sufficient conditions for uniquely identifying the mobility distribution from a dataset and propose a machine-learning-based approach to learn mobility from both synthesized and real-world data.2024-06-17T18:13:57Z19 pages, 8 figuresNing JiangWeiqi ChuYao Lihttp://arxiv.org/abs/2409.02086v2Noise-free comparison of stochastic agent-based simulations using common random numbers2024-09-05T22:21:32ZRandom numbers are at the heart of every agent-based model (ABM) of health and disease. By representing each individual in a synthetic population, agent-based models enable detailed analysis of intervention impact and parameter sensitivity. Yet agent-based modeling has a fundamental signal-to-noise problem, in which small changes between simulations cannot be reliably differentiated from stochastic noise resulting from misaligned random number realizations. We introduce a novel methodology that eliminates noise due to misaligned random numbers, a first for agent-based modeling. Our approach enables meaningful individual-level analysis between ABM scenarios because all differences are driven by mechanistic effects rather than random number noise. We demonstrate the benefits of our approach on three disparate examples. Results consistently show reductions in the number of simulations required to achieve a given standard error with levels exceeding 10-fold for some applications.2024-09-03T17:39:38ZUpdated title, improved abstract, and changed formattingDaniel J. KleinRomesh G. AbeysuriyaRobyn M. StuartCliff C. Kerrhttp://arxiv.org/abs/2305.19890v4Element-wise and Recursive Solutions for the Power Spectral Density of Biological Stochastic Dynamical Systems at Fixed Points2024-09-05T18:59:40ZStochasticity plays a central role in nearly every biological process, and the noise power spectral density (PSD) is a critical tool for understanding variability and information processing in living systems. In steady-state, many such processes can be described by stochastic linear time-invariant (LTI) systems driven by Gaussian white noise, whose PSD is a complex rational function of the frequency that can be concisely expressed in terms of their Jacobian, dispersion, and diffusion matrices, fully defining the statistical properties of the system's dynamics at steady-state. Here, we arrive at compact element-wise solutions of the rational function coefficients for the auto- and cross-spectrum that enable the explicit analytical computation of the PSD in dimensions n=2,3,4. We further present a recursive Leverrier-Faddeev-type algorithm for the exact computation of the rational function coefficients. Crucially, both solutions are free of matrix inverses. We illustrate our element-wise and recursive solutions by considering the stochastic dynamics of neural systems models, namely Fitzhugh-Nagumo (n=2), Hindmarsh-Rose (n=3), Wilson-Cowan (n=4), and the Stabilized Supralinear Network (n=22), as well as an evolutionary game-theoretic model with mutations (n=5, 31). We extend our approach to derive a recursive method for calculating the coefficients in the power series expansion of the integrated covariance matrix for interacting spiking neurons modeled as Hawkes processes on arbitrary directed graphs.2023-05-31T14:25:02ZShivang RawatStefano Martiniani10.1103/PhysRevResearch.6.043179http://arxiv.org/abs/2409.03372v1Simple measures to capture the robustness and the plasticity of soil microbial communities2024-09-05T09:20:38ZSoil microbial communities are known to be robust against perturbations such as nutrition inputs, which appears as an obstacle for the soil improvement. On the other hand, its adaptable aspect has been also reported. Here we propose simple measures for these seemingly contradicting features of soil microbial communities, robustness and plasticity, based on the distribution of the populations. The first measure is the similarity in the population balance, i.e. the shape of the distribution function, which is found to show resilience against the nutrition inputs. The other is the similarity in the composition of the species measured by the rank order of the population, which shows an adaptable response during the population balance is recovering. These results clearly show that the soil microbial system is robust (or, homeostatic) in its population balance, while the composition of the species is rather plastic and adaptable.2024-09-05T09:20:38Z9 pages, 3 figures, with supplementary informationTakashi ShimadaKazumori MiseKai MorinoShigeto Otsukahttp://arxiv.org/abs/2409.03353v1Modelling the age distribution of longevity leaders2024-09-05T08:55:39ZHuman longevity leaders with remarkably long lifespan play a crucial role in the advancement of longevity research. In this paper, we propose a stochastic model to describe the evolution of the age of the oldest person in the world by a Markov process, in which we assume that the births of the individuals follow a Poisson process with increasing intensity, lifespans of individuals are independent and can be characterized by a gamma-Gompertz distribution with time-dependent parameters. We utilize a dataset of the world's oldest person title holders since 1955, and we compute the maximum likelihood estimate for the parameters iteratively by numerical integration. Based on our preliminary estimates, the model provides a good fit to the data and shows that the age of the oldest person alive increases over time in the future. The estimated parameters enable us to describe the distribution of the age of the record holder process at a future time point.2024-09-05T08:55:39Z14 pages, 5 figuresSci. Rep., 14 (2024), no. 20592Csaba KissLászló NémethBálint Vető10.1038/s41598-024-71444-whttp://arxiv.org/abs/2405.00333v2Ecosystem knowledge should replace coexistence and stability assumptions in ecological network modelling2024-09-05T05:45:27ZQuantitative population modelling is an invaluable tool for identifying the cascading effects of ecosystem management and interventions. Ecosystem models are often constructed by assuming stability and coexistence in ecological communities as a proxy for abundance data when monitoring programs are not available. However, a growing body of literature suggests that these assumptions are inappropriate for modelling conservation outcomes. In this work, we develop an alternative for dataless population modelling that instead relies on expert-elicited knowledge of species abundances. While time series abundance data is often not available for ecosystems of interest, these systems may still be highly studied or observed in an informal capacity. In particular, limits on population sizes and their capacity to rapidly change during an observation period can be reasonably elicited for many species. We propose a robust framework for generating an ensemble of ecosystem models whose population predictions match the expected population dynamics, as defined by experts. Our new Bayesian algorithm systematically removes model parameters that lead to unreasonable population predictions without incurring excessive computational costs. Our results demonstrate that models constructed using expert-elicited information, rather than stability and coexistence assumptions, can dramatically impact population predictions, expected responses to management, conservation decision-making, and long-term ecosystem behaviour. In the absence of data, we argue that field observations and expert knowledge are preferred for representing ecosystems observed in nature instead of theoretical assumptions of coexistence and stability.2024-05-01T05:52:15ZSarah A. VollertChristopher DrovandiMatthew P. Adams