https://arxiv.org/api/Mnb/NOw3aZFqakWxzc/akCnVnwA 2026-06-21T15:00:31Z 1596 660 15 http://arxiv.org/abs/2011.00485v1 Comparing Machine Learning Algorithms with or without Feature Extraction for DNA Classification 2020-11-01T12:04:54Z The classification of DNA sequences is a key research area in bioinformatics as it enables researchers to conduct genomic analysis and detect possible diseases. In this paper, three state-of-the-art algorithms, namely Convolutional Neural Networks, Deep Neural Networks, and N-gram Probabilistic Models, are used for the task of DNA classification. Furthermore, we introduce a novel feature extraction method based on the Levenshtein distance and randomly generated DNA sub-sequences to compute information-rich features from the DNA sequences. We also use an existing feature extraction method based on 3-grams to represent amino acids and combine both feature extraction methods with a multitude of machine learning algorithms. Four different data sets, each concerning viral diseases such as Covid-19, AIDS, Influenza, and Hepatitis C, are used for evaluating the different approaches. The results of the experiments show that all methods obtain high accuracies on the different DNA datasets. Furthermore, the domain-specific 3-gram feature extraction method leads in general to the best results in the experiments, while the newly proposed technique outperforms all other methods on the smallest Covid-19 dataset 2020-11-01T12:04:54Z 17 pages Xiangxie Zhang Ben Beinke Berlian Al Kindhi Marco Wiering http://arxiv.org/abs/2011.01877v1 Exploring the Synchrony Between Body Temperature and HR, RR, and Aortic Blood Pressure in Viral/Bacterial Disease Onsets with Signal Dynamics 2020-10-30T22:32:04Z Signal-based early detection of illnesses has been a key topic in research and hospital settings; it reduces technological costs and paves the way for quick and effective patient-care operations. Elementary machine learning and signal processing algorithms have proven to be sufficient in classifying the onset of viral and bacterial conditions before clinical symptoms are shown. Inspired by these recent developments, this project employs signal dynamics analysis to infer changes in vital signs (temperature, respiration, and heart rate). The results demonstrate that the trends of one vital function can be predicted from that of another. In particular, it is shown that heart rate and respiration typically change shortly after body temperature, and aortic blood pressure follows. This is not an etiologically specific approach, but if advanced further, it can enable patients and wearable system users to tame these changes and prevent immediate symptoms. 2020-10-30T22:32:04Z Camille Dunning http://arxiv.org/abs/2011.00002v1 Molecular Communications in Viral Infections Research: Modelling, Experimental Data and Future Directions 2020-10-30T15:18:35Z Hundreds of millions of people worldwide are affected by viral infections each year, and yet, several of them neither have vaccines nor effective treatment during and post-infection. This challenge has been highlighted by the COVID-19 pandemic, showing how viruses can quickly spread and how they can impact society as a whole. Novel techniques that bring in different disciplines must emerge to provide forward-looking strategies to combat viral infections, as well as possible future pandemics. In the past decade, an interdisciplinary area involving bioengineering, nanotechnology and information and communication technology (ICT) has been developing, known as Molecular Communications. This new emerging area uses elements of classical communication systems and maps it to molecular signalling and communication found inside and outside the body, where the aim is to develop new tools that can serve future medicine. In this paper, we provide an extensive and detailed discussion on how Molecular Communications can be integrated into the research on viral infectious diseases modelling, and how possible treatment and vaccines can be developed considering molecules as information carriers. We provide a literature review on the existing models of Molecular Communications for viral infection (in-body and out-body), a deep analysis on their effects on the host and subsequent communication process for other systems within the body (e.g., immune response), sources of experimental data on known viral infections and how it can be used by the Molecular Communications community, as well as open issues and future directions. Since the development of therapeutics/vaccines needs an interdisciplinary approach centred around ICT, we are confident that Molecular Communications can play a central role here by providing a detail characterisation and manipulation of the propagation of molecules in different media. 2020-10-30T15:18:35Z Submitted for journal publication Michael Taynnan Barros Mladen Veletić Masamitsu Kanada Massimiliano Pierobon Seppo Vainio Ilangko Balasingham Sasitharan Balasubramaniam http://arxiv.org/abs/2010.16154v1 The Human Cell Atlas & Equity: Lessons Learned 2020-10-30T09:50:29Z The Human Cell Atlas has been undergoing a massive effort to support global scientific equity. The co-leaders of its Equity Working Group share some lessons learned in the process. 2020-10-30T09:50:29Z Nature Medicine 26 (2020) 1509-1511; Partha P. Majumder Musa M. Mhlanga Alex K. Shalek 10.1038/s41591-020-1100-4 http://arxiv.org/abs/2011.01873v1 Self-Organized Networks: Darwinian Evolution of Myosin-1 2020-10-28T19:06:54Z Cytoskeletons are self-organized networks based on polymerized proteins: actin, tubulin, and driven by motor proteins, such as myosin, kinesin and dynein. Their positive Darwinian evolution enables them to approach optimized functionality (self-organized criticality). The principal features of the eukaryotic evolution of the cytoskeleton motor protein myosin-1 parallel those of actin and tubulin, but also show striking differences connected to its dynamical function. Optimized (long) hydropathic waves characterize the molecular level Darwinian evolution towards optimized functionality (self-organized criticality). The N-terminal and central domains of myosin-1 have evolved in eukaryotes at different rates, with the central domain hydropathic extrema being optimally active in humans. A test shows that hydropathic scaling can yield accuracies of better than 1% near optimized functionality. Evolution towards synchronized level extrema is connected to a special function of Mys-1 in humans involving Golgi complexes. 2020-10-28T19:06:54Z 20 pages, 9 figures J. C. Phillips http://arxiv.org/abs/1911.12363v3 Optimizing Energetic cost of Uncertainty in a Driven System With and Without Feedback 2020-10-27T16:25:31Z Many biological functions require the dynamics to be necessarily driven out-of-equilibrium. In contrast, in various contexts, a nonequilibrium dynamics at fast timescales can be described by an effective equilibrium dynamics at a slower timescale. In this work we study the two different aspects, (i) the energy-efficiency tradeoff for a specific nonequilibrium linear dynamics of two variables with feedback, and (ii) the cost of effective parameters in a coarse-grained theory as given by the "hidden" dissipation and entropy production rate in the effective equilibrium limit of the dynamics. To meaningfully discuss the tradeoff between energy consumption and the efficiency of the desired function, a one-to-one mapping between function(s) and energy input is required. The function considered in this work is the variance of one of the variables. We get a one-to-one mapping by considering the minimum variance obtained for a fixed entropy production rate and vice-versa. We find that this minimum achievable variance is a monotonically decreasing function of the given entropy production rate. When there is a timescale separation, in the effective equilibrium limit, the cost of the effective potential and temperature is the associated "hidden" entropy production rate. 2019-11-27T18:08:53Z 8 pages, 4 figures Phys. Rev. E 102, 052405 (2020) Amit Singh Vishen 10.1103/PhysRevE.102.052405 http://arxiv.org/abs/2010.12332v1 Old Drugs for JAK-STAT Pathway Inhibition in COVID-19 2020-10-23T13:09:16Z The pandemic threat of COVID-19 with more than 37 million cases in which about 5 percent entering critical stage characterized by cytokine storm and hyperinflammatory condition, the state more often leads to admission to intensive care unit with rapid mortality. Janus kinase enzymes of Jak-1, Jak-2, Jak-3, and Tyk2 seem to be good targets for inhibition by medications to control cytokine storm in this context. In the present work, the inhibitory properties of different analgesic drugs on these targets are studied to assess their ability for clinical application from different points of view. Our docking results indicated that naproxen, methadone, and amitriptyline considering their higher binding energy, lower energy variance, and higher hydrophobicity, seem to express more inhibitory effects on Janus kinase enzymes than thats for approved inhibitors i.e. baricitinib and ruxolitinib. Accordingly, we suggest our wide list of candidate drugs including indomethacin, etodolac, buprenorphine, rofecoxib, duloxetine, valdecoxib, naproxen, methadone, and amitriptilin for clinical assessments for their usefulness in COVID-19 treatment, especially taking into account that up to now, there is no approved cure for this disease. 2020-10-23T13:09:16Z Mohammad Reza Dayer 10.13140/RG.2.2.33735.73122 http://arxiv.org/abs/2010.10107v1 A Global View of Standards for Open Image Data Formats and Repositories 2020-10-20T08:00:19Z Biological and biomedical imaging datasets record the constitution, architecture and dynamics of living organisms across several orders of magnitude of space and time. Imaging technologies are now used throughout the life and biomedical sciences to achieve discovery and understanding of biological mechanisms in the basic sciences as well as assessment, diagnosis and therapeutic intervention in clinical trials and animal and human medicine. The universal application and use of imaging raises an important question and opportunity: what is the value and ultimate destination of biological and medical imaging data? In the last few years, several informatics and data science technologies have matured sufficiently so that routine publication of these datasets is now possible. Participants in Global BioImaging from 15 countries and all populated continents have agreed on the need for recommendations and guidelines for the establishment of image data repositories and the formats they use for delivering data to the global scientific community. This white paper presents a shared, global view of criteria for these common, globally applicable guidelines and provisional proposals for open tools and resources that are available now and can provide a foundation for future development. 2020-10-20T08:00:19Z Jason R. Swedlow Pasi Kankaanpää Ugis Sarkans Wojtek Goscinski Graham Galloway Ryan P. Sullivan Claire M. Brown Chris Wood Antje Keppler Ben Loos Sara Zullino Dario Livio Longo Silvio Aime Shuichi Onami http://arxiv.org/abs/2010.12067v1 A Calculation Model for Estimating Effect of COVID-19 Contact-Confirming Application (COCOA) on Decreasing Infectors 2020-10-17T09:32:39Z As of 2020, COVID-19 is spreading in the world. In Japan, the Ministry of Health, Labor and Welfare developed COVID-19 Contact-Confirming Application (COCOA). The researches to examine the effect of COCOA are still not sufficient. We develop a mathematical model to examine the effect of COCOA and show examined result. 2020-10-17T09:32:39Z 4 pages, 3 figures Mathematical Biosciences and Engineering, 2021, Volume 18, Issue 5, pp.6506-6526 Yuto Omae Jun Toyotani Kazuyuki Hara Yasuhiro Gon Hirotaka Takahashi 10.3934/mbe.2021323 http://arxiv.org/abs/2010.00957v1 Estimands in Hematologic Oncology Trials 2020-10-01T15:38:48Z The estimand framework included in the addendum to the ICH E9 guideline facilitates discussions to ensure alignment between the key question of interest, the analysis, and interpretation. Therapeutic knowledge and drug mechanism play a crucial role in determining the strategy and defining the estimand for clinical trial designs. Clinical trials in patients with hematological malignancies often present unique challenges for trial design due to complexity of treatment options and existence of potential curative but highly risky procedures, e.g. stem cell transplant or treatment sequence across different phases (induction, consolidation, maintenance). Here, we illustrate how to apply the estimand framework in hematological clinical trials and how the estimand framework can address potential difficulties in trial result interpretation. This paper is a result of a cross-industry collaboration to connect the International Conference on Harmonisation (ICH) E9 addendum concepts to applications. Three randomized phase 3 trials will be used to consider common challenges including intercurrent events in hematologic oncology trials to illustrate different scientific questions and the consequences of the estimand choice for trial design, data collection, analysis, and interpretation. Template language for describing estimand in both study protocols and statistical analysis plans is suggested for statisticians' reference. 2020-10-01T15:38:48Z 5 tables, 1 figure Pharm. Stat., 2021, 20, 793-805 Steven Sun Hans-Jochen Weber Emily Butler Kaspar Rufibach Satrajit Roychoudhury 10.1002/pst.2108 http://arxiv.org/abs/2009.13830v1 Unboxing mutations: Connecting mutation types with evolutionary consequences 2020-09-29T07:34:24Z Mutations are typically classified by their effects on the nucleotide sequence and by their size. Here, we argue that if our main aim is to understand the effect of mutations on evolutionary outcomes (such as adaptation or speciation), we need to instead consider their population genetic and genomic effects, from altering recombination rate to modifying chromatin. We start by reviewing known population genetic and genomic effects of different mutation types and connect these to the major evolutionary processes of drift and selection. We illustrate how mutation type can thus be linked with evolutionary outcomes and provide suggestions for further exploring and quantifying these relationships. This reframing lays a foundation for determining the evolutionary significance of different mutation types. 2020-09-29T07:34:24Z Emma L. Berdan Alexandre Blanckaert Tanja Slotte Alexander Suh Anja M. Westram Inês Fragata http://arxiv.org/abs/2009.09702v1 On the verge of life: Distribution of nucleotide sequences in viral RNAs 2020-09-21T09:17:23Z The aim of the study is to analyze viruses using parameters obtained from distributions of nucleotide sequences in the viral RNA. Seeking for the input data homogeneity, we analyze single-stranded RNA viruses only. Two approaches are used to obtain the nucleotide sequences; In the first one, chunks of equal length (four nucleotides) are considered. In the second approach, the whole RNA genome is divided into parts by adenine or the most frequent nucleotide as a "space". Rank--frequency distributions are studied in both cases. Within the first approach, the Pólya and the negative hypergeometric distribution yield the best fit. For the distributions obtained within the second approach, we have calculated a set of parameters, including entropy, mean sequence length, and its dispersion. The calculated parameters became the basis for the classification of viruses. We observed that proximity of viruses on planes spanned on various pairs of parameters corresponds to related species. In certain cases, such a proximity is observed for unrelated species as well calling thus for the expansion of the set of parameters used in the classification. We also observed that the fourth most frequent nucleotide sequences obtained within the second approach are of different nature in case of human coronaviruses (different nucleotides for MERS, SARS-CoV, and SARS-CoV-2 versus identical nucleotides for four other coronaviruses). We expect that our findings will be useful as a supplementary tool in the classification of diseases caused by RNA viruses with respect to severity and contagiousness. 2020-09-21T09:17:23Z Biosemiotics 14, No. 2, 253-269 (2021) Mykola Husev Andrij Rovenchak 10.1007/s12304-021-09403-5 http://arxiv.org/abs/2009.10646v1 A stable method for 4D CT-based CFD simulation in the right ventricle of a TGA patient 2020-09-20T10:45:07Z The paper discusses a stabilization of a finite element method for the equations of fluid motion in a time-dependent domain. After experimental convergence analysis, the method is applied to simulate a blood flow in the right ventricle of a post-surgery patient with the transposition of the great arteries disorder. The flow domain is reconstructed from a sequence of 4D CT images. The corresponding segmentation and triangulation algorithms are also addressed in brief. 2020-09-20T10:45:07Z to be published in Russian Journal of Numerical Analysis and Mathematical Modelling Alexander Danilov Yushui Han Chun H. Lin Alexander Lozovskiy Maxim A. Olshanskii Victoria Yu. Salamatova Yuri V. Vassilevski http://arxiv.org/abs/2009.09911v1 Are mouse and cat the missing link in the COVID-19 outbreaks in seafood markets? 2020-09-18T10:23:23Z Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus caused the novel coronavirus disease-2019 (COVID-19) affecting the whole world. Like SARS-CoV and MERS-CoV, SARS-CoV-2 are thought to originate in bats and then spread to humans through intermediate hosts. Identifying intermediate host species is critical to understanding the evolution and transmission mechanisms of COVID-19. However, determining which animals are intermediate hosts remains a key challenge. Virus host-genome similarity (HGS) is an important factor that reflects the adaptability of virus to host. SARS-CoV-2 may retain beneficial mutations to increase HGS and evade the host immune system. This study investigated the HGSs between 399 SARS-CoV-2 strains and 10 hosts of different species, including bat, mouse, cat, swine, snake, dog, pangolin, chicken, human and monkey. The results showed that the HGS between SARS-CoV-2 and bat was the highest, followed by mouse and cat. Human and monkey had the lowest HGS values. In terms of genetic similarity, mouse and monkey are halfway between bat and human. Moreover, given that COVID-19 outbreaks tend to be associated with live poultry and seafood markets, mouse and cat are more likely sources of infection in these places. However, more experimental data are needed to confirm whether mouse and cat are true intermediate hosts. These findings suggest that animals closely related to human life, especially those with high HGS, need to be closely monitored. 2020-09-18T10:23:23Z Daniel H. Tao Weitao Sun http://arxiv.org/abs/2007.05410v3 A Physics Modeling Study of SARS-CoV-2 Transport in Air 2020-09-15T14:22:34Z The health threat from SARS-CoV-2 airborne infection has become a public emergency of international concern. During the ongoing coronavirus pandemic, people have been advised by the Centers for Disease Control and Prevention to maintain social distancing of at least 2 m to limit the risk of exposure to the coronavirus. Experimental data, however, show that infected aerosols and droplets trapped inside a turbulent puff cloud can travel up to 7 to 8 m. We propose a nuclear physics analogy-based modeling of the complex gas cloud and its payload of pathogen-virions. We show that the cloud stopping range is proportional to the product of the puff's diameter and its density. We use our puff model to determine the average density of the buoyant fluid in the turbulent cloud. A fit to the experimental data yields $1.8 < ρ_P/ρ_{\rm air} < 4.0$, where $ρ_P$ and $ρ_{\rm air}$ are the average density of the puff and the air. We demonstrate that temperature variation could cause an ${\cal O}(\pm 8\%)$ effect in the puff stopping range for extreme ambient cold or warmth. We also demonstrate that aerosols and droplets can remain suspended for hours in the air. Therefore, once the puff slows down sufficiently, and its coherence is lost, the eventual spreading of the infected aerosols becomes dependent on the ambient air currents and turbulence. 2020-07-10T14:29:16Z 6 pages, 1 figure SciMedJ 2 (2020) 83-91 Luis A. Anchordoqui James B. Dent Thomas J. Weiler 10.28991/SciMedJ-2020-02-SI-7