https://arxiv.org/api/Mnb/NOw3aZFqakWxzc/akCnVnwA2026-06-21T15:00:31Z159666015http://arxiv.org/abs/2011.00485v1Comparing Machine Learning Algorithms with or without Feature Extraction for DNA Classification2020-11-01T12:04:54ZThe classification of DNA sequences is a key research area in bioinformatics as it enables researchers to conduct genomic analysis and detect possible diseases. In this paper, three state-of-the-art algorithms, namely Convolutional Neural Networks, Deep Neural Networks, and N-gram Probabilistic Models, are used for the task of DNA classification. Furthermore, we introduce a novel feature extraction method based on the Levenshtein distance and randomly generated DNA sub-sequences to compute information-rich features from the DNA sequences. We also use an existing feature extraction method based on 3-grams to represent amino acids and combine both feature extraction methods with a multitude of machine learning algorithms. Four different data sets, each concerning viral diseases such as Covid-19, AIDS, Influenza, and Hepatitis C, are used for evaluating the different approaches. The results of the experiments show that all methods obtain high accuracies on the different DNA datasets. Furthermore, the domain-specific 3-gram feature extraction method leads in general to the best results in the experiments, while the newly proposed technique outperforms all other methods on the smallest Covid-19 dataset2020-11-01T12:04:54Z17 pagesXiangxie ZhangBen BeinkeBerlian Al KindhiMarco Wieringhttp://arxiv.org/abs/2011.01877v1Exploring the Synchrony Between Body Temperature and HR, RR, and Aortic Blood Pressure in Viral/Bacterial Disease Onsets with Signal Dynamics2020-10-30T22:32:04ZSignal-based early detection of illnesses has been a key topic in research and hospital settings; it reduces technological costs and paves the way for quick and effective patient-care operations. Elementary machine learning and signal processing algorithms have proven to be sufficient in classifying the onset of viral and bacterial conditions before clinical symptoms are shown. Inspired by these recent developments, this project employs signal dynamics analysis to infer changes in vital signs (temperature, respiration, and heart rate). The results demonstrate that the trends of one vital function can be predicted from that of another. In particular, it is shown that heart rate and respiration typically change shortly after body temperature, and aortic blood pressure follows. This is not an etiologically specific approach, but if advanced further, it can enable patients and wearable system users to tame these changes and prevent immediate symptoms.2020-10-30T22:32:04ZCamille Dunninghttp://arxiv.org/abs/2011.00002v1Molecular Communications in Viral Infections Research: Modelling, Experimental Data and Future Directions2020-10-30T15:18:35ZHundreds of millions of people worldwide are affected by viral infections each year, and yet, several of them neither have vaccines nor effective treatment during and post-infection. This challenge has been highlighted by the COVID-19 pandemic, showing how viruses can quickly spread and how they can impact society as a whole. Novel techniques that bring in different disciplines must emerge to provide forward-looking strategies to combat viral infections, as well as possible future pandemics. In the past decade, an interdisciplinary area involving bioengineering, nanotechnology and information and communication technology (ICT) has been developing, known as Molecular Communications. This new emerging area uses elements of classical communication systems and maps it to molecular signalling and communication found inside and outside the body, where the aim is to develop new tools that can serve future medicine. In this paper, we provide an extensive and detailed discussion on how Molecular Communications can be integrated into the research on viral infectious diseases modelling, and how possible treatment and vaccines can be developed considering molecules as information carriers. We provide a literature review on the existing models of Molecular Communications for viral infection (in-body and out-body), a deep analysis on their effects on the host and subsequent communication process for other systems within the body (e.g., immune response), sources of experimental data on known viral infections and how it can be used by the Molecular Communications community, as well as open issues and future directions. Since the development of therapeutics/vaccines needs an interdisciplinary approach centred around ICT, we are confident that Molecular Communications can play a central role here by providing a detail characterisation and manipulation of the propagation of molecules in different media.2020-10-30T15:18:35ZSubmitted for journal publicationMichael Taynnan BarrosMladen VeletićMasamitsu KanadaMassimiliano PierobonSeppo VainioIlangko BalasinghamSasitharan Balasubramaniamhttp://arxiv.org/abs/2010.16154v1The Human Cell Atlas & Equity: Lessons Learned2020-10-30T09:50:29ZThe Human Cell Atlas has been undergoing a massive effort to support global scientific equity. The co-leaders of its Equity Working Group share some lessons learned in the process.2020-10-30T09:50:29ZNature Medicine 26 (2020) 1509-1511;Partha P. MajumderMusa M. MhlangaAlex K. Shalek10.1038/s41591-020-1100-4http://arxiv.org/abs/2011.01873v1Self-Organized Networks: Darwinian Evolution of Myosin-12020-10-28T19:06:54ZCytoskeletons are self-organized networks based on polymerized proteins: actin, tubulin, and driven by motor proteins, such as myosin, kinesin and dynein. Their positive Darwinian evolution enables them to approach optimized functionality (self-organized criticality). The principal features of the eukaryotic evolution of the cytoskeleton motor protein myosin-1 parallel those of actin and tubulin, but also show striking differences connected to its dynamical function. Optimized (long) hydropathic waves characterize the molecular level Darwinian evolution towards optimized functionality (self-organized criticality). The N-terminal and central domains of myosin-1 have evolved in eukaryotes at different rates, with the central domain hydropathic extrema being optimally active in humans. A test shows that hydropathic scaling can yield accuracies of better than 1% near optimized functionality. Evolution towards synchronized level extrema is connected to a special function of Mys-1 in humans involving Golgi complexes.2020-10-28T19:06:54Z20 pages, 9 figuresJ. C. Phillipshttp://arxiv.org/abs/1911.12363v3Optimizing Energetic cost of Uncertainty in a Driven System With and Without Feedback2020-10-27T16:25:31ZMany biological functions require the dynamics to be necessarily driven out-of-equilibrium. In contrast, in various contexts, a nonequilibrium dynamics at fast timescales can be described by an effective equilibrium dynamics at a slower timescale. In this work we study the two different aspects, (i) the energy-efficiency tradeoff for a specific nonequilibrium linear dynamics of two variables with feedback, and (ii) the cost of effective parameters in a coarse-grained theory as given by the "hidden" dissipation and entropy production rate in the effective equilibrium limit of the dynamics. To meaningfully discuss the tradeoff between energy consumption and the efficiency of the desired function, a one-to-one mapping between function(s) and energy input is required. The function considered in this work is the variance of one of the variables. We get a one-to-one mapping by considering the minimum variance obtained for a fixed entropy production rate and vice-versa. We find that this minimum achievable variance is a monotonically decreasing function of the given entropy production rate. When there is a timescale separation, in the effective equilibrium limit, the cost of the effective potential and temperature is the associated "hidden" entropy production rate.2019-11-27T18:08:53Z8 pages, 4 figuresPhys. Rev. E 102, 052405 (2020)Amit Singh Vishen10.1103/PhysRevE.102.052405http://arxiv.org/abs/2010.12332v1Old Drugs for JAK-STAT Pathway Inhibition in COVID-192020-10-23T13:09:16ZThe pandemic threat of COVID-19 with more than 37 million cases in which about 5 percent entering critical stage characterized by cytokine storm and hyperinflammatory condition, the state more often leads to admission to intensive care unit with rapid mortality. Janus kinase enzymes of Jak-1, Jak-2, Jak-3, and Tyk2 seem to be good targets for inhibition by medications to control cytokine storm in this context. In the present work, the inhibitory properties of different analgesic drugs on these targets are studied to assess their ability for clinical application from different points of view. Our docking results indicated that naproxen, methadone, and amitriptyline considering their higher binding energy, lower energy variance, and higher hydrophobicity, seem to express more inhibitory effects on Janus kinase enzymes than thats for approved inhibitors i.e. baricitinib and ruxolitinib. Accordingly, we suggest our wide list of candidate drugs including indomethacin, etodolac, buprenorphine, rofecoxib, duloxetine, valdecoxib, naproxen, methadone, and amitriptilin for clinical assessments for their usefulness in COVID-19 treatment, especially taking into account that up to now, there is no approved cure for this disease.2020-10-23T13:09:16ZMohammad Reza Dayer10.13140/RG.2.2.33735.73122http://arxiv.org/abs/2010.10107v1A Global View of Standards for Open Image Data Formats and Repositories2020-10-20T08:00:19ZBiological and biomedical imaging datasets record the constitution, architecture and dynamics of living organisms across several orders of magnitude of space and time. Imaging technologies are now used throughout the life and biomedical sciences to achieve discovery and understanding of biological mechanisms in the basic sciences as well as assessment, diagnosis and therapeutic intervention in clinical trials and animal and human medicine. The universal application and use of imaging raises an important question and opportunity: what is the value and ultimate destination of biological and medical imaging data? In the last few years, several informatics and data science technologies have matured sufficiently so that routine publication of these datasets is now possible. Participants in Global BioImaging from 15 countries and all populated continents have agreed on the need for recommendations and guidelines for the establishment of image data repositories and the formats they use for delivering data to the global scientific community. This white paper presents a shared, global view of criteria for these common, globally applicable guidelines and provisional proposals for open tools and resources that are available now and can provide a foundation for future development.2020-10-20T08:00:19ZJason R. SwedlowPasi KankaanpääUgis SarkansWojtek GoscinskiGraham GallowayRyan P. SullivanClaire M. BrownChris WoodAntje KepplerBen LoosSara ZullinoDario Livio LongoSilvio AimeShuichi Onamihttp://arxiv.org/abs/2010.12067v1A Calculation Model for Estimating Effect of COVID-19 Contact-Confirming Application (COCOA) on Decreasing Infectors2020-10-17T09:32:39ZAs of 2020, COVID-19 is spreading in the world. In Japan, the Ministry of Health, Labor and Welfare developed COVID-19 Contact-Confirming Application (COCOA). The researches to examine the effect of COCOA are still not sufficient. We develop a mathematical model to examine the effect of COCOA and show examined result.2020-10-17T09:32:39Z4 pages, 3 figuresMathematical Biosciences and Engineering, 2021, Volume 18, Issue 5, pp.6506-6526Yuto OmaeJun ToyotaniKazuyuki HaraYasuhiro GonHirotaka Takahashi10.3934/mbe.2021323http://arxiv.org/abs/2010.00957v1Estimands in Hematologic Oncology Trials2020-10-01T15:38:48ZThe estimand framework included in the addendum to the ICH E9 guideline facilitates discussions to ensure alignment between the key question of interest, the analysis, and interpretation. Therapeutic knowledge and drug mechanism play a crucial role in determining the strategy and defining the estimand for clinical trial designs. Clinical trials in patients with hematological malignancies often present unique challenges for trial design due to complexity of treatment options and existence of potential curative but highly risky procedures, e.g. stem cell transplant or treatment sequence across different phases (induction, consolidation, maintenance). Here, we illustrate how to apply the estimand framework in hematological clinical trials and how the estimand framework can address potential difficulties in trial result interpretation.
This paper is a result of a cross-industry collaboration to connect the International Conference on Harmonisation (ICH) E9 addendum concepts to applications. Three randomized phase 3 trials will be used to consider common challenges including intercurrent events in hematologic oncology trials to illustrate different scientific questions and the consequences of the estimand choice for trial design, data collection, analysis, and interpretation. Template language for describing estimand in both study protocols and statistical analysis plans is suggested for statisticians' reference.2020-10-01T15:38:48Z5 tables, 1 figurePharm. Stat., 2021, 20, 793-805Steven SunHans-Jochen WeberEmily ButlerKaspar RufibachSatrajit Roychoudhury10.1002/pst.2108http://arxiv.org/abs/2009.13830v1Unboxing mutations: Connecting mutation types with evolutionary consequences2020-09-29T07:34:24ZMutations are typically classified by their effects on the nucleotide sequence and by their size. Here, we argue that if our main aim is to understand the effect of mutations on evolutionary outcomes (such as adaptation or speciation), we need to instead consider their population genetic and genomic effects, from altering recombination rate to modifying chromatin. We start by reviewing known population genetic and genomic effects of different mutation types and connect these to the major evolutionary processes of drift and selection. We illustrate how mutation type can thus be linked with evolutionary outcomes and provide suggestions for further exploring and quantifying these relationships. This reframing lays a foundation for determining the evolutionary significance of different mutation types.2020-09-29T07:34:24ZEmma L. BerdanAlexandre BlanckaertTanja SlotteAlexander SuhAnja M. WestramInês Fragatahttp://arxiv.org/abs/2009.09702v1On the verge of life: Distribution of nucleotide sequences in viral RNAs2020-09-21T09:17:23ZThe aim of the study is to analyze viruses using parameters obtained from distributions of nucleotide sequences in the viral RNA. Seeking for the input data homogeneity, we analyze single-stranded RNA viruses only. Two approaches are used to obtain the nucleotide sequences; In the first one, chunks of equal length (four nucleotides) are considered. In the second approach, the whole RNA genome is divided into parts by adenine or the most frequent nucleotide as a "space". Rank--frequency distributions are studied in both cases. Within the first approach, the Pólya and the negative hypergeometric distribution yield the best fit. For the distributions obtained within the second approach, we have calculated a set of parameters, including entropy, mean sequence length, and its dispersion. The calculated parameters became the basis for the classification of viruses. We observed that proximity of viruses on planes spanned on various pairs of parameters corresponds to related species. In certain cases, such a proximity is observed for unrelated species as well calling thus for the expansion of the set of parameters used in the classification. We also observed that the fourth most frequent nucleotide sequences obtained within the second approach are of different nature in case of human coronaviruses (different nucleotides for MERS, SARS-CoV, and SARS-CoV-2 versus identical nucleotides for four other coronaviruses). We expect that our findings will be useful as a supplementary tool in the classification of diseases caused by RNA viruses with respect to severity and contagiousness.2020-09-21T09:17:23ZBiosemiotics 14, No. 2, 253-269 (2021)Mykola HusevAndrij Rovenchak10.1007/s12304-021-09403-5http://arxiv.org/abs/2009.10646v1A stable method for 4D CT-based CFD simulation in the right ventricle of a TGA patient2020-09-20T10:45:07ZThe paper discusses a stabilization of a finite element method for the equations of fluid motion in a time-dependent domain. After experimental convergence analysis, the method is applied to simulate a blood flow in the right ventricle of a post-surgery patient with the transposition of the great arteries disorder. The flow domain is reconstructed from a sequence of 4D CT images. The corresponding segmentation and triangulation algorithms are also addressed in brief.2020-09-20T10:45:07Zto be published in Russian Journal of Numerical Analysis and Mathematical ModellingAlexander DanilovYushui HanChun H. LinAlexander LozovskiyMaxim A. OlshanskiiVictoria Yu. SalamatovaYuri V. Vassilevskihttp://arxiv.org/abs/2009.09911v1Are mouse and cat the missing link in the COVID-19 outbreaks in seafood markets?2020-09-18T10:23:23ZSevere acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus caused the novel coronavirus disease-2019 (COVID-19) affecting the whole world. Like SARS-CoV and MERS-CoV, SARS-CoV-2 are thought to originate in bats and then spread to humans through intermediate hosts. Identifying intermediate host species is critical to understanding the evolution and transmission mechanisms of COVID-19. However, determining which animals are intermediate hosts remains a key challenge. Virus host-genome similarity (HGS) is an important factor that reflects the adaptability of virus to host. SARS-CoV-2 may retain beneficial mutations to increase HGS and evade the host immune system. This study investigated the HGSs between 399 SARS-CoV-2 strains and 10 hosts of different species, including bat, mouse, cat, swine, snake, dog, pangolin, chicken, human and monkey. The results showed that the HGS between SARS-CoV-2 and bat was the highest, followed by mouse and cat. Human and monkey had the lowest HGS values. In terms of genetic similarity, mouse and monkey are halfway between bat and human. Moreover, given that COVID-19 outbreaks tend to be associated with live poultry and seafood markets, mouse and cat are more likely sources of infection in these places. However, more experimental data are needed to confirm whether mouse and cat are true intermediate hosts. These findings suggest that animals closely related to human life, especially those with high HGS, need to be closely monitored.2020-09-18T10:23:23ZDaniel H. TaoWeitao Sunhttp://arxiv.org/abs/2007.05410v3A Physics Modeling Study of SARS-CoV-2 Transport in Air2020-09-15T14:22:34ZThe health threat from SARS-CoV-2 airborne infection has become a public emergency of international concern. During the ongoing coronavirus pandemic, people have been advised by the Centers for Disease Control and Prevention to maintain social distancing of at least 2 m to limit the risk of exposure to the coronavirus. Experimental data, however, show that infected aerosols and droplets trapped inside a turbulent puff cloud can travel up to 7 to 8 m. We propose a nuclear physics analogy-based modeling of the complex gas cloud and its payload of pathogen-virions. We show that the cloud stopping range is proportional to the product of the puff's diameter and its density. We use our puff model to determine the average density of the buoyant fluid in the turbulent cloud. A fit to the experimental data yields $1.8 < ρ_P/ρ_{\rm air} < 4.0$, where $ρ_P$ and $ρ_{\rm air}$ are the average density of the puff and the air. We demonstrate that temperature variation could cause an ${\cal O}(\pm 8\%)$ effect in the puff stopping range for extreme ambient cold or warmth. We also demonstrate that aerosols and droplets can remain suspended for hours in the air. Therefore, once the puff slows down sufficiently, and its coherence is lost, the eventual spreading of the infected aerosols becomes dependent on the ambient air currents and turbulence.2020-07-10T14:29:16Z6 pages, 1 figureSciMedJ 2 (2020) 83-91Luis A. AnchordoquiJames B. DentThomas J. Weiler10.28991/SciMedJ-2020-02-SI-7