https://arxiv.org/api/hwgcicmIG9yltTydxzMXEvTqENY 2026-06-19T00:08:01Z 1596 450 15 http://arxiv.org/abs/2307.14790v1 Decoding Microbial Enigmas: Unleashing the Power of Artificial Intelligence in Analyzing Antibiotic-Resistant Pathogens and their Impact on Human Health 2023-07-27T11:42:48Z

In this research, medical information from 1200 patients across various hospitals in Iraq was collected over a period of 3 years, from February 3, 2018, to March 5, 2021. The study encompassed several infections, including urinary tract infections, wound infections, tonsillitis, prostatitis, endometritis, endometrial lining infections, burns infections, pneumonia, and bloodstream infections in children. Multiple bacterial pathogens were identified, and their resistance to various antibiotics was recorded. The data analysis revealed significant patterns of antibiotic resistance among the identified bacterial pathogens. Resistance was observed to several commonly used antibiotics, highlighting the emerging challenge of antimicrobial resistance in Iraq. These findings underscore the importance of implementing effective antimicrobial stewardship programs and infection control measures in healthcare settings to mitigate the spread of antibiotic-resistant infections and ensure optimal patient outcomes. This study contributes valuable insights into the prevalence and patterns of antibiotic resistance in microbial infections, which can guide healthcare practitioners and policymakers in formulating targeted interventions to combat the growing threat of antimicrobial resistance in Iraq's healthcare landscape.

2023-07-27T11:42:48Z 11 pages, 2 figures Maitham G. Yousif http://arxiv.org/abs/2307.13708v1 The Impact of Genomic Variation on Function (IGVF) Consortium 2023-07-24T20:51:25Z

Our genomes influence nearly every aspect of human biology from molecular and cellular functions to phenotypes in health and disease. Human genetics studies have now associated hundreds of thousands of differences in our DNA sequence ("genomic variation") with disease risk and other phenotypes, many of which could reveal novel mechanisms of human biology and uncover the basis of genetic predispositions to diseases, thereby guiding the development of new diagnostics and therapeutics. Yet, understanding how genomic variation alters genome function to influence phenotype has proven challenging. To unlock these insights, we need a systematic and comprehensive catalog of genome function and the molecular and cellular effects of genomic variants. Toward this goal, the Impact of Genomic Variation on Function (IGVF) Consortium will combine approaches in single-cell mapping, genomic perturbations, and predictive modeling to investigate the relationships among genomic variation, genome function, and phenotypes. Through systematic comparisons and benchmarking of experimental and computational methods, we aim to create maps across hundreds of cell types and states describing how coding variants alter protein activity, how noncoding variants change the regulation of gene expression, and how both coding and noncoding variants may connect through gene regulatory and protein interaction networks. These experimental data, computational predictions, and accompanying standards and pipelines will be integrated into an open resource that will catalyze community efforts to explore genome function and the impact of genetic variation on human biology and disease across populations.

2023-07-24T20:51:25Z Draft Marker Paper for the Impact of Genomic Variation on Function (IGVF) Consortium (https://www.igvf.org). Detailed author list (members of the IGVF Consortium) is included in the manuscript IGVF Consortium http://arxiv.org/abs/2307.11109v1 Influence of phytohormones on seed germination of Solanum linnaeanum 2023-07-20T07:53:38Z

The aim of this study was to determine the germination ability and seedling growth of the apple of Sodom by soaking in water, gibberellin (GA3), naphthylacetic acid (NAA), and salicylic acid (SA), separately. The findings showed that NAA at 50 mgL-1 produced superior germination (77.78%), germination speed (1.43 seeds/time interval), hypocotyl length (1.01 cm), hypocotyl diameter (1.13 mm), leaf number (2.66), and root number (17.25), followed by 50 and 100 mgL-1 GA3, particularly in germination percentage. The best root length (5.33 cm) was detected at 100 mgL-1 SA. In contrast, control seeds and water-soaked seeds showed inferior results. The seeds of the apple of Sodom can be germinated successfully as a result of treatment with NAA at 50 mgL-1, followed by GA3 at 50 and 100 mgL-1.

2023-07-20T07:53:38Z Aram Akram Mohammed Haidar Anwar Arkwazee Ayub Karim Mahmood Hemn Abdalla Mustafa Hawar Sleman Halshoy Salam Mahmud Sulaiman Jalal Hamasalih Ismael Nawroz Abdul-razzak Tahir http://arxiv.org/abs/2308.04500v1 Predicting Pathogenicity Of nsSNPs Associated With Rb1 -- An In Silico Approach 2023-07-16T18:55:30Z

Single nucleotide polymorphisms (SNPs) are variations at specific locations in DNA. Sequence responsible for marking genes associated with diseases or tracking inherited diseases within The family. These variations in the Rb1 gene can cause Retinoblastoma and cancer in the retina Of one eye or both, Osteosarcoma, Melanoma, Leukemias, Lungs, and Breast cancer. First of all,The SNP database hosted by NCBI was used to extract some principal data. The association of Rb1 to Other genes were analyzed by GeneMANIA. Ten different computational tools, i.eSIFT,Polyphen-2, I-Mutant 3.0,PROVEAN, SNAP2, PHD-SNP, PMut, SNPs&GO were used for the screening of damaging SNP for the estimation of conserved regions of amino acids Consurf Server was used for the evaluation of the structural stability of both native and mutant proteins, Project Hope was used to examine the structural effects of mutant protein.GeneMANIA predicted that RB1 Gene was expected to have a strong association with 20 other genes i.e. CCND1 and RBP2 etc. As per data retrieved from dbSNP hosted by NCBI,the Rb1 gene probed in this study carried a total of 36,358 SNPs. 345 were found in 3'UTR, 65 in 5'UTR, and 34,543 were found in the intron region. 844 were coding SNPs, and out of 844, 199 were synonymous And 450 were non-synonymous, including 425 missense, five nonsense, and 20 frameshift mutations. And remaining all are other types of SNPs. We took 425 missense SNPs for our investigation. A total of 17 mutations i.e. D332G, R445Q, E492V, P515T, W516G, V531G, E533K, E539K, M558R,W563G, L657Q, A658T, R661Q, D697H, D697E, P796L and R798W were predicted to have Damaging effects on structure and function of Rb1 protein..

2023-07-16T18:55:30Z Anum Munir http://arxiv.org/abs/2307.03934v1 Better Research Software Tools to Elevate the Rate of Scientific Discovery -- or why we need to invest in research software engineering 2023-07-08T08:46:14Z

In the past decade, enormous progress has been made in advancing the state-of-the-art in bioimage analysis - a young computational field that works in close collaboration with the life sciences on the quantitative analysis of scientific image data. In many cases, tremendous effort has been spent to package these new advances into usable software tools and, as a result, users can nowadays routinely apply cutting-edge methods to their analysis problems using software tools such as ilastik [1], cellprofiler [2], Fiji/ImageJ2 [3,4] and its many modern plugins that build on the BigDataViewer ecosystem [5], and many others. Such software tools have now become part of a critical infrastructure for science [6]. Unfortunately, overshadowed by the few exceptions that have had long-lasting impact, many other potentially useful tools fail to find their way into the hands of users. While there are many reasons for this, we believe that at least some of the underlying problems, which we discuss in more detail below, can be mitigated. In this opinion piece, we specifically argue that embedding teams of research software engineers (RSEs) within imaging and image analysis core facilities would be a major step towards sustainable bioimage analysis software.

2023-07-08T08:46:14Z 8 pages, 0 figures Joran Deschamps Damian Dalle Nogare Florian Jug http://arxiv.org/abs/2307.02502v1 Math Agents: Computational Infrastructure, Mathematical Embedding, and Genomics 2023-07-04T20:16:32Z

The advancement in generative AI could be boosted with more accessible mathematics. Beyond human-AI chat, large language models (LLMs) are emerging in programming, algorithm discovery, and theorem proving, yet their genomics application is limited. This project introduces Math Agents and mathematical embedding as fresh entries to the "Moore's Law of Mathematics", using a GPT-based workflow to convert equations from literature into LaTeX and Python formats. While many digital equation representations exist, there's a lack of automated large-scale evaluation tools. LLMs are pivotal as linguistic user interfaces, providing natural language access for human-AI chat and formal languages for large-scale AI-assisted computational infrastructure. Given the infinite formal possibility spaces, Math Agents, which interact with math, could potentially shift us from "big data" to "big math". Math, unlike the more flexible natural language, has properties subject to proof, enabling its use beyond traditional applications like high-validation math-certified icons for AI alignment aims. This project aims to use Math Agents and mathematical embeddings to address the ageing issue in information systems biology by applying multiscalar physics mathematics to disease models and genomic data. Generative AI with episodic memory could help analyse causal relations in longitudinal health records, using SIR Precision Health models. Genomic data is suggested for addressing the unsolved Alzheimer's disease problem.

2023-07-04T20:16:32Z Melanie Swan Takashi Kido Eric Roland Renato P. dos Santos http://arxiv.org/abs/2307.00036v1 Machine learning for potion development at Hogwarts 2023-06-30T08:47:27Z

Objective: To determine whether machine learning methods can generate useful potion recipes for research and teaching at Hogwarts School of Witchcraft and Wizardry. Design: Using deep neural networks to classify generated recipes into a standard drug classification system. Setting: Hogwarts School of Witchcraft and Wizardry. Data sources: 72 potion recipes from the Hogwarts curriculum, extracted from the Harry Potter Wiki. Results: Most generated recipes fall into the categories of psychoanaleptics and dermatologicals. The number of recipes predicted for each category reflected the number of training recipes. Predicted probabilities were often above 90% but some recipes were classified into 2 or more categories with similar probabilities which complicates anticipating the predicted effects. Conclusions: Machine learning powered methods are able to generate potentially useful potion recipes for teaching and research at Hogwarts. This corresponds to similar efforts in the non-magical world where such methods have been applied to identify potentially effective drug combinations.

2023-06-30T08:47:27Z Christoph F. Kurz Adriana N. König http://arxiv.org/abs/2307.01210v1 AI and Non AI Assessments for Dementia 2023-06-30T03:28:47Z

Current progress in the artificial intelligence domain has led to the development of various types of AI-powered dementia assessments, which can be employed to identify patients at the early stage of dementia. It can revolutionize the dementia care settings. It is essential that the medical community be aware of various AI assessments and choose them considering their degrees of validity, efficiency, practicality, reliability, and accuracy concerning the early identification of patients with dementia (PwD). On the other hand, AI developers should be informed about various non-AI assessments as well as recently developed AI assessments. Thus, this paper, which can be readable by both clinicians and AI engineers, fills the gap in the literature in explaining the existing solutions for the recognition of dementia to clinicians, as well as the techniques used and the most widespread dementia datasets to AI engineers. It follows a review of papers on AI and non-AI assessments for dementia to provide valuable information about various dementia assessments for both the AI and medical communities. The discussion and conclusion highlight the most prominent research directions and the maturity of existing solutions.

2023-06-30T03:28:47Z 49 pages Mahboobeh Parsapoor Mah Parsa Hamed Ghodrati Vincenzo Dentamaro Christopher R. Madan Ioulietta Lazarou Spiros Nikolopoulos Ioannis Kompatsiaris http://arxiv.org/abs/2306.15113v1 Minimum information and guidelines for reporting a Multiplexed Assay of Variant Effect 2023-06-26T23:43:03Z

Multiplexed Assays of Variant Effect (MAVEs) have emerged as a powerful approach for interrogating thousands of genetic variants in a single experiment. The flexibility and widespread adoption of these techniques across diverse disciplines has led to a heterogeneous mix of data formats and descriptions, which complicates the downstream use of the resulting datasets. To address these issues and promote reproducibility and reuse of MAVE data, we define a set of minimum information standards for MAVE data and metadata and outline a controlled vocabulary aligned with established biomedical ontologies for describing these experimental designs.

2023-06-26T23:43:03Z Melina Claussnitzer Victoria N. Parikh Alex H. Wagner Jeremy A. Arbesfeld Carol J. Bult Helen V. Firth Lara A. Muffley Alex N. Nguyen Ba Kevin Riehle Frederick P. Roth Daniel Tabet Benedetta Bolognesi Andrew M. Glazer Alan F. Rubin http://arxiv.org/abs/2306.06699v1 Adapting to the Impact of AI in Scientific Writing: Balancing Benefits and Drawbacks while Developing Policies and Regulations 2023-06-11T15:06:55Z

This article examines the advantages and disadvantages of Large Language Models (LLMs) and Artificial Intelligence (AI) in research and education and proposes the urgent need for an international statement to guide their responsible use. LLMs and AI demonstrate remarkable natural language processing, data analysis, and decision-making capabilities, offering potential benefits such as improved efficiency and transformative solutions. However, concerns regarding ethical considerations, bias, fake publications, and malicious use also arise. The objectives of this paper are to critically evaluate the utility of LLMs and AI in research and education, call for discussions between stakeholders, and discuss the need for an international statement. We identify advantages such as data processing, task automation, and personalized experiences, alongside disadvantages like bias reinforcement, interpretability challenges, inaccurate reporting, and plagiarism. Stakeholders from academia, industry, government, and civil society must engage in open discussions to address the ethical, legal, and societal implications. The proposed international statement should emphasize transparency, accountability, ongoing research, and risk mitigation. Monitoring, evaluation, user education, and awareness are essential components. By fostering discussions and establishing guidelines, we can ensure the responsible and ethical development and use of LLMs and AI, maximizing benefits while minimizing risks.

2023-06-11T15:06:55Z 2 Figure, (in press) Journal of Nature and Science of Medicine 2023, Volume 6, Issue 3 Ahmed S. BaHammam Khaled Trabelsi Seithikurippu R. Pandi-Perumal Hiatham Jahrami http://arxiv.org/abs/2306.06580v1 Unlocking the Power of Health Datasets and Registries: The Need for Urgent Institutional and National Ownership and Governance Regulations for Research Advancement 2023-06-11T04:01:42Z

Health datasets have immense potential to drive research advancements and improve healthcare outcomes. However, realizing this potential requires careful consideration of governance and ownership frameworks. This article explores the importance of nurturing governance and ownership models that facilitate responsible and ethical use of health datasets for research purposes. We highlight the importance of adopting governance and ownership models that enable responsible and ethical utilization of health datasets and clinical data registries for research purposes. The article addresses the important local and international regulations related to the utilization of health data/medical records in research, and emphasizes the urgent need for developing clear institutional and national guidelines on data access, sharing, and utilization, ensuring transparency, privacy, and data protection. By establishing robust governance structures and fostering ownership among stakeholders, collaboration, innovation, and equitable access to health data can be promoted, ultimately unlocking its full power for transformative research and improving global health outcomes.

2023-06-11T04:01:42Z 1 Figure, (in press) Journal of Nature and Science of Medicine 2023, Volume 6, Issue 3 Ahmed S. BaHammam http://arxiv.org/abs/2306.06353v1 Neural Replicator Analysis of the genus Flavivirus 2023-06-10T06:04:45Z

The results of applying neural replicator analysis (NRA) to the genomes of viruses belonging to the genus Flavivirus are presented. It is shown that the viral genomes considered in this study can be placed in five different cells of the viral genome table. Some of these cells appear for the first time and are characterized by 9-periodicity of WS-encoded genomic sequences. It is noteworthy that Japanese encephalitis viral strains and Zika viral strains occupy not one, but two common cells of this table. We also present the results of the NRA of Zika viral strains and suggest that the earliest strain in Asia is an Indian strain that spread from Africa (Uganda) to the East. The fine structure of the sets of Japanese encephalitis viral strains is presented and it is shown that their generally accepted genotypes 1 and 3 can be clearly divided into two subgenotypes. It is also shown that probably not Indonesian, but Indian strains of this virus can be considered the earliest known strains that further evolved and spread in Asian countries.

2023-06-10T06:04:45Z 17 pages, 7 figures Alexandr A. Ezhov http://arxiv.org/abs/2306.10038v1 Comments arising from WJ Thompson "Uncertainty in probabilistic genotyping of low template DNA A case study comparing STRmix and TrueAllele" 2023-06-10T01:51:23Z

Thompson reports a comparison of data from STRmix and TrueAllele. The data he has arises from different inputs to the two software. If the input data are made more similar the outputs become more similar. Thompson argues that the Analytical Threshold, AT, should be varied in casework. This produced different LRs but the analyst would be left deciding what to do with these options. This cannot be based on the LRs but should be based on whether any movement in the AT adds reliable or unreliable data. This is how most laboratories set their AT in the first place. Hence it is pointless, and potentially dangerous, to experimentally vary the AT in casework. The profile is low level and shows at most three peaks. Thompson argues that LR results assuming that the number of contributors (NoC) is 2 or 3 should be reported. Uncertainty in NoC should be treated as a nuisance variable and summed out.

2023-06-10T01:51:23Z 9 pages 1 figure Tim Kalafut James Curran Mike Coble John Buckleton http://arxiv.org/abs/2306.03255v1 Evaluation of software impact designed for biomedical research: Are we measuring what's meaningful? 2023-06-05T21:15:05Z

Software is vital for the advancement of biology and medicine. Analysis of usage and impact metrics can help developers determine user and community engagement, justify additional funding, encourage additional use, identify unanticipated use cases, and help define improvement areas. However, there are challenges associated with these analyses including distorted or misleading metrics, as well as ethical and security concerns. More attention to the nuances involved in capturing impact across the spectrum of biological software is needed. Furthermore, some tools may be especially beneficial to a small audience, yet may not have compelling typical usage metrics. We propose more general guidelines, as well as strategies for more specific types of software. We highlight outstanding issues regarding how communities measure or evaluate software impact. To get a deeper understanding of current practices for software evaluations, we performed a survey of participants in the Informatics Technology for Cancer Research (ITCR) program funded by the National Cancer Institute (NCI). We also investigated software among this community and others to assess how often infrastructure that supports such evaluations is implemented and how this impacts rates of papers describing usage of the software. We find that developers recognize the utility of analyzing software usage, but struggle to find the time or funding for such analyses. We also find that infrastructure such as social media presence, more in-depth documentation, the presence of software health metrics, and clear information on how to contact developers seem to be associated with increased usage rates. Our findings can help scientific software developers make the most out of evaluations of their software.

2023-06-05T21:15:05Z 25 total pages (17 pages for manuscript and 8 pages for the supplement). There are 2 figures Awan Afiaz Department of Biostatistics, University of Washington, Seattle, WA Biostatistics Program, Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA Andrey Ivanov Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Emory University, Atlanta, GA John Chamberlin Department of Biomedical Informatics, University of Utah, Salt Lake City, UT David Hanauer Department of Learning Health Sciences, University of Michigan Medical School, Ann Arbor, MI Candace Savonen Biostatistics Program, Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA Mary J Goldman University of California Santa Cruz, Santa Cruz, CA Martin Morgan Roswell Park Comprehensive Cancer Center, Buffalo, NY Michael Reich University of California, San Diego, La Jolla, CA Alexander Getka University of Pennsylvania, Philadelphia, PA Aaron Holmes Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA Institute for Precision Health, University of California, Los Angeles, CA Department of Human Genetics, University of California, Los Angeles, CA Department of Urology, University of California, Los Angeles, CA Sarthak Pati University of Pennsylvania, Philadelphia, PA Dan Knight Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA Institute for Precision Health, University of California, Los Angeles, CA Department of Human Genetics, University of California, Los Angeles, CA Department of Urology, University of California, Los Angeles, CA Paul C. Boutros Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA Institute for Precision Health, University of California, Los Angeles, CA Department of Human Genetics, University of California, Los Angeles, CA Department of Urology, University of California, Los Angeles, CA Spyridon Bakas University of Pennsylvania, Philadelphia, PA J. Gregory Caporaso Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ Guilherme Del Fiol Department of Biomedical Informatics, University of Utah, Salt Lake City, UT Harry Hochheiser Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA Brian Haas Methods Development Laboratory, Broad Institute, Cambridge, MA Patrick D. Schloss Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI James A. Eddy Sage Bionetworks, Seattle, WA Jake Albrecht Sage Bionetworks, Seattle, WA Andrey Fedorov Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA Levi Waldron Department of Epidemiology and Biostatistics, City University of New York Graduate School of Public Health and Health Policy, New York, NY Ava M. Hoffman Biostatistics Program, Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA Richard L. Bradshaw Department of Biomedical Informatics, University of Utah, Salt Lake City, UT Jeffrey T. Leek Biostatistics Program, Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA Carrie Wright Biostatistics Program, Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA http://arxiv.org/abs/2301.08559v2 The Lost Art of Mathematical Modelling 2023-06-02T09:03:19Z

We provide a critique of mathematical biology in light of rapid developments in modern machine learning. We argue that out of the three modelling activities -- (1) formulating models; (2) analysing models; and (3) fitting or comparing models to data -- inherent to mathematical biology, researchers currently focus too much on activity (2) at the cost of (1). This trend, we propose, can be reversed by realising that any given biological phenomena can be modelled in an infinite number of different ways, through the adoption of an open/pluralistic approach. We explain the open approach using fish locomotion as a case study and illustrate some of the pitfalls -- universalism, creating models of models, etc. -- that hinder mathematical biology. We then ask how we might rediscover a lost art: that of creative mathematical modelling. This article is dedicated to the memory of Edmund Crampin.

2023-01-19T13:16:31Z Linnéa Gyllingberg Abeba Birhane David J. T. Sumpter