https://arxiv.org/api//kEDlBrNUDfRgh82W4SwTYYhhtM 2026-03-18T10:17:27Z 1549 15 15 http://arxiv.org/abs/2602.13502v1 Translating Dietary Standards into Healthy Meals with Minimal Substitutions 2026-02-13T22:18:36Z An important goal for personalized diet systems is to improve nutritional quality without compromising convenience or affordability. We present an end-to-end framework that converts dietary standards into complete meals with minimal change. Using the What We Eat in America (WWEIA) intake data for 135,491 meals, we identify 34 interpretable meal archetypes that we then use to condition a generative model and a portion predictor to meet USDA nutritional targets. In comparisons within archetypes, generated meals are better at following recommended daily intake (RDI) targets by 47.0%, while remaining compositionally close to real meals. Our results show that by allowing one to three food substitutions, we were able to create meals that were 10% more nutritious, while reducing costs 19-32%, on average. By turning dietary guidelines into realistic, budget-aware meals and simple swaps, this framework can underpin clinical decision support, public-health programs, and consumer apps that deliver scalable, equitable improvements in everyday nutrition. 2026-02-13T22:18:36Z 49 pages, 4 figures Trevor Chan Ilias Tagkopoulos http://arxiv.org/abs/2506.10037v3 The Cell Ontology in the age of single-cell omics 2026-02-13T16:11:42Z Single-cell omics technologies have transformed our understanding of cellular diversity by enabling high-resolution profiling of individual cells. However, the unprecedented scale and heterogeneity of these datasets demand robust frameworks for data integration and annotation. The Cell Ontology (CL) has emerged as a pivotal resource for achieving FAIR (Findable, Accessible, Interoperable, and Reusable) data principles by providing standardized, species-agnostic terms for canonical cell types - forming a core component of a wide range of platforms and tools. In this paper, we describe the wide variety of uses of CL in these platforms and tools and detail ongoing work to improve and extend CL content including the addition of transcriptomic types, working closely with major atlasing efforts including the Human Cell Atlas and the Brain Initiative Cell Atlas Network to support their needs. We cover the challenges and future plans for harmonising classical and transcriptomic cell type definitions, integrating markers and using Large Language Models (LLMs) to improve content and efficiency of CL workflows. 2025-06-10T21:38:26Z 48 pages, 8 Figures Shawn Zheng Kai Tan Aleix Puig-Barbe Damien Goutte-Gattat Caroline Eastwood Brian Aevermann Alida Avola James P Balhoff Ismail Ugur Bayindir Jasmine Belfiore Anita Reane Caron David S Fischer Nancy George Benjamin M Gyori Melissa A Haendel Charles Tapley Hoyt Huseyin Kir Tiago Lubiana Nicolas Matentzoglu James A Overton Beverly Peng Bjoern Peters Ellen M Quardokus Patrick L Ray Paola Roncaglia Andrea D Rivera Ray Stefancsik Wei Kheng Teh Sabrina Toro Nicole Vasilevsky Chuan Xu Yun Zhang Richard H Scheuermann Christopher J Mungall Alexander D Diehl David Osumi-Sutherland http://arxiv.org/abs/2409.17038v2 Omnibenchmark: transparent, reproducible, extensible and standardized orchestration of solo and collaborative benchmarks 2026-02-11T11:27:20Z Benchmarking involves designing, running and disseminating rigorous performance assessments of methods, most often for data analysis and software tools, but the process can also be applied to experimental systems. Ideally, a benchmarking system is used to facilitate the benchmarking process by providing a structured entrypoint to design, coordinate, execute, and store standardized benchmarks. We describe a novel benchmarking system, Omnibenchmark, that facilitates benchmark formalization and execution in both solo and community efforts. Omnibenchmark provides a flexible benchmark plan syntax (i.e., a configuration YAML file), dynamic workflow generation based on Snakemake, S3-compatible storage handling, and reproducible software environments using environment modules, Apptainer or Conda. Such a setup provides an unprecedented flexibility such that existing benchmark designs can be forked and extended, run separately or collaboratively, giving versioned and standardized result outputs and therefore much-needed transparency to the analysis and interpretation of benchmark results. Tutorials and installation instructions are available from https://omnibenchmark.org. 2024-09-25T15:46:29Z 20 page, 2 figures Izaskun Mallona Almut Luetge Ben Carrillo Daniel Incicau Reto Gerber Aidan Meara Anthony Sonrel Charlotte Soneson Mark D. Robinson http://arxiv.org/abs/2602.10011v1 Towards a topological view of blood pressure regulation 2026-02-10T17:35:01Z Blood pressure regulation is commonly addressed in terms of local mechanisms such as vascular resistance, compliance and neurohumoral control. However, the human vasculature encompasses multiple quasi-closed flow loops under both physiological and pathological conditions. To test whether these loops could influence pressure dynamics beyond local control, we address the role of vascular topology in blood pressure regulation. Using one dimensional flow simulation models, we compared pressure dynamics in open vascular segments and closed vascular loops. We found that in open segments pressure fades away and remains spatially localized, whereas in closed loops pressure can keep circulating around the loop even if resistance in one spot is modified. Since parallel pathways within loops are dynamically coupled rather than independent, pressure changes in one place can affect the entire closed loop, allowing system level pressure patterns to emerge. Also, we assessed the temporal evolution of pressure fluctuations within closed vascular loops in normotensive and hypertensive parameter regimes, before and after loop breaking intervention. This topological approach helps clarifying why drugs or local interventions may fail to lower blood pressure in looped vascular architectures, providing a theoretical interpretation of some forms of resistant hypertension. Because disrupting a loop restores pressure relaxation, it may also help explain the disproportionate pressure changes observed after topology altering events like thrombosis, vascular surgery or embolization of arteriovenous malformations and shunts. Therefore, vascular topology can influence cardiovascular physiology by coupling local pressure flow relations to global constraints on blood pressure regulation, with physiological, pathological and clinical implications. 2026-02-10T17:35:01Z 9 pages, one figure Arturo Tozzi http://arxiv.org/abs/2602.08061v1 Securing Dual-Use Pathogen Data of Concern 2026-02-08T17:11:19Z Training data is an essential input into creating competent artificial intelligence (AI) models. AI models for biology are trained on large volumes of data, including data related to biological sequences, structures, images, and functions. The type of data used to train a model is intimately tied to the capabilities it ultimately possesses--including those of biosecurity concern. For this reason, an international group of more than 100 researchers at the recent 50th anniversary Asilomar Conference endorsed data controls to prevent the use of AI for harmful applications such as bioweapons development. To help design such controls, we introduce a five-tier Biosecurity Data Level (BDL) framework for categorizing pathogen data. Each level contains specific data types, based on their expected ability to contribute to capabilities of concern when used to train AI models. For each BDL tier, we propose technical restrictions appropriate to its level of risk. Finally, we outline a novel governance framework for newly created dual-use pathogen data. In a world with widely accessible computational and coding resources, data controls may be among the most high-leverage interventions available to reduce the proliferation of concerning biological AI capabilities. 2026-02-08T17:11:19Z 39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: Biosecurity Safeguards for Generative AI Doni Bloomfield Allison Berke Moritz S. Hanke Aaron Maiwald James R. M. Black Toby Webster Tina Hernandez-Boussard Oliver M. Crook Jassi Pannu http://arxiv.org/abs/2602.07076v1 Bifacial weakness with paresthesias (BFP) secondary to trauma: a case report 2026-02-06T01:51:46Z This case details the diagnosis and treatment process of a patient with bilateral facial nerve palsy accompanied with limb sensory disturbance secondary to head trauma, who was ultimately diagnosed with Bifacial weakness with paresthesias (BFP) , a rare variant of Guillain-Barré Syndrome(GBS) . The patient underwent plasma exchange therapy and showed favorable recovery . In this article, for the first time we report a case of BFP secondary to trauma. 2026-02-06T01:51:46Z 5 pages,4 figures,1083 words Jingjing Chen Xuxia Tang Shuo Dai Xiao He http://arxiv.org/abs/2602.00259v1 Intelligent Reasoning Cues: A Framework and Case Study of the Roles of AI Information in Complex Decisions 2026-01-30T19:22:23Z Artificial intelligence (AI)-based decision support systems can be highly accurate yet still fail to support users or improve decisions. Existing theories of AI-assisted decision-making focus on calibrating reliance on AI advice, leaving it unclear how different system designs might influence the reasoning processes underneath. We address this gap by reconsidering AI interfaces as collections of intelligent reasoning cues: discrete pieces of AI information that can individually influence decision-making. We then explore the roles of eight types of reasoning cues in a high-stakes clinical decision (treating patients with sepsis in intensive care). Through contextual inquiries with six teams and a think-aloud study with 25 physicians, we find that reasoning cues have distinct patterns of influence that can directly inform design. Our results also suggest that reasoning cues should prioritize tasks with high variability and discretion, adapt to ensure compatibility with evolving decision needs, and provide complementary, rigorous insights on complex cases. 2026-01-30T19:22:23Z Accepted at CHI 2026 Venkatesh Sivaraman Eric P. Mason Mengfan Ellen Li Jessica Tong Andrew J. King Jeremy M. Kahn Adam Perer 10.1145/3772318.3790953 http://arxiv.org/abs/2601.19852v1 Hyperdisorder in tumor growth 2026-01-27T18:01:43Z Tumor growth is constrained by spatial, mechanical, and metabolic factors whose alignment progressively breaks down across cellular, mesoscopic, and tissue scales as tumors expand. We hypothesize that this misalignment drives tumors toward a distinct architectural regime, termed hyperdisorder. Hyperdisorder is not defined by increased heterogeneity alone, but by the coexistence of elevated disorder across scales and spatial nonstationarity within the same tumor. Unlike ordinary randomness, where independent fluctuations diminish under spatial averaging, disorder here persists, reorganizes, or even amplifies with increasing observation scale, preventing convergence toward a stable architectural description. Using hematoxylin and eosin stained whole-slide images of gastric cancer from The Cancer Genome Atlas, we quantify tumor architecture using tile-based metrics that capture complementary aspects of organization, including texture entropy, microstructural fragmentation, orientation isotropy, and multiscale entropy variation. These measures are combined into a standardized hyperdisorder index, enabling unsupervised comparison across spatial regions. We find that architectural disruption is unevenly distributed and partially decoupled across scales within individual slides, consistent with growth-driven multiscale incoherence rather than uniform stochastic variability. Testable consequences include anomalous scaling of heterogeneity with sampling size, failure of coarse graining to converge, and systematic differences between tumor cores and invasive fronts. In diagnostic and clinical contexts, this framework clarifies when measurements from limited tissue samples are representative of the whole tumor and when they are dominated by scale- and location-dependent effects. 2026-01-27T18:01:43Z 10 pages, 1 figure Arturo Tozzi http://arxiv.org/abs/2509.15278v3 Assessing metadata privacy in neuroimaging 2026-01-27T09:59:44Z The ethical and legal imperative to share research data without causing harm requires careful attention to privacy risks. While mounting evidence demonstrates that data sharing benefits science, legitimate concerns persist regarding the potential leakage of personal information that could lead to reidentification and subsequent harm. We reviewed metadata accompanying neuroimaging datasets from heterogeneous studies openly available on OpenNeuro, involving participants across the lifespan, from children to older adults, with and without clinical diagnoses, and including associated clinical score data. Using metaprivBIDS (https://github.com/CPernet/metaprivBIDS), a software application for BIDS compliant tsv/json files that computes and reports different privacy metrics (k-anonymity, k-global, l-diversity, SUDA, PIF), we found that privacy is generally well maintained, with serious vulnerabilities being rare. Nonetheless, issues were identified in nearly all datasets and warrant mitigation. Notably, clinical score data (e.g., neuropsychological results) posed minimal reidentification risk, whereas demographic variables: age, sex assigned at birth, sexual orientations, race, income, and geolocation, represented the principal privacy vulnerabilities. We outline practical measures to address these risks, enabling safer data sharing practices. 2025-09-18T12:56:03Z 19 pages, 7 tables, 1 figure, original analysis of 6 Open Datasets Emilie Kibsgaard Anita Sue Jwa Christopher J Markiewicz David Rodriguez Gonzalez Judith Sainz Pardo Russell A. Poldrack Cyril R. Pernet http://arxiv.org/abs/2304.05411v18 Precision Oncology: Targeting Genomic Alterations and Cancer Signaling with Integrative Multi-Omics, Deep Learning and Network Biology in Medical Oncology 2026-01-26T19:26:46Z Cancer is a complex genetic disease involving uncontrolled cell growth and proliferation, and necessitates effective targeting of dysregulated cellular pathways underlying cancer progression. Multiple genetic and epigenetic alterations characterize tumor progression and define hallmarks of cancer. Importantly, patients with the same cancer type respond differently to available cancer treatments, likely due to tumor-specific DNA, RNA, and proteins, indicating the need for patient-specific treatment options. Precision oncology has evolved as a form of cancer therapy that is focused on genetic and molecular profiling of tumors to identify specific molecular alterations involved in carcinogenesis for tailored individualized cancer treatment. Advances in high-throughput sequencing technologies have enabled gene expression profiling, providing multiomics data for detailed molecular characterization of various tumors. Integration and analysis of various multiomic sequencing data are crucial in this regard, as they can reveal critical molecular changes, such as cancer-driving mutations, post-translational modifications, gene fusions, amplifications, and alterations in signaling networks within tumors. Furthermore, the role of computational techniques such as artificial intelligence and deep learning, in analyzing complex data and identifying patterns of disease development for better outcomes is now well established in precision medicine. Additionally, AI-powered multi-omics and network biology have been harnessed to integrate and analyze biological data through networks, which may prove crucial in solving key problems facing precision oncology. This article aims to briefly explain the foundations and frontiers of precision oncology in the context of cutting-edge developments in tools and techniques associated with it, and try to assess its scope and importance in achieving the intended goals. 2023-04-11T17:13:08Z Pictures and other related data have been taken from sources freely available for reuse or permission for the same can be obtained upon request. Pictures no. 1 has been added to the text with permission from Elsevier. (Order No. 5521991271884, dated 4th April 2023). 40 pages, 2 figures, and 2 tables Manish Kumar 10.17632/s9pcj32yw2.1 http://arxiv.org/abs/2601.15854v1 Towards mathematical spaces for biological processes 2026-01-22T11:02:03Z Physics relies on mathematical spaces carefully matched to the phenomena under study. Phase space in classical mechanics, Hilbert space in quantum theory, configuration spaces in field theory all provide representations in which physical laws, stability and invariants become expressible and testable. In contrast, biology lacks an agreed-upon notion of space capturing context dependence, partial observability, degeneracy and irreversible dynamics. To address this gap, we introduce a unified mathematical space tailored to biological processes where states are represented in locally convex spaces indexed by context, where context includes both environment and history. Within our setting, proximity is defined through families of seminorms rather than a single global metric, allowing biological relevance to vary across conditions. Admissible sets encode biological constraints, observation maps formalize partial observability and many-to-one relations between state and dynamics capture irreversibility without requiring convergence to fixed points. Stabilization is characterized by neighborhood inclusion and degeneracy arises naturally through quotient structures induced by observation. We develop explicit constructions, operators and bounds within this space, yielding quantitative predictions dictated by its structure. A worked example based on EGFR-mutant non-small-cell lung cancer shows how single-cell data can be mapped into our framework, how numerical thresholds can be calibrated from the literature and how testable predictions can be formulated concerning rare tolerant states, context-dependent proximity and early stabilization. Overall, by providing biology with a space playing a role analogous to those used in physics, we aim to support structurally grounded and quantitative analyses of biological systems across contexts. 2026-01-22T11:02:03Z 17 pages, 1 figure, 2 tables Arturo Tozzi http://arxiv.org/abs/2601.15483v1 Data complexity signature predicts quantum projected learning benefit for antibiotic resistance 2026-01-21T21:35:28Z This study presents the first large-scale empirical evaluation of quantum machine learning for predicting antibiotic resistance in clinical urine cultures. Antibiotic resistance is amongst the top threats to humanity, and inappropriate antibiotic use is a main driver of resistance. We developed a Quantum Projective Learning (QPL) approach and executed 60 qubit experiments on IBM Eagle and Heron quantum processing units. While QPL did not consistently outperform classical baselines, potentially reflecting current quantum hardware limitations, it did achieve parity or superiority in specific scenarios, notably for the antibiotic nitrofurantoin and selected data splits, revealing that quantum advantage may be data-dependent. Analysis of data complexity measures uncovered a multivariate signature, which comprised Shannon entropy, Fisher Discriminant Ratio, standard deviation of kurtosis, number of low-variance features, and total correlations. The multivariate model accurately (AUC = 0.88, $p$-value = 0.03) distinguished cases wherein QPL executed on quantum hardware would outperform classical models. This signature suggests that quantum kernels excel in feature spaces with high entropy and structural complexity. These findings point to complexity-driven adaptive model selection as a promising strategy for optimizing hybrid quantum-classical workflows in healthcare. Overall, this investigation marks the first application of quantum machine learning in urology, and in antibiotic resistance prediction. Further, this work highlights conditional quantum utility and introduces a principled approach for leveraging data complexity signatures to guide quantum machine learning deployment in biomedical applications. 2026-01-21T21:35:28Z Kahn Rhrissorrakrai Filippo Utro Alex Milinovich Sandip Vasavada Daniel Rhoads Laxmi Parida Glenn T. Werneburg http://arxiv.org/abs/2507.19992v4 Development and Evaluation of an Ontology for Non-Invasive Respiratory Support in Acute Care 2026-01-21T05:55:49Z Managing patients with respiratory failure increasingly involves noninvasive respiratory support (NIRS) strategies to support respiration, often preventing the need for invasive mechanical ventilation. However, despite the rapidly expanding use of NIRS, there remains a significant challenge to its optimal use across all medical circumstances. It lacks a unified ontological structure, complicating guidance on NIRS modalities across healthcare systems. This study introduced NIRS ontology to support knowledge representation in acute care settings by providing a unified framework that enhances data clarity and interoperability, laying the groundwork for future clinical decision-making. We developed NIRS ontology using the Web Ontology Language (OWL) and Protege to organize clinical concepts and relationships. To enable rule-based clinical reasoning beyond hierarchical structures, we added Semantic Web Rule Language (SWRL) rules. We evaluated logical reasoning by adding a sample of 6 patient scenarios and used SPARQL queries to retrieve and test targeted inferences. The ontology has 145 classes, 11 object properties, and 18 data properties across 949 axioms that establish concept relationships. To standardize clinical concepts, we added 392 annotations, including descriptive definitions based on controlled vocabularies. SPARQL query evaluations across clinical scenarios confirmed the ontology ability to support rule based reasoning and therapy recommendations, providing a foundation for consistent documentation practices, integration into clinical data models, and advanced analysis of NIRS outcomes. In conclusion, we unified NIRS concepts into an ontological framework and demonstrated its applicability through the evaluation of patient scenarios and alignment with standardized vocabularies. 2025-07-26T16:05:20Z Md Fantacher Islam Jarrod Mosier Vignesh Subbian http://arxiv.org/abs/2601.13985v1 Component systems: do null models explain everything? 2026-01-20T14:01:01Z Component systems - ensembles of realizations built from a shared repertoire of modular parts - are ubiquitous in biological, ecological, technological, and socio-cultural domains. From genomes to texts, cities, and software, these systems exhibit statistical regularities that often meet the "bona fide" requirements of laws in the physical sciences. Here, we argue that the generality and simplicity of those laws are often due to basic combinatorial or sampling constraints, raising the question of whether such patterns are actually revealing system-specific mechanisms and how we might move beyond them. To this end, we first present a unifying mathematical framework, which allows us to compare modular systems in different fields and highlights the common "null" trends as well as the system-specific uniqueness, which, arguably, are signatures of the underlying generative dynamics. Next, we can exploit the framework with statistical mechanics and modern machine-learning tools for a twofold objective. (i) Explaining why the general regularities emerge, highlighting the constraints between them and the general principles at their origins, and (ii) "subtracting" them from data, which will isolate the informative features for inferring hidden system-specific generative processes, mechanistic and causal aspects. 2026-01-20T14:01:01Z Andrea Mazzolini Mattia Corigliano Rossana Droghetti Matteo Osella Marco Cosentino-Lagomarsino http://arxiv.org/abs/2601.13504v1 Modeling Age-Adjusted Mortality in the United States 2026-01-20T01:39:21Z This research explores how total mortality figures relate to age-standardized death rates within the United States, using the complete historical record of national mortality statistics. Through a detailed investigation of both all-cause and cause-specific mortality trends, the study evaluates the impact of demographic standardization on interpreting mortality data across different time periods and geographic regions. Results indicate a robust and persistent association between crude death totals and age-adjusted rates. However, the findings also demonstrate that without adjusting for age, comparisons over time or across locations may misrepresent underlying epidemiological shifts, largely due to evolving population age structures. The study underscores the critical role of age adjustment as a methodological tool for generating accurate, interpretable, and comparable measures of public health outcomes. 2026-01-20T01:39:21Z 29 pages, 5 figures, 1 table Brandon Dunbar Paramahansa Pramanik Haley Kate Robinson