https://arxiv.org/api//kEDlBrNUDfRgh82W4SwTYYhhtM
2026-03-18T10:17:27Z
1549
15
15
http://arxiv.org/abs/2602.13502v1
Translating Dietary Standards into Healthy Meals with Minimal Substitutions
2026-02-13T22:18:36Z
An important goal for personalized diet systems is to improve nutritional quality without compromising convenience or affordability. We present an end-to-end framework that converts dietary standards into complete meals with minimal change. Using the What We Eat in America (WWEIA) intake data for 135,491 meals, we identify 34 interpretable meal archetypes that we then use to condition a generative model and a portion predictor to meet USDA nutritional targets. In comparisons within archetypes, generated meals are better at following recommended daily intake (RDI) targets by 47.0%, while remaining compositionally close to real meals. Our results show that by allowing one to three food substitutions, we were able to create meals that were 10% more nutritious, while reducing costs 19-32%, on average. By turning dietary guidelines into realistic, budget-aware meals and simple swaps, this framework can underpin clinical decision support, public-health programs, and consumer apps that deliver scalable, equitable improvements in everyday nutrition.
2026-02-13T22:18:36Z
49 pages, 4 figures
Trevor Chan
Ilias Tagkopoulos
http://arxiv.org/abs/2506.10037v3
The Cell Ontology in the age of single-cell omics
2026-02-13T16:11:42Z
Single-cell omics technologies have transformed our understanding of cellular diversity by enabling high-resolution profiling of individual cells. However, the unprecedented scale and heterogeneity of these datasets demand robust frameworks for data integration and annotation. The Cell Ontology (CL) has emerged as a pivotal resource for achieving FAIR (Findable, Accessible, Interoperable, and Reusable) data principles by providing standardized, species-agnostic terms for canonical cell types - forming a core component of a wide range of platforms and tools. In this paper, we describe the wide variety of uses of CL in these platforms and tools and detail ongoing work to improve and extend CL content including the addition of transcriptomic types, working closely with major atlasing efforts including the Human Cell Atlas and the Brain Initiative Cell Atlas Network to support their needs. We cover the challenges and future plans for harmonising classical and transcriptomic cell type definitions, integrating markers and using Large Language Models (LLMs) to improve content and efficiency of CL workflows.
2025-06-10T21:38:26Z
48 pages, 8 Figures
Shawn Zheng Kai Tan
Aleix Puig-Barbe
Damien Goutte-Gattat
Caroline Eastwood
Brian Aevermann
Alida Avola
James P Balhoff
Ismail Ugur Bayindir
Jasmine Belfiore
Anita Reane Caron
David S Fischer
Nancy George
Benjamin M Gyori
Melissa A Haendel
Charles Tapley Hoyt
Huseyin Kir
Tiago Lubiana
Nicolas Matentzoglu
James A Overton
Beverly Peng
Bjoern Peters
Ellen M Quardokus
Patrick L Ray
Paola Roncaglia
Andrea D Rivera
Ray Stefancsik
Wei Kheng Teh
Sabrina Toro
Nicole Vasilevsky
Chuan Xu
Yun Zhang
Richard H Scheuermann
Christopher J Mungall
Alexander D Diehl
David Osumi-Sutherland
http://arxiv.org/abs/2409.17038v2
Omnibenchmark: transparent, reproducible, extensible and standardized orchestration of solo and collaborative benchmarks
2026-02-11T11:27:20Z
Benchmarking involves designing, running and disseminating rigorous performance assessments of methods, most often for data analysis and software tools, but the process can also be applied to experimental systems. Ideally, a benchmarking system is used to facilitate the benchmarking process by providing a structured entrypoint to design, coordinate, execute, and store standardized benchmarks. We describe a novel benchmarking system, Omnibenchmark, that facilitates benchmark formalization and execution in both solo and community efforts. Omnibenchmark provides a flexible benchmark plan syntax (i.e., a configuration YAML file), dynamic workflow generation based on Snakemake, S3-compatible storage handling, and reproducible software environments using environment modules, Apptainer or Conda. Such a setup provides an unprecedented flexibility such that existing benchmark designs can be forked and extended, run separately or collaboratively, giving versioned and standardized result outputs and therefore much-needed transparency to the analysis and interpretation of benchmark results. Tutorials and installation instructions are available from https://omnibenchmark.org.
2024-09-25T15:46:29Z
20 page, 2 figures
Izaskun Mallona
Almut Luetge
Ben Carrillo
Daniel Incicau
Reto Gerber
Aidan Meara
Anthony Sonrel
Charlotte Soneson
Mark D. Robinson
http://arxiv.org/abs/2602.10011v1
Towards a topological view of blood pressure regulation
2026-02-10T17:35:01Z
Blood pressure regulation is commonly addressed in terms of local mechanisms such as vascular resistance, compliance and neurohumoral control. However, the human vasculature encompasses multiple quasi-closed flow loops under both physiological and pathological conditions. To test whether these loops could influence pressure dynamics beyond local control, we address the role of vascular topology in blood pressure regulation. Using one dimensional flow simulation models, we compared pressure dynamics in open vascular segments and closed vascular loops. We found that in open segments pressure fades away and remains spatially localized, whereas in closed loops pressure can keep circulating around the loop even if resistance in one spot is modified. Since parallel pathways within loops are dynamically coupled rather than independent, pressure changes in one place can affect the entire closed loop, allowing system level pressure patterns to emerge. Also, we assessed the temporal evolution of pressure fluctuations within closed vascular loops in normotensive and hypertensive parameter regimes, before and after loop breaking intervention. This topological approach helps clarifying why drugs or local interventions may fail to lower blood pressure in looped vascular architectures, providing a theoretical interpretation of some forms of resistant hypertension. Because disrupting a loop restores pressure relaxation, it may also help explain the disproportionate pressure changes observed after topology altering events like thrombosis, vascular surgery or embolization of arteriovenous malformations and shunts. Therefore, vascular topology can influence cardiovascular physiology by coupling local pressure flow relations to global constraints on blood pressure regulation, with physiological, pathological and clinical implications.
2026-02-10T17:35:01Z
9 pages, one figure
Arturo Tozzi
http://arxiv.org/abs/2602.08061v1
Securing Dual-Use Pathogen Data of Concern
2026-02-08T17:11:19Z
Training data is an essential input into creating competent artificial intelligence (AI) models. AI models for biology are trained on large volumes of data, including data related to biological sequences, structures, images, and functions. The type of data used to train a model is intimately tied to the capabilities it ultimately possesses--including those of biosecurity concern. For this reason, an international group of more than 100 researchers at the recent 50th anniversary Asilomar Conference endorsed data controls to prevent the use of AI for harmful applications such as bioweapons development. To help design such controls, we introduce a five-tier Biosecurity Data Level (BDL) framework for categorizing pathogen data. Each level contains specific data types, based on their expected ability to contribute to capabilities of concern when used to train AI models. For each BDL tier, we propose technical restrictions appropriate to its level of risk. Finally, we outline a novel governance framework for newly created dual-use pathogen data. In a world with widely accessible computational and coding resources, data controls may be among the most high-leverage interventions available to reduce the proliferation of concerning biological AI capabilities.
2026-02-08T17:11:19Z
39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: Biosecurity Safeguards for Generative AI
Doni Bloomfield
Allison Berke
Moritz S. Hanke
Aaron Maiwald
James R. M. Black
Toby Webster
Tina Hernandez-Boussard
Oliver M. Crook
Jassi Pannu
http://arxiv.org/abs/2602.07076v1
Bifacial weakness with paresthesias (BFP) secondary to trauma: a case report
2026-02-06T01:51:46Z
This case details the diagnosis and treatment process of a patient with bilateral facial nerve palsy accompanied with limb sensory disturbance secondary to head trauma, who was ultimately diagnosed with Bifacial weakness with paresthesias (BFP) , a rare variant of Guillain-Barré Syndrome(GBS) . The patient underwent plasma exchange therapy and showed favorable recovery . In this article, for the first time we report a case of BFP secondary to trauma.
2026-02-06T01:51:46Z
5 pages,4 figures,1083 words
Jingjing Chen
Xuxia Tang
Shuo Dai
Xiao He
http://arxiv.org/abs/2602.00259v1
Intelligent Reasoning Cues: A Framework and Case Study of the Roles of AI Information in Complex Decisions
2026-01-30T19:22:23Z
Artificial intelligence (AI)-based decision support systems can be highly accurate yet still fail to support users or improve decisions. Existing theories of AI-assisted decision-making focus on calibrating reliance on AI advice, leaving it unclear how different system designs might influence the reasoning processes underneath. We address this gap by reconsidering AI interfaces as collections of intelligent reasoning cues: discrete pieces of AI information that can individually influence decision-making. We then explore the roles of eight types of reasoning cues in a high-stakes clinical decision (treating patients with sepsis in intensive care). Through contextual inquiries with six teams and a think-aloud study with 25 physicians, we find that reasoning cues have distinct patterns of influence that can directly inform design. Our results also suggest that reasoning cues should prioritize tasks with high variability and discretion, adapt to ensure compatibility with evolving decision needs, and provide complementary, rigorous insights on complex cases.
2026-01-30T19:22:23Z
Accepted at CHI 2026
Venkatesh Sivaraman
Eric P. Mason
Mengfan Ellen Li
Jessica Tong
Andrew J. King
Jeremy M. Kahn
Adam Perer
10.1145/3772318.3790953
http://arxiv.org/abs/2601.19852v1
Hyperdisorder in tumor growth
2026-01-27T18:01:43Z
Tumor growth is constrained by spatial, mechanical, and metabolic factors whose alignment progressively breaks down across cellular, mesoscopic, and tissue scales as tumors expand. We hypothesize that this misalignment drives tumors toward a distinct architectural regime, termed hyperdisorder. Hyperdisorder is not defined by increased heterogeneity alone, but by the coexistence of elevated disorder across scales and spatial nonstationarity within the same tumor. Unlike ordinary randomness, where independent fluctuations diminish under spatial averaging, disorder here persists, reorganizes, or even amplifies with increasing observation scale, preventing convergence toward a stable architectural description. Using hematoxylin and eosin stained whole-slide images of gastric cancer from The Cancer Genome Atlas, we quantify tumor architecture using tile-based metrics that capture complementary aspects of organization, including texture entropy, microstructural fragmentation, orientation isotropy, and multiscale entropy variation. These measures are combined into a standardized hyperdisorder index, enabling unsupervised comparison across spatial regions. We find that architectural disruption is unevenly distributed and partially decoupled across scales within individual slides, consistent with growth-driven multiscale incoherence rather than uniform stochastic variability. Testable consequences include anomalous scaling of heterogeneity with sampling size, failure of coarse graining to converge, and systematic differences between tumor cores and invasive fronts. In diagnostic and clinical contexts, this framework clarifies when measurements from limited tissue samples are representative of the whole tumor and when they are dominated by scale- and location-dependent effects.
2026-01-27T18:01:43Z
10 pages, 1 figure
Arturo Tozzi
http://arxiv.org/abs/2509.15278v3
Assessing metadata privacy in neuroimaging
2026-01-27T09:59:44Z
The ethical and legal imperative to share research data without causing harm requires careful attention to privacy risks. While mounting evidence demonstrates that data sharing benefits science, legitimate concerns persist regarding the potential leakage of personal information that could lead to reidentification and subsequent harm. We reviewed metadata accompanying neuroimaging datasets from heterogeneous studies openly available on OpenNeuro, involving participants across the lifespan, from children to older adults, with and without clinical diagnoses, and including associated clinical score data. Using metaprivBIDS (https://github.com/CPernet/metaprivBIDS), a software application for BIDS compliant tsv/json files that computes and reports different privacy metrics (k-anonymity, k-global, l-diversity, SUDA, PIF), we found that privacy is generally well maintained, with serious vulnerabilities being rare. Nonetheless, issues were identified in nearly all datasets and warrant mitigation. Notably, clinical score data (e.g., neuropsychological results) posed minimal reidentification risk, whereas demographic variables: age, sex assigned at birth, sexual orientations, race, income, and geolocation, represented the principal privacy vulnerabilities. We outline practical measures to address these risks, enabling safer data sharing practices.
2025-09-18T12:56:03Z
19 pages, 7 tables, 1 figure, original analysis of 6 Open Datasets
Emilie Kibsgaard
Anita Sue Jwa
Christopher J Markiewicz
David Rodriguez Gonzalez
Judith Sainz Pardo
Russell A. Poldrack
Cyril R. Pernet
http://arxiv.org/abs/2304.05411v18
Precision Oncology: Targeting Genomic Alterations and Cancer Signaling with Integrative Multi-Omics, Deep Learning and Network Biology in Medical Oncology
2026-01-26T19:26:46Z
Cancer is a complex genetic disease involving uncontrolled cell growth and proliferation, and necessitates effective targeting of dysregulated cellular pathways underlying cancer progression. Multiple genetic and epigenetic alterations characterize tumor progression and define hallmarks of cancer. Importantly, patients with the same cancer type respond differently to available cancer treatments, likely due to tumor-specific DNA, RNA, and proteins, indicating the need for patient-specific treatment options. Precision oncology has evolved as a form of cancer therapy that is focused on genetic and molecular profiling of tumors to identify specific molecular alterations involved in carcinogenesis for tailored individualized cancer treatment. Advances in high-throughput sequencing technologies have enabled gene expression profiling, providing multiomics data for detailed molecular characterization of various tumors. Integration and analysis of various multiomic sequencing data are crucial in this regard, as they can reveal critical molecular changes, such as cancer-driving mutations, post-translational modifications, gene fusions, amplifications, and alterations in signaling networks within tumors. Furthermore, the role of computational techniques such as artificial intelligence and deep learning, in analyzing complex data and identifying patterns of disease development for better outcomes is now well established in precision medicine. Additionally, AI-powered multi-omics and network biology have been harnessed to integrate and analyze biological data through networks, which may prove crucial in solving key problems facing precision oncology. This article aims to briefly explain the foundations and frontiers of precision oncology in the context of cutting-edge developments in tools and techniques associated with it, and try to assess its scope and importance in achieving the intended goals.
2023-04-11T17:13:08Z
Pictures and other related data have been taken from sources freely available for reuse or permission for the same can be obtained upon request. Pictures no. 1 has been added to the text with permission from Elsevier. (Order No. 5521991271884, dated 4th April 2023). 40 pages, 2 figures, and 2 tables
Manish Kumar
10.17632/s9pcj32yw2.1
http://arxiv.org/abs/2601.15854v1
Towards mathematical spaces for biological processes
2026-01-22T11:02:03Z
Physics relies on mathematical spaces carefully matched to the phenomena under study. Phase space in classical mechanics, Hilbert space in quantum theory, configuration spaces in field theory all provide representations in which physical laws, stability and invariants become expressible and testable. In contrast, biology lacks an agreed-upon notion of space capturing context dependence, partial observability, degeneracy and irreversible dynamics. To address this gap, we introduce a unified mathematical space tailored to biological processes where states are represented in locally convex spaces indexed by context, where context includes both environment and history. Within our setting, proximity is defined through families of seminorms rather than a single global metric, allowing biological relevance to vary across conditions. Admissible sets encode biological constraints, observation maps formalize partial observability and many-to-one relations between state and dynamics capture irreversibility without requiring convergence to fixed points. Stabilization is characterized by neighborhood inclusion and degeneracy arises naturally through quotient structures induced by observation. We develop explicit constructions, operators and bounds within this space, yielding quantitative predictions dictated by its structure. A worked example based on EGFR-mutant non-small-cell lung cancer shows how single-cell data can be mapped into our framework, how numerical thresholds can be calibrated from the literature and how testable predictions can be formulated concerning rare tolerant states, context-dependent proximity and early stabilization. Overall, by providing biology with a space playing a role analogous to those used in physics, we aim to support structurally grounded and quantitative analyses of biological systems across contexts.
2026-01-22T11:02:03Z
17 pages, 1 figure, 2 tables
Arturo Tozzi
http://arxiv.org/abs/2601.15483v1
Data complexity signature predicts quantum projected learning benefit for antibiotic resistance
2026-01-21T21:35:28Z
This study presents the first large-scale empirical evaluation of quantum machine learning for predicting antibiotic resistance in clinical urine cultures. Antibiotic resistance is amongst the top threats to humanity, and inappropriate antibiotic use is a main driver of resistance. We developed a Quantum Projective Learning (QPL) approach and executed 60 qubit experiments on IBM Eagle and Heron quantum processing units. While QPL did not consistently outperform classical baselines, potentially reflecting current quantum hardware limitations, it did achieve parity or superiority in specific scenarios, notably for the antibiotic nitrofurantoin and selected data splits, revealing that quantum advantage may be data-dependent. Analysis of data complexity measures uncovered a multivariate signature, which comprised Shannon entropy, Fisher Discriminant Ratio, standard deviation of kurtosis, number of low-variance features, and total correlations. The multivariate model accurately (AUC = 0.88, $p$-value = 0.03) distinguished cases wherein QPL executed on quantum hardware would outperform classical models. This signature suggests that quantum kernels excel in feature spaces with high entropy and structural complexity. These findings point to complexity-driven adaptive model selection as a promising strategy for optimizing hybrid quantum-classical workflows in healthcare. Overall, this investigation marks the first application of quantum machine learning in urology, and in antibiotic resistance prediction. Further, this work highlights conditional quantum utility and introduces a principled approach for leveraging data complexity signatures to guide quantum machine learning deployment in biomedical applications.
2026-01-21T21:35:28Z
Kahn Rhrissorrakrai
Filippo Utro
Alex Milinovich
Sandip Vasavada
Daniel Rhoads
Laxmi Parida
Glenn T. Werneburg
http://arxiv.org/abs/2507.19992v4
Development and Evaluation of an Ontology for Non-Invasive Respiratory Support in Acute Care
2026-01-21T05:55:49Z
Managing patients with respiratory failure increasingly involves noninvasive respiratory support (NIRS) strategies to support respiration, often preventing the need for invasive mechanical ventilation. However, despite the rapidly expanding use of NIRS, there remains a significant challenge to its optimal use across all medical circumstances. It lacks a unified ontological structure, complicating guidance on NIRS modalities across healthcare systems. This study introduced NIRS ontology to support knowledge representation in acute care settings by providing a unified framework that enhances data clarity and interoperability, laying the groundwork for future clinical decision-making. We developed NIRS ontology using the Web Ontology Language (OWL) and Protege to organize clinical concepts and relationships. To enable rule-based clinical reasoning beyond hierarchical structures, we added Semantic Web Rule Language (SWRL) rules. We evaluated logical reasoning by adding a sample of 6 patient scenarios and used SPARQL queries to retrieve and test targeted inferences. The ontology has 145 classes, 11 object properties, and 18 data properties across 949 axioms that establish concept relationships. To standardize clinical concepts, we added 392 annotations, including descriptive definitions based on controlled vocabularies. SPARQL query evaluations across clinical scenarios confirmed the ontology ability to support rule based reasoning and therapy recommendations, providing a foundation for consistent documentation practices, integration into clinical data models, and advanced analysis of NIRS outcomes. In conclusion, we unified NIRS concepts into an ontological framework and demonstrated its applicability through the evaluation of patient scenarios and alignment with standardized vocabularies.
2025-07-26T16:05:20Z
Md Fantacher Islam
Jarrod Mosier
Vignesh Subbian
http://arxiv.org/abs/2601.13985v1
Component systems: do null models explain everything?
2026-01-20T14:01:01Z
Component systems - ensembles of realizations built from a shared repertoire of modular parts - are ubiquitous in biological, ecological, technological, and socio-cultural domains. From genomes to texts, cities, and software, these systems exhibit statistical regularities that often meet the "bona fide" requirements of laws in the physical sciences. Here, we argue that the generality and simplicity of those laws are often due to basic combinatorial or sampling constraints, raising the question of whether such patterns are actually revealing system-specific mechanisms and how we might move beyond them. To this end, we first present a unifying mathematical framework, which allows us to compare modular systems in different fields and highlights the common "null" trends as well as the system-specific uniqueness, which, arguably, are signatures of the underlying generative dynamics. Next, we can exploit the framework with statistical mechanics and modern machine-learning tools for a twofold objective. (i) Explaining why the general regularities emerge, highlighting the constraints between them and the general principles at their origins, and (ii) "subtracting" them from data, which will isolate the informative features for inferring hidden system-specific generative processes, mechanistic and causal aspects.
2026-01-20T14:01:01Z
Andrea Mazzolini
Mattia Corigliano
Rossana Droghetti
Matteo Osella
Marco Cosentino-Lagomarsino
http://arxiv.org/abs/2601.13504v1
Modeling Age-Adjusted Mortality in the United States
2026-01-20T01:39:21Z
This research explores how total mortality figures relate to age-standardized death rates within the United States, using the complete historical record of national mortality statistics. Through a detailed investigation of both all-cause and cause-specific mortality trends, the study evaluates the impact of demographic standardization on interpreting mortality data across different time periods and geographic regions. Results indicate a robust and persistent association between crude death totals and age-adjusted rates. However, the findings also demonstrate that without adjusting for age, comparisons over time or across locations may misrepresent underlying epidemiological shifts, largely due to evolving population age structures. The study underscores the critical role of age adjustment as a methodological tool for generating accurate, interpretable, and comparable measures of public health outcomes.
2026-01-20T01:39:21Z
29 pages, 5 figures, 1 table
Brandon Dunbar
Paramahansa Pramanik
Haley Kate Robinson