https://arxiv.org/api/CI5rNu56E5rmv3s/1AG7zGbURaU 2026-03-20T18:36:52Z 2597 105 15 http://arxiv.org/abs/2506.06810v2 The influence of cell phenotype on collective cell invasion into the extracellular matrix 2025-10-01T16:36:02Z Understanding the interactions between cells and the extracellular matrix (ECM) during collective cell invasion is crucial for advancements in tissue engineering, cancer therapies, and regenerative medicine. This study focuses on the roles of contact guidance and ECM remodelling in directing cell behaviour, with a particular emphasis on exploring how differences in cell phenotype impact collective cell invasion. We present a computationally tractable two-dimensional hybrid model of collective cell migration within the ECM, where cells are modelled as individual entities and collagen fibres as a continuous tensorial field. Our model incorporates random motility, contact guidance, cell-cell adhesion, volume filling, and the dynamic remodelling of collagen fibres through cellular secretion and degradation. Through a comprehensive parameter sweep, we provide valuable insights into how differences in the cell phenotype, in terms of the ability of the cell to migrate, secrete, degrade, and respond to contact guidance cues from the ECM, impacts the characteristics of collective cell invasion. 2025-06-07T14:18:47Z 36 pages, 6 figures Yuan Yin Sarah L. Waters Ruth E. Baker http://arxiv.org/abs/2510.06232v1 Neu-RadBERT for Enhanced Diagnosis of Brain Injuries and Conditions 2025-10-01T11:54:33Z Objective: We sought to develop a classification algorithm to extract diagnoses from free-text radiology reports of brain imaging performed in patients with acute respiratory failure (ARF) undergoing invasive mechanical ventilation. Methods: We developed and fine-tuned Neu-RadBERT, a BERT-based model, to classify unstructured radiology reports. We extracted all the brain imaging reports (computed tomography and magnetic resonance imaging) from MIMIC-IV database, performed in patients with ARF. Initial manual labelling was performed on a subset of reports for various brain abnormalities, followed by fine-tuning Neu-RadBERT using three strategies: 1) baseline RadBERT, 2) Neu-RadBERT with Masked Language Modeling (MLM) pretraining, and 3) Neu-RadBERT with MLM pretraining and oversampling to address data skewness. We compared the performance of this model to Llama-2-13B, an autoregressive LLM. Results: The Neu-RadBERT model, particularly with oversampling, demonstrated significant improvements in diagnostic accuracy compared to baseline RadBERT for brain abnormalities, achieving up to 98.0% accuracy for acute brain injuries. Llama-2-13B exhibited relatively lower performance, peaking at 67.5% binary classification accuracy. This result highlights potential limitations of current autoregressive LLMs for this specific classification task, though it remains possible that larger models or further fine-tuning could improve performance. Conclusion: Neu-RadBERT, enhanced through target domain pretraining and oversampling techniques, offered a robust tool for accurate and reliable diagnosis of neurological conditions from radiology reports. This study underscores the potential of transformer-based NLP models in automatically extracting diagnoses from free text reports with potential applications to both research and patient care. 2025-10-01T11:54:33Z Both Manpreet Singh and Sean Macrae contributed equally and should be considered co-first authors. Corresponding author: Yiorgos Alexandros Cavayas Manpreet Singh Équipe de Recherche en Soins Intensifs, Centre de recherche du Centre intégré universitaire de santé et de services sociaux du Nord-de-l'Île-de-Montréal Sean Macrae Faculté de Médecine, Université de Montréal Pierre-Marc Williams Faculté de Médecine, Université de Montréal Nicole Hung Faculté de Médecine, Université de Montréal Sabrina Araujo de Franca Équipe de Recherche en Soins Intensifs, Centre de recherche du Centre intégré universitaire de santé et de services sociaux du Nord-de-l'Île-de-Montréal Laurent Letourneau-Guillon Faculté de Médecine, Université de Montréal Department of Radiology, Centre Hospitalier de l'Université de Montréal François-Martin Carrier Faculté de Médecine, Université de Montréal Department of Anesthesia, Centre Hospitalier de l'Université de Montréal Bang Liu Applied Research in Computer Linguistics Laboratory, Department of Computer Science and Operations Research, Université de Montréal Yiorgos Alexandros Cavayas Équipe de Recherche en Soins Intensifs, Centre de recherche du Centre intégré universitaire de santé et de services sociaux du Nord-de-l'Île-de-Montréal Faculté de Médecine, Université de Montréal Division of Critical Care Medicine, Department of Medicine, Hôpital du Sacré-Cœur de Montréal http://arxiv.org/abs/2510.06230v1 Robust Federated Anomaly Detection Using Dual-Signal Autoencoders: Application to Kidney Stone Identification in Ureteroscopy 2025-10-01T00:17:13Z This work introduces Federated Adaptive Gain via Dual Signal Trust (FedAgain), a novel federated learning algorithm designed to enhance anomaly detection in medical imaging under decentralized and heterogeneous conditions. Focusing on the task of kidney stone classification, FedAgain addresses the common challenge of corrupted or low-quality client data in real-world clinical environments by implementing a dual-signal trust mechanism based on reconstruction error and model divergence. This mechanism enables the central server to dynamically down-weight updates from untrustworthy clients without accessing their raw data, thereby preserving both model integrity and data privacy. FedAgain employs deep convolutional autoencoders trained in two diverse kidney stone datasets and is evaluated in 16 types of endoscopy-specific corruption at five severity levels. Extensive experiments demonstrate that FedAgain effectively suppresses "expert forger" clients, enhances robustness to image corruptions, and offers a privacy-preserving solution for collaborative medical anomaly detection. Compared to traditional FedAvg, FedAgain achieves clear improvements in all 16 types of corruption, with precision gains of up to +14.49\% and F1 score improvements of up to +10.20\%, highlighting its robustness and effectiveness in challenging imaging scenarios. 2025-10-01T00:17:13Z Ivan Reyes-Amezcua Francisco Lopez-Tiro Clément Larose Christian Daul Andres Mendez-Vazquez Gilberto Ochoa-Ruiz http://arxiv.org/abs/2509.26259v1 Millennium Pathways for Tractography: 40 grand challenges to shape the future of tractography 2025-09-30T13:46:21Z In the spirit of the historic Millennium Prize Problems that heralded a new era for mathematics, the newly formed International Society for Tractography (IST) has launched the Millennium Pathways for Tractography, a community-driven roadmap designed to shape the future of the field. Conceived during the inaugural Tract-Anat Retreat, this initiative reflects a collective vision for advancing tractography over the coming decade and beyond. The roadmap consists of 40 grand challenges, developed by international experts and organized into seven categories spanning three overarching themes: neuroanatomy, tractography methods, and clinical applications. By defining shared short-, medium-, and long-term goals, these pathways provide a structured framework to confront fundamental limitations, promote rigorous validation, and accelerate the translation of tractography into a robust tool for neuroscience and medicine. Ultimately, the Millennium Pathways aim to guide and inspire future research and collaboration, ensuring the continued scientific and clinical relevance of tractography well into the future. 2025-09-30T13:46:21Z Millennium Pathways (30 Sept, 2025) Maxime Descoteaux Kurt G. Schilling Dogu Baran Aydogan Christian Beaulieu Elena Borra Maxime Chamberland Alessandro Daducci Alberto De Luca Flavio Dell'Acqua Jessica Dubois Tim B. Dyrby Shawna Farquharson Stephanie Forkel Martijn Froeling Alessandra Griffa Mareike Grotheer Pamela Guevara Suzanne N. Haber Vinod Kumar Jangir Alexander Leemans Joel Lefebvre Ching-Po Lin Graham Little Chun-Yi Zac Lo Chiara Maffei Helen S. Mayberg Jennifer A. McNab Pratik Mukherjee Lauren J. O'Donnell Martin Parent Carlo Pierpaoli Francois Rheault Kathleen S. Rockland Alard Roebroeck Ariel Rokem R. Jarrett Rushmore Silvio Sarubbo Simona Schiavi Stamatios N Sotiropoulos Diego Szczupak Michel Thiebaut de Schotten J-Donald Tournier Francesco Vergani Joseph Yuan-Mou Yang Fan Zhang Derek Jones Laurent Petit http://arxiv.org/abs/2509.25069v1 Effectiveness and Safety of Selective IL-23 Receptor Antagonists in Moderate to Severe Ulcerative Colitis: A Systematic Review, Meta-Analysis and Trial Sequential Analysis 2025-09-29T17:13:45Z Selective interleukin-23 receptor antagonists (IL-23RA) show promise for treating moderate to severe ulcerative colitis (UC) but their efficacy and safety are not fully understood. We performed a systematic review and meta-analysis of randomized controlled trials comparing IL-23RA with placebo in moderate to severe UC. Outcomes included clinical and endoscopic remission, response rates, and adverse events. Nine trials including 3808 patients in the induction phase and 1734 in the maintenance phase were analyzed. IL-23RA improved clinical remission (induction risk ratio 2.63, 95 percent confidence interval 2.05-3.36; maintenance 1.99, 95 percent confidence interval 1.63-2.44) and endoscopic remission (induction 2.36, 95 percent confidence interval 1.70-2.20; maintenance 1.96, 95 percent confidence interval 1.63-2.37). IL-23RA reduced serious adverse events in the induction phase (0.40, 95 percent confidence interval 0.27-0.69) with no difference during maintenance (0.75, 95 percent confidence interval 0.31-1.84). No significant differences were observed in overall adverse events or specific events such as headache or nasopharyngitis. Trial sequential analysis confirmed sufficient sample size for clinical endpoints. IL-23RA showed superior effectiveness and similar safety compared with placebo in moderate to severe UC. 2025-09-29T17:13:45Z Short title: IL-23 antagonists in Ulcerative Colitis Word count: 3,819 words. Figures: 3. Tables: 2 Wellgner Fernandes Oliveira Amador Isabelle Castro Vitor Milena Ramos Tome Diogo Delgado Dotta Rodrigo V Motta http://arxiv.org/abs/2509.21206v2 Data-driven Neural Networks for Windkessel Parameter Calibration 2025-09-26T16:54:58Z In this work, we propose a novel method for calibrating Windkessel (WK) parameters in a dimensionally reduced 1D-0D coupled blood flow model. To this end, we design a data-driven neural network (NN)trained on simulated blood pressures in the left brachial artery. Once trained, the NN emulates the pressure pulse waves across the entire simulated domain, i.e., over time, space and varying WK parameters, with negligible error and computational effort. To calibrate the WK parameters on a measured pulse wave, the NN is extended by dummy neurons and retrained only on these. The main objective of this work is to assess the effectiveness of the method in various scenarios -- particularly, when the exact measurement location is unknown or the data are affected by noise. 2025-09-25T14:14:53Z 32 pages, 15 figures, for associated git see https://github.com/bhoock/WKcalNN, submitted to International Journal for Numerical Methods in Biomedical Engineering Benedikt Hoock Tobias Köppl http://arxiv.org/abs/2509.16328v2 The Role of High-Performance GPU Resources in Large Language Model Based Radiology Imaging Diagnosis 2025-09-24T19:08:24Z Large-language models (LLMs) are rapidly being applied to radiology, enabling automated image interpretation and report generation tasks. Their deployment in clinical practice requires both high diagnostic accuracy and low inference latency, which in turn demands powerful hardware. High-performance graphical processing units (GPUs) provide the necessary compute and memory throughput to run large LLMs on imaging data. We review modern GPU architectures (e.g. NVIDIA A100/H100, AMD Instinct MI250X/MI300) and key performance metrics of floating-point throughput, memory bandwidth, VRAM capacity. We show how these hardware capabilities affect radiology tasks: for example, generating reports or detecting findings on CheXpert and MIMIC-CXR images is computationally intensive and benefits from GPU parallelism and tensor-core acceleration. Empirical studies indicate that using appropriate GPU resources can reduce inference time and improve throughput. We discuss practical challenges including privacy, deployment, cost, power and optimization strategies: mixed-precision, quantization, compression, and multi-GPU scaling. Finally, we anticipate that next-generation features (8-bit tensor cores, enhanced interconnect) will further enable on-premise and federated radiology AI. Advancing GPU infrastructure is essential for safe, efficient LLM-based radiology diagnostics. 2025-09-19T18:13:12Z Jyun-Ping Kao http://arxiv.org/abs/2509.17854v2 Magnetically Guided Endothelial BioBots: A Next-Generation Strategy for Treating Complex Cerebral Aneurysms 2025-09-23T23:07:10Z Cerebral aneurysms affect three to five percent of the population, and rupture remains a major cause of stroke-related death and disability. Current therapies, surgical clipping, endovascular coiling, and flow diversion, have improved outcomes but each carries limitations. Clipping is invasive and often unsuitable for deep or posterior lesions. Coiling is prone to recurrence from compaction or incomplete occlusion, particularly in wide-neck or fusiform aneurysms. Flow diverters offer improved durability but rely on rigid metallic scaffolds that may malappose in tortuous vessels, compromise branch arteries, delay endothelialization, and necessitate long-term dual antiplatelet therapy. These shortcomings highlight a gap in current management: devices primarily provide mechanical occlusion but fail to conform to complex geometries or reliably promote rapid, complete endothelialization. As a result, aneurysm necks may remain exposed to persistent flow, delayed healing, and thrombosis. To address this, we propose magnetically guided endothelial BioBots as a next-generation therapeutic strategy. BioBots are biodegradable hydrogel carriers embedded with magnetic nanoparticles and coated with primed endothelial progenitor cells. Delivered through microcatheters and guided by external electromagnetic fields, they can assemble across aneurysm defects. Once localized, they form a conformal, geometry-adaptive endothelial patch that provides immediate antithrombotic protection and, as the hydrogel degrades, leaves behind a stable, functional endothelial lining. By integrating microrobotic navigation with regenerative vascular biology, BioBots may overcome the central limitations of current devices and enable safer, more durable treatment for complex aneurysms. 2025-09-22T14:48:49Z 14 pages, 2 figures. Review/Technical Report. Currently under journal peer review Duong Le Department of Biomedical Engineering, University of Massachusetts Amherst, Amherst, MA http://arxiv.org/abs/2509.17924v1 Medical priority fusion: achieving dual optimization of sensitivity and interpretability in nipt anomaly detection 2025-09-22T15:49:20Z Clinical machine learning faces a critical dilemma in high-stakes medical applications: algorithms achieving optimal diagnostic performance typically sacrifice the interpretability essential for physician decision-making, while interpretable methods compromise sensitivity in complex scenarios. This paradox becomes particularly acute in non-invasive prenatal testing (NIPT), where missed chromosomal abnormalities carry profound clinical consequences yet regulatory frameworks mandate explainable AI systems. We introduce Medical Priority Fusion (MPF), a constrained multi-objective optimization framework that resolves this fundamental trade-off by systematically integrating Naive Bayes probabilistic reasoning with Decision Tree rule-based logic through mathematically-principled weighted fusion under explicit medical constraints. Rigorous validation on 1,687 real-world NIPT samples characterized by extreme class imbalance (43.4:1 normal-to-abnormal ratio) employed stratified 5-fold cross-validation with comprehensive ablation studies and statistical hypothesis testing using McNemar's paired comparisons. MPF achieved simultaneous optimization of dual objectives: 89.3% sensitivity (95% CI: 83.9-94.7%) with 80% interpretability score, significantly outperforming individual algorithms (McNemar's test, p < 0.001). The optimal fusion configuration achieved Grade A clinical deployment criteria with large effect size (d = 1.24), establishing the first clinically-deployable solution that maintains both diagnostic accuracy and decision transparency essential for prenatal care. This work demonstrates that medical-constrained algorithm fusion can resolve the interpretability-performance trade-off, providing a mathematical framework for developing high-stakes medical decision support systems that meet both clinical efficacy and explainability requirements. 2025-09-22T15:49:20Z 24 pages, 47 figures, publish to BIBM Xiuqi Ge Zhibo Yao Yaosong Du http://arxiv.org/abs/2509.15854v1 Changes in Liver Fibrosis in Patients with Chronic Hepatitis B Treated with Pegylated Interferon Combined with Oral Antiviral Agents: A 48-Week Observation from a Prospective Cohort Study 2025-09-19T10:44:30Z Background and Aims: Pegylated interferon (PEG-IFN) combined with oral antiviral agents is currently the most widely used and highly effective treatment regimen for chronic hepatitis B virus (HBV) infection. While effectively suppressing HBV replication, its impact on liver histopathological fibrosis and inflammation remains a critical concern for clinicians and patients. Methods : A total of 625 patients who completed 48 weeks of PEG-IFN combined with oral antiviral therapy were enrolled in this real-world study. Based on their virological response at 48 weeks, patients were categorized into Clearance group and Non-clearance group. Changes in liver biochemistry, fibrosis, and renal function were compared between groups and before/after treatment. Results: No significant differences were observed in baseline blood tests, liver biochemical markers, or histopathological features between the Clearance group and Non-clearance group. Similarly, baseline renal function showed no significant variation. Further analysis revealed that the Clearance group exhibited significant aggravation of liver fibrosis after 48 weeks of treatment, which correlated strongly with alterations in liver enzyme levels. However, one patient who underwent paired liver biopsies before and after treatment demonstrated marked histopathological improvement in fibrosis. This finding underscores the irreplaceable role of liver histopathology in assessing fibrosis and inflammation. Conclusion: PEG-IFN combined with oral antiviral therapy exerts favorable effects on liver fibrosis and inflammation in chronic HBV patients. Non-invasive fibrosis assessment models can monitor fibrotic progression but are susceptible to confounding by hepatic inflammation. 2025-09-19T10:44:30Z Zhao Jinhua http://arxiv.org/abs/2509.16255v1 RootletSeg: Deep learning method for spinal rootlets segmentation across MRI contrasts 2025-09-17T19:44:50Z Purpose: To develop a deep learning method for the automatic segmentation of spinal nerve rootlets on various MRI scans. Material and Methods: This retrospective study included MRI scans from two open-access and one private dataset, consisting of 3D isotropic 3T TSE T2-weighted (T2w) and 7T MP2RAGE (T1-weighted [T1w] INV1 and INV2, and UNIT1) MRI scans. A deep learning model, RootletSeg, was developed to segment C2-T1 dorsal and ventral spinal rootlets. Training was performed on 76 scans and testing on 17 scans. The Dice score was used to compare the model performance with an existing open-source method. Spinal levels derived from RootletSeg segmentations were compared with vertebral levels defined by intervertebral discs using Bland-Altman analysis. Results: The RootletSeg model developed on 93 MRI scans from 50 healthy adults (mean age, 28.70 years $\pm$ 6.53 [SD]; 28 [56%] males, 22 [44%] females) achieved a mean $\pm$ SD Dice score of 0.67 $\pm$ 0.09 for T1w-INV2, 0.65 $\pm$ 0.11 for UNIT1, 0.64 $\pm$ 0.08 for T2w, and 0.62 $\pm$ 0.10 for T1w-INV1 contrasts. Spinal-vertebral level correspondence showed a progressively increasing rostrocaudal shift, with Bland-Altman bias ranging from 0.00 to 8.15 mm (median difference between level midpoints). Conclusion: RootletSeg accurately segmented C2-T1 spinal rootlets across MRI contrasts, enabling the determination of spinal levels directly from MRI scans. The method is open-source and can be used for a variety of downstream analyses, including lesion classification, neuromodulation therapy, and functional MRI group analysis. 2025-09-17T19:44:50Z 26 pages, 6 figures, 4 tables Katerina Krejci Jiri Chmelik Sandrine Bédard Falk Eippert Ulrike Horn Virginie Callot Julien Cohen-Adad Jan Valosek http://arxiv.org/abs/2509.16254v1 Imaging Modalities-Based Classification for Lung Cancer Detection 2025-09-17T19:18:05Z Lung cancer continues to be the predominant cause of cancer-related mortality globally. This review analyzes various approaches, including advanced image processing methods, focusing on their efficacy in interpreting CT scans, chest radiographs, and biological markers. Notably, we identify critical gaps in the previous surveys, including the need for robust models that can generalize across diverse populations and imaging modalities. This comprehensive synthesis aims to serve as a foundational resource for researchers and clinicians, guiding future efforts toward more accurate and efficient lung cancer detection. Key findings reveal that 3D CNN architectures integrated with CT scans achieve the most superior performances, yet challenges such as high false positives, dataset variability, and computational complexity persist across modalities. 2025-09-17T19:18:05Z Accepted at ICMI 2025 Sajim Ahmed Muhammad Zain Chaudhary Muhammad Zohaib Chaudhary Mahmoud Abbass Ahmed Sherif Mohammad Mahbubur Rahman Khan Mamun http://arxiv.org/abs/2509.16251v1 R-Net: A Reliable and Resource-Efficient CNN for Colorectal Cancer Detection with XAI Integration 2025-09-17T18:29:44Z State-of-the-art (SOTA) Convolutional Neural Networks (CNNs) are criticized for their extensive computational power, long training times, and large datasets. To overcome this limitation, we propose a reasonable network (R-Net), a lightweight CNN only to detect and classify colorectal cancer (CRC) using the Enteroscope Biopsy Histopathological Hematoxylin and Eosin Image Dataset (EBHI). Furthermore, six SOTA CNNs, including Multipath-based CNNs (DenseNet121, ResNet50), Depth-based CNNs (InceptionV3), width-based multi-connection CNNs (Xception), depth-wise separable convolutions (MobileNetV2), spatial exploitation-based CNNs (VGG16), Transfer learning, and two ensemble models are also tested on the same dataset. The ensemble models are a multipath-depth-width combination (DenseNet121-InceptionV3-Xception) and a multipath-depth-spatial combination (ResNet18-InceptionV3-VGG16). However, the proposed R-Net lightweight achieved 99.37% accuracy, outperforming MobileNet (95.83%) and ResNet50 (96.94%). Most importantly, to understand the decision-making of R-Net, Explainable AI such as SHAP, LIME, and Grad-CAM are integrated to visualize which parts of the EBHI image contribute to the detection and classification process of R-Net. The main novelty of this research lies in building a reliable, lightweight CNN R-Net that requires fewer computing resources yet maintains strong prediction results. SOTA CNNs, transfer learning, and ensemble models also extend our knowledge on CRC classification and detection. XAI functionality and the impact of pixel intensity on correct and incorrect classification images are also some novelties in CRC detection and classification. 2025-09-17T18:29:44Z Rokonozzaman Ayon Md Taimur Ahad Bo Song Yan Li http://arxiv.org/abs/2509.16250v1 A study on Deep Convolutional Neural Networks, transfer learning, and Mnet model for Cervical Cancer Detection 2025-09-17T18:11:09Z Early and accurate detection through Pap smear analysis is critical to improving patient outcomes and reducing mortality of Cervical cancer. State-of-the-art (SOTA) Convolutional Neural Networks (CNNs) require substantial computational resources, extended training time, and large datasets. In this study, a lightweight CNN model, S-Net (Simple Net), is developed specifically for cervical cancer detection and classification using Pap smear images to address these limitations. Alongside S-Net, six SOTA CNNs were evaluated using transfer learning, including multi-path (DenseNet201, ResNet152), depth-based (Serasnet152), width-based multi-connection (Xception), depth-wise separable convolutions (MobileNetV2), and spatial exploitation-based (VGG19). All models, including S-Net, achieved comparable accuracy, with S-Net reaching 99.99%. However, S-Net significantly outperforms the SOTA CNNs in terms of computational efficiency and inference time, making it a more practical choice for real-time and resource-constrained applications. A major limitation in CNN-based medical diagnosis remains the lack of transparency in the decision-making process. To address this, Explainable AI (XAI) techniques, such as SHAP, LIME, and Grad-CAM, were employed to visualize and interpret the key image regions influencing model predictions. The novelty of this study lies in the development of a highly accurate yet computationally lightweight model (S-Net) caPable of rapid inference while maintaining interpretability through XAI integration. Furthermore, this work analyzes the behavior of SOTA CNNs, investigates the effects of negative transfer learning on Pap smear images, and examines pixel intensity patterns in correctly and incorrectly classified samples. 2025-09-17T18:11:09Z Saifuddin Sagor Md Taimur Ahad Faruk Ahmed Rokonozzaman Ayon Sanzida Parvin http://arxiv.org/abs/2509.10369v1 Data distribution impacts the performance and generalisability of contrastive learning-based foundation models of electrocardiograms 2025-09-12T16:01:18Z Contrastive learning is a widely adopted self-supervised pretraining strategy, yet its dependence on cohort composition remains underexplored. We present Contrasting by Patient Augmented Electrocardiograms (CAPE) foundation model and pretrain on four cohorts (n = 5,203,352), from diverse populations across three continents (North America, South America, Asia). We systematically assess how cohort demographics, health status, and population diversity influence the downstream performance for prediction tasks also including two additional cohorts from another continent (Europe). We find that downstream performance depends on the distributional properties of the pretraining cohort, including demographics and health status. Moreover, while pretraining with a multi-centre, demographically diverse cohort improves in-distribution accuracy, it reduces out-of-distribution (OOD) generalisation of our contrastive approach by encoding cohort-specific artifacts. To address this, we propose the In-Distribution Batch (IDB) strategy, which preserves intra-cohort consistency during pretraining and enhances OOD robustness. This work provides important insights for developing clinically fair and generalisable foundation models. 2025-09-12T16:01:18Z Currently under review at npj Digital Medicine Gul Rukh Khattak Konstantinos Patlatzoglou Joseph Barker Libor Pastika Boroumand Zeidaabadi Ahmed El-Medany Hesham Aggour Yixiu Liang Antonio H. Ribeiro Jeffrey Annis Antonio Luiz Pinho Ribeiro Junbo Ge Daniel B. Kramer Jonathan W. Waks Evan Brittain Nicholas Peters Fu Siong Ng Arunashis Sau