https://arxiv.org/api/CI5rNu56E5rmv3s/1AG7zGbURaU2026-03-20T18:36:52Z259710515http://arxiv.org/abs/2506.06810v2The influence of cell phenotype on collective cell invasion into the extracellular matrix2025-10-01T16:36:02ZUnderstanding the interactions between cells and the extracellular matrix (ECM) during collective cell invasion is crucial for advancements in tissue engineering, cancer therapies, and regenerative medicine. This study focuses on the roles of contact guidance and ECM remodelling in directing cell behaviour, with a particular emphasis on exploring how differences in cell phenotype impact collective cell invasion. We present a computationally tractable two-dimensional hybrid model of collective cell migration within the ECM, where cells are modelled as individual entities and collagen fibres as a continuous tensorial field. Our model incorporates random motility, contact guidance, cell-cell adhesion, volume filling, and the dynamic remodelling of collagen fibres through cellular secretion and degradation. Through a comprehensive parameter sweep, we provide valuable insights into how differences in the cell phenotype, in terms of the ability of the cell to migrate, secrete, degrade, and respond to contact guidance cues from the ECM, impacts the characteristics of collective cell invasion.2025-06-07T14:18:47Z36 pages, 6 figuresYuan YinSarah L. WatersRuth E. Bakerhttp://arxiv.org/abs/2510.06232v1Neu-RadBERT for Enhanced Diagnosis of Brain Injuries and Conditions2025-10-01T11:54:33ZObjective: We sought to develop a classification algorithm to extract diagnoses from free-text radiology reports of brain imaging performed in patients with acute respiratory failure (ARF) undergoing invasive mechanical ventilation. Methods: We developed and fine-tuned Neu-RadBERT, a BERT-based model, to classify unstructured radiology reports. We extracted all the brain imaging reports (computed tomography and magnetic resonance imaging) from MIMIC-IV database, performed in patients with ARF. Initial manual labelling was performed on a subset of reports for various brain abnormalities, followed by fine-tuning Neu-RadBERT using three strategies: 1) baseline RadBERT, 2) Neu-RadBERT with Masked Language Modeling (MLM) pretraining, and 3) Neu-RadBERT with MLM pretraining and oversampling to address data skewness. We compared the performance of this model to Llama-2-13B, an autoregressive LLM. Results: The Neu-RadBERT model, particularly with oversampling, demonstrated significant improvements in diagnostic accuracy compared to baseline RadBERT for brain abnormalities, achieving up to 98.0% accuracy for acute brain injuries. Llama-2-13B exhibited relatively lower performance, peaking at 67.5% binary classification accuracy. This result highlights potential limitations of current autoregressive LLMs for this specific classification task, though it remains possible that larger models or further fine-tuning could improve performance. Conclusion: Neu-RadBERT, enhanced through target domain pretraining and oversampling techniques, offered a robust tool for accurate and reliable diagnosis of neurological conditions from radiology reports. This study underscores the potential of transformer-based NLP models in automatically extracting diagnoses from free text reports with potential applications to both research and patient care.2025-10-01T11:54:33ZBoth Manpreet Singh and Sean Macrae contributed equally and should be considered co-first authors. Corresponding author: Yiorgos Alexandros CavayasManpreet SinghÉquipe de Recherche en Soins Intensifs, Centre de recherche du Centre intégré universitaire de santé et de services sociaux du Nord-de-l'Île-de-MontréalSean MacraeFaculté de Médecine, Université de MontréalPierre-Marc WilliamsFaculté de Médecine, Université de MontréalNicole HungFaculté de Médecine, Université de MontréalSabrina Araujo de FrancaÉquipe de Recherche en Soins Intensifs, Centre de recherche du Centre intégré universitaire de santé et de services sociaux du Nord-de-l'Île-de-MontréalLaurent Letourneau-GuillonFaculté de Médecine, Université de MontréalDepartment of Radiology, Centre Hospitalier de l'Université de MontréalFrançois-Martin CarrierFaculté de Médecine, Université de MontréalDepartment of Anesthesia, Centre Hospitalier de l'Université de MontréalBang LiuApplied Research in Computer Linguistics Laboratory, Department of Computer Science and Operations Research, Université de MontréalYiorgos Alexandros CavayasÉquipe de Recherche en Soins Intensifs, Centre de recherche du Centre intégré universitaire de santé et de services sociaux du Nord-de-l'Île-de-MontréalFaculté de Médecine, Université de MontréalDivision of Critical Care Medicine, Department of Medicine, Hôpital du Sacré-Cœur de Montréalhttp://arxiv.org/abs/2510.06230v1Robust Federated Anomaly Detection Using Dual-Signal Autoencoders: Application to Kidney Stone Identification in Ureteroscopy2025-10-01T00:17:13ZThis work introduces Federated Adaptive Gain via Dual Signal Trust (FedAgain), a novel federated learning algorithm designed to enhance anomaly detection in medical imaging under decentralized and heterogeneous conditions. Focusing on the task of kidney stone classification, FedAgain addresses the common challenge of corrupted or low-quality client data in real-world clinical environments by implementing a dual-signal trust mechanism based on reconstruction error and model divergence. This mechanism enables the central server to dynamically down-weight updates from untrustworthy clients without accessing their raw data, thereby preserving both model integrity and data privacy. FedAgain employs deep convolutional autoencoders trained in two diverse kidney stone datasets and is evaluated in 16 types of endoscopy-specific corruption at five severity levels. Extensive experiments demonstrate that FedAgain effectively suppresses "expert forger" clients, enhances robustness to image corruptions, and offers a privacy-preserving solution for collaborative medical anomaly detection. Compared to traditional FedAvg, FedAgain achieves clear improvements in all 16 types of corruption, with precision gains of up to +14.49\% and F1 score improvements of up to +10.20\%, highlighting its robustness and effectiveness in challenging imaging scenarios.2025-10-01T00:17:13ZIvan Reyes-AmezcuaFrancisco Lopez-TiroClément LaroseChristian DaulAndres Mendez-VazquezGilberto Ochoa-Ruizhttp://arxiv.org/abs/2509.26259v1Millennium Pathways for Tractography: 40 grand challenges to shape the future of tractography2025-09-30T13:46:21ZIn the spirit of the historic Millennium Prize Problems that heralded a new era for mathematics, the newly formed International Society for Tractography (IST) has launched the Millennium Pathways for Tractography, a community-driven roadmap designed to shape the future of the field. Conceived during the inaugural Tract-Anat Retreat, this initiative reflects a collective vision for advancing tractography over the coming decade and beyond. The roadmap consists of 40 grand challenges, developed by international experts and organized into seven categories spanning three overarching themes: neuroanatomy, tractography methods, and clinical applications. By defining shared short-, medium-, and long-term goals, these pathways provide a structured framework to confront fundamental limitations, promote rigorous validation, and accelerate the translation of tractography into a robust tool for neuroscience and medicine. Ultimately, the Millennium Pathways aim to guide and inspire future research and collaboration, ensuring the continued scientific and clinical relevance of tractography well into the future.2025-09-30T13:46:21ZMillennium Pathways (30 Sept, 2025)Maxime DescoteauxKurt G. SchillingDogu Baran AydoganChristian BeaulieuElena BorraMaxime ChamberlandAlessandro DaducciAlberto De LucaFlavio Dell'AcquaJessica DuboisTim B. DyrbyShawna FarquharsonStephanie ForkelMartijn FroelingAlessandra GriffaMareike GrotheerPamela GuevaraSuzanne N. HaberVinod Kumar JangirAlexander LeemansJoel LefebvreChing-Po LinGraham LittleChun-Yi Zac LoChiara MaffeiHelen S. MaybergJennifer A. McNabPratik MukherjeeLauren J. O'DonnellMartin ParentCarlo PierpaoliFrancois RheaultKathleen S. RocklandAlard RoebroeckAriel RokemR. Jarrett RushmoreSilvio SarubboSimona SchiaviStamatios N SotiropoulosDiego SzczupakMichel Thiebaut de SchottenJ-Donald TournierFrancesco VerganiJoseph Yuan-Mou YangFan ZhangDerek JonesLaurent Petithttp://arxiv.org/abs/2509.25069v1Effectiveness and Safety of Selective IL-23 Receptor Antagonists in Moderate to Severe Ulcerative Colitis: A Systematic Review, Meta-Analysis and Trial Sequential Analysis2025-09-29T17:13:45ZSelective interleukin-23 receptor antagonists (IL-23RA) show promise for treating moderate to severe ulcerative colitis (UC) but their efficacy and safety are not fully understood. We performed a systematic review and meta-analysis of randomized controlled trials comparing IL-23RA with placebo in moderate to severe UC. Outcomes included clinical and endoscopic remission, response rates, and adverse events. Nine trials including 3808 patients in the induction phase and 1734 in the maintenance phase were analyzed. IL-23RA improved clinical remission (induction risk ratio 2.63, 95 percent confidence interval 2.05-3.36; maintenance 1.99, 95 percent confidence interval 1.63-2.44) and endoscopic remission (induction 2.36, 95 percent confidence interval 1.70-2.20; maintenance 1.96, 95 percent confidence interval 1.63-2.37). IL-23RA reduced serious adverse events in the induction phase (0.40, 95 percent confidence interval 0.27-0.69) with no difference during maintenance (0.75, 95 percent confidence interval 0.31-1.84). No significant differences were observed in overall adverse events or specific events such as headache or nasopharyngitis. Trial sequential analysis confirmed sufficient sample size for clinical endpoints. IL-23RA showed superior effectiveness and similar safety compared with placebo in moderate to severe UC.2025-09-29T17:13:45ZShort title: IL-23 antagonists in Ulcerative Colitis Word count: 3,819 words. Figures: 3. Tables: 2Wellgner Fernandes Oliveira AmadorIsabelle Castro VitorMilena Ramos TomeDiogo Delgado DottaRodrigo V Mottahttp://arxiv.org/abs/2509.21206v2Data-driven Neural Networks for Windkessel Parameter Calibration2025-09-26T16:54:58ZIn this work, we propose a novel method for calibrating Windkessel (WK) parameters in a dimensionally reduced 1D-0D coupled blood flow model. To this end, we design a data-driven neural network (NN)trained on simulated blood pressures in the left brachial artery. Once trained, the NN emulates the pressure pulse waves across the entire simulated domain, i.e., over time, space and varying WK parameters, with negligible error and computational effort. To calibrate the WK parameters on a measured pulse wave, the NN is extended by dummy neurons and retrained only on these. The main objective of this work is to assess the effectiveness of the method in various scenarios -- particularly, when the exact measurement location is unknown or the data are affected by noise.2025-09-25T14:14:53Z32 pages, 15 figures, for associated git see https://github.com/bhoock/WKcalNN, submitted to International Journal for Numerical Methods in Biomedical EngineeringBenedikt HoockTobias Köpplhttp://arxiv.org/abs/2509.16328v2The Role of High-Performance GPU Resources in Large Language Model Based Radiology Imaging Diagnosis2025-09-24T19:08:24ZLarge-language models (LLMs) are rapidly being applied to radiology, enabling automated image interpretation and report generation tasks. Their deployment in clinical practice requires both high diagnostic accuracy and low inference latency, which in turn demands powerful hardware. High-performance graphical processing units (GPUs) provide the necessary compute and memory throughput to run large LLMs on imaging data. We review modern GPU architectures (e.g. NVIDIA A100/H100, AMD Instinct MI250X/MI300) and key performance metrics of floating-point throughput, memory bandwidth, VRAM capacity. We show how these hardware capabilities affect radiology tasks: for example, generating reports or detecting findings on CheXpert and MIMIC-CXR images is computationally intensive and benefits from GPU parallelism and tensor-core acceleration. Empirical studies indicate that using appropriate GPU resources can reduce inference time and improve throughput. We discuss practical challenges including privacy, deployment, cost, power and optimization strategies: mixed-precision, quantization, compression, and multi-GPU scaling. Finally, we anticipate that next-generation features (8-bit tensor cores, enhanced interconnect) will further enable on-premise and federated radiology AI. Advancing GPU infrastructure is essential for safe, efficient LLM-based radiology diagnostics.2025-09-19T18:13:12ZJyun-Ping Kaohttp://arxiv.org/abs/2509.17854v2Magnetically Guided Endothelial BioBots: A Next-Generation Strategy for Treating Complex Cerebral Aneurysms2025-09-23T23:07:10ZCerebral aneurysms affect three to five percent of the population, and rupture remains a major cause of stroke-related death and disability. Current therapies, surgical clipping, endovascular coiling, and flow diversion, have improved outcomes but each carries limitations. Clipping is invasive and often unsuitable for deep or posterior lesions. Coiling is prone to recurrence from compaction or incomplete occlusion, particularly in wide-neck or fusiform aneurysms. Flow diverters offer improved durability but rely on rigid metallic scaffolds that may malappose in tortuous vessels, compromise branch arteries, delay endothelialization, and necessitate long-term dual antiplatelet therapy. These shortcomings highlight a gap in current management: devices primarily provide mechanical occlusion but fail to conform to complex geometries or reliably promote rapid, complete endothelialization. As a result, aneurysm necks may remain exposed to persistent flow, delayed healing, and thrombosis.
To address this, we propose magnetically guided endothelial BioBots as a next-generation therapeutic strategy. BioBots are biodegradable hydrogel carriers embedded with magnetic nanoparticles and coated with primed endothelial progenitor cells. Delivered through microcatheters and guided by external electromagnetic fields, they can assemble across aneurysm defects. Once localized, they form a conformal, geometry-adaptive endothelial patch that provides immediate antithrombotic protection and, as the hydrogel degrades, leaves behind a stable, functional endothelial lining. By integrating microrobotic navigation with regenerative vascular biology, BioBots may overcome the central limitations of current devices and enable safer, more durable treatment for complex aneurysms.2025-09-22T14:48:49Z14 pages, 2 figures. Review/Technical Report. Currently under journal peer reviewDuong LeDepartment of Biomedical Engineering, University of Massachusetts Amherst, Amherst, MAhttp://arxiv.org/abs/2509.17924v1Medical priority fusion: achieving dual optimization of sensitivity and interpretability in nipt anomaly detection2025-09-22T15:49:20ZClinical machine learning faces a critical dilemma in high-stakes medical applications: algorithms achieving optimal diagnostic performance typically sacrifice the interpretability essential for physician decision-making, while interpretable methods compromise sensitivity in complex scenarios. This paradox becomes particularly acute in non-invasive prenatal testing (NIPT), where missed chromosomal abnormalities carry profound clinical consequences yet regulatory frameworks mandate explainable AI systems. We introduce Medical Priority Fusion (MPF), a constrained multi-objective optimization framework that resolves this fundamental trade-off by systematically integrating Naive Bayes probabilistic reasoning with Decision Tree rule-based logic through mathematically-principled weighted fusion under explicit medical constraints. Rigorous validation on 1,687 real-world NIPT samples characterized by extreme class imbalance (43.4:1 normal-to-abnormal ratio) employed stratified 5-fold cross-validation with comprehensive ablation studies and statistical hypothesis testing using McNemar's paired comparisons. MPF achieved simultaneous optimization of dual objectives: 89.3% sensitivity (95% CI: 83.9-94.7%) with 80% interpretability score, significantly outperforming individual algorithms (McNemar's test, p < 0.001). The optimal fusion configuration achieved Grade A clinical deployment criteria with large effect size (d = 1.24), establishing the first clinically-deployable solution that maintains both diagnostic accuracy and decision transparency essential for prenatal care. This work demonstrates that medical-constrained algorithm fusion can resolve the interpretability-performance trade-off, providing a mathematical framework for developing high-stakes medical decision support systems that meet both clinical efficacy and explainability requirements.2025-09-22T15:49:20Z24 pages, 47 figures, publish to BIBMXiuqi GeZhibo YaoYaosong Duhttp://arxiv.org/abs/2509.15854v1Changes in Liver Fibrosis in Patients with Chronic Hepatitis B Treated with Pegylated Interferon Combined with Oral Antiviral Agents: A 48-Week Observation from a Prospective Cohort Study2025-09-19T10:44:30ZBackground and Aims: Pegylated interferon (PEG-IFN) combined with oral antiviral agents is currently the most widely used and highly effective treatment regimen for chronic hepatitis B virus (HBV) infection. While effectively suppressing HBV replication, its impact on liver histopathological fibrosis and inflammation remains a critical concern for clinicians and patients. Methods : A total of 625 patients who completed 48 weeks of PEG-IFN combined with oral antiviral therapy were enrolled in this real-world study. Based on their virological response at 48 weeks, patients were categorized into Clearance group and Non-clearance group. Changes in liver biochemistry, fibrosis, and renal function were compared between groups and before/after treatment. Results: No significant differences were observed in baseline blood tests, liver biochemical markers, or histopathological features between the Clearance group and Non-clearance group. Similarly, baseline renal function showed no significant variation. Further analysis revealed that the Clearance group exhibited significant aggravation of liver fibrosis after 48 weeks of treatment, which correlated strongly with alterations in liver enzyme levels. However, one patient who underwent paired liver biopsies before and after treatment demonstrated marked histopathological improvement in fibrosis. This finding underscores the irreplaceable role of liver histopathology in assessing fibrosis and inflammation. Conclusion: PEG-IFN combined with oral antiviral therapy exerts favorable effects on liver fibrosis and inflammation in chronic HBV patients. Non-invasive fibrosis assessment models can monitor fibrotic progression but are susceptible to confounding by hepatic inflammation.2025-09-19T10:44:30ZZhao Jinhuahttp://arxiv.org/abs/2509.16255v1RootletSeg: Deep learning method for spinal rootlets segmentation across MRI contrasts2025-09-17T19:44:50ZPurpose: To develop a deep learning method for the automatic segmentation of spinal nerve rootlets on various MRI scans. Material and Methods: This retrospective study included MRI scans from two open-access and one private dataset, consisting of 3D isotropic 3T TSE T2-weighted (T2w) and 7T MP2RAGE (T1-weighted [T1w] INV1 and INV2, and UNIT1) MRI scans. A deep learning model, RootletSeg, was developed to segment C2-T1 dorsal and ventral spinal rootlets. Training was performed on 76 scans and testing on 17 scans. The Dice score was used to compare the model performance with an existing open-source method. Spinal levels derived from RootletSeg segmentations were compared with vertebral levels defined by intervertebral discs using Bland-Altman analysis. Results: The RootletSeg model developed on 93 MRI scans from 50 healthy adults (mean age, 28.70 years $\pm$ 6.53 [SD]; 28 [56%] males, 22 [44%] females) achieved a mean $\pm$ SD Dice score of 0.67 $\pm$ 0.09 for T1w-INV2, 0.65 $\pm$ 0.11 for UNIT1, 0.64 $\pm$ 0.08 for T2w, and 0.62 $\pm$ 0.10 for T1w-INV1 contrasts. Spinal-vertebral level correspondence showed a progressively increasing rostrocaudal shift, with Bland-Altman bias ranging from 0.00 to 8.15 mm (median difference between level midpoints). Conclusion: RootletSeg accurately segmented C2-T1 spinal rootlets across MRI contrasts, enabling the determination of spinal levels directly from MRI scans. The method is open-source and can be used for a variety of downstream analyses, including lesion classification, neuromodulation therapy, and functional MRI group analysis.2025-09-17T19:44:50Z26 pages, 6 figures, 4 tablesKaterina KrejciJiri ChmelikSandrine BédardFalk EippertUlrike HornVirginie CallotJulien Cohen-AdadJan Valosekhttp://arxiv.org/abs/2509.16254v1Imaging Modalities-Based Classification for Lung Cancer Detection2025-09-17T19:18:05ZLung cancer continues to be the predominant cause of cancer-related mortality globally. This review analyzes various approaches, including advanced image processing methods, focusing on their efficacy in interpreting CT scans, chest radiographs, and biological markers. Notably, we identify critical gaps in the previous surveys, including the need for robust models that can generalize across diverse populations and imaging modalities. This comprehensive synthesis aims to serve as a foundational resource for researchers and clinicians, guiding future efforts toward more accurate and efficient lung cancer detection. Key findings reveal that 3D CNN architectures integrated with CT scans achieve the most superior performances, yet challenges such as high false positives, dataset variability, and computational complexity persist across modalities.2025-09-17T19:18:05ZAccepted at ICMI 2025Sajim AhmedMuhammad Zain ChaudharyMuhammad Zohaib ChaudharyMahmoud AbbassAhmed SherifMohammad Mahbubur Rahman Khan Mamunhttp://arxiv.org/abs/2509.16251v1R-Net: A Reliable and Resource-Efficient CNN for Colorectal Cancer Detection with XAI Integration2025-09-17T18:29:44ZState-of-the-art (SOTA) Convolutional Neural Networks (CNNs) are criticized for their extensive computational power, long training times, and large datasets. To overcome this limitation, we propose a reasonable network (R-Net), a lightweight CNN only to detect and classify colorectal cancer (CRC) using the Enteroscope Biopsy Histopathological Hematoxylin and Eosin Image Dataset (EBHI). Furthermore, six SOTA CNNs, including Multipath-based CNNs (DenseNet121, ResNet50), Depth-based CNNs (InceptionV3), width-based multi-connection CNNs (Xception), depth-wise separable convolutions (MobileNetV2), spatial exploitation-based CNNs (VGG16), Transfer learning, and two ensemble models are also tested on the same dataset. The ensemble models are a multipath-depth-width combination (DenseNet121-InceptionV3-Xception) and a multipath-depth-spatial combination (ResNet18-InceptionV3-VGG16). However, the proposed R-Net lightweight achieved 99.37% accuracy, outperforming MobileNet (95.83%) and ResNet50 (96.94%). Most importantly, to understand the decision-making of R-Net, Explainable AI such as SHAP, LIME, and Grad-CAM are integrated to visualize which parts of the EBHI image contribute to the detection and classification process of R-Net. The main novelty of this research lies in building a reliable, lightweight CNN R-Net that requires fewer computing resources yet maintains strong prediction results. SOTA CNNs, transfer learning, and ensemble models also extend our knowledge on CRC classification and detection. XAI functionality and the impact of pixel intensity on correct and incorrect classification images are also some novelties in CRC detection and classification.2025-09-17T18:29:44ZRokonozzaman AyonMd Taimur AhadBo SongYan Lihttp://arxiv.org/abs/2509.16250v1A study on Deep Convolutional Neural Networks, transfer learning, and Mnet model for Cervical Cancer Detection2025-09-17T18:11:09ZEarly and accurate detection through Pap smear analysis is critical to improving patient outcomes and reducing mortality of Cervical cancer. State-of-the-art (SOTA) Convolutional Neural Networks (CNNs) require substantial computational resources, extended training time, and large datasets. In this study, a lightweight CNN model, S-Net (Simple Net), is developed specifically for cervical cancer detection and classification using Pap smear images to address these limitations. Alongside S-Net, six SOTA CNNs were evaluated using transfer learning, including multi-path (DenseNet201, ResNet152), depth-based (Serasnet152), width-based multi-connection (Xception), depth-wise separable convolutions (MobileNetV2), and spatial exploitation-based (VGG19). All models, including S-Net, achieved comparable accuracy, with S-Net reaching 99.99%. However, S-Net significantly outperforms the SOTA CNNs in terms of computational efficiency and inference time, making it a more practical choice for real-time and resource-constrained applications. A major limitation in CNN-based medical diagnosis remains the lack of transparency in the decision-making process. To address this, Explainable AI (XAI) techniques, such as SHAP, LIME, and Grad-CAM, were employed to visualize and interpret the key image regions influencing model predictions. The novelty of this study lies in the development of a highly accurate yet computationally lightweight model (S-Net) caPable of rapid inference while maintaining interpretability through XAI integration. Furthermore, this work analyzes the behavior of SOTA CNNs, investigates the effects of negative transfer learning on Pap smear images, and examines pixel intensity patterns in correctly and incorrectly classified samples.2025-09-17T18:11:09ZSaifuddin SagorMd Taimur AhadFaruk AhmedRokonozzaman AyonSanzida Parvinhttp://arxiv.org/abs/2509.10369v1Data distribution impacts the performance and generalisability of contrastive learning-based foundation models of electrocardiograms2025-09-12T16:01:18ZContrastive learning is a widely adopted self-supervised pretraining strategy, yet its dependence on cohort composition remains underexplored. We present Contrasting by Patient Augmented Electrocardiograms (CAPE) foundation model and pretrain on four cohorts (n = 5,203,352), from diverse populations across three continents (North America, South America, Asia). We systematically assess how cohort demographics, health status, and population diversity influence the downstream performance for prediction tasks also including two additional cohorts from another continent (Europe). We find that downstream performance depends on the distributional properties of the pretraining cohort, including demographics and health status. Moreover, while pretraining with a multi-centre, demographically diverse cohort improves in-distribution accuracy, it reduces out-of-distribution (OOD) generalisation of our contrastive approach by encoding cohort-specific artifacts. To address this, we propose the In-Distribution Batch (IDB) strategy, which preserves intra-cohort consistency during pretraining and enhances OOD robustness. This work provides important insights for developing clinically fair and generalisable foundation models.2025-09-12T16:01:18ZCurrently under review at npj Digital MedicineGul Rukh KhattakKonstantinos PatlatzoglouJoseph BarkerLibor PastikaBoroumand ZeidaabadiAhmed El-MedanyHesham AggourYixiu LiangAntonio H. RibeiroJeffrey AnnisAntonio Luiz Pinho RibeiroJunbo GeDaniel B. KramerJonathan W. WaksEvan BrittainNicholas PetersFu Siong NgArunashis Sau