https://arxiv.org/api/Q2EEi+sukJPk6WzXufCd3ZF4jIA2026-04-09T21:37:20Z880828515http://arxiv.org/abs/2511.06163v2Cross-Modal Fine-Tuning of 3D Convolutional Foundation Models for ADHD Classification with Low-Rank Adaptation2026-01-15T05:18:46ZEarly diagnosis of attention-deficit/hyperactivity disorder (ADHD) in children plays a crucial role in improving outcomes in education and mental health. Diagnosing ADHD using neuroimaging data, however, remains challenging due to heterogeneous presentations and overlapping symptoms with other conditions. To address this, we propose a novel parameter-efficient transfer learning approach that adapts a large-scale 3D convolutional foundation model, pre-trained on CT images, to an MRI-based ADHD classification task. Our method introduces Low-Rank Adaptation (LoRA) in 3D by factorizing 3D convolutional kernels into 2D low-rank updates, dramatically reducing trainable parameters while achieving superior performance. In a five-fold cross-validated evaluation on a public diffusion MRI database, our 3D LoRA fine-tuning strategy achieved state-of-the-art results, with one model variant reaching 71.9% accuracy and another attaining an AUC of 0.716. Both variants use only 1.64 million trainable parameters (over 113x fewer than a fully fine-tuned foundation model). Our results represent one of the first successful cross-modal (CT-to-MRI) adaptations of a foundation model in neuroimaging, establishing a new benchmark for ADHD classification while greatly improving efficiency.2025-11-08T23:29:28ZAccepted for presentation at the IEEE International Symposium on Biomedical Imaging (ISBI) 2026Jyun-Ping KaoShinyeong RhoShahar LazarevHyun-Hae ChoFangxu XingTaehoon ShinC. -C. Jay KuoJonghye Woohttp://arxiv.org/abs/2601.09439v1DeepLight: A Sobolev-trained Image-to-Image Surrogate Model for Light Transport in Tissue2026-01-14T12:40:02ZIn optoacoustic imaging, recovering the absorption coefficients of tissue by inverting the light transport remains a challenging problem. Improvements in solving this problem can greatly benefit the clinical value of optoacoustic imaging. Existing variational inversion methods require an accurate and differentiable model of this light transport. As neural surrogate models allow fast and differentiable simulations of complex physical processes, they are considered promising candidates to be used in solving such inverse problems. However, there are in general no guarantees that the derivatives of these surrogate models accurately match those of the underlying physical operator. As accurate derivatives are central to solving inverse problems, errors in the model derivative can considerably hinder high fidelity reconstructions. To overcome this limitation, we present a surrogate model for light transport in tissue that uses Sobolev training to improve the accuracy of the model derivatives. Additionally, the form of Sobolev training we used is suitable for high-dimensional models in general. Our results demonstrate that Sobolev training for a light transport surrogate model not only improves derivative accuracy but also reduces generalization error for in-distribution and out-of-distribution samples. These improvements promise to considerably enhance the utility of the surrogate model in downstream tasks, especially in solving inverse problems.2026-01-14T12:40:02ZPhilipp HaimVasilis NtziachristosTorsten EnßlinDominik Jüstelhttp://arxiv.org/abs/2601.09235v1Use of synthetic data for training dose estimation neural networks in CT dosimetry2026-01-14T07:15:59ZPersonalized computed tomography (CT) dosimetry has great potential in assessing patient-specific radiation exposure, supporting risk assessment, and optimizing clinical protocols. The aim of this study is to evaluate the potential of synthetic anatomical data for improving machine learning-based personalized computed tomography (CT) dosimetry. It is investigated whether the combination of synthetic human body geometries with real patient data can improve model accuracy and generalization for CT organ dose estimation while maintaining the uncertainty requirements outlined in IAEA TRS-457. Deep learning models for organ dose prediction are trained using datasets with varying proportions of real and synthetic data. Synthetic datasets are generated from computational human phantoms with controlled distributions of organ volumes and body. A dedicated model uncertainty evaluation method is implemented to quantify prediction reliability and verify compliance with TRS-457 accuracy limits. Model performance and uncertainty are compared across different training data compositions, including a model trained solely on real patient data. As baseline validated Monte Carlo simulation is used. Models trained solely on synthetic data show limited predictive accuracy, particularly for small or peripheral organs. Incorporating as little as 10 % real patient data significantly improves both statistical accuracy and uncertainty estimates, achieving a performance comparable to that of real-only models. The hybrid training approach improves robustness across different anatomies while maintaining TRS-457-compliant uncertainty levels (k=2 uncertainty < 20% for adults). The results indicate that the combination of real and synthetic data in combination with a systematic uncertainty assessment supports the development of CT dosimetry models and at the same time reduces the amount of real data required.2026-01-14T07:15:59ZMarie-Luise KuhlmannJörg MartinStefan Pojtingerhttp://arxiv.org/abs/2601.09008v1Changes in Visual Attention Patterns for Detection Tasks due to Dependencies on Signal and Background Spatial Frequencies2026-01-13T22:12:27ZWe aim to investigate the impact of image and signal properties on visual attention mechanisms during a signal detection task in digital images. The application of insight yielded from this work spans many areas of digital imaging where signal or pattern recognition is involved in complex heterogenous background. We used simulated tomographic breast images as the platform to investigate this question. While radiologists are highly effective at analyzing medical images to detect and diagnose diseases, misdiagnosis still occurs. We selected digital breast tomosynthesis (DBT) images as a sample medical images with different breast densities and structures using digital breast phantoms (Bakic and XCAT). Two types of lesions (with distinct spatial frequency properties) were randomly inserted in the phantoms during projections to generate abnormal cases. Six human observers participated in observer study designed for a locating and detection of an 3-mm sphere lesion and 6-mm spicule lesion in reconstructed in-plane DBT slices. We collected eye-gaze data to estimate gaze metrics and to examine differences in visual attention mechanisms. We found that detection performance in complex visual environments is strongly constrained by later perceptual stages, with decision failures accounting for the largest proportion of errors. Signal detectability is jointly influenced by both target morphology and background complexity, revealing a critical interaction between local signal features and global anatomical noise. Increased fixation duration on spiculated lesions suggests that visual attention is differentially engaged depending on background and signal spatial frequency dependencies.2026-01-13T22:12:27Z21 pages, 7 imagesAmar KavuriHoward C. GiffordMini Dashttp://arxiv.org/abs/2503.15383v6Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers2026-01-13T16:26:34ZPhoton-counting computed tomography (PCCT) has emerged as a promising imaging technique, enabling spectral imaging and material decomposition (MD). However, images typically suffer from a low signal-to-noise ratio (SNR) due to constraints such as low photon counts and sparse-view settings which provoke artifacts. To prevent this, variational methods minimize a data-fit function coupled with handcrafted regularizers that mimic a prior by enforcing image properties such as gradient sparsity. In the last few years, diffusion models (DMs) have become predominant in the field of generative models and have been used as a learned prior for image reconstruction. This work investigates the use of DMs as regularizers for MD tasks in PCCT, specifically using diffusion posterior sampling (DPS) guidance. Three DPS-based approaches -- image-domain two-step DPS (im-TDPS), projection-domain two-step DPS (proj-TDPS), and one-step DPS (ODPS) -- are evaluated. The first two methods achieve MD in two steps by performing reconstruction and MD separately. The last method, ODPS, samples the material images directly from the measurement data. The results indicate that ODPS achieves superior performance compared to im-TDPS and proj-TDPS, providing sharper, noise-free and crosstalk-free images. Furthermore, we introduce a novel hybrid method for scenarios involving materials absent from the training dataset which combines DM priors with standard variational handcrafted regularizers for the materials unknown to the DM. This hybrid method demonstrates improved MD quality compared to a standard variational method and does not require additional training of the DM neural network (NN).2025-03-19T16:21:16Z13 pages, 10 figures, 4 tablesCorentin VaziaThore DassowAlexandre BousseJacques FromentBéatrice VedelFranck VermetAlessandro PerelliJean-Pierre TasuDimitris Visvikis10.1109/TRPMS.2026.3651354http://arxiv.org/abs/2601.08644v1Percentile-based probabilistic optimization for systematic and random uncertainties in radiation therapy2026-01-13T15:15:43ZGeometric uncertainty can degrade treatment quality in radiation therapy. While margins and robust optimization mitigate these effects, they provide only implicit control over clinical goal fulfillment probability. We therefore develop a probabilistic planning framework using a percentile-based optimization function that targets a specified probability of clinical goal fulfillment.
Systematic and random uncertainties were explicitly modeled over full treatment courses. A scenario dose approximation method based on interpolation between a fixed set of doses was used, enabling efficient simulation of treatment courses during optimization. The framework was evaluated on a prostate case treated with volumetric-modulated arc therapy (VMAT) and a brain case treated with pencil beam scanning (PBS) proton therapy. Plans were compared to conventional margin-based and worst-case robust optimization using probabilistic evaluation.
For the prostate case, probabilistic optimization improved organ at risk (OAR) sparing while maintaining target coverage compared to margin-based planning, increasing average OAR goal fulfillment probability by 13.3 percentage points and reducing 90th percentile OAR doses by an average of 3.5~Gy. For the brain case, probabilistic optimization improved target minimum dose passing probabilities (e.g., 88\% vs.~22\% for $D_{95}$) and brainstem maximum dose passing probability (70\% vs.~30\%), while maintaining comparable or improved OAR sparing compared to worst-case optimization.
Probabilistic optimization enables explicit and interpretable control over goal fulfillment probabilities. Combining full treatment course modeling with efficient approximate dose calculation, the proposed framework improved the trade-off between target coverage and OAR sparing compared to conventional planning approaches in both photon and proton therapy.2026-01-13T15:15:43Z15 pages, 5 figuresAlbin FredrikssonErik EngwallJenneke de JongJohan Sundströmhttp://arxiv.org/abs/2509.13614v2Generative Consistency Models for Estimation of Kinetic Parametric Image Posteriors in Total-Body PET2026-01-13T01:24:34ZDynamic total body positron emission tomography (TB-PET) makes it feasible to measure the kinetics of all organs in the body simultaneously which may lead to important applications in multi-organ disease and systems physiology. Since whole-body kinetics are highly heterogeneous with variable signal-to-noise ratios, parametric images should ideally comprise not only point estimates but also measures of posterior statistical uncertainty. However, standard Bayesian techniques, such as Markov chain Monte Carlo (MCMC), are computationally prohibitive at the total body scale. We introduce a generative consistency model (CM) that generates samples from the posterior distributions of the kinetic model parameters given measured time-activity curves and arterial input function. CM is able to collapse the hundreds of iterations required by standard diffusion models into just 3 denoising steps. When trained on 500,000 physiologically realistic two-tissue compartment model simulations, the CM produces similar accuracy to MCMC (median absolute percent error < 5%; median K-L divergence < 0.5) but is more than five orders of magnitude faster. CM produces more reliable Ki images than the Patlak method by avoiding the assumption of irreversibility, while also offering valuable information on statistical uncertainty of parameter estimates and the underlying model. The proposed framework removes the computational barrier to routine, fully Bayesian parametric imaging in TB-PET and is readily extensible to other tracers and compartment models.2025-09-17T01:13:48ZYun ZhaoQinlin GuGeorgios I. AngelisAndrew J. ReaderYanan FanSteven R. Meiklehttp://arxiv.org/abs/2601.07998v1Predicting Region of Interest in Human Visual Search Based on Statistical Texture and Gabor Features2026-01-12T21:06:23ZUnderstanding human visual search behavior is a fundamental problem in vision science and computer vision, with direct implications for modeling how observers allocate attention in location-unknown search tasks. In this study, we investigate the relationship between Gabor-based features and gray-level co-occurrence matrix (GLCM) based texture features in modeling early-stage visual search behavior. Two feature-combination pipelines are proposed to integrate Gabor and GLCM features for narrowing the region of possible human fixations. The pipelines are evaluated using simulated digital breast tomosynthesis images. Results show qualitative agreement among fixation candidates predicted by the proposed pipelines and a threshold-based model observer. A strong correlation is observed between GLCM mean and Gabor feature responses, indicating that these features encode related image information despite their different formulations. Eye-tracking data from human observers further suggest consistency between predicted fixation regions and early-stage gaze behavior. These findings highlight the value of combining structural and texture-based features for modeling visual search and support the development of perceptually informed observer models.2026-01-12T21:06:23Z10 pages, 6 fguresHongwei LinDiego AndradeMini DasHoward C. Giffordhttp://arxiv.org/abs/2601.07976v1Application of Ideal Observer for Thresholded Data in Search Task2026-01-12T20:18:02ZThis study advances task-based image quality assessment by developing an anthropomorphic thresholded visual-search model observer. The model is an ideal observer for thresholded data inspired by the human visual system, allowing selective processing of high-salience features to improve discrimination performance. By filtering out irrelevant variability, the model enhances diagnostic accuracy and computational efficiency.
The observer employs a two-stage framework: candidate selection and decision-making. Using thresholded data during candidate selection refines regions of interest, while stage-specific feature processing optimizes performance. Simulations were conducted to evaluate the effects of thresholding on feature maps, candidate localization, and multi-feature scenarios. Results demonstrate that thresholding improves observer performance by excluding low-salience features, particularly in noisy environments. Intermediate thresholds often outperform no thresholding, indicating that retaining only relevant features is more effective than keeping all features.
Additionally, the model demonstrates effective training with fewer images while maintaining alignment with human performance. These findings suggest that the proposed novel framework can predict human visual search performance in clinically realistic tasks and provide solutions for model observer training with limited resources. Our novel approach has applications in other areas where human visual search and detection tasks are modeled such as in computer vision, machine learning, defense and security image analysis.2026-01-12T20:18:02Z13 pages, 6 figuresHongwei LinHoward C. Giffordhttp://arxiv.org/abs/2601.07680v1A density functional theory study of amino acids on Mg and Mg-based alloys2026-01-12T16:09:36ZMagnesium (Mg) has mechanical properties similar to bone tissue, and Mg ions take part in the metabolism. This makes Mg of interest for biocompatible degradable body implants, provided that its high corrosion rate can be inhibited. Slightly alloying Mg and adding surface coatings can slow down the corrosion processes without significantly changing the mechanical properties. Use of coating molecules that are native to the body increase the likelihood of making the surface biocompatible, for example by use of amino acids. We here present a density functional theory (DFT) study of the adsorption on Mg(0001) of the amino acids glycine, L-proline, and L-hydroxyproline (Hyp), the main amino acid content of collagen. We investigate how binding of the functional groups of Hyp are affected when Mg(0001) is slightly alloyed with zinc, lithium or aluminium, and we also model the immersion of the systems in a water environment to see how this affects the binding.2026-01-12T16:09:36Z10 pages, 6 figures, 3 tablesJohn BolinAmanda GooldOlof HildebergAlva LimbäckElsebeth Schröderhttp://arxiv.org/abs/2601.07458v1Overcoming the limitations of NMR Field Probes: A Novel Integrated Sensor Utilizing Pre-Polarization for (Ultra) Low Field MRI2026-01-12T12:05:44ZAccess to magnetic resonance imaging (MRI) remains severely limited in low- and middle-income countries, especially in sub-Saharan Africa, despite rising rates of non-communicable diseases. Low-field MRI presents an affordable, locally developable diagnostic solution, but its performance is constrained by magnetic field instability. We present a novel NMR field probe designed to overcome these challenges using a rapid non-adiabatic switch-off of a pre-polarization field resulting in precessing spin magnetization. Achieved by first use of high-voltage silicon carbide transistors operating in controlled avalanche breakdown, it measures the Larmor frequency without prior field knowledge, unlike conventional probes. This capability is crucial during magnet development with often unknown fields, allowing early detection of magnet issues, and offering an urgently needed tool for magnet design and image-quality improvement. Validated from 1 mT to 45 mT (up to 1,000 times stronger than similar systems) its low-cost, modular design supports replication, upgrades, and enhanced field control, helping expand global MRI access.2026-01-12T12:05:44ZSubmitted to Nature Communications EngineeringPavel PovolniDominique GoernerPraveen Iyyappan ValsalaLukas GebertGeorgiy SolomakhaNicolas KempfFelix GlangJudith SamlowRuben SchnitzlerIngmar KallfassKai BuckenmaierKlaus Schefflerhttp://arxiv.org/abs/2601.07363v1A Pilot Kinematic Study on the Forehand Reverse Flick: Feasibility of a Novel Short Return Technique in Table Tennis2026-01-12T09:38:21ZBackground Following changes in table tennis ball materials, offensive returns have become more important for initiating sustained topspin offense. However, using the backhand flick (BF) to return forehand short balls often increases the difficulty of recovery and continuity, revealing a technical gap. This study preliminarily verified a novel forehand short return technique, the forehand reverse flick (FRF), and analyzed its similarities and differences with the BF. Methods Four elite athletes completed seven consecutive days of FRF specific training. Infrared motion capture and ultra-high-speed cameras were used to collect data on racket kinematics, movement duration, and ball performance. Results The success rate of the FRF increased steadily, reaching 86%. Racket trajectories of the two techniques were highly similar along the X (r = 1) and Y (r = 0.99) axes but differed along the Z (r = -0.04) axis. Racket and ball velocities were comparable between techniques, whereas the FRF showed lower resultant acceleration (approximately 265.57 m/s) and required about 0.03 s more for movement duration. Ball velocity was comparable between techniques, for the ball spin, the FRF generated lower spin (approximately 76.61 r/s) about 64% of the BF value (approximately 120.13 r/s). The highest participant mean spin rate reached 93 r/s, about 77% of the BF mean. Conclusion Overall, the FRF was found to have favorable learnability and training value, with potential for further optimization and competitive application.2026-01-12T09:38:21Z20pages, 7 FigurePengfei JinJie RenChen YangQingtao KongQingshan ZhangNan GuBin ChenQin ZhangZhe Fenghttp://arxiv.org/abs/2601.07225v1On optimization of Paganin's method for propagation-based X-ray phase-contrast imaging and tomography2026-01-12T05:43:28ZPaganin's method for image reconstruction in propagation-based phase-contrast X-ray imaging and tomography has enjoyed broad acceptance in recent years, with over one thousand publications citing its use. The present paper discusses approaches to optimization of the method with respect to simple image quality metrics, such as signal-to-noise ratio and spatial resolution, as well as a reference-based metric corresponding to the relative mean squared difference between the reconstructed image and the "ground truth" image that would be obtained in a setup with perfect spatial resolution and no noise. The problem of optimization of the intrinsic regularization parameter of Paganin's method with respect to spatial resolution in the reconstructed image is studied in detail. It is also demonstrated that a combination of Paganin's method with a Tikhonov-regularized deconvolution of the point-spread function of the imaging system can provide significantly higher image quality compared to the standard version of the method. Analytical expressions for some relevant image quality metrics are obtained and compared with results of numerical simulations. Advantages and shortcomings of optimization approaches using a number of different image quality metrics are discussed. The results of this study are expected to be useful in practical X-ray imaging and training of deep machine learning models for image denoising and segmentation.2026-01-12T05:43:28Z29 pages, 4 figuresTimur E. GureyevDavid M. PaganinAshkan PakzadHarry M. Quineyhttp://arxiv.org/abs/2601.07187v1Critical Shortfall in NIH Support for Medical Physics Research2026-01-12T04:21:49ZThis report summarizes changes in federal research funding to the medical physics community between FY24 and FY25. By linking the AAPM membership database with NIH RePORTER records, we quantified the distribution of NIH funding for projects led by AAPM researchers. Although total NIH funding to AAPM members remained relatively stable across the two years, the composition of that funding shifted substantially. Competing (new and renewal) awards declined 50%, driven largely by an 80% collapse in new R01 grants from the National Cancer Institute (NCI). In contrast, noncompeting continuation awards increased by 10%, following a shift in how NIH funds multi-year projects. These changes occurred in the context of widespread disruptions to NIH review and grantmaking, including delayed study sections and more stringent administrative requirements. Federal funding is essential to sustaining innovation, supporting early-stage investigators, and ensuring that patients receive the best possible care. The trends identified here raise concerns about the long-term vitality and stability of the medical physics research pipeline.2026-01-12T04:21:49ZGuillem PratxWensha YangMatthew L Scarpellihttp://arxiv.org/abs/2511.19329v2A primer on treatment planning aspects for temporally modulated pulsed radiation therapy2026-01-11T22:16:46ZTemporally modulated pulsed radiotherapy (TMPRT) delivers conventional fraction doses of radiation using temporally separated pulses of low doses (<30 cGy) yielding fraction-effective dose rates of around 6.7 cGy/min with the goal to exploit tumor radiation hypersensitivity, which was observed in both, preclinical models and in human clinical trials. To facilitate TMPRT, volumetric modulated arc therapy (VMAT) and 3D-CRT planning techniques were developed following the guidelines of the proposed NRG CC-017 trial. Plans were evaluated with respect to homogeneity, conformality, and adherence to dose constraints. Deliverability of plans was assessed using in-phantom measurements for absorbed dose accuracy at low dose rates and using EPID for isodose verification. For VMAT only single arc plans were found to be acceptable due to otherwise unacceptably heterogeneous field doses, while for dynamic conformal arcs machine limtations on the number of monitor units per degree require the use of partial arcs for each pulse. Delivery of plans at low dose rates (< 100 MU/min) was accurate with high Gamma pass rates on modern LINACs and moderate pass rates on legacy LINACs, in line with their general performance. Generally, VMAT is preferred to achieve optimal homogeneity, conformality, and organ-at-risk sparing, while the use of 3D-CRT can increase the availability of TMPRT for more patients and clinics.2025-11-24T17:21:06ZChristian VeltenAdam BaylissJiayi HuangWolfgang A. Tomé