https://arxiv.org/api/qMC5/2cw0z8M297ZGKqvi9cQj7c 2026-06-09T21:31:24Z 10640 30 15 http://arxiv.org/abs/2606.07258v1 CaliPPer: quantifying, predicting and improving AI model performance for binding prediction 2026-06-05T13:34:47Z Binding prediction models accelerate therapeutic antibody and TCR discovery, but their performance on new datasets is unpredictable, often leading to low discovery rates. Density-ratio methods (PAPE, M-CBPE) provide label-free performance estimation for binary classification, but their assumptions and aggregate-only outputs limit binding prediction on neoepitopes, antigen variants and chemical scaffolds. Here we present CaliPPer (Calibration and Prediction of Performance), a post-hoc framework pairing a multi-chain Sample-to-Domain Distance (S2DD) with distance-aware Bayesian recalibration, operating at three resolutions: generalisability score, aggregate performance prediction, and per-sample confidence. Across ten models, eight architectures and two immune-receptor domains, CaliPPer attains distance--performance correlations $|r|=0.80\text{--}0.92$, predicts AUROC/AP/F1 with mean absolute errors $0.008\text{--}0.070$, and improves AUROC by up to $+0.20$ on unseen epitopes/variants. Applied retrospectively to five published TCR, BCR, MHC--peptide and small-molecule studies, CaliPPer raises true discovery rates in all five (e.g.\ $0/5 \to 3/5$ confirmed neoantigens), providing a triage layer between computational prediction and experimental validation. 2026-06-05T13:34:47Z Jian-Qing Zheng Hantao Lou Zinan Yin Sam Farrar Yuze Zhou Elie Antoun Xiangxi Wang Xuetao Cao Tao Dong http://arxiv.org/abs/2606.07215v1 A Comparative Study of Deep Learning Models for Geological Carbon Sequestration 2026-06-05T12:27:22Z Numerical reservoir simulations are extremely computationally expensive, as they require the repeated solution of large nonlinear algebraic systems derived from the discretized governing equations. With growing demand for real-time optimization, uncertainty quantification, and history matching in digital twin applications, reducing computational cost has become essential. Deep learning (DL)--based surrogate models have emerged as an effective approach for accelerating subsurface flow simulations. Here, we seek to determine which DL architectures are best suited for high-dimensional, transient subsurface flow problems. In this study, we examine the advantages and relative costs associated with training such models, including memory requirements, training speed, accuracy, robustness, and generalization. We conduct a comparative study of several DL architectures commonly used as surrogate models for subsurface flow problems, including U-Net, V-Net, Temporal Convolutional Networks, Fourier Neural Operators (FNO), and a U-Net--enhanced FNO (U-FNO). As a benchmark, we compare the performance of the studied models for geological carbon sequestration to predict transient pressure build-up and CO$_2$ saturation fields. We study the problem of CO$_2$ injection into a single wellbore in a two-dimensional domain, which is parameterized by anisotropic, heterogeneous permeability and porosity fields, injection configurations, and reservoir properties. Results demonstrate that surrogate model performance is strongly dependent on the underlying PDE type (i.e., hyperbolic vs. elliptic). The U-FNO achieves the highest accuracy for predicting CO$_2$ saturation fields, while the FNO provides the best performance for pressure build-up prediction. 2026-06-05T12:27:22Z Giovanni Zingaro Robert Gracie Yuri Leonenko http://arxiv.org/abs/2606.07146v1 Decision-Aware Evaluation of Physics-Informed Surrogates 2026-06-05T11:00:46Z Physics-informed machine learning is often assessed by curve error, although engineering use depends on downstream decisions: ranking candidates, avoiding infeasible designs and limiting regret. We introduce pinn-gym, an open benchmark for material-conditioned lattice design that couples a transparent reduced-order crush-and-impact oracle with five printable polymer cards, dimensionless force-response targets and a protocol spanning curve fidelity, physical admissibility, top-k retrieval and mass regret. Across per-material, pooled and cross-material settings, low nRMSE is frequently insufficient to identify useful design selections. Physics-informed losses alter trade-offs rather than monotonically improving all metrics, and dimensionless conditioning improves comparability without making transfer symmetric. The benchmark is not a certified material model; within the released oracle, candidate generator and material cards, pinn-gym provides a reproducible testbed for evaluating PIML surrogates as decision systems rather than curve predictors alone. 2026-06-05T11:00:46Z 12 pages, 5 figures, 9 tables. Code and data available at https://github.com/Dyniel/pinn-gym Daniel Cieślak Andrzej Czyżewski http://arxiv.org/abs/2606.02475v2 Physics-Informed Residuals for Adaptive Mesh Refinement in Finite-Difference PDE Solvers 2026-06-05T09:36:44Z Classical finite-difference solvers remain reliable tools for partial differential equations, but their efficiency depends on where mesh resolution is placed. Uniform refinement can waste degrees of freedom when solution difficulty is localised near sharp gradients, fronts, oscillations, or constraint-sensitive regions. This paper studies a hybrid strategy in which a physics-informed neural network (PINN) is used not as the final solver, but as an off-grid residual probe for adaptive mesh refinement. The PINN residual is sampled over the domain, converted into cellwise indicators, and used to guide refinement before the final approximation is computed by a finite-difference solver. The method is evaluated on three benchmarks. The main full-solver validation uses the one-dimensional viscous Burgers equation with a nonuniform finite-difference solve on the adapted meshes. PINN-threshold refinement attains final relative $L^2$ error $0.021067$ with $60$ degrees of freedom, compared with $0.022617$ for uniform refinement with $192$ degrees of freedom. At matched mesh size, PINN-threshold reduces the error by about $67.5\%$. PINN-Dörfler refinement gives similar performance, with error $0.021264$ using $58$ degrees of freedom. A gradient indicator remains slightly more accurate, so the result supports usefulness rather than universal superiority. Manufactured 2D and 3D proxy tests, based on a nonlinear Schrödinger equation and an incompressible Navier--Stokes system, show that PINN residuals can organise structured refinement and improve over random refinement, although they do not consistently outperform gradient or uniform baselines. The results support PINN-guided AMR as a residual-indicator strategy for transferring physics-informed diagnostic information into finite-difference mesh adaptation while preserving the classical solver as the final approximation engine. 2026-06-01T16:47:21Z 20 pages, 5 tables, 5 figures Henry Kasumba Ronald Katende http://arxiv.org/abs/2606.07036v1 STREAM: Stochastic Riemannian Flow Matching with Anisotropic Decoder for Digital Histopathology Image Generation 2026-06-05T08:26:44Z Synthetic histopathology image generation addresses critical challenges in computational pathology, including patient privacy and the growing need for large-scale training data for foundation models. Latent diffusion models have dominated the image generation domain, with recent works emphasizing that the choice of latent space is critical to the quality of generated images. Existing state-of-the-art generative models in histopathology use pretrained Vision Foundation Models (VFMs) as conditioning signals, and we observe that this leads to "conditioning collapse," where the conditioning signal dominates the latent space and lowers the quality and diversity of generated samples. Therefore, we instead use pretrained histopathology VFMs as the latent space itself, leveraging their patch-token features that encode rich semantic information. We empirically show that these features are $\ell_2$-normalized and lie on the unit hypersphere $\mathcal{S}^{d-1}$ with strong angular dominance and intrinsic curvature, making them naturally suited for a Riemannian formulation. We therefore present STREAM, the first framework to apply Riemannian flow matching in the pathology domain. STREAM consists of two stages: 1) a bridge-type stochastic perturbation that establishes per-token rectifiability on $\mathcal{S}^{d-1}$ for training a Diffusion Transformer (DiT) in latent space, and 2) a novel anisotropic decoder that allocates robustness to low-energy directions of the velocity-field Jacobian while preserving fidelity along its high-energy directions. Together, STREAM achieves state-of-the-art reconstruction and generation performance on breast and colorectal cancer datasets. The code will be publicly released upon acceptance. 2026-06-05T08:26:44Z 27 pages, 7 figures Won June Cho Daeky Jeong Hyeongyeol Lim Hongjun Yoon http://arxiv.org/abs/2606.06882v1 Learning to Strategically Acquire Resources in Competition 2026-06-05T03:52:58Z We consider multiple agents competing to acquire some costly divisible resource (e.g. shares of a financial asset, compute resources, etc.) over time. Leveraging a standard model for price dynamics, we propose a novel game-theoretic model for this problem, generalizing settings studied in diverse literatures. Our analysis considers different assumptions on the information available to agents. Under partial-information with a common prior (which subsumes complete information as a special case), we establish the existence, uniqueness, and efficient computability of the Bayesian Nash equilibrium (BNE), and bound the price of anarchy. Next and more generally, we consider agents with no common prior learning to act optimally given realistic market feedback from repeated interactions. We provide sufficient conditions on agents doing simultaneous learning dynamics for last-iterate convergence to the BNE. For all settings, we provide simulations based on real financial data to illustrate our theoretical results and offer new insights on strategic behavior in the context of trading and resource acquisition. 2026-06-05T03:52:58Z Safwan Hossain Mirah Shi Andrew Bennett Neil Andrew Chriss Michael Kearns Anderson Schneider Yuriy Nevmyvaka http://arxiv.org/abs/2606.07681v1 Systematic LLM Translation of Legacy Scientific Code to Differentiable Frameworks: Application to a Land Surface Model 2026-06-04T22:18:09Z Differentiable programming offers transformative capabilities for scientific modeling, enabling gradient-based parameter estimation, sensitivity analysis, and data assimilation. Yet, migrating legacy codebases into differentiable frameworks remains a challenge. We present a five-phase LLM-based agentic pipeline that translates legacy Fortran into JAX: static dependency analysis determines module translation order from the full call graph; iterative compile-repair loops correct errors autonomously; and a Fortran reference oracle enforces numerical parity at the module level before integration and gradient verification. We instantiate and evaluate the pipeline on CLM-ml-v2, a 19,000-line Fortran land surface model, and analyze agent behavior across 73 module translation tasks. The resulting differentiable model computes the complete Jacobian in a single backward pass, recovers physical parameters in eight times fewer steps than gradient-free optimization, and achieves a 24 times wall-clock speedup over sequential Fortran at ensemble size N=2,048. Both the translated model and pipeline infrastructure are released as a reusable framework for differentiating other Earth system model components. 2026-06-04T22:18:09Z Aya Lahlou Linnia Hawkins Pierre Gentine http://arxiv.org/abs/2511.04567v2 Machine Learning for Electron-Scale Turbulence Modeling in W7-X 2026-06-04T22:11:09Z Constructing reduced models for turbulent transport is essential for accelerating profile predictions and enabling many-query tasks such as parameter exploration and design optimization. This work investigates machine-learning-driven reduced models for Electron Temperature Gradient (ETG) turbulence in the Wendelstein 7-X (W7-X) stellarator. We develop physics-guided scaling laws to predict the ETG heat flux at seven radial locations as functions of three key plasma parameters: the normalized electron temperature gradient ($ω_{T_e}$), the ratio of normalized electron temperature and density gradients ($η_e$), and the electron-to-ion temperature ratio ($τ$). The model coefficients are determined through regression combined with an active learning strategy. The procedure initializes the scaling laws using low-cardinality sparse-grid training data and iteratively enriches the training set by selecting maximally informative samples from an existing simulation database. The predictive performance of the models is assessed using out-of-sample datasets comprising more than $393$ points per radial location. Using the coefficients identified at the seven training radial locations, we further derive regression-based parameterizations for the scaling-law coefficients as functions of radial position. The resulting models are then evaluated at three additional radial locations not used during training, including both interpolation and moderate extrapolation cases. Overall, our reduced models demonstrate good predictive performance and achieve accuracy comparable to the original reference simulations, including in interpolation and moderate extrapolation regimes. An important finding is that a single radius-independent model cannot adequately describe ETG transport across the W7-X core, suggesting the presence of geometry-dependent physics not captured by the present formulation. 2025-11-06T17:24:37Z 15 pages, 7 tables, 14 figures Phys. Plasmas 33, 000000 (2026) Ionut-Gabriel Farcas Don Lawrence Carl Agapito Fernando Alejandro Banon Navarro Gabriele Merlo Frank Jenko 10.1063/5.0311057 http://arxiv.org/abs/2605.19116v2 Robust Restless Multi-Armed Bandit for Data Center Flexibility Services Through Virtual Machine Scheduling 2026-06-04T22:06:57Z Energy demands from data centers have surged and stressed the grid in recent years. Electric grids require balancing supply and demand every second, motivating demand response (reduction) from large loads, including data centers. This can be achieved by rescheduling jobs on a physical machine. Its real-time implementation is uncertain due to fluctuating resource utilization, and rescheduling incurs quality-of-service (QoS) losses that providers are unwilling to disclose. We propose a restless multi-armed bandit (RMAB) framework, in which the grid operator requests load reductions without access to detailed job-rescheduling procedures. Using open-source virtual machine (VM) datasets, we model job arrivals and rescheduling at each data center as a restless arm in a Markov decision process (MDP) and derive Whittle-index-based policies using the learned transition function via Thompson sampling. To overcome the weakness of an increasingly long learning process due to an enlarged state space, we use a mixed strategy that includes a global upper confidence bound (UCB) and encodes trust indices to enhance robustness and accelerate learning. Results show that the proposed mixed-strategy algorithm remains robust across varying state-space sizes and consistently outperforms the pure Thompson-Whittle (TW) algorithm, especially when contextual information is noisy. It also demonstrates superior performance compared to the state-of-the-art EXP4 framework. We provided open-source code to ensure reproducibility. 2026-05-18T21:03:10Z Zixi Chen Yifu Ding Thomas Magnanti http://arxiv.org/abs/2606.06652v1 Probabilistic Risk Sensitivity and Loss Aversion in Cumulative Prospect Theory 2026-06-04T18:58:48Z This paper develops a binary-gamble framework for characterizing risk sensitivity and loss aversion in Cumulative Prospect Theory (CPT). The proposed probabilistic risk-sensitivity metric is defined as a probability-threshold ratio that determines acceptance and preference thresholds in choice problems involving either a certain outcome and a binary gamble or two binary gambles. We show how standard notions of symmetric and non-symmetric bet aversion can be recovered within this framework, and we compare the resulting threshold-based conditions with utility premia, probability premia, and Arrow--Pratt curvature measures. The analysis clarifies when these criteria coincide and when they diverge, particularly for increasing aversion conditions, binary gambles with unequal probability distributions, and settings involving probability weighting functions. We also identify technical restrictions that arise when CPT-utility functions are used to represent loss aversion at the reference point. The resulting framework provides a decision-theoretic interpretation of risk sensitivity that is directly tied to probability thresholds and complements existing premium-based approaches. 2026-06-04T18:58:48Z This paper has been submitted for publication Symeon Vaidanis Marios Kountouris http://arxiv.org/abs/2606.06405v1 Bridging CAD and Data-Driven Design: Attributed Feature Graphs for Engineering Design 2026-06-04T17:11:41Z Engineering design is an iterative, simulation-driven process where traditional workflows rely heavily on computationally expensive analyses such as finite element and computational fluid dynamics. Although data-driven methods have accelerated design evaluation and optimization, most existing geometric representations discard parametric and feature-level semantics, limiting their integration with CAD-driven design workflows and reducing model interpretability. To address this gap, this work introduces Attributed Feature Graphs (AFGs), a feature-based representation that encodes design features, such as extrusions, ribs, and pockets, as nodes and their geometric or dependency relations as directed edges. AFGs preserve design intent and parametric structure while remaining compatible with standard graph-based learning methods, enabling end-to-end learning directly on CAD-derived feature graphs. The paper demonstrates the proposed representation through a surrogate-modeling case study on the CarHoods10K automotive hood frame dataset, where a Graph Neural Network (GNN) is trained as an evaluation engine to predict performance metrics from AFG inputs. The learned model achieves competitive surrogate performance compared with traditional data-driven approaches, but with the added benefit that engineers can map predictions back to specific CAD features and interpret how individual design elements influence system behavior. Furthermore, because AFGs are built from native CAD features, engineers can directly edit the underlying geometry in the CAD environment and reevaluate the design through the same learned model. 2026-06-04T17:11:41Z Accepted to ASME IDETC-CIE 2026 Abhishek Indupally Ibraheem Alawadhi Satchit Ramnath Jami J. Shah http://arxiv.org/abs/2606.06325v1 Data valuation model for non-monetary exchanges 2026-06-04T16:00:24Z In the evolving landscape of data product exchange platforms, traditional economic valuation models fall short due to the non-rival nature of data and the prevalence of non-monetary data product exchanges. This paper introduces a normative, choice-based metric for valuing data products within intracompany exchanges, where conventional pricing mechanisms are absent. By modeling consumer attention and preferences, the proposed metric quantifies the value of data offerings based solely on user selection behavior, without relying on cost, demand, or competitive pricing data. We show that this metric can be formally cast as a cooperative game with a closed-form Shapley value, providing a principled and fairness-based allocation of value across offerings. The model rewards uniqueness and discriminative consumption, effectively addressing the limitations of popularity-based metrics and incentivizing the creation of high-value, long-tail data products. Through theoretical analysis and illustrative examples, the metric is shown to align with economic principles, support equitable valuation, and contribute to a robust framework for measuring gross data product value. Future research directions include exploring bundling strategies and quantifying product complementarity. 2026-06-04T16:00:24Z Julia Blyumen Eitan Farchi http://arxiv.org/abs/2606.06135v1 Adaptation of the hybrid fictitious domain-immersed boundary method for Reynolds-averaged turbulence modeling 2026-06-04T13:19:15Z Engineering practice often calls for shape or topology optimization (TO) of fluid defining components, while the ever-increasing computing power allows the optimized cost functions to be based on computational fluid dynamics (CFD). However, a common bottleneck in CFD-based TO frameworks is the requirement for frequent remeshing. In order to alleviate this bottleneck, we propose an adaptation of an immersed boundary (IB) method variant, the hybrid fictitious domain-immersed boundary method, to leverage Reynolds-averaged Navier-Stokes (RANS) equations and wall function. The main contribution of the present work lies in the design and open-source implementation of the IB-aware steady-state solution of the RANS equations via the SIMPLE algorithm in the OpenFOAM library. For the most common two-equation RANS models, Reynolds numbers from $10^1$ to $10^6$, and several benchmarks, such as flow over a backwards facing step or an Ahmed body, the framework gives results consistent with the standard body-fitted CFD. Furthermore, given the intended application in TO, special emphasis is placed on the robustness and applicability of the approach to general geometries, which is tested on a NACA profile under various angles of attack. 2026-06-04T13:19:15Z preprint Lucie Kubíčková Martin Isoz http://arxiv.org/abs/2604.01349v4 PI-JEPA: Label-Free Surrogate Pretraining for Coupled Multiphysics Simulation via Operator-Split Latent Prediction 2026-06-04T12:22:14Z Reservoir simulation workflows face a fundamental data asymmetry: input parameter fields (geostatistical permeability realizations, porosity distributions) are free to generate in arbitrary quantities, yet existing neural operator surrogates require large corpora of expensive labeled simulation trajectories and cannot exploit this unlabeled structure. We introduce \textbf{PI-JEPA} (Physics-Informed Joint Embedding Predictive Architecture), a surrogate pretraining framework that trains \emph{without any completed PDE solves}, using masked latent prediction on unlabeled parameter fields under per-sub-operator PDE residual regularization. The predictor bank is structurally aligned with the Lie--Trotter operator-splitting decomposition of the governing equations, dedicating a separate physics-constrained latent module to each sub-process (pressure, saturation transport, reaction), enabling fine-tuning with as few as 100 labeled simulation runs. On single-phase Darcy flow, PI-JEPA achieves $1.9\times$ lower error than FNO and $2.4\times$ lower error than DeepONet at $N_\ell{=}100$, with 24\% improvement over supervised-only training at $N_\ell{=}500$, demonstrating that label-free surrogate pretraining substantially reduces the simulation budget required for multiphysics surrogate deployment. 2026-04-01T20:02:56Z Substantial Revision Required Brandon Yee Pairie Koh http://arxiv.org/abs/2605.15212v2 Fault tolerance estimation in digital circuits with visualised generative networks 2026-06-04T12:20:17Z We propose a new numerical method to estimate the fault tolerance of failure modes in digital circuit structures with a generative network sampling technique. From a random input of generated bitwise configurations of ideally digitalised analog currents in the digital circuit design with classical logical gates, expected output currents are compared to the realistic signals of a numerical experiment at the discriminator part of the Generative Adversarial Network (GAN) to calculate the deviation from ideal digital electronic signals, including various error modes, such as missing or interchanged logical devices. From the present analysis of a representation of the GAN in terms of complex variables, it is possible to evaluate the robustness in electronic designs by differentiating the impact of failure modes associated with different classical logical elements in the circuit. 2026-05-09T18:44:02Z 7 pages, 7 figures, 1 table Sascha Biel Carl Alexander Gaede Amiel Glaser Jan Wolter Alexej Schelle