https://arxiv.org/api/Rr1Q7o2d8wmsFKNFRcPT0mnjk6I2026-06-09T23:28:26Z106406015http://arxiv.org/abs/2606.04228v1When Does Structure Help? The Information Bonus of AlphaFold2 Representations over Protein Language Models2026-06-02T21:26:09ZAI scientist systems increasingly choose biological foundation models before they choose experiments. In protein pipelines, this creates a concrete engineering and scientific question: when is the cost of structural inference worth paying over a cheaper sequence-only model? We introduce the information bonus (IB), a task-level metric that measures the linearly accessible advantage of frozen single-sequence AlphaFold2 Evoformer representations over frozen ESM-2 embeddings under protein-level cross-validation. Across binding affinity regression (PDBbind, n=5,680), conformational flexibility (ATLAS molecular dynamics, 268 proteins), and allosteric-site classification (AlloSigDB, n=9,925 residues), IB is sharply mechanism-dependent. ESM-2 dominates binding affinity (IB=-0.141; Pearson r=0.449 vs. 0.307) and binary flexibility (IB=-0.060; AUROC 0.824 vs. 0.764; p=0.0017). AF2 single representations give the only above-chance allostery predictions (IB=+0.064; AUROC 0.548 vs. 0.485), revealing long-range geometric signal not recovered from sequence alone. We also identify a residue-level leakage artifact: naive residue splits inflate RMSF performance by 27-39% depending on the representation, enough to reverse representation rankings. These results turn representation selection into a measurable decision for AI-for-science systems.2026-06-02T21:26:09ZKargi Chauhanhttp://arxiv.org/abs/2606.04220v1Dead Science Walking: Publication Bias and the AI Scientist Pipeline2026-06-02T21:15:54ZAI scientist systems are beginning to automate the production, evaluation, and iteration of scientific hypotheses. Their promise is speed; their risk is that speed also scales errors embedded in the scientific record. We argue that a near-term risk is corpus failure: AI scientist systems are trained on and grounded in a literature that over-represents positive results and under-represents null findings. We formalise this distortion as the null result gap, estimate it across three domains (drug discovery ~0.60, psychology ~0.56, cancer biology ~0.35), and introduce an amplification index for reasoning about how retrieval, generation, and automated evaluation can compound the raw gap. Using first-order estimates, we argue that a standard three-stage pipeline can amplify corpus distortion by a factor of 2.18x, with the conclusion unchanged under more conservative multipliers. We identify four governance failure modes: confident rediscovery, ghost evidence accumulation, replication laundering, and confidence miscalibration. We then propose three interventions: null-result databases as training infrastructure, retraction-aware evaluation metrics, and mandatory training corpus disclosure. The central takeaway is that AI scientists will not only accelerate science. Without governance, they will accelerate science's blind spots before they accelerate its discoveries.2026-06-02T21:15:54ZKargi Chauhanhttp://arxiv.org/abs/2606.03933v1Physics-Informed Single Atom Matching Pursuit: Guided-Waves Wavenumbers and Propagation Distance Estimation for Damage Localization in Structural Health Monitoring2026-06-02T17:25:34ZStructural Health Monitoring (SHM) aims at the real-time monitoring of the integrity of engineering structures, with Guided-waves (GWs) providing high sensitivity to damage presence and to ageing effects for thin-walled components. In conventional GW-based SHM, a bonded piezoelectric transducer (PZT) emits a short tone burst that produces an Initial Wave Packet (IWP) propagating through the structure. As this packet interacts with boundaries and potential damages, additional scattered wave packets are produced. A major limitation of such approaches lies in the simultaneous excitation of multiple dispersive GW modes by a single PZT, which significantly complicates signal interpretation and damage monitoring. In this context, this work proposes the Physics-Informed Single Atom Matching Pursuit (PISAMP) method, a signal decomposition method grounded in the physical principles governing wave propagation. In contrast with purely data-driven or numerically intensive techniques, the proposed approach embeds strong physical constraints into a low-dimensional and computationally efficient signal representation. This formulation enables the direct identification of key physically meaningful features, including modal wavenumber functions and propagation distances between actuator, damage and sensors. These extracted features, especially source-damage-sensor distances, allows to subsequently perform damage location using well established Elliptical Localization techniques. The principal novelty of this study lies in integrating wave propagation physics into a compact signal decomposition framework and developing an interpretable damage localization methodology for GW-SHM applications.2026-06-02T17:25:34ZSebastian RodriguezBorja FerrandizFrancisco ChinestaNazih MechbalMarc Rébillathttp://arxiv.org/abs/2606.03816v1Learning finite viscoelasticity with DAVIS: A supervised framework for generalized standard materials2026-06-02T15:58:04ZThis work revisits the recently proposed data-adaptive viscoelasticity (DAVIS) framework, a spline-based formulation of finite viscoelasticity within the generalized standard materials setting. DAVIS enables a data-driven representation of equilibrium and non-equilibrium constitutive functions while retaining thermodynamic consistency and supporting parameter identification via finite element model updating. The present contribution focuses on improving the robustness and identifiability of non-equilibrium branches in generalized Maxwell-type models. To this end, two extensions of the original formulation are introduced. First, the spline representation is reformulated in terms of curvature-based variables, which is especially convenient to enforce monotonicity and convexity constraints by construction through a smooth parameter mapping. Second, the adaptation of interpolation domains is decoupled from the inner parameter identification by means of a staggered, block-alternating strategy: spline coefficients are optimized for fixed domain endpoints, while the endpoints are updated in an outer loop based on smooth statistics of sampled invariants. This separation alleviates an inherent scaling ambiguity between interpolation domains and spline coefficients that can impair conditioning in viscoelastic inverse problems. The underlying constitutive model remains the finite strain viscoelasticity framework of Reese and Govindjee. The proposed identification strategy is assessed for homogeneous uniaxial loading-unloading tests, which facilitates the study of identifiability and robustness of non-equilibrium branches.2026-06-02T15:58:04ZSimon WiesheierPaul SteinmannMiguel Angel Moreno-Mateoshttp://arxiv.org/abs/2606.03704v1Dynamic Objective Selection with Safeguards and LLM Oversight for Financial Decision-Making2026-06-02T14:22:07ZFinancial decision-making tasks such as stock recommendation and portfolio allocation typically estimate future return and risk and then select trades or allocations for an investor, and the chosen optimization objective often determines realized performance. However, because market conditions evolve over time, a fixed objective can be suboptimal across regimes, while regime-switching pipelines that rely on latent regime estimates can be noisy or delayed and frequent switching can increase turnover and operational instability. In this paper, we propose DOSS (Dynamic Objective Selection with Safeguards), a learning-based selector that directly chooses the decision-relevant objective function at each time point from interpretable statistical summaries of recent returns, selecting among a small set of candidates (e.g., return-seeking, loss-averse, and risk-adjusted) without introducing intermediate regime variables. DOSS formulates objective selection as a classification problem over objectives and performs sequential updates with a rolling window to make forward-looking selections without temporal leakage, while also outputting a confidence score for each proposal. To mitigate misselection and excessive switching in deployment, DOSS applies confidence-aware gating with a fail-safe that overrides low-confidence proposals to a conservative default and enforces explicit controls tied to switching frequency. We further integrate governance by positioning a Large Language Model (LLM) as an oversight component rather than a generator of new objectives: the LLM is restricted to accept a proposed objective or override it to a predefined safe default, with deterministic rule-based constraints triggering overrides when needed.2026-06-02T14:22:07ZAccpeted to The 2nd Workskop on Advances in Financial AI Workshop: Towards Agentic and Responsible Systems at ICLR 2026Keigo SakuraiTakahiro OgawaMiki HaseyamaAnjyu AnanKei Nakagawahttp://arxiv.org/abs/2606.03548v1Cost of Manipulation in AMM-Based Oracles2026-06-02T12:10:23ZWe study the robustness of AMM-based on-chain price oracles to strategic manipulation. An attacker trades against constant product automated market makers (CPMMs) to distort an on-chain oracle, arbitrageurs restore cross-pool and cross-venue consistency, and an oracle designer chooses how to aggregate pool quotes.
Taking an efficient-market-hypothesis (EMH) view of the off-chain "true" price, we define the \emph{cost of manipulation} as the minimal mark-to-market loss that an attacker must incur to move the oracle by a given multiplicative factor. For independent CPMMs, we derive closed-form single-pool manipulation formulas and solve the attacker-designer game for weighted means and weighted medians, showing that liquidity weights maximize the minimum cost of manipulation within these classes for weighted medians (for any distortion level) and, for weighted means, locally as the distortion tends to zero. For larger distortions, weighted means become more fragile: optimal weights can depend on the target distortion and no single choice is uniformly optimal across distortion levels. In a frictionless CPMM model with cross-pool arbitrage, the manipulation cost depends only on the total quote depth and coincides across symmetric aggregators.
We extend this framework to multi-asset star architectures, confirming that liquidity weights remain optimal in the same sense. Finally, we bridge theory and practice by incorporating dwell times and rate limits, providing a quantitative yardstick to size oracles against the explicit economic costs of attack.2026-06-02T12:10:23ZPublished at DeFi Workshop of FC'26Sebastian MüllerNordine MoumeniAdel Messaoudihttp://arxiv.org/abs/2606.03515v1A Voxel-Based Quantum Computing Method (VBQC) for Solid Mechanics Problem2026-06-02T11:37:49ZQuantum computing presents a promising method to overcome the efficiency and memory constraints in large-scale mechanical problems, with numerous successful applications demonstrated in fluid mechanics. However, solid mechanics problems usually require irregular grids for spatial discretization, due to the Lagrange formulations and complex boundaries, which makes the quantum simulation of the system matrix, e.g., the mass or stiffness matrix which is often referred to as the Hamiltonian in quantum computing, difficult to be effectively conducted. This study proposes a voxel-based quantum computing method (VBQC) for the quantum simulation of Hamiltonians in solid mechanics. VBQC applies voxel grids to discretize the spatial domain, thereby enabling the system matrix to exhibit the tridiagonal fractal property. Based on this property, the system matrix can be decomposed into three groups of fundamental matrices, $\mathbf{k}_{n}$, $\mathbf{c}_{n}$, and $\mathbf{q}_{n}$. This decomposition process is referred to as the KCQ decomposition. By integrating the KCQ decomposition with the quantum Fourier transform and the quantum multiplexer, VBQC enables efficient quantum simulation of Hamiltonians in solid mechanics. Three specific solid problems with different dimensions and numbers of variables are applied to preliminarily verify the correctness of the proposed VBQC for solid mechanics problems.2026-06-02T11:37:49ZFeng WuYuxiang YangLi ZhuChen LiYansong GuoXu Guohttp://arxiv.org/abs/2606.03422v1HonestAffinity: Leak-Aware Evaluation of Protein and Pocket Priors for Binding Affinity Prediction2026-06-02T10:08:22ZSequence-based deep learning offers a scalable alternative to structure-based scoring for protein-ligand binding affinity prediction. However, progress is hard to interpret when architectural priors are evaluated on canonical PDBbind-style splits that leak similarity classes across folds. We present HonestAffinity, a compact 1D-input predictor to isolate two priors under a leak-aware protocol: frozen ESM-2 (650M) protein embeddings and a learned binary pocket-position marker. We evaluate a multi-scale convolutional/Transformer template in three variants: HonestAffinity-Pocket, HonestAffinity-NoPocket, and HonestAffinity-Pocket-NoESM. All three train on 11,513 LP-PDBBind complexes in ~3 GPU-hours. We benchmark against five baselines on the LP-PDBBind 3-tier no-leak hold-out, CASF-2016, and a CASF-2016 non-train subset. Our central finding is a split-conditioned reversal rather than a uniformly best prior: HonestAffinity-Pocket achieves the best mean Pearson R on validation and CASF-2016 splits, whereas HonestAffinity-Pocket-NoESM achieves the best mean Pearson R on every strict LP no-leak tier (test_cl1-cl3). Both the pocket marker and ESM-2 input improve performance on familiar splits but reduce Pearson R on strict no-leak tiers. We argue models should report paired canonical and leak-proof ablations, and that deployment-regime-matched variants better describe these reversals than a single default. Code and scripts are linked in the footnote; checkpoints will be released upon acceptance.2026-06-02T10:08:22ZJunhao WeiBaili LuZhenhong PengWanyan LiZhirong HuangYanxiao LiYifu ZhaoDexing YaoHaochen LiXudong YeSio-Kei ImYapeng WangXu Yanghttp://arxiv.org/abs/2606.03282v1GROSS: German Rail Open-Source SUMO Scenario2026-06-02T07:46:33ZMicroscopic simulation enables reproducible evaluation in intelligent transportation systems, yet most open SUMO scenarios and toolchains remain road-traffic centric, leaving rail underrepresented despite its importance for public transport and its sensitivity to network-wide disruptions. We present the German Rail Open-Source Scenario (GROSS), an open pipeline that combines OpenStreetMap railway infrastructure with GTFS schedules to generate nation-scale rail scenarios for SUMO (Simulation of Urban MObility). Existing conversions often rely on geometry-only stop-to-track matching and inconsistent platform/track assignments, which can create routing anomalies and unstable simulations dominated by teleportation artefacts. GROSS addresses this with topology-aware stop mapping via a hierarchical station model, followed by station-level routing with validation and targeted repair. Across multiple German regions, GROSS reduces average teleportations per vehicle by a factor of 1.7--76.8$\times$, shortens delays compared to the vanilla SUMO pipeline, and it enables end-to-end generation of a Germany-wide scenario with 35\,925 trips for comparisons with operator-reported delay statistics. While the remaining long delays highlight limitations in available timetable metadata and rail dispatch modeling, GROSS lowers the barrier to building scalable, fully open rail simulations and to studying delay propagation at country scale.2026-06-02T07:46:33ZJuri PenellDamian Dailisanhttp://arxiv.org/abs/2606.03255v1Multi-Agent Framework Leveraging Knowledge Graphs for Virtual Commissioning Models2026-06-02T07:16:15ZVirtual commissioning models (VCMs) of discrete manufacturing systems are used to validate automation behavior before physical deployment, but creating and maintaining them remains labor-intensive. Relevant engineering information is distributed across programmable logic controller (PLC) engineering projects, such as Siemens TIA Portal, and kinematic simulation models, such as Siemens NX Mechatronics Concept Designer (NX MCD), where it is stored in incompatible, tool-specific data structures. In practice, IEC 61131-3-based PLC programs and variables are engineered separately from rigid-body and kinematic simulation objects such as parts, joints, sensors, and actuators. As a result, understanding system behavior, generating simulation components, and mapping PLC variables to corresponding simulation objects require cross-domain expertise and remain largely manual. This paper presents a knowledge-graph-grounded multi-agent framework for semi-automated VCM development. A deterministic setup process extracts structured data from Siemens TIA Portal and Siemens NX MCD and transforms both sources into graph-based representations within a shared graph database. The framework uses a hierarchical multi-agent architecture to support three task classes in early-stage VCM development: system understanding, simulation component generation, and cross-domain signal mapping. It provides grounded natural-language access to engineering knowledge, template-guided generation of executable NX Open journal scripts, and ranked mapping suggestions between PLC variables and NX MCD simulation objects. Evaluation on a laboratory-scale discrete manufacturing system shows that the approach reduces manual cross-domain interpretation effort and makes recurring VCM engineering tasks more actionable.2026-06-02T07:16:15ZMax DiekmannJonas NitzlerJan FischerHans-Jürgen PfistererDirk Hartmannhttp://arxiv.org/abs/2606.03210v1Critical evaluation of PINN for FWD inverse analysis and differentiable FEM as an alternative2026-06-02T06:08:55ZAutomatic-differentiation-based inverse analysis methods, including physics-informed neural networks (PINNs) and differentiable programming, have recently shown great promise due to their ability to compute accurate gradients and convergence efficiency. However, their applicability to falling weight deflectometer (FWD) backcalculation remains unexplored. This study critically evaluates PINN-based inverse analysis for a multilayer pavement system and investigates differentiable finite element method (DiffFEM) as an alternative based on a synthetic benchmark. The standard PINN does not recover layer moduli because of the sharp domain discontinuities inherent to layered pavement systems. Although we use an extended PINN with domain decomposition (XPINN), which shows better performance on discontinuous domains, its performance remains highly sensitive to loss weighting and network architecture, and degrades under measurement noise. By contrast, DiffFEM consistently achieves more accurate, stable, and computationally efficient inversion results. These results indicate that DiffFEM, which enforces the governing physics as a hard constraint, yields better accuracy, robustness, and computational efficiency than PINN-based approaches, in which the governing physics is imposed as a soft constraint through the loss function. More broadly, the findings suggest that the choice between PINN- and DiffFEM-based inverse analysis needs careful consideration, with DiffFEM offering practical advantages when an efficient and robust differentiable forward solver is available.2026-06-02T06:08:55ZYongjin ChoiHyeonbin MoonSeunghwa Ryuhttp://arxiv.org/abs/2510.22491v3LAMP: Data-Efficient Linear Affine Weight-Space Models for Parameter-Controlled 3D Shape Generation and Extrapolation2026-06-02T04:10:50ZGenerating high-fidelity 3D geometries under explicit parameter constraints is central to engineering design, yet current methods often require large datasets and fail to provide reliable control beyond the training distribution. We introduce LAMP, a data-efficient framework for controllable and interpretable 3D generation that aligns signed distance function (SDF) decoders by overfitting each exemplar from a shared initialization, then generates new designs by solving a parameter-constrained affine mixing problem in the aligned weight space. To improve reliability, we propose a linearity-mismatch safety metric that detects when mixed decoders leave the valid local regime. We evaluate LAMP on DrivAerNet++, BlendedNet, and additional industry-level vehicle families, including sports cars, SUVs, and convertibles. LAMP enables controlled interpolation with as few as 50 samples, safe extrapolation up to 100% beyond training ranges, and performance-guided optimization under fixed parameters, outperforming conditional autoencoder and Deep Network Interpolation (DNI) baselines in extrapolation, data efficiency, and parameter fidelity. Our results demonstrate that LAMP advances controllable, data-efficient, and safe 3D generation for design exploration, dataset generation, and performance-driven optimization.2025-10-26T02:12:20ZGhadi NehmeYanxia ZhangDule ShuMatt KlenkFaez Ahmedhttp://arxiv.org/abs/2511.13899v2A Factorized Low-Rank RNN Framework for Uncovering Independent Neural Latent Dynamics and Connectivity2026-06-02T03:01:03ZLow-rank recurrent neural networks (lrRNNs) are a class of models that uncover low-dimensional latent dynamics underlying neural population activity. Although their functional connectivity is low-rank, it lacks independence interpretations, making it difficult to assign distinct computational roles to different latent dimensions. To address this, we propose the Factored Recurrent Neural Network (FacRNN), a generative lrRNN framework that assumes group-wise independence among latent dynamics while allowing flexible within-group entanglement. These independent latent groups allow latent dynamics to evolve separately, but are internally rich for complex computation. We reformulate the lrRNN under a variational autoencoder (VAE) framework, enabling us to introduce a partial correlation penalty that encourages independence between groups of latent dimensions. Experiments on synthetic, monkey M1, and mouse voltage imaging data show that FacRNN consistently improves the disentanglement and interpretability of learned neural latent trajectories in low-dimensional space and low-rank connectivity over baseline lrRNNs that do not encourage group-wise independence.2025-11-17T20:49:58ZChengrui LiYunmiao WangYule WangWeihan LiDieter JaegerAnqi Wuhttp://arxiv.org/abs/2501.18850v3Beyond Pairwise Interactions: Equivariant Hypergraph Diffusion for Crystal Structure Prediction2026-06-01T23:27:52ZCrystal Structure Prediction (CSP) remains a fundamental challenge with significant implications for materials discovery and the advancement of various scientific disciplines. Recent advances have demonstrated that generative models, particularly diffusion models, are especially promising for CSP. However, traditional graph-based representations, where atomic bonds are modeled as pairwise graph edges, fail to capture the intricate high-order interactions essential for accurately describing crystal structures. To address this limitation, we propose leveraging hypergraphs to represent crystal structures, enabling more expressive modeling of multi-way atomic interactions. Hypergraphs naturally encode complex high-order relationships and respect key symmetries -- such as permutation and periodic translation invariance -- that are crucial for characterizing crystalline materials. Building on this representation, we propose the \textbf{E}quivariant \textbf{H}ypergraph \textbf{Diff}usion Model (\textbf{EH-Diff}), a generative framework designed to exploit the symmetry-preserving properties of hypergraphs. EH-Diff provides an efficient and accurate method for predicting crystal structures, with rigorous theoretical guarantees on invariance preservation. Empirically, we conduct extensive experiments on four benchmark datasets, and the results demonstrate that EH-Diff outperforms state-of-the-art CSP methods even with a single diffusion sample.2025-01-31T02:14:42Z12 pages, 4 figuresYang LiuChuan ZhouShuai ZhangXiaotong WuPeng ZhangXixun LinShirui PanZhao Lihttp://arxiv.org/abs/2606.02935v1CAD-to-CT Registration of Cylindrical Objects via Ellipse-Based Axis Estimation2026-06-01T22:33:40ZAccurate registration of CAD models to CT scans is essential for establishing ground truth geometry in volumetric imaging. Obtaining reliable object masks is of growing importance in machine learning settings; as recent architectures grow more capable, huge datasets are required to fully utilise their capabilities. Traditional intensity-based methods fail when CT grayscale values lack calibration references, while point-based algorithms (e.g., ICP, RANSAC) require feature correspondence unavailable between idealized CAD geometry and noisy volumetric CT data.
We propose a two-stage geometric registration method for cylindrical objects (ionization chambers) that takes advantage of the distinctive geometric features of the objects. First, we estimate the 3D rotation axis by detecting elliptical cross-sections across CT slices, fitting ellipses to edge-detected contours, and performing PCA on the fitted ellipse centers after RANSAC outlier removal. Second, we voxelize the CAD model, orient it along the detected axis, and maximize volumetric overlap with the CT scan through translational adjustment.
This approach achieves robust registration with tilt and orientation errors below $0.1^\circ$ without intensity calibration or feature matching. Once registered, the aligned CAD model provides ground truth geometry for applications including machine learning-based object localization and automated analysis in industrial CT workflows.2026-06-01T22:33:40ZAleksander OgonowskiMikołaj MrozowskiDaniel WięcekArkadiusz ĆwiekKonrad KlimaszewskiRafał MożdżonekAdam PadeeLech RaczyńskiPiotr WasiukWojciech WiślickiMichał MatusiakSławomir Wronka