https://arxiv.org/api/mO+R4JYT3Zp0Qjtau+ZmPX7VtUU 2026-05-15T23:42:41Z 28341 0 15 http://arxiv.org/abs/2605.15179v1 Eradicating Negative Transfer in Multi-Physics Foundation Models via Sparse Mixture-of-Experts Routing 2026-05-14T17:58:15Z

Scaling Scientific Machine Learning (SciML) toward universal foundation models is bottlenecked by negative transfer: the simultaneous co-training of disparate partial differential equation (PDE) regimes can induce gradient conflict, unstable optimization, and plasticity loss in dense neural operators. In particular, broadband open-channel fluid dynamics and boundary-dominated porous media flows impose incompatible spectral and geometric demands on a single dense parameter path. We introduce Shodh-MoE, a sparse-activated latent transformer architecture for multi-physics transport. Shodh-MoE operates on compressed 16^3 physical latents produced by a physics-informed autoencoder with an intra-tokenizer Helmholtz-style velocity parameterization, restricting decoded states to divergence-free velocity manifolds. The model guarantees exact mass conservation, achieving a physically verifiable velocity divergence of ~2.8 x 10^-10 (evaluated post-hoc in FP64) on 128^3 grids. A Top-1 soft-semantic router dynamically assigns localized latent patches to expert subnetworks, enabling specialized parameter paths for distinct physical mechanisms while preserving shared experts for universal symmetries. In a 20,000-step distributed pretraining run over mixed three-dimensional physical tensors, routing telemetry shows autonomous domain bifurcation: held-out validation tokens from the open-channel domain route exclusively to Expert 0, while porous-media tokens route exclusively to Expert 1. The model converges simultaneously across both regimes, achieving latent validation MSEs of 2.46 x 10^-5 and 9.76 x 10^-6, and decoded physical MSEs of 2.48 x 10^-6 and 1.76 x 10^-6. These results support sparse expert routing as a practical architectural mechanism for mitigating multi-physics interference in universal neural operators.

2026-05-14T17:58:15Z 5 pages, 4 figures Ellwil Sharma Arastu Sharma http://arxiv.org/abs/2512.19634v2 Influence of Magnetic Order on Proximity-Induced Superconductivity in Mn Layers on Nb(110) from First Principles 2026-05-14T17:32:00Z

We investigate the influence of magnetic order on the proximity-induced superconducting state in the Mn layers of a Mn-Nb(110) heterostructure by using a first-principles method. For this study, we use the recently developed Bogoliubov-de Gennes (BdG) solver for superconducting heterostructures [Csire et al., Phys. Rev. B 97, 024514 (2018)] within the first-principles calculations based on multiple scattering theory and the screened Korringa-Kohn-Rostoker (SKKR) Green's function method. In our calculations, we first study the normal-state density of states (DOS) in the single- and double-Mn-layer heterostructures, and calculate the induced magnetic moments in the Nb layers. Next, we compute the momentum-resolved spectral functions in the superconducting state for the heterostructure with a single Mn layer, and find bands crossing the Fermi level within the superconducting (SC) gap. We also study the SC state DOS in the single- and double-Mn-layer heterostructures and compare some of our results with experimental findings, revealing secondary gaps, plateau-like regions, and central V-shaped in-gap states within the bulk SC Nb gap that are magnetic-order-dependent. Finally, we compute the singlet and internally antisymmetric triplet (IAT) order parameters for each layer for both heterostructures, and find an order of magnitude difference in the induced singlet part of the SC order parameter in the Mn layer/s between the FM and AFM cases in favor of the AFM pairing with the maximum still being only 4.44% of the bulk Nb singlet order parameter value. We also find a negligible induced triplet part, yet comparable to the induced singlet values, indicating some singlet-triplet mixing in the Mn layer/s.

2025-12-22T18:10:06Z Phys. Rev. B 113, 174508 (Published 14 May, 2026) Sohair ElMeligy Balázs Újfalussy Kyungwha Park 10.1103/dbz9-7qp7 http://arxiv.org/abs/2605.15089v1 Adaptive homotopy continuation for robust dispersion curve computation in viscoelastic waveguides: guaranteed branch identity continuity 2026-05-14T17:10:12Z

This paper presents the first systematic application of a material homotopy continuation framework for efficient, automated computation of dispersion curves in viscoelastic waveguides of arbitrary cross-section. A material homotopy continuously maps the original lossy problem to an auxiliary lossless one via an attenuation parameter s in [0,1], addressing the core challenges of the non-Hermitian eigenvalue problem. Grounded in analytic perturbation theory, the method guarantees branch identity continuity--a one-to-one correspondence between solutions at s=0 and s=1--provided the real-parameter path does not cross any exceptional points. Under a Type I exceptional point topology, physical mode labels established at the elastic stage remain valid at the viscoelastic stage without post-processing, yielding the characteristic real-part veering with imaginary-part crossing. The decoupling strategy performs reliable mode tracking in the Hermitian regime via adaptive wavenumber refinement, then propagates a sparse set of key solutions to the target viscoelastic state through predictor-corrector homotopy continuation. Numerical examples across symmetric and unsymmetric laminates validate the framework's robustness and efficiency, with the majority of cases verified at a loss factor of approximately 0.003 and a single symmetric laminate providing additional support at 0.02. For a challenging unsymmetric laminate at a loss factor of 0.05, the method still produces numerically accurate solutions; two complementary diagnostic signatures--an extremely sharp imaginary-part crossing and a discernible discrepancy between spectral group velocity and energy flux velocity--warn of potential label mismatch and guide further analysis.

2026-05-14T17:10:12Z 43 pages, 11 figures Dong Xiao Zahra Sharif Khodaei M. H. Aliabadi http://arxiv.org/abs/2605.15073v1 Fast contracted Clebsch--Gordan tensor products for equivariant graph neural networks 2026-05-14T16:59:00Z

We present an $\mathcal{O}(L^3)$ algorithm for evaluating contracted Clebsch--Gordan tensor products in $\mathrm{O}(3)$-equivariant machine learning potentials at fixed Canonical Polyadic (CP) rank. Mapping the angular integral to a structured Gauss--Legendre and Fourier tensor-product grid decouples the radial channel contractions from the angular transforms. The antisymmetric parity-odd Clebsch--Gordan channels, unreachable by the symmetric pointwise product on a scalar $S^2$ grid, are recovered through the surface-curl pairing $\hat r \cdot [\nabla_{S^2} A \times \nabla_{S^2} B]$, the spherical Poisson bracket, which supplies the $L=1$ angular momentum on the grid while preserving rotational equivariance. The construction extends to parity-aware equivariant message passing in atomic-cluster-expansion-style architectures and is verified by direct numerical quadrature. The full uncontracted Clebsch--Gordan tensor product remains subject to the $\mathcal{O}(L^4)$ output-size lower bound. A benchmark shows wall-clock scaling empirically as $L^2$ across the practical $l_{\max}$ range. For the on-site contraction this is pre-asymptotic, giving way to $L^3$ at large $l_{\max}$. For message passing it is structural and the runtime is memory-bandwidth bound on $L^2$-sized grid tensors.

2026-05-14T16:59:00Z Anton Bochkarev Yury Lysogorskiy Ralf Drautz http://arxiv.org/abs/2504.03990v2 Parametric Operator Inference to Simulate the Purging Process in Semiconductor Manufacturing 2026-05-14T16:53:46Z

This work presents the application of parametric Operator Inference (OpInf) -- a nonintrusive reduced-order modeling (ROM) technique that learns a low-dimensional representation of a high-fidelity model -- to the numerical model of the purging process in semiconductor manufacturing. Leveraging the data-driven nature of the OpInf framework, we aim to forecast the flow field within a plasma-enhanced chemical vapor deposition (PECVD) chamber using computational fluid dynamics (CFD) simulation data. Our model simplifies the system by excluding plasma dynamics and chemical reactions, while still capturing the key features of the purging flow behavior. The parametric OpInf framework learns nine ROMs based on varying argon mass flow rates at the inlet and different outlet pressures. It then interpolates these ROMs to predict the system's behavior for 25 parameter combinations, including 16 scenarios that are not seen in training. The parametric OpInf ROMs, trained on 36\% of the data and tested on 64\%, demonstrate accuracy across the entire parameter domain, with a maximum error of 9.32\%. Furthermore, the ROM achieves an approximate 142-fold speedup in online computations compared to the full-order model CFD simulation. These OpInf ROMs may be used for fast and accurate predictions of the purging flow in the PECVD chamber, which could facilitate effective particle contamination control in semiconductor manufacturing.

2025-04-04T23:08:38Z 18 pages, 11 figures Seunghyon Kang Hyeonghun Kim Boris Kramer http://arxiv.org/abs/2605.14987v1 A Monte Carlo positronium decay source model with multiple annihilation channels in GATE 2026-05-14T15:49:19Z

Positronium-based imaging requires realistic modelling of positronium (Ps) decay in matter. We introduce a modular Ps decay model implemented in GATE 9.4 and GATE 10, enabling the definition of an arbitrary number of decay channels characterised by lifetime, branching fraction, annihilation multiplicity (2g/3g), and optional prompt photon emission. The model is validated through analytical and numerical benchmarks, including lifetime distributions, branching fraction consistency, photon kinematics, and prompt photon emission. Its practical applicability is demonstrated using simulations of mixed annihilation scenarios and the NEMA IEC phantom with a large field-of-view PET system. The proposed model accurately reproduces input lifetime distributions as weighted sums of exponential components and correctly samples decay channel fractions. Simulated two- and three-photon annihilation kinematics are consistent with theoretical expectations. Complex mixtures of decay channels, including varying 3g-to-2g ratios and multi-component ortho-positronium lifetimes, are correctly modelled, with observable signatures reflected in both temporal and energy distributions. Phantom simulations demonstrate the capability to generate realistic positronium-sensitive datasets. This work provides the first general-purpose, multi-channel positronium decay model integrated into GATE, enabling realistic simulations of positronium behaviour in complex media. The model supports the development and optimisation of positronium-based imaging techniques, including PLI and multi-photon PET, and applies to medical imaging, industrial tomography, and fundamental physics studies. Its public availability and compatibility with standard GATE workflows make it a valuable tool for the broader research community.

2026-05-14T15:49:19Z 24 pages, 12 figures Wojciech Krzemien Mateusz Bala Kamil Dulski Wojciech Zdeb Aurélien Coussat Beatrix C. Hiesmayr Konrad Klimaszewski Michał Obara Lech Raczyński Roman Y. Shopa http://arxiv.org/abs/2602.11626v2 ArGEnT: Arbitrary Geometry-encoded Transformer for Operator Learning 2026-05-14T15:46:56Z

Learning solution operators for systems with complex, varying geometries and parametric physical settings is a central challenge in scientific machine learning. In many-query regimes such as design optimization, control and inverse problems, surrogate modeling must generalize across geometries while allowing flexible evaluation at arbitrary spatial locations. In this work, we propose Arbitrary Geometry-encoded Transformer (ArGEnT), a geometry-aware attention-based architecture for operator learning on arbitrary domains. ArGEnT employs Transformer attention mechanisms to encode geometric information directly from point-cloud representations with three variants-self-attention, cross-attention, and hybrid-attention-that incorporates different strategies for incorporating geometric features. By integrating ArGEnT into DeepONet as the trunk network, we develop a surrogate modeling framework capable of learning operator mappings that depend on both geometric and non-geometric inputs without the need to explicitly parametrize geometry as a branch network input. Evaluation on benchmark problems spanning fluid dynamics, solid mechanics and electrochemical systems, we demonstrate significantly improved prediction accuracy and generalization performance compared with the standard DeepONet and other existing geometry-aware saurrogates. In particular, the cross-attention transformer variant enables accurate geometry-conditioned predictions with reduced reliance on signed distance functions. By combining flexible geometry encoding with operator-learning capabilities, ArGEnT provides a scalable surrogate modeling framework for optimization, uncertainty quantification, and data-driven modeling of complex physical systems.

2026-02-12T06:22:59Z 69 pages, 21 figures, 10 tables Wenqian Chen Yucheng Fu Michael Penwarden Pratanu Roy Panos Stinis http://arxiv.org/abs/2605.11253v2 Low-rank compression of two-electron reduced density matrices 2026-05-14T15:43:33Z

Two-body reduced density matrices (2RDMs) encode the essential two-electron physics of electronic states, but their quartic storage cost poses a major limitation in practical workflows. We investigate a simple protocol to compress both transition and non-transition 2RDMs into a lower-rank representation that preserves their wedge-product structure and physical symmetries under truncation. The resulting decomposition couples Coulomb and exchange channels through a common set of low-rank factors, yielding a more compact rank-sparse representation than single-channel factorizations. For correlated states, the effective rank scales linearly with system size, achieving a $\sim99$\% compression for the coupled-cluster 2RDM of octane while retaining chemical accuracy. We apply this to the recently introduced {\em ab initio} eigenvector continuation workflows, where many-body wave functions are interpolated across nuclear geometries with mean-field cost. Here, 2RDMs between training states act as projectors into a subspace but their memory scaling limits applications to larger systems. The compression scheme reduces the memory cost from quartic to quadratic for a fixed error per electron. Metrics to systematically control the decomposition are investigated, enabling statistically resolved structural, dynamical and spectroscopic observables from nonadiabatic molecular dynamics simulations of photoexcited H$_{28}$ chains, interpolating from compressed near-exact DMRG training data. This establishes these structure-preserving compressed intermediates for practical correlated electronic structure workflows.

2026-05-11T21:25:53Z Kemal Atalar Hugh G. A. Burton Andreas Grüneis George H. Booth http://arxiv.org/abs/2605.14861v1 Lévy-like flights and fractal geometry of finite point sets 2026-05-14T14:07:09Z

We study Lévy-like and truncated Lévy-like flights with step probability distribution of the form $r^{-1+ν}$ for negative, positive, and zero $ν$, focusing on the appearance of fractal geometry characteristics in the generated point sets. Forming ensembles of such point sets with fixed multiplicity, we develop simulation techniques leading to the desired value of correlation dimension in a vast continuous interval of scales. In particular, we demonstrate the possibility to produce ensembles of data sets with a low number of points with the needed properties. Furthermore, we show that the positive $ν$ distributions, apart from a region near the upper scale limit, show fractal behaviour that extends to infinitesimally low scales. As an example, we apply our findings to producing simulations relevant to the search for critical fluctuations, related to QCD critical endpoint, in heavy-ion collision experiments.

2026-05-14T14:07:09Z 35 pages, 16 figures Konstantinos Chalas F. K. Diakonos A. S. Kapoyannis http://arxiv.org/abs/2605.14745v1 Functional and Density-Driven Errors in Density Functional Theory: Quantum Monte Carlo Benchmarks for Solids 2026-05-14T12:13:01Z

We introduce a systematic analysis of density functional approximation errors in solids by separating functional-driven from density-driven contributions using quantum Monte Carlo densities of silicon, sodium chloride, and copper as reference. Typically, functional errors dominate, but we identify important exceptions where density-driven errors exceed functional errors by factors of 2-3, notably for SOGGA11 and τ-HCTH in the semiconductor and the insulator. Material dependence is striking: 63% of functionals show error cancellation in silicon versus 18% in copper, and only five functionals surpass LDA accuracy for metallic copper even with exact densities. For silicon and sodium chloride, GILL or BECKE exchange combined with PBE, PW91, or P86 correlation achieves near-exact xc energies on QMC densities, while copper requires specialized functionals like PBEsol or PBELYP. High-quality densities consistently reduce density-driven errors across all systems. Historical analysis reveals that 1990s GGA functionals outperform many modern meta-GGAs, contradicting expectations of systematic improvement along Jacob's ladder. These results provide practical guidance for functional selection and highlight implications for machine learning potential development, where material-dependent error cancellation may compromise transferability.

2026-05-14T12:13:01Z Ayoub Aouina Nicolas Tancogne-Dejean Silvana Botti http://arxiv.org/abs/2605.14646v1 N-Graphdiyne as a Tunable Platform for Stabilizing Light Metals toward High-Capacity Reversible Hydrogen Storage 2026-05-14T10:03:41Z

Hydrogen (H2) is a promising carbon-neutral energy carrier. However, its deployment is limited by the lack of lightweight, reversible storage media that operate under practical conditions. Here, we establish nitrogen-doped graphdiyne (N-GDY) as a programmable two-dimensional platform for stabilizing dispersed light-metal dopants and enabling high-capacity physisorption of molecular H2. The computational package involves density functional theory (DFT) combined with ab initio molecular dynamics (AIMD) and Langmuir-based statistical thermodynamic modeling. The results revealed that N-sites of N-GDY bind up to five Li, Na, K, and Ca atoms per primitive cell with binding energies of -2.27, -1.57, -1.80, and -2.13 eV, respectively, exceeding their respective bulk cohesive energies. AIMD simulations at 400 K further confirm the structural robustness of the decorated frameworks and the absence of metal aggregation. The polarised metal centres activate reversible H2 adsorption through electrostatic and dispersion interactions, with average adsorption energies falling within the optimal window (-0.15 to -0.35 eV per H2). Sequential adsorption analysis reveals uptake of up to 25 H2 molecules per primitive cell, achieving intrinsic gravimetric capacities of 13.08, 10.82, 9.23, and 9.15 wt% for Li-, Na-, K-, and Ca-functionalized systems, respectively. Thermodynamic analysis indicates favorable adsorption-desorption behavior under near-ambient conditions, with Li- and Ca-functionalized systems exceeding the 6.5 wt% U.S. Department of Energy's ultimate system-level target when considering intrinsic material capacity. These results identify N-GDY as a chemically tunable scaffold for dispersing lightweight metals and provide a mechanistic design strategy for achieving high-capacity, reversible hydrogen storage in porous two-dimensional materials.

2026-05-14T10:03:41Z Wael Othman Biomedical Engineering and Biotechnology, Khalifa University, Abu Dhabi, United Arab Emirates Healthcare Engineering Innovation Group Ibrahim Alghoul Physics Department, United Arab Emirates University, Al Ain, United Arab Emirates Water Research Center, United Arab Emirates University, Al Ain, United Arab Emirates K-F. Aguey-Zinsou5 Physics Department, United Arab Emirates University, Al Ain, United Arab Emirates Water Research Center, United Arab Emirates University, Al Ain, United Arab Emirates Nacir Tit Physics Department, United Arab Emirates University, Al Ain, United Arab Emirates Water Research Center, United Arab Emirates University, Al Ain, United Arab Emirates Tanveer Hussain School of Science and Technology, University of New England, Armidale, New South Wales, Australia http://arxiv.org/abs/2605.14527v1 Lang2MLIP: End-to-End Language-to-Machine Learning Interatomic Potential Development with Autonomous Agentic Workflows 2026-05-14T08:10:42Z

Developing machine learning interatomic potentials (MLIPs) for complex materials systems remains challenging because it requires expertise in atomistic simulations, machine learning, and workflow design, as well as iterative active learning procedures. Existing automated pipelines typically assume a fixed sequence of stages or depend on domain experts, which limits their adaptability to heterogeneous materials systems where the optimal curriculum is not known in advance. To lower the barrier to developing MLIPs for non-experts, we propose Lang2MLIP, a multi-agent framework that takes natural-language input and formulates end-to-end MLIP development as a sequential decision-making problem solved by large language models (LLMs). At each step, a decision-making agent observes the current dataset, model, evaluation results, and execution log, and then automatically selects an appropriate action to improve the model. This removes the need for a predefined pipeline and enables the agent to self-correct by revisiting earlier subsystems when new failures arise. We evaluate this approach on a solid electrolyte interphase (SEI) system with multiple components and interfaces. These results suggest that LLM-based multi-agent systems are a promising direction for automating MLIP development and making it more accessible to non-experts.

2026-05-14T08:10:42Z 31 pages, 12 figures Wenwen Li Yuki Orimo Nontawat Charoenphakdee http://arxiv.org/abs/2605.14471v1 High-Pressure Crystal Structure Database 2026-05-14T07:05:46Z

High-pressure research is a productive route to new structures and emergent properties. However, crucial high-pressure structural information remains highly fragmented across individual publications and heterogeneous computational repositories. This fragmentation creates a major bottleneck for data-driven materials design. To bridge this gap, we introduce the High-Pressure Crystal Structure Database (HPCSD), a traceable, pressure-resolved repository that integrates experimental and theoretical high-pressure structures. HPCSD is constructed from two complementary data streams: elemental high-pressure phases and a searchable configuration space of stable and metastable phases generated via CALYPSO crystal structure prediction. To ensure rigorous comparability, all retained structures underwent re-optimization under a unified density functional theory (DFT) framework , with continuous enthalpy curves systematically generated specifically for the elemental phases across their stability fields. The initial release encompasses 77,346 consistently evaluated structural entries spanning 89 elements. An analysis reveals that pressure-induced polymorphism is ubiquitous and exhibits pronounced family-dependent trends. Structural diversity is strongly influenced by an element's electronic adaptability , with the greatest structural complexity emerging at intermediate rather than highest pressures. By providing standardized, reusable, and rigorously evaluated high-pressure structural data, HPCSD establishes a robust infrastructure to accelerate experimental phase identification, facilitate cross-study thermodynamic comparisons, and support the development of machine-learning interatomic potentials and generative models for high-pressure systems.

2026-05-14T07:05:46Z 6 pages, 2 figures Zhenyu Wang Qingchang Wang Junwen Duan Heng Ge Xiaoshan Luo Pengyue Gao Wei Zhang Jian Lv Yanchao Wang Yanming Ma http://arxiv.org/abs/2605.14397v1 Three dimensional simulation of fluid-driven frictional and tensile ruptures on existing discontinuities 2026-05-14T05:25:28Z

We present an implicit, fully-coupled hydro-mechanical solver for the three dimensional simulation of fluid-driven rupture propagation along existing discontinuities. The solver handles simultaneously frictional slip (shear failure) and tensile opening (hydraulic fracture) along arbitrary intersecting fractures and faults in a linearly elastic and impermeable rock matrix. The spatial discretization combines a collocation displacement discontinuity boundary element method for quasi-static elasticity with a Galerkin finite element method for nonlinear pore-fluid diffusion along the discontinuities. Frictional and tensile failure are governed by a poro-elastoplastic cohesive zone like interface law with slip-weakening friction, dilatancy, and tensile strength degradation, integrated via an elastic predictor-plastic corrector scheme. The strong nonlinear coupling between mechanical deformation and fracture permeability is handled via adaptive implicit time-stepping. Efficient block preconditioning of the coupled tangent system, leveraging hierarchical matrix representations of the boundary element operator, is essential to achieve robustness across the full range of fracture behaviors. Accuracy and convergence are demonstrated against a comprehensive suite of analytical and semi-analytical solutions of increasing complexity: fluid-driven frictional ruptures under constant and slip-weakening friction, dilatant ruptures with permeability changes, and penny shaped hydraulic fractures spanning the viscosity-to-toughness transition. The solver is further assessed on two multi-fracture configurations: injection into three intersecting fractures, and a height-confined hydraulic fracture intersecting a strike-slip fault. The proposed framework simultaneously captures frictional slip, dilatancy, permeability evolution, and tensile opening.

2026-05-14T05:25:28Z Brice Lecampion Sylvain Brisson Antareep Sarma Ankit Gupta Alexis Sáez Regina Fakhretdinova http://arxiv.org/abs/2605.14370v1 Deciphering Neural Reparameterized Full-Waveform Inversion with Neural Sensitivity Kernel and Wave Tangent Kernel 2026-05-14T04:49:23Z

Full-waveform inversion (FWI) estimates unknown parameters in the wave equation from limited boundary measurements. Recent advances in neural reparameterized FWI (NeurFWI) demonstrate that representing the parameters using a neural network can reduce the reliance on the high-quality initial model and wavefield data, at the cost of slow high-resolution convergence. However, its underlying theoretical mechanism remains unclear. In this study, we establish the neural sensitivity kernel (NSK) and the wave tangent kernel (WTK) to analyze their convergence behavior from both model and data domains. These theoretical frameworks show that the neural tangent kernel (NTK) induced by neural representation adaptively modulates the original sensitivity and wave tangent kernels. This modulation leads to several key outcomes, i.e., the spectral filtering effect, the gradient wavenumber modulation, and the wave frequency bias, connecting the convergence behavior of NeurFWI with the eigen-structures of NSK and WTK. Building on these insights, we propose several enhanced NeurFWI methods with tailored eigen-structures in NSK and WTK to improve inversion performances and efficiency. We numerically validate these theoretical claims and the proposed methods in seismic exploration, and firstly extend their application to medical imaging.

2026-05-14T04:49:23Z Ruihua Chen Yisi Luo Bangyu Wu Xile Zhao Deyu Meng