https://arxiv.org/api/L07C19X2xf8RFL5VyBfy0WnyHUI2026-06-18T16:36:03Z2748739015http://arxiv.org/abs/2605.14408v1Strain-Enhanced Hydrogen Evolution, Electrical, Optical, and Thermoelectric Properties of the Multifunctional 2D CrSi2N4 Monolayer2026-05-14T05:50:46ZFirst-principles density functional theory (DFT) is employed to evaluate the structural, electronic, optical, thermoelectric, and electrocatalytic properties of monolayer CrSi2N4. Its symmetric N-Si-N-Cr-N-Si-N septuple-layer structure exhibits dynamic, thermal (300 K), and mechanical stability, supported by a -8.76 eV/atom cohesive energy. PBE and HSE06 functionals reveal an indirect bandgap of 0.58 eV and 2.16 eV, respectively, driven by localized Cr-3d and N-2p states. The monolayer features 15.57 static dielectric constant and maximum absorption coefficients of 0.9 X 10^6 cm-1 (visible) and 1.4 X 10^6 cm-1 (deep-UV). Semiclassical Boltzmann calculations predict an outstanding room-temperature n-type thermoelectric power factor of 3.5 x mW/mK2. For hydrogen evolution (HER), the basal plane yields a baseline hydrogen adsorption free energy (ΔGH) of 1.05 eV at the N-site. Applying +5% expansive biaxial strain improves HER kinetics, reducing ΔGH to 0.46 eV. Thus, CrSi2N4 is a resilient, tuneable candidate for waste-heat recovery, photodetectors, and sustainable electrocatalysis.2026-05-14T05:50:46ZRao Uzair AhmadFahd Sikandar KhanNasir Javedhttp://arxiv.org/abs/2605.14287v1A quantum chemistry dataset containing ground-state and conical-intersection structures of 260k molecules2026-05-14T02:39:30ZConical intersections play central roles in photoinduced reactions. However, comprehensive conical-intersection datasets that could advance our understanding of excited-state reaction processes remain scarce. To address this gap, we constructed a quantum chemistry dataset containing ground-state and conical-intersection structures of small molecules (up to ten heavy atoms: C, N, O, F). Ground-state geometries were optimized at the semi-empirical OM2 level, with single-point energies calculated at the OM2/MRCI level. Conical-intersection geometries and energies were also computed at the OM2/MRCI level. This dataset is designed to enable a deep integration of photochemistry with machine learning, bridging the gap between photochemical insight and data-driven approaches.2026-05-14T02:39:30ZJiahui ZhangYifei ZhuChuqiao FengYingjin MaChao XuZhenggang Lanhttp://arxiv.org/abs/2605.14154v1TSAgent: An Agentic Workflow for Autonomous Transition State Search2026-05-13T22:08:24ZIdentifying transition states (TSs) on potential energy surfaces is a central computational bottleneck in mechanistic studies of catalytic materials. A TS search is not a single calculation but a long-horizon, multi-step workflow of atomistic simulations with delayed, asynchronous feedback and heterogeneous failure modes that require a joint multimodal analysis of scalar convergence diagnostics and atomic geometries along the reaction path. To address this challenge, we propose TSAgent, an agentic workflow that automates TS search directly at the density functional theory (DFT) level of quantum chemical accuracy. TSAgent operates through a persistent plan-execute-analyze-replan loop, continuously adapting its strategy based on convergence diagnostics and geometric feedback without human intervention. We evaluate TSAgent on a diverse 100-example subset of the OC20NEB heterogeneous catalysis benchmark, where it successfully locates TSs with 83% accuracy. In a direct comparison against expert DFT practitioners on 10 held-out examples, TSAgent achieves a 70% success rate compared to a human-expert average of 73 +/- 12%. Finally, TSAgent independently reproduces Bronsted-Evans-Polanyi scaling relationships for NH3 dissociation on metal and single-atom alloy surfaces from a published heterogeneous catalysis study, demonstrating that its utility extends beyond curated benchmarks to real scientific investigations.2026-05-13T22:08:24ZVarun MadhavanAnkit MathankerDean M. SweeneyOluwatosin A. OhiroYixin WangBryan R. Goldsmithhttp://arxiv.org/abs/2511.07686v2Kolmogorov-Arnold Chemical Reaction Neural Networks for learning pressure-dependent kinetic rate laws2026-05-13T19:09:01ZChemical Reaction Neural Networks (CRNNs) have emerged as an interpretable machine learning framework for discovering reaction kinetics directly from data, while strictly adhering to the Arrhenius and mass action laws. However, standard CRNNs cannot represent pressure-dependent or mixture-based rate behavior, which is critical in many combustion and chemical systems and typically requires empirical falloff formulations such as Troe or SRI, or data-based interpolation or polynomial fits such as PLOG or Chebyshev Polynomials. Here, we develop Kolmogorov-Arnold Chemical Reaction Neural Networks (KA-CRNNs) that generalize CRNNs by modeling each kinetic parameter as a learnable function of third-body concentrations using Kolmogorov-Arnold activations. This structure maintains the Arrhenius and mass action interpretability and physical constraints of a vanilla CRNN while enabling assumption-free inference of global and collider-specific pressure effects directly from data. Two proof-of-concept reaction studies are presented to highlight the capability of KA-CRNNs to accurately reproduce pressure-dependent and collider-specific kinetics across a range of temperatures, pressures, and bath gas mixtures, extracting meaningful and generalizable models from sparse training data and significantly outperforming interpolative approaches (2.88x reduction in MSE). The framework establishes a foundation for data-driven discovery of extended kinetic behaviors in complex reacting systems, advancing interpretable and physics-constrained approaches for chemical model inference.2025-11-10T23:08:54Z12 pages, 8 figuresBenjamin C. KoenigSili Denghttp://arxiv.org/abs/2602.05793v2Generalized Path Reweighting and History-Dependent Free Energies2026-05-13T18:38:35ZTransition interface sampling (TIS) and replica exchange TIS (RETIS) are powerful methods for computing rates of rare events inaccessible to straightforward molecular dynamics (MD) simulations. Path reweighting extends their output, enabling the evaluation of diverse thermodynamic and kinetic quantities, including reaction prediction metrics, activation barriers, committor functions, and free energies. The recently developed Infinity-RETIS algorithm boosts parallel efficiency through asynchronous replica exchanges in the infinite-swap limit, eliminating the wall-time bottlenecks of conventional RETIS. This approach introduces fractional samples and biased sampling distributions, requiring a generalized path reweighting framework, for which we derive expressions demonstrating how exact dynamic and thermodynamic variables can be computed. We then focus on a special class of free energy surfaces defined by history-dependent conditions, whose values are influenced by kinetic factors such as particle mass and friction, unlike standard unconditional free energy surfaces. Even with suboptimal reaction coordinates, these conditional free energies can reveal kinetically relevant barriers that may be misrepresented by standard unconditional free energies, thereby providing a rigorous and versatile tool for characterizing complex molecular transitions.2026-02-05T15:48:32Z15 pages, 5 figuresTitus S. van ErpDaniel T. ZhangElias WilsSina SafaeiAn Ghyselshttp://arxiv.org/abs/2605.13826v1Reducing cross-sample prediction churn in scientific machine learning2026-05-13T17:50:57ZScientific machine learning reports predictive performance. It does not report whether the same prediction would survive a different draw of training data. Across $9$ chemistry benchmarks, two classifiers trained on independent bootstraps of the same training set agree on aggregate accuracy to within $1.3\text{--}4.2$ percentage points but disagree on the class label of $8.0\text{--}21.8\%$ of test molecules. We call this gap \emph{cross-sample prediction churn}. The standard parameter-side techniques (deep ensembles, MC dropout, stochastic weight averaging) do not reduce this gap; two data-side methods do. The first is $K$-bootstrap bagging, which cuts the rate $40\text{--}54\%$ on every dataset at no accuracy cost ($K{\times}$-ERM compute). The second is \emph{twin-bootstrap}, our proposal: two networks trained jointly on independent bootstraps with a sym-KL consistency loss between their predictions, which at matched $2{\times}$-ERM compute reduces churn a further median $45\%$ beyond bagging-$K{=}2$. Cross-sample prediction churn deserves a column alongside predictive performance in scientific-ML benchmark reports, because without it the parameter-side and data-side methods are indistinguishable on the metric they actually differ on.2026-05-13T17:50:57ZGordan PrastaloKevin Maik Jablonkahttp://arxiv.org/abs/2602.12382v2Fast Generation of Pipek-Mezey Wannier Functions via the Co-Iterative Augmented Hessian Method2026-05-13T15:13:51ZWe report a $k$-point extension of the second-order co-iterative augmented Hessian (CIAH) algorithm, termed $k$-CIAH, for Pipek-Mezey (PM) localization of Wannier functions (WFs). By exploiting an efficient evaluation of the Hessian-vector product, $k$-CIAH achieves $O(N_k^2 n^3)$ scaling in both CPU time and memory, matching that of previously reported first-order $k$-space approaches while improving upon the $O(N_k^3 n^3)$ scaling of $Γ$-point CIAH, where $N_k$ denotes the number of $k$-points sampling the first Brillouin zone and $n$ characterizes the unit-cell size. Benchmark calculations on a diverse set of solids -- including insulators, semiconductors, metals, and surfaces -- demonstrate the fast and robust convergence of $k$-CIAH-based PMWF optimization, which yields an overall computational efficiency approximately 2-3--fold higher than first-order $k$-space methods and orders of magnitude higher than $Γ$-point CIAH for localizing 1000-5000 orbitals. The quality of the resulting PMWFs is further validated by accurate electronic band structures obtained via PMWF-based Wannier interpolation.2026-02-12T20:24:02ZGengzhi YangHong-Zhou Yehttp://arxiv.org/abs/2601.00131v2Random phase approximation-based local natural orbital coupled cluster theory2026-05-13T14:58:56ZPractical applications of fragment embedding and closely related local correlation methods critically depend on a judicious choice of a low-level theory to define the local embedding subspace and to capture long-range electrostatic and correlation effects outside the embedding region. Second-order Møller-Plesset perturbation theory (MP2) is by far the most widely used correlated low-level theory; however, its applicability becomes questionable in systems where MP2 is known to fail either quantitatively or qualitatively. In this work, we present the random phase approximation (RPA) as a promising alternative low-level theory to MP2 within the local natural orbital-based coupled-cluster (LNO-CC) framework. We demonstrate that RPA-based LNO-CC closely matches the performance of its MP2-based counterpart for systems with sizable energy gaps, while delivering significantly faster convergence toward the canonical coupled-cluster limit for metallic systems, particularly as the thermodynamic limit is approached. These results highlight the critical role of the low-level theory in fragment embedding and local correlation methods and identify RPA as a compelling alternative to the commonly used MP2.2025-12-31T22:23:48ZRuiheng SongXiliang GongAamy BakryHong-Zhou Yehttp://arxiv.org/abs/2605.23971v1Physics-Guided Concentration Inference from Resistance Transients in a Mixed-Phase SnO-SnO$_2$ Carbon Monoxide Sensor with p-n Switching2026-05-13T10:37:32ZThis work presents a physics-guided machine-learning framework for carbon monoxide concentration inference from experimentally measured resistance transients of a mixed-phase SnO-SnO$_2$ material gas sensor exhibiting temperature-dependent p-n switching behavior. Cycle-level transient responses are represented through physically interpretable descriptors and complemented by compact fast Fourier transform (FFT) and discrete wavelet transform (DWT)-based summaries. Using leakage-aware grouped cross-validation, we study both multi-class concentration classification and continuous concentration regression for the p-type and n-type sensing regimes separately. Across both regimes, fused features provide the strongest overall performance, while the physics-guided descriptor block remains highly competitive, indicating that the dominant concentration information is already encoded in physically meaningful transient dynamics. The p-type branch shows the best concentration-class discrimination, with the fused Random Forest classifier reaching approximately $96.5\%$ accuracy, whereas the n-type branch yields the best quantitative concentration estimation, with the fused Random Forest regressor achieving an MAE$\approx 1.48$ ppm and an R$^2$ $\approx 0.992$. These results reveal a clear dual-regime behavior: p-type sensing is particularly favorable for classification, whereas n-type sensing is more favorable for high-fidelity regression. More broadly, the study demonstrates that leakage-aware, cycle-level, physics-guided machine learning can extend conventional gas-sensing analysis beyond single-response metrics while preserving physical interpretability2026-05-13T10:37:32Z15 pages, 14 figuresSani BiswasPreetam SinghAmit Kumar Gangwarhttp://arxiv.org/abs/2605.13305v1MPINeuralODE: Multiple-Initial-Condition Physics-Informed Neural ODEs for Globally Consistent Dynamical System Learning2026-05-13T10:18:18ZNeural ordinary differential equations (Neural ODEs) often fit training trajectories while generalizing poorly to unseen initial conditions and long horizons. We propose MPINeuralODE, which combines a soft physics-informed residual with a Multiple-Initial-Condition (MIC) multiple-shooting curriculum whose ingredients are structurally complementary: the physics term anchors the vector-field magnitude on the support that MIC enlarges. We evaluate along three axes: out-of-sample error, long-horizon stability, and Hamiltonian drift, which together expose whether the learned dynamics recover the underlying vector field. On Lotka-Volterra, MPINeuralODE achieves the lowest out-of-sample and long-horizon MSE among data-driven methods, with a 26% reduction over the baseline Neural ODE, while essentially matching the PINN ablation on Hamiltonian drift.2026-05-13T10:18:18ZLake YangAntonio Malpica-MoralesFrank Ioannis Papadakis WoodSerafim Kalliadasishttp://arxiv.org/abs/2605.10312v2FusionRCG: Orchestrating Recursive Computation Graphs across GPU Memory Hierarchies2026-05-13T10:03:04ZEvaluating high-dimensional integrals via deep hierarchical recurrences is a dominant cost in quantum chemistry. While CPUs manage these efficiently, GPUs suffer a critical mismatch: limited per-thread memory is quickly overwhelmed by an explosion of simultaneously live intermediate variables. As recurrence scales, this forces massive data spilling to global memory, collapsing performance into a severe memory-bound regime. We present FusionRCG, a framework that jointly optimizes computation graph structure and GPU memory mapping. Exploiting the inherent topological flexibility of recurrence graphs, using electron repulsion integrals as an example, we contribute: (1) liveness-aware graph orchestration to minimize peak live intermediates; (2) algebraic dimensionality reduction via stepwise Cartesian-to-spherical fusion, shrinking intermediate footprints by up to $7.7\times$; and (3) an adaptive multi-tier kernel architecture routing graphs across the memory hierarchy. Evaluated on NVIDIA A100 GPUs, FusionRCG achieves up to $3.09\times$ end-to-end SCF speedup over GPU4PySCF and maintains $75\%$ parallel efficiency at 64~GPUs, successfully rescuing these workloads from memory-bound limits.2026-05-11T10:12:25ZYihong ZhangXinran WeiJunshi ChenFusong JuWei HuJinlong YangHuanhuan Xiahttp://arxiv.org/abs/2605.10363v2Accelerating Locality-Driven Integration in Quantum Chemistry with Block-Structured Matrix Multiplication2026-05-13T09:59:52ZLocality-driven integration is a pervasive computational pattern in quantum chemistry, arising whenever spatially localized basis functions interact through numerical quadrature or integral screening. The dominant matrix multiplications in these tasks exhibit dynamic, structured sparsity driven by spatial locality, posing significant challenges for both dense batched kernels and generic sparse formats on GPUs. We present KerneLDI, a GPU-oriented framework that addresses this regime by co-designing data layout, screening logic, and matrix-computation operators to realize block-structured matrix multiplication for locality-driven integration. KerneLDI reorganizes operand matrices into a unified block-filtered representation that retains only spatially relevant blocks, and executes the resulting contractions with customized dense block multipliers that adapt proven dense-matmul optimizations to retained block pairs. We develop and evaluate KerneLDI on exchange--correlation (EXC) integration in Kohn--Sham density functional theory, a representative and computationally critical instance of this pattern. Across diverse molecular systems, KerneLDI preserves numerical accuracy while delivering up to 10$\times$ speedup for EXC evaluation over a dense GPU baseline, scales favorably with increasing system size and multi-GPU parallelism, accelerates end-to-end self-consistent field calculations, and yields nearly 6$\times$ throughput improvement for ab initio molecular dynamics.2026-05-11T11:08:19ZXinran WeiYan PanFusong JuZehao ZhouYihong ZhangLin HuangJianwei ZhuJia ZhangHuanhuan XiaBin ShaoTao Qinhttp://arxiv.org/abs/2605.13164v1Helium Bubbles in Liquid Lead Lithium Solutions: Pressure Inhomogeneities at Interfaces and Non Ideal Mixture Effects2026-05-13T08:28:00ZThe extremely low solubility of helium in liquid metals may lead to rapid supersaturation, promoting spontaneous formation of helium bubbles by nucleation. Once nucleated, the stability of these bubbles is governed by the properties of the helium liquid metal interface. In particular, interfacial tension between the immiscible phases controls bubble interactions and induces local pressure inhomogeneities. This work is motivated by the need of a better understanding of helium bubble formation in liquid Pb Li alloys, which are of particular relevance for the design of breeding blankets in the future nuclear fusion reactors. We employ classical molecular dynamics simulations to investigate helium segregation in a range of lead lithium systems, including the limiting cases of pure lead and pure lithium. Changes in local pressure are evaluated from direct mechanical calculations, enabling the characterization of interfacial properties. Interfacial tension and radius of the bubble are subsequently determined across multiple thermodynamic conditions, spanning temperatures starting near the melting points of the constituent metals up to 1021 K. The impact of curvature and composition of the alloy on the interfacial behaviour are also investigated.2026-05-13T08:28:00Z14 pages, 8 figures. Including "Supplementary Information" at the bottom of the manuscriptEdgar Alvarez-GaleraJordi MartiLluis Batethttp://arxiv.org/abs/2605.13060v1Rotational energy levels in the ground vibrational state of methane with kHz-level accuracy from comb-referenced double-resonance and Lamb-dip spectroscopies2026-05-13T06:33:29ZMethane is a key spherical-top molecule, yet restrictive selection rules for one-photon transitions have prevented determination of its ground state (GS) energies with state-of-the-art kHz-level accuracy. We report the GS rotational energy level differences with kHz-level accuracy from two frequency-comb-referenced sub-Doppler methods: optical-optical double-resonance spectroscopy in the $Λ$-type configuration, and Lamb-dip spectroscopy of allowed and forbidden transitions. A Hamiltonian fit to the data yields GS term values with rotational numbers up to $\it{J}$ = 12 with kHz level accuracy.2026-05-13T06:33:29ZVinicius Silva de OliveiraIsak SilanderHiroyuki SasadaSho OkuboHajima InabaKevin K. LehmannAleksandra Foltynowiczhttp://arxiv.org/abs/2605.12823v1Hessian Matching for Machine-Learned Coarse-Grained Molecular Dynamics2026-05-12T23:46:38ZCoarse-grained (CG) molecular dynamics enables simulations of atomic systems such as biomolecules at timescales inaccessible to all-atom (AA) methods, but existing CG neural potentials trained via force matching capture only the gradient of the free-energy surface, leaving its curvature unconstrained. We introduce a framework that augments force matching with stochastic Hessian-vector product (HVP) matching, instilling second-order curvature information into CG potentials without constructing the full Hessian. We derive a decomposition of the target CG Hessian into a model-independent projected AA Hessian, precomputed once before training, and a model-dependent covariance correction computed online at negligible cost. We construct an unbiased stochastic estimator of the Hessian-matching objective by using random probe vectors. We evaluate our method by comparing against force matching on a benchmark of nine fast-folding proteins unseen during training. HVP matching outperforms plain force matching on 8 of 9 proteins on slow-mode metrics, with reductions of up to 85% in the Kullback--Leibler divergence between the CG and reference distributions along the slowest collective mode of the largest protein. Our results demonstrate that higher-order physical supervision is a practical path to more accurate and transferable CG potentials for biomolecular simulation.2026-05-12T23:46:38Z15 pages, 4 figures, 1 tableSanya MurdeshwarSanjit ShashiKevin BachelorWilliam NoidAshwin LokapallyRazvan Marinescu