https://arxiv.org/api/L07C19X2xf8RFL5VyBfy0WnyHUI 2026-06-18T16:36:03Z 27487 390 15 http://arxiv.org/abs/2605.14408v1 Strain-Enhanced Hydrogen Evolution, Electrical, Optical, and Thermoelectric Properties of the Multifunctional 2D CrSi2N4 Monolayer 2026-05-14T05:50:46Z

First-principles density functional theory (DFT) is employed to evaluate the structural, electronic, optical, thermoelectric, and electrocatalytic properties of monolayer CrSi2N4. Its symmetric N-Si-N-Cr-N-Si-N septuple-layer structure exhibits dynamic, thermal (300 K), and mechanical stability, supported by a -8.76 eV/atom cohesive energy. PBE and HSE06 functionals reveal an indirect bandgap of 0.58 eV and 2.16 eV, respectively, driven by localized Cr-3d and N-2p states. The monolayer features 15.57 static dielectric constant and maximum absorption coefficients of 0.9 X 10^6 cm-1 (visible) and 1.4 X 10^6 cm-1 (deep-UV). Semiclassical Boltzmann calculations predict an outstanding room-temperature n-type thermoelectric power factor of 3.5 x mW/mK2. For hydrogen evolution (HER), the basal plane yields a baseline hydrogen adsorption free energy (ΔGH) of 1.05 eV at the N-site. Applying +5% expansive biaxial strain improves HER kinetics, reducing ΔGH to 0.46 eV. Thus, CrSi2N4 is a resilient, tuneable candidate for waste-heat recovery, photodetectors, and sustainable electrocatalysis.

2026-05-14T05:50:46Z Rao Uzair Ahmad Fahd Sikandar Khan Nasir Javed http://arxiv.org/abs/2605.14287v1 A quantum chemistry dataset containing ground-state and conical-intersection structures of 260k molecules 2026-05-14T02:39:30Z

Conical intersections play central roles in photoinduced reactions. However, comprehensive conical-intersection datasets that could advance our understanding of excited-state reaction processes remain scarce. To address this gap, we constructed a quantum chemistry dataset containing ground-state and conical-intersection structures of small molecules (up to ten heavy atoms: C, N, O, F). Ground-state geometries were optimized at the semi-empirical OM2 level, with single-point energies calculated at the OM2/MRCI level. Conical-intersection geometries and energies were also computed at the OM2/MRCI level. This dataset is designed to enable a deep integration of photochemistry with machine learning, bridging the gap between photochemical insight and data-driven approaches.

2026-05-14T02:39:30Z Jiahui Zhang Yifei Zhu Chuqiao Feng Yingjin Ma Chao Xu Zhenggang Lan http://arxiv.org/abs/2605.14154v1 TSAgent: An Agentic Workflow for Autonomous Transition State Search 2026-05-13T22:08:24Z

Identifying transition states (TSs) on potential energy surfaces is a central computational bottleneck in mechanistic studies of catalytic materials. A TS search is not a single calculation but a long-horizon, multi-step workflow of atomistic simulations with delayed, asynchronous feedback and heterogeneous failure modes that require a joint multimodal analysis of scalar convergence diagnostics and atomic geometries along the reaction path. To address this challenge, we propose TSAgent, an agentic workflow that automates TS search directly at the density functional theory (DFT) level of quantum chemical accuracy. TSAgent operates through a persistent plan-execute-analyze-replan loop, continuously adapting its strategy based on convergence diagnostics and geometric feedback without human intervention. We evaluate TSAgent on a diverse 100-example subset of the OC20NEB heterogeneous catalysis benchmark, where it successfully locates TSs with 83% accuracy. In a direct comparison against expert DFT practitioners on 10 held-out examples, TSAgent achieves a 70% success rate compared to a human-expert average of 73 +/- 12%. Finally, TSAgent independently reproduces Bronsted-Evans-Polanyi scaling relationships for NH3 dissociation on metal and single-atom alloy surfaces from a published heterogeneous catalysis study, demonstrating that its utility extends beyond curated benchmarks to real scientific investigations.

2026-05-13T22:08:24Z Varun Madhavan Ankit Mathanker Dean M. Sweeney Oluwatosin A. Ohiro Yixin Wang Bryan R. Goldsmith http://arxiv.org/abs/2511.07686v2 Kolmogorov-Arnold Chemical Reaction Neural Networks for learning pressure-dependent kinetic rate laws 2026-05-13T19:09:01Z

Chemical Reaction Neural Networks (CRNNs) have emerged as an interpretable machine learning framework for discovering reaction kinetics directly from data, while strictly adhering to the Arrhenius and mass action laws. However, standard CRNNs cannot represent pressure-dependent or mixture-based rate behavior, which is critical in many combustion and chemical systems and typically requires empirical falloff formulations such as Troe or SRI, or data-based interpolation or polynomial fits such as PLOG or Chebyshev Polynomials. Here, we develop Kolmogorov-Arnold Chemical Reaction Neural Networks (KA-CRNNs) that generalize CRNNs by modeling each kinetic parameter as a learnable function of third-body concentrations using Kolmogorov-Arnold activations. This structure maintains the Arrhenius and mass action interpretability and physical constraints of a vanilla CRNN while enabling assumption-free inference of global and collider-specific pressure effects directly from data. Two proof-of-concept reaction studies are presented to highlight the capability of KA-CRNNs to accurately reproduce pressure-dependent and collider-specific kinetics across a range of temperatures, pressures, and bath gas mixtures, extracting meaningful and generalizable models from sparse training data and significantly outperforming interpolative approaches (2.88x reduction in MSE). The framework establishes a foundation for data-driven discovery of extended kinetic behaviors in complex reacting systems, advancing interpretable and physics-constrained approaches for chemical model inference.

2025-11-10T23:08:54Z 12 pages, 8 figures Benjamin C. Koenig Sili Deng http://arxiv.org/abs/2602.05793v2 Generalized Path Reweighting and History-Dependent Free Energies 2026-05-13T18:38:35Z

Transition interface sampling (TIS) and replica exchange TIS (RETIS) are powerful methods for computing rates of rare events inaccessible to straightforward molecular dynamics (MD) simulations. Path reweighting extends their output, enabling the evaluation of diverse thermodynamic and kinetic quantities, including reaction prediction metrics, activation barriers, committor functions, and free energies. The recently developed Infinity-RETIS algorithm boosts parallel efficiency through asynchronous replica exchanges in the infinite-swap limit, eliminating the wall-time bottlenecks of conventional RETIS. This approach introduces fractional samples and biased sampling distributions, requiring a generalized path reweighting framework, for which we derive expressions demonstrating how exact dynamic and thermodynamic variables can be computed. We then focus on a special class of free energy surfaces defined by history-dependent conditions, whose values are influenced by kinetic factors such as particle mass and friction, unlike standard unconditional free energy surfaces. Even with suboptimal reaction coordinates, these conditional free energies can reveal kinetically relevant barriers that may be misrepresented by standard unconditional free energies, thereby providing a rigorous and versatile tool for characterizing complex molecular transitions.

2026-02-05T15:48:32Z 15 pages, 5 figures Titus S. van Erp Daniel T. Zhang Elias Wils Sina Safaei An Ghysels http://arxiv.org/abs/2605.13826v1 Reducing cross-sample prediction churn in scientific machine learning 2026-05-13T17:50:57Z

Scientific machine learning reports predictive performance. It does not report whether the same prediction would survive a different draw of training data. Across $9$ chemistry benchmarks, two classifiers trained on independent bootstraps of the same training set agree on aggregate accuracy to within $1.3\text{--}4.2$ percentage points but disagree on the class label of $8.0\text{--}21.8\%$ of test molecules. We call this gap \emph{cross-sample prediction churn}. The standard parameter-side techniques (deep ensembles, MC dropout, stochastic weight averaging) do not reduce this gap; two data-side methods do. The first is $K$-bootstrap bagging, which cuts the rate $40\text{--}54\%$ on every dataset at no accuracy cost ($K{\times}$-ERM compute). The second is \emph{twin-bootstrap}, our proposal: two networks trained jointly on independent bootstraps with a sym-KL consistency loss between their predictions, which at matched $2{\times}$-ERM compute reduces churn a further median $45\%$ beyond bagging-$K{=}2$. Cross-sample prediction churn deserves a column alongside predictive performance in scientific-ML benchmark reports, because without it the parameter-side and data-side methods are indistinguishable on the metric they actually differ on.

2026-05-13T17:50:57Z Gordan Prastalo Kevin Maik Jablonka http://arxiv.org/abs/2602.12382v2 Fast Generation of Pipek-Mezey Wannier Functions via the Co-Iterative Augmented Hessian Method 2026-05-13T15:13:51Z

We report a $k$-point extension of the second-order co-iterative augmented Hessian (CIAH) algorithm, termed $k$-CIAH, for Pipek-Mezey (PM) localization of Wannier functions (WFs). By exploiting an efficient evaluation of the Hessian-vector product, $k$-CIAH achieves $O(N_k^2 n^3)$ scaling in both CPU time and memory, matching that of previously reported first-order $k$-space approaches while improving upon the $O(N_k^3 n^3)$ scaling of $Γ$-point CIAH, where $N_k$ denotes the number of $k$-points sampling the first Brillouin zone and $n$ characterizes the unit-cell size. Benchmark calculations on a diverse set of solids -- including insulators, semiconductors, metals, and surfaces -- demonstrate the fast and robust convergence of $k$-CIAH-based PMWF optimization, which yields an overall computational efficiency approximately 2-3--fold higher than first-order $k$-space methods and orders of magnitude higher than $Γ$-point CIAH for localizing 1000-5000 orbitals. The quality of the resulting PMWFs is further validated by accurate electronic band structures obtained via PMWF-based Wannier interpolation.

2026-02-12T20:24:02Z Gengzhi Yang Hong-Zhou Ye http://arxiv.org/abs/2601.00131v2 Random phase approximation-based local natural orbital coupled cluster theory 2026-05-13T14:58:56Z

Practical applications of fragment embedding and closely related local correlation methods critically depend on a judicious choice of a low-level theory to define the local embedding subspace and to capture long-range electrostatic and correlation effects outside the embedding region. Second-order Møller-Plesset perturbation theory (MP2) is by far the most widely used correlated low-level theory; however, its applicability becomes questionable in systems where MP2 is known to fail either quantitatively or qualitatively. In this work, we present the random phase approximation (RPA) as a promising alternative low-level theory to MP2 within the local natural orbital-based coupled-cluster (LNO-CC) framework. We demonstrate that RPA-based LNO-CC closely matches the performance of its MP2-based counterpart for systems with sizable energy gaps, while delivering significantly faster convergence toward the canonical coupled-cluster limit for metallic systems, particularly as the thermodynamic limit is approached. These results highlight the critical role of the low-level theory in fragment embedding and local correlation methods and identify RPA as a compelling alternative to the commonly used MP2.

2025-12-31T22:23:48Z Ruiheng Song Xiliang Gong Aamy Bakry Hong-Zhou Ye http://arxiv.org/abs/2605.23971v1 Physics-Guided Concentration Inference from Resistance Transients in a Mixed-Phase SnO-SnO$_2$ Carbon Monoxide Sensor with p-n Switching 2026-05-13T10:37:32Z

This work presents a physics-guided machine-learning framework for carbon monoxide concentration inference from experimentally measured resistance transients of a mixed-phase SnO-SnO$_2$ material gas sensor exhibiting temperature-dependent p-n switching behavior. Cycle-level transient responses are represented through physically interpretable descriptors and complemented by compact fast Fourier transform (FFT) and discrete wavelet transform (DWT)-based summaries. Using leakage-aware grouped cross-validation, we study both multi-class concentration classification and continuous concentration regression for the p-type and n-type sensing regimes separately. Across both regimes, fused features provide the strongest overall performance, while the physics-guided descriptor block remains highly competitive, indicating that the dominant concentration information is already encoded in physically meaningful transient dynamics. The p-type branch shows the best concentration-class discrimination, with the fused Random Forest classifier reaching approximately $96.5\%$ accuracy, whereas the n-type branch yields the best quantitative concentration estimation, with the fused Random Forest regressor achieving an MAE$\approx 1.48$ ppm and an R$^2$ $\approx 0.992$. These results reveal a clear dual-regime behavior: p-type sensing is particularly favorable for classification, whereas n-type sensing is more favorable for high-fidelity regression. More broadly, the study demonstrates that leakage-aware, cycle-level, physics-guided machine learning can extend conventional gas-sensing analysis beyond single-response metrics while preserving physical interpretability

2026-05-13T10:37:32Z 15 pages, 14 figures Sani Biswas Preetam Singh Amit Kumar Gangwar http://arxiv.org/abs/2605.13305v1 MPINeuralODE: Multiple-Initial-Condition Physics-Informed Neural ODEs for Globally Consistent Dynamical System Learning 2026-05-13T10:18:18Z

Neural ordinary differential equations (Neural ODEs) often fit training trajectories while generalizing poorly to unseen initial conditions and long horizons. We propose MPINeuralODE, which combines a soft physics-informed residual with a Multiple-Initial-Condition (MIC) multiple-shooting curriculum whose ingredients are structurally complementary: the physics term anchors the vector-field magnitude on the support that MIC enlarges. We evaluate along three axes: out-of-sample error, long-horizon stability, and Hamiltonian drift, which together expose whether the learned dynamics recover the underlying vector field. On Lotka-Volterra, MPINeuralODE achieves the lowest out-of-sample and long-horizon MSE among data-driven methods, with a 26% reduction over the baseline Neural ODE, while essentially matching the PINN ablation on Hamiltonian drift.

2026-05-13T10:18:18Z Lake Yang Antonio Malpica-Morales Frank Ioannis Papadakis Wood Serafim Kalliadasis http://arxiv.org/abs/2605.10312v2 FusionRCG: Orchestrating Recursive Computation Graphs across GPU Memory Hierarchies 2026-05-13T10:03:04Z

Evaluating high-dimensional integrals via deep hierarchical recurrences is a dominant cost in quantum chemistry. While CPUs manage these efficiently, GPUs suffer a critical mismatch: limited per-thread memory is quickly overwhelmed by an explosion of simultaneously live intermediate variables. As recurrence scales, this forces massive data spilling to global memory, collapsing performance into a severe memory-bound regime. We present FusionRCG, a framework that jointly optimizes computation graph structure and GPU memory mapping. Exploiting the inherent topological flexibility of recurrence graphs, using electron repulsion integrals as an example, we contribute: (1) liveness-aware graph orchestration to minimize peak live intermediates; (2) algebraic dimensionality reduction via stepwise Cartesian-to-spherical fusion, shrinking intermediate footprints by up to $7.7\times$; and (3) an adaptive multi-tier kernel architecture routing graphs across the memory hierarchy. Evaluated on NVIDIA A100 GPUs, FusionRCG achieves up to $3.09\times$ end-to-end SCF speedup over GPU4PySCF and maintains $75\%$ parallel efficiency at 64~GPUs, successfully rescuing these workloads from memory-bound limits.

2026-05-11T10:12:25Z Yihong Zhang Xinran Wei Junshi Chen Fusong Ju Wei Hu Jinlong Yang Huanhuan Xia http://arxiv.org/abs/2605.10363v2 Accelerating Locality-Driven Integration in Quantum Chemistry with Block-Structured Matrix Multiplication 2026-05-13T09:59:52Z

Locality-driven integration is a pervasive computational pattern in quantum chemistry, arising whenever spatially localized basis functions interact through numerical quadrature or integral screening. The dominant matrix multiplications in these tasks exhibit dynamic, structured sparsity driven by spatial locality, posing significant challenges for both dense batched kernels and generic sparse formats on GPUs. We present KerneLDI, a GPU-oriented framework that addresses this regime by co-designing data layout, screening logic, and matrix-computation operators to realize block-structured matrix multiplication for locality-driven integration. KerneLDI reorganizes operand matrices into a unified block-filtered representation that retains only spatially relevant blocks, and executes the resulting contractions with customized dense block multipliers that adapt proven dense-matmul optimizations to retained block pairs. We develop and evaluate KerneLDI on exchange--correlation (EXC) integration in Kohn--Sham density functional theory, a representative and computationally critical instance of this pattern. Across diverse molecular systems, KerneLDI preserves numerical accuracy while delivering up to 10$\times$ speedup for EXC evaluation over a dense GPU baseline, scales favorably with increasing system size and multi-GPU parallelism, accelerates end-to-end self-consistent field calculations, and yields nearly 6$\times$ throughput improvement for ab initio molecular dynamics.

2026-05-11T11:08:19Z Xinran Wei Yan Pan Fusong Ju Zehao Zhou Yihong Zhang Lin Huang Jianwei Zhu Jia Zhang Huanhuan Xia Bin Shao Tao Qin http://arxiv.org/abs/2605.13164v1 Helium Bubbles in Liquid Lead Lithium Solutions: Pressure Inhomogeneities at Interfaces and Non Ideal Mixture Effects 2026-05-13T08:28:00Z

The extremely low solubility of helium in liquid metals may lead to rapid supersaturation, promoting spontaneous formation of helium bubbles by nucleation. Once nucleated, the stability of these bubbles is governed by the properties of the helium liquid metal interface. In particular, interfacial tension between the immiscible phases controls bubble interactions and induces local pressure inhomogeneities. This work is motivated by the need of a better understanding of helium bubble formation in liquid Pb Li alloys, which are of particular relevance for the design of breeding blankets in the future nuclear fusion reactors. We employ classical molecular dynamics simulations to investigate helium segregation in a range of lead lithium systems, including the limiting cases of pure lead and pure lithium. Changes in local pressure are evaluated from direct mechanical calculations, enabling the characterization of interfacial properties. Interfacial tension and radius of the bubble are subsequently determined across multiple thermodynamic conditions, spanning temperatures starting near the melting points of the constituent metals up to 1021 K. The impact of curvature and composition of the alloy on the interfacial behaviour are also investigated.

2026-05-13T08:28:00Z 14 pages, 8 figures. Including "Supplementary Information" at the bottom of the manuscript Edgar Alvarez-Galera Jordi Marti Lluis Batet http://arxiv.org/abs/2605.13060v1 Rotational energy levels in the ground vibrational state of methane with kHz-level accuracy from comb-referenced double-resonance and Lamb-dip spectroscopies 2026-05-13T06:33:29Z

Methane is a key spherical-top molecule, yet restrictive selection rules for one-photon transitions have prevented determination of its ground state (GS) energies with state-of-the-art kHz-level accuracy. We report the GS rotational energy level differences with kHz-level accuracy from two frequency-comb-referenced sub-Doppler methods: optical-optical double-resonance spectroscopy in the $Λ$-type configuration, and Lamb-dip spectroscopy of allowed and forbidden transitions. A Hamiltonian fit to the data yields GS term values with rotational numbers up to $\it{J}$ = 12 with kHz level accuracy.

2026-05-13T06:33:29Z Vinicius Silva de Oliveira Isak Silander Hiroyuki Sasada Sho Okubo Hajima Inaba Kevin K. Lehmann Aleksandra Foltynowicz http://arxiv.org/abs/2605.12823v1 Hessian Matching for Machine-Learned Coarse-Grained Molecular Dynamics 2026-05-12T23:46:38Z

Coarse-grained (CG) molecular dynamics enables simulations of atomic systems such as biomolecules at timescales inaccessible to all-atom (AA) methods, but existing CG neural potentials trained via force matching capture only the gradient of the free-energy surface, leaving its curvature unconstrained. We introduce a framework that augments force matching with stochastic Hessian-vector product (HVP) matching, instilling second-order curvature information into CG potentials without constructing the full Hessian. We derive a decomposition of the target CG Hessian into a model-independent projected AA Hessian, precomputed once before training, and a model-dependent covariance correction computed online at negligible cost. We construct an unbiased stochastic estimator of the Hessian-matching objective by using random probe vectors. We evaluate our method by comparing against force matching on a benchmark of nine fast-folding proteins unseen during training. HVP matching outperforms plain force matching on 8 of 9 proteins on slow-mode metrics, with reductions of up to 85% in the Kullback--Leibler divergence between the CG and reference distributions along the slowest collective mode of the largest protein. Our results demonstrate that higher-order physical supervision is a practical path to more accurate and transferable CG potentials for biomolecular simulation.

2026-05-12T23:46:38Z 15 pages, 4 figures, 1 table Sanya Murdeshwar Sanjit Shashi Kevin Bachelor William Noid Ashwin Lokapally Razvan Marinescu