https://arxiv.org/api/HluDC8eacuQ58ymMhq8y5Y1NLZs 2026-06-14T02:18:30Z 27442 240 15 http://arxiv.org/abs/2601.09845v2 Quantum-Accurate Conformational Stabilities and Vibrational Dynamics in Molecules and Proteins with Machine-Learned Force Fields 2026-05-22T11:18:49Z Biomolecular thermodynamics and spectroscopy depend on relative conformer energies, local curvatures, and collective dipole fluctuations on the potential-energy surface. Conventional molecular mechanics force fields enable large-scale simulations, but their fixed functional forms can misrepresent infrared intensities, mode character, and environment-dependent vibrational response. Here we assess general-purpose machine-learned force fields across small molecules, finite-temperature infrared spectra, gas-phase peptides, and monomeric, oligomeric, and solvated protein assemblies. To enable this analysis, we introduce QVib, a dataset of 293 molecules and 1365 conformers, together with peptide amide-band benchmarks and p53 oligomerization-domain models, to evaluate vibrational transferability from DFT references to experimental spectra. Across these systems, machine-learned force fields substantially improve over molecular mechanics in reproducing DFT-level forces, vibrational frequencies, densities of states, mode eigenvectors, conformational energetics, and experimental infrared spectra. Among models with explicit long-range electrostatics, SO3LR provides the most favourable accuracy-cost balance for the biomolecular systems considered. These results show that machine-learned force-field dynamics can recover collective, environment-dependent vibrational landscapes at near-DFT fidelity, enabling spectroscopically validated biomolecular simulations at force-field-like cost. 2026-01-14T20:05:22Z 24 pages, 6 figures, Supplementary information (19 figures) Sergio Suárez-Dou Miguel Gallegos Kyunghoon Han Florian N. Brünig Joshua T. Berryman Alexandre Tkatchenko http://arxiv.org/abs/2504.06673v3 Are Molecules Magical? Non-Stabilizerness in Molecular Bonding 2026-05-22T08:50:45Z Isolated atoms as well as molecules at equilibrium are presumed to be simple from the point of view of quantum computational complexity. Here we show that the process of chemical bond formation is accompanied by a marked increase in the quantum complexity of the electronic ground state. By studying the hydrogen dimer H$_{2}$ as a prototypical example, we demonstrate that when two hydrogen atoms form a bond, a specific measure of quantum complexity exhibits a pronounced peak that closely follows the behavior of the binding energy. This measure of quantum complexity, known as magic in the quantum information literature, reflects how difficult it is to simulate the state using classical methods. We show that the observations for H$_{2}$ also hold for a collection of other dimers, including the weakly bonded diatomic helium dimer He$_{2}$. This observation suggests that regions of strong bonding formation or breaking are also regions of enhanced intrinsic quantum complexity. This insight suggests a connection of quantum information measures to chemical reactivity and advocates the use of stretched molecules as a quantum computational resource. 2025-04-09T08:14:27Z 11 pages, 5 figures. Generalisation to other dimers. Comparison to other quantum information theoretic and quantum chemistry metrics Matthieu Sarkis Alexandre Tkatchenko http://arxiv.org/abs/2601.21703v2 Fewest-Switches Surface Hopping with Combined Deep Learning Potential and Long Short-Term Memory Network Propagator for Simulating Realistic Photochemical Processes 2026-05-22T06:15:44Z Fewest-switches surface hopping (FSSH) is the most popular method for simulating photochemical processes of molecular systems. Recently, we have constructed long short-term memory (LSTM) networks as a propagator for electronic subsystems in FSSH dynamics simulations. The collective results on Tully's three models have been reproduced satisfactorily. In the present work, we develop an extended LSTM-FSSH framework to simulate realistic photochemical reactions. The input features of LSTM as well as the training procedure are redesigned to represent high-dimensional nuclear degrees of freedom in an effective way. Equivariant neural networks are integrated with LSTM to build adiabatic potential energy surfaces in ground and excited states. Photoisomerizations of $\mathrm{CH_2NH}$ and azobenzene are simulated, showing that our new proposed LSTM-FSSH method can produce excited-state lifetimes and product yields accurately in comparison with conventional FSSH simulations as reference. Only 10 reference trajectories are required for training LSTM networks, and then a trajectory ensemble can be generated with very efficient LSTM-FSSH dynamics simulations to obtain collective results. 2026-01-29T13:32:56Z SI included Zhenxing Zhu Diandong Tang Lin Shen Wei-Hai Fang 10.1021/acs.jctc.6c00170 http://arxiv.org/abs/2605.22990v1 Drift-React: One-step Generation of Reaction Pathways via SE(3) Drifting Fields 2026-05-21T19:40:45Z Mapping reaction pathways and transition states (TS) is fundamental to chemistry but computationally expensive at scale. The minimum energy pathway (MEP) dictates reaction rates and mechanisms, yet recovering it via electronic-structure methods requires thousands of costly force evaluations. Recent generative models accelerate TS identification but rely on iterative inference and only predict isolated saddle-point snapshots, missing the continuous reaction trajectory. We introduce Drift-React, an $\mathrm{SE}(3)$-equivariant generative framework that predicts complete reaction pathways in a single forward pass from only reactant and product geometries. By shifting distribution evolution to training via a Sinkhorn-weighted drifting field, Drift-React eliminates both the iterative force evaluations of NEB-style methods and the sequential ODE/SDE integration of diffusion and flow matching models. Evaluated on the Transition1x and Halo8 datasets, our one-step model generates physically consistent MEPs that accurately capture energetic bottlenecks and enable arbitrary-resolution sampling along the reaction coordinate. For isolated TS prediction, Drift-React matches the sub-Ångström accuracy of state-of-the-art iterative models while delivering orders-of-magnitude acceleration, clearing a major computational bottleneck for large-scale reaction network exploration. 2026-05-21T19:40:45Z Rémi Schlama Philippe Schwaller http://arxiv.org/abs/2605.22977v1 Absorbing Many-Body Correlations into Core-Optimized Orbitals 2026-05-21T19:15:57Z The cost of simulating quantum many-body systems - on classical or quantum hardware - scales with the number of variational parameters, so progress at fixed computational budget hinges on more parameter-efficient ansätze. Configuration Interaction (CI) is widely dismissed as parameter-heavy; we show this verdict is an artifact of the orbital basis. Co-optimizing the orbital basis with a sparse CI wavefunction - a method we call Core-Optimized Orbitals (COO) - absorbs a large fraction of the dynamical correlation directly into the single-particle basis, cutting the determinant count by several orders of magnitude beyond the already compact TrimCI ansatz on which it builds. On [Fe$_4$S$_4$] (54e, 36o), a billion-determinant TrimCI+COO wavefunction reaches accuracy that would require $3\!\times\!10^{14}$ determinants in a localized basis. At matched accuracy, it is $8\times$ more compact than the largest unrestricted-DMRG benchmark ($25\times$ with PT2). Across the iron-sulfur series - from [Fe$_2$S$_2$] (30e,20o) to the P-cluster (114e,73o) - TrimCI+COO is $10$-$100\times$ more compact than SU(2)-adapted DMRG with entanglement-minimized orbitals at matched accuracy. A tunable Hubbard-on-graph model factorizes the advantage into an orbital-basis gain and an ansatz gain, the latter capturing multi-center entanglement that resists MPS localization. COO therefore changes the picture of CI efficiency: sparse CI with optimized orbitals can outperform state-of-the-art tensor networks on strongly correlated multi-center systems. 2026-05-21T19:15:57Z main text: 6 pages, 5 figures Hao Zhang Matthew Otten http://arxiv.org/abs/2605.05658v2 Quantum-classical solvation hydrodynamics: a Hamiltonian modeling framework 2026-05-21T17:13:12Z We propose a mixed quantum-classical hydrodynamic framework to model short-time inertial effects in the non-adiabatic evolution of a quantum solute coupled to a classical polar solvent. Drawing upon the work of Burghardt and Bagchi [Chem. Phys. 329 (2006), 343], we employ the Hamiltonian approach to incorporate consistent backreaction and preserve quantum decoherence beyond standard Ehrenfest dynamics. The solvent is treated as an ideal polar fluid and the quantum solute state is coupled to both the position and molecular orientation coordinates of the liquid. This approach retains essential solute-solvent correlations while significantly reducing the computational complexity of previous approaches. We further incorporate dissipative terms to capture both inertial effects and polarization relaxation. After establishing the general setting for non-local dielectric continua, the Marcus local approximation is integrated into the model thereby extending traditional solvation theory to account for collective fluid sloshing on fast timescales. 2026-05-07T04:20:49Z 31 pages, two appendices. Various improvements. Comments welcome François Gay-Balmaz Cesare Tronci http://arxiv.org/abs/2605.22698v1 Machine Learning Interatomic Potentials: Advancing Open-Source Software for Efficient and Scalable Molecular Simulation 2026-05-21T16:40:40Z Machine learning interatomic potentials (MLIPs) enable atomistic simulations with near ab initio accuracy at significantly reduced computational cost, but their broader adoption is often limited by fragmented tooling, limited scalability, and inflexible software design. We present mlip v2, a new generation of the mlip library that advances efficient and scalable molecular simulation through a unified and extensible framework. The new release features a targeted API redesign with improved modularity and control, enabling flexible customization of training, data processing, and simulation workflows. It further integrates a new high-performance backend for equivariant operations, e3j, significantly accelerating model inference and simulations. In addition, the framework introduces a range of entirely new capabilities, including the eSEN architecture with a Mixture-of-Experts formulation for scalable training on large and diverse datasets, improved handling of electrostatics through more physically grounded charge modeling and long-range interaction treatment, and advanced simulation features such as NPT ensembles and nudged elastic band methods. Together, these extensions significantly broaden the scope of MLIP applications, enabling efficient modeling of complex, reactive, and out-of-equilibrium systems, and bridging the gap between ML research and practical molecular simulation applications. The library is available on GitHub and on PyPI under the Apache license 2.0. 2026-05-21T16:40:40Z 29 pages, 7 figures Christoph Brunken Titouan Cormier Lucien Walewski Marco Carobene Yessine Khanfir Zachary Weller-Davies Miguel Bragança Armand Picard Adrien Pichard Leon Wehrhan Heloise Chomet Eszter Varga-Umbrich Marie Bluntzer Massimo Bortone Valentin Heyraud Silvia Acosta-Gutiérrez Jules Tilly Olivier Peltre http://arxiv.org/abs/2601.12188v2 Accurate starting points for one-shot $G_0W_0$ and Bethe-Salpeter Equation calculations via effective tuning of range-separated hybrid functionals 2026-05-21T16:35:26Z The accuracy of one-shot $G_0W_0$ and Bethe-Salpeter equation (BSE) calculations depends strongly on the underlying starting-point eigensystem, which is commonly obtained from a mean-field density-functional approximation. Range-separated hybrid (RSH) functionals provide a particularly effective starting point, however, conventional optimally tuned RSH procedures often require costly, system-specific, multi-step optimizations of the range-separation parameter $ω$. In this work, we show that a recently proposed effective tuning protocol [Singh \textit{et. al.}, Journal of Physical Chemistry Letters, 16, 32, 8198-8208, (2025)] for RSH functionals can serve as an efficient alternative for determining $ω$ used in $G_0W_0$ and BSE calculations. This simplified tuning scheme yields range-separation parameters that are effectively equivalent to those obtained from more elaborate tuning strategies, while avoiding their substantial computational overhead. The resulting tuned RSH eigensystems provide reliable starting points for many-body perturbation theory. In particular, one-shot $G_0W_0$ calculations based on effectively tuned RSH orbitals reproduce reference ionization potentials with high accuracy, while subsequent BSE calculations yield quantitatively reliable neutral excitation energies, optical absorption spectra, and excitonic properties for a diverse set of molecular systems and clusters. These results demonstrate that effective RSH tuning offers a practical and broadly applicable route to accurate quasiparticle and excited-state calculations, combining the accuracy of optimally tuned starting points with the low computational cost required for routine applications of $G_0W_0$ and BSE. 2026-01-17T22:38:05Z Aditi Singh Subrata Jana Szymon Śmiga http://arxiv.org/abs/2605.22584v1 On the Regularity and Interpolation of Coupled Cluster Amplitudes in Canonical Orbital Basis 2026-05-21T14:55:39Z Arguably the most widely used approaches for obtaining highly accurate molecular ground-state energies are coupled cluster methods. Despite introducing two layers of approximation, a linear and a nonlinear one, coupled cluster methods remain computationally intensive, with the complexity scaling as $O(poly(N))$, where $N$ is the number of electrons. Moreover, this method must be applied over a large set of different nuclear coordinates in order to study certain chemical phenomena. Therefore, in this work, we investigate the regularity of single-reference coupled cluster amplitudes with respect to nuclear coordinate displacements, with the aim of enabling interpolation or extrapolation approaches that rely on only a limited number of reference geometries. We show that, in theory, under certain non-degeneracy assumptions on the Hartree-Fock level of theory, and the coupled cluster level of theory the amplitudes behave real analytic. Furthermore, we analyze the artifacts that arise in practical calculations that use canonical orbitals, which hinder this high degree of regularity, and suggest strategies to mitigate these issues. Finally, we validate our findings through numerical experiments by interpolating the amplitudes and comparing the performance of the interpolants with that of the exact amplitudes. 2026-05-21T14:55:39Z 29 pages, 4 figures Jonas Beck Benjamin Stamm http://arxiv.org/abs/2605.22543v1 pANO-F12: An atomic natural orbital-inspired route to more compact basis sets for F12 explicitly correlated methods 2026-05-21T14:27:32Z Explicitly correlated methods such as MP2-F12 and CCSD(F12*) exhibit much faster basis set convergence (asymptotically $\propto L^{-7}$, with L the highest angular momentum) than orbital-only approaches. Yet it has been pointed out that cc-pVnZ-F12 basis sets themselves are substantially larger than the corresponding cc-pVnZ, and specifically that cc-pVDZ-F12 is the size of cc-pVTZ. One way to generate compact basis sets in an orbital-only context are Atomic Natural Orbital (ANO) basis sets [J. Almlöf and P. R. Taylor, JCP 86, 4070 (1987)]. However, obtaining the required first-order reduced density matrix while properly accounting for the F12 geminal is problematic. In this work, we show that an energy minimization-based contraction process under linear independence constraints yields `pseudo-ANO' (pANO) basis sets that are functionally equivalent in quality. Subsequently, we apply this recipe to obtain pANO-F12 basis sets from the same elements, then validate them for several thermochemical benchmarks and for the hypersensitive out-of-plane vibrations of benzene. We show that, unlike cc-pVnZ-F12, pANO-F12 exhibits the familiar shell structure seen in cc-pVnZ and ANO basis sets, and that pANO-F12 offers a route to more compact F12 basis sets more amenable to medium-sized systems, especially in conjunction with localized pair natural orbital approaches. Overall, the pANO approach is most beneficial for the smaller double-and triple-zeta basis sets, offering either superior performance to cc-pVnZ-F12 at same cost, or similar performance at lower cost. 2026-05-21T14:27:32Z 33 pages draft; basis sets part of download package Vladimir Fishman Jan M. L. Martin http://arxiv.org/abs/2601.03394v3 Frontier Orbital Engineering in Heteroatom-Doped Prototypical Organic Dyes for Dye-Sensitized Solar Cells 2026-05-21T14:01:56Z The computational design of heteroatom-doped organic dyes for dye-sensitized solar cells (DSSCs) remains challenging, as predictive methods must accurately describe long-range charge-transfer (CT) excitations while remaining computationally efficient for systematic materials screening. In this work, we investigate the electronic structure and excited-state properties using the range-separated hybrid functional LC-$ω$PBE in conjunction with linear-response time-dependent density functional theory (TDDFT) within the Tamm-Dancoff approximation (TDA). We employ a simplified, physically motivated, effective tuning protocol ($ω_{eff}$) to enable the rapid and reliable screening of electronic properties of organic dyes. Charge-transfer excitation energies and frontier orbital alignment the key factors governing light absorption and electron injection in DSSCs are analyzed through targeted heteroatom (N, O, and B) incorporation into donor-$π$-acceptor (D-$π$-A) organic dyes. A library of 27 mono-, di-, and tri-doped prototypical organic dyes is designed based on a carbazole donor and a cyanoacrylic acid acceptor through targeted doping at three positions of the $π$-bridge or linker. Distinct design trends emerge: electron-rich nitrogen and oxygen dopants increase the HOMO-LUMO gap and blue-shift CT excitations, with nitrogen exhibiting the strongest effect, whereas electron-deficient boron substitution narrows the gap and induces pronounced red shifts. Notably, the BBN-doped dye exhibits the smallest gap and lowest excitation energy, highlighting boron-rich motifs as promising candidates for enhanced solar light harvesting. Overall, this study establishes transferable heteroatom-doping guidelines and introduces an efficient, reliable, and cost-effective tuned DFT-TDDFT framework for high-throughput computational discovery and optimization of DSSC sensitizers. 2026-01-06T20:14:09Z Aditi Singh Ram Dhari Pandey Subrata Jana Prasanjit Samal Paweł Tecmer Szymon Śmiga http://arxiv.org/abs/2605.22459v1 Reduced Dynamical Maps in Finite Temperature Vibronic Coupling Models via Choi Matrices: Numerical Methods and Applications 2026-05-21T13:23:35Z We present a streamlined implementation of a computational framework for constructing and analyzing reduced dynamical maps for complex system--bath models at finite temperature. The methodology is based on three established ingredients of quantum dynamics: the Choi--Jamiołkowski isomorphism for the representation of quantum channels, thermofield (TFD) purification of thermal environments, and tensor-train (TT) propagation of the resulting enlarged pure state. The reduced map is obtained from a single unitary propagation in a thermofield-doubled Hilbert space and represented in matrix form through the Choi--Jamiołkowski isomorphism. The TFD evolution is implemented in the TT representation, enabling efficient propagation of high-dimensional purified thermal states. We illustrate the methodology for exciton transfer in the Fenna--Matthews--Olson complex with site-dependent structured spectral densities represented by discretized bosonic environments. The resulting maps are used to analyze decoherence, relaxation, and finite-memory effects, and to assess the crossover to an effectively time-local description. The proposed approach provides a route to compute reduced propagators and to post-process them into memory kernels, transfer tensors, and effective kinetic rate descriptions for complex molecular systems. 2026-05-21T13:23:35Z The following article has been accepted by Journal of Chemical Physics. After it is published, it will be found at https://pubs.aip.org/aip/jcp Raffaele Borrelli Hideaki Takahashi 10.1063/5.0332266 http://arxiv.org/abs/2605.22394v1 Dynamic electron correlation energy for multireference wavefunction methods from one- and two-electron reduced density matrices 2026-05-21T12:28:15Z Efficiently recovering dynamic correlation in strongly correlated systems without incurring prohibitive computational costs remains a central challenge in quantum chemistry. In this Perspective, we review and benchmark methods capable of recovering dynamic correlation for multireference wave functions exclusively from low-order reduced density matrices and densities. These approaches require at most the two-electron reduced density matrix of the reference wave function and fall into two categories: density functional theory (DFT)-based methods and purely ab initio multireference adiabatic connection (AC) methods. The former include MC-srDFT, which recovers dynamic correlation through a short-range exchange-correlation functional depending on the charge and spin densities, as well as MC-PDFT and MC-srPDFT, which employ translated functionals that additionally depend on the on-top pair density. Within the post-CASSCF framework, we perform a direct, head-to-head benchmark of these approaches under identical computational settings (including active spaces and basis sets) against challenging multireference problems, including singlet-triplet gaps in organic biradicals, excitation energies, and spin-state splittings in iron complexes. Among the DFT-based methods, MC-srPDFT emerges as the most accurate, underscoring the benefit of incorporating the on-top pair density. However, all considered DFT-based methods fail to provide reliable spin-state energetics for transition-metal complexes. Conversely, linearized AC0 rivals or outperforms more computationally expensive second-order perturbation theory approaches across all benchmark sets. We discuss these findings in the context of alternative formulations and existing literature, highlighting critical limitations and identifying promising directions for the future development of scalable multireference methods. 2026-05-21T12:28:15Z Michał Hapka Aleksandra Tucholska Katarzyna Pernal http://arxiv.org/abs/2605.22367v1 Benchmarking machine-learned interatomic potentials for molecular infrared spectroscopy 2026-05-21T12:00:21Z Machine learning has transformed the field of atomistic simulations by enabling the development of interatomic potentials that are computationally efficient and highly accurate. These advances have opened the door to modeling molecular vibrations and predicting infrared spectra with near ab-initio accuracy at a fraction of the computational cost. Among these approaches, message-passing neural networks (MPNNs) have emerged as a particularly powerful class of models for representing complex atomic interactions. In this study, we benchmark five MPNN architectures, SchNet, FieldSchNet, SO3Net, PaiNN, and MACE, for predicting infrared spectra of small organic molecules. SchNet and FieldSchNet are invariant models, while SO3Net, PaiNN, and MACE are equivariant, explicitly accounting for rotational symmetries in molecular representations. We evaluate their performance in terms of computational efficiency, accuracy, and robustness. All models accurately predict properties, such as energies, forces, and dipole moments, required for infrared spectra calculations. They also capture harmonic frequencies and infrared spectra derived from molecular dynamics with high fidelity for molecules in the training set. However, SchNet and FieldSchNet show limited transferability to unseen systems, while SO3Net, PaiNN, and MACE generalize more effectively. In terms of computational efficiency, SchNet is the most efficient and FieldSchNet enables field-dependent response modeling but with higher cost. PaiNN achieves the best balance between accuracy and efficiency, MACE provides the highest spectral accuracy and transferability, and SO3Net performs between PaiNN and MACE. 2026-05-21T12:00:21Z 20 pages, 3 figures, 7 tables Nitik Bhatia Ondrej Krejci Patrick Rinke http://arxiv.org/abs/2605.19874v2 FNO-CCSDTQ(5)$_Λ$ as an economical alternative for connected quintuple excitations contributions in coupled cluster thermochemistry 2026-05-21T11:08:24Z Contributions from connected quintuple excitations in coupled cluster theory can reach the 0.5 kcal/mol range, important enough to matter in accurate computational thermochemistry, yet the very steep $\propto N^{12}$ CPU time scaling impedes routine evaluation. We show that for the differential contribution of quintuples, convergence of a frozen natural orbital (FNO) expansion with respect to the NO cutoff is rapid enough to make FNO-CCSDTQ(5)$_Λ$ with cutoffs of 0.0025 or 0.001 viable alternatives. A naive extrapolation to zero cutoff from \{0.005,0.0025\} works surprisingly well as a low-cost option. Interestingly, FNO convergence is definitely slower for second-row than for first-row compounds. 2026-05-19T14:05:18Z to be submitted, 7 pages in AIP two-column format [minor updates] Gregory H. Jones Aditya Barman Margarita Shepelenko Jan M. L. Martin