https://arxiv.org/api/e5xKsstlZuaFDZpcAdBd+2A82Vs2026-06-21T19:26:22Z2749970515http://arxiv.org/abs/2604.14904v1Frozen density embedding with pCCD electron densities2026-04-16T11:47:49ZThe pair-coupled-cluster doubles (pCCD) method has emerged as a viable approach for quantum-chemical studies of strongly correlated systems. Despite its lower formal scaling (O(N$^4$)) compared to other versions of coupled cluster (CC) theory, applications to large chemical structures are still expensive. Fragmentation and embedding strategies offer a viable approach in such cases. In this work, we present a simple and efficient density-embedding scheme based on pCCD electron densities. The main computational benefit arises from the fact that pCCD response $Λ$-equations are much cheaper to compute than those of standard CC methods, providing easy access to one-electron properties. The pCCD densities of the individual subsystems are used to generate static embedding potentials that capture the environment's effect on the embedded system. The individual fragment energies are then iteratively converged in a self-consistent fashion. We demonstrate the reliable performance of this scheme with the estimation of dipole moments of the weakly bound CO2$\cdots$Rg (Rg = He, Ne, Ar, and Kr) complexes and with the modeling of vertical excitations of some microsolvated molecules.2026-04-16T11:47:49ZRahul ChakrabortyPaweł Tecmerhttp://arxiv.org/abs/2604.14873v1Highly coarse-grained polarisable water models for mesoscopic simulations2026-04-16T11:02:21ZModelling micro- and meso-scopic scale thermodynamic and transport properties of soft condensed matter hinges upon its representation. This is especially relevant for polar solvents such as water, since these require effective representation of their dielectric nature as driven by molecular charge distributions and molecular network structuring. The dielectric nature of a medium leads to complex phenomena such as local polarisability response and restructuring near interfaces in reaction to changes in local charge distributions. Inclusion of such phenomena when using larger-than-atomistic techniques such as coarse-grained molecular dynamics (CG-MD) and dissipative particle dynamics (DPD) is still an open question, to which we provide a novel way to consider and justify the necessary and suitable coarse-graining level, enabling us to compare new polar CG models' performance against that of an underlying atomistic model. We polarise our previous non-polar nDPD water model to prepare it for use in simulations of liquid electrolytes as well as solvated organic membranes and measure its fitness to serve as a dielectric medium by comparing its properties to those of the TIP3P water model, while simultaneously observing changes to properties already represented well by the non-polar model.2026-04-16T11:02:21ZMichael A. SeatonBenjamin T. SpeakeIlian T. Todorovhttp://arxiv.org/abs/2604.14848v1Emergence of Open Chemical Reaction Network Thermodynamics within Closed Systems2026-04-16T10:34:16ZWe address a fundamental question: under which conditions do the dynamics and thermodynamics of open chemical reaction networks (CRNs), grounded on the notion of idealized chemostats that exchange selected species, emerge from underlying closed CRNs? While open CRNs provide the standard framework to describe out-of-equilibrium chemical systems, real systems are finite and ultimately relax to equilibrium, leaving the status of this description conceptually unresolved. Here we show that open-CRN behavior arises as an asymptotic regime of closed CRNs when two minimal and physically transparent conditions are met: a time-scale separation, whereby fast reactions effectively act as exchange mechanisms, and an abundance separation, whereby a subset of species behaves as chemostats with diverging chemical capacity. In this regime, both the stochastic dynamics and the thermodynamic structure \ -- including local detailed balance, entropy production, and free-energy balance \ -- emerge to leading order from the underlying closed CRN. Our results apply to arbitrary stoichiometries. They show that chemostats need not be introduced as external idealizations, but instead arise as emergent thermodynamic structures within closed systems, providing a unified and physically grounded foundation for the nonequilibrium thermodynamics of CRNs.2026-04-16T10:34:16Z26 pages, 5 figuresBenedikt RemleinMassimiliano EspositoFrancesco Avanzinihttp://arxiv.org/abs/2604.14784v1Interfacial Electric Fields in Water Nanodroplets are Weakly Dependent on Curvature and pH2026-04-16T08:42:28ZThe origin of enhanced reactivity in aqueous microdroplets remains debated, with interfacial electric fields (IEFs) often invoked as catalytic drivers. Here, we provide a quantum-mechanical, spatially resolved characterization of the electric field at air-water interfaces by combining deep-learning molecular dynamics with \emph{ab initio} re-sampling. Across planar interfaces and nanodroplets of varying curvature and charge state, we find an outward-oriented field of $\sim 1.0$--$1.2$ V/Å along the intrinsic surface normal. Crucially, its magnitude scales linearly with the average number of hydrogen bonds per interfacial molecule, directly tying the field to the local hydrogen-bond network. Despite its large magnitude and contrary to common expectations, we find that curvature and pH exert only a minor influence on the IEF, becoming negligible at experimentally relevant droplet sizes and pH. Consequently, the reactivity differences observed in $μ$m-sized droplets cannot be ascribed to variations in the IEF, which changes by a factor of only $\sim10^{-5}$ between $3$ and $40μ$m-sized droplets. Moreover, the IEF is localized inside the interfacial region and rapidly vanishes within a few Å. This strong spatial confinement renders the IEF strongly tied to the local electronic structure, identifying it as a local property of the air-water boundary rather than an independent physical driver of ``on-water'' catalysis.2026-04-16T08:42:28ZGabriele AmanteFortunata PanzeraGabriele CentiJing XieAli HassanaliA. Marco SaittaGiuseppe Cassonehttp://arxiv.org/abs/2603.10523v2First-Principles Electronegativity Scale from the Atomic Mean Inner Potential2026-04-16T08:39:56ZElectronegativity is a cornerstone of chemical intuition, essential for rationalizing bonding, reactivity, and material properties. However, prevailing scales remain empirically derived, often relying on parameterized models or composite physical quantities. In this work, we introduce a universal electronegativity scale founded on the atomic mean inner potential (AMIP), also known as the average Coulomb potential, a fundamental, quantum-mechanical property accessible through both first-principles computation and electron-scattering experiments. Our scale, denoted $χ_{\mathrm{AMIP},p}$, is an analytic function of just three ground-state atomic descriptors and carries explicit physical units. It demonstrates excellent agreement with established scales and successfully classifies bonding types across 358 compounds, including adherence to the metalloid ``Si rule". Beyond replicating known trends, $χ_{\mathrm{AMIP,1/2}}$ proves to be a powerful predictive tool, accurately determining Lewis acid strengths for over 14,000 coordination environments ($R^2=0.93$) and $γ$-ray annihilation spectral widths for 36 elements ($R^2=0.97$), outperforming previous methods. By linking electronegativity directly to a measurable quantum property, this work provides a unified and predictive descriptor for electronic structure and chemical behavior across the periodic table.2026-03-11T08:28:10Z36 pages, 9 figures, 3 tables. Additional data for Zn, Cd, and Hg are providedFrontiers of Physics, 21(11), 114201 (2026)Jin-Cheng Zheng10.15302/frontphys.2026.114201http://arxiv.org/abs/2511.13402v3Molecular mechanism of heterogeneous ice nucleation on potassium feldspar2026-04-16T07:40:27ZMineral dust aerosols strongly influence Earth's climate by acting as ice-nucleating particles (INPs). Feldspar minerals, particularly K-feldspar, are recognized as dominant INPs, and a previous study attributed this behavior to (100) surfaces exposed at defects. Using machine-learning molecular dynamics simulations, we systematically investigate ice nucleation on multiple K-feldspar surfaces. We identify the (110) surface, exposed at defects such as steps, as the most active plane for ice formation. This surface uniquely structures interfacial water into an arrangement resembling that on the (110) surface of cubic ice, providing an optimal template for nucleation. Using advanced sampling, we directly observe the formation of clusters with cubic-ice structure and their orientation agrees with experiment. These results provide a molecular-level explanation of how ice forms in our planet's atmosphere.2025-11-17T14:17:10Z23 pages, 4 figures, and supplementary materialsWanqi ZhouPablo M. Piaggihttp://arxiv.org/abs/2602.12109v3A critical assessment of bonding descriptors for predicting materials properties2026-04-16T07:21:01ZMost machine learning models for materials science rely on descriptors based on materials compositions and structures, even though the chemical bond has been proven to be a valuable concept for predicting materials properties. Over the years, various theoretical frameworks have been developed to characterize bonding in solid-state materials. However, integrating bonding information from these frameworks into machine learning pipelines at scale has been limited by the lack of a systematically generated and validated database. Recent advances in high-throughput bonding analysis workflows have addressed this issue, and our previously computed Quantum-Chemical Bonding Database for Solid-State Materials was extended to include approximately 13,000 materials. This database is then used to derive a new set of quantum-chemical bonding descriptors. A systematic assessment is performed using statistical significance tests to evaluate how the inclusion of these descriptors influences the performance of machine-learning models that otherwise rely solely on structure- and composition-derived features. Models are built to predict elastic, vibrational, and thermodynamic properties typically associated with chemical bonding in materials. The results demonstrate that incorporating quantum-chemical bonding descriptors not only improves predictive performance but also helps identify intuitive expressions for properties such as the projected force constant and lattice thermal conductivity via symbolic regression.2026-02-12T16:00:12ZAakash Ashok NaikNidal DhamraitKatharina UeltzenChristina ErturalPhilipp BennerGian-Marco RignaneseJanine Georgehttp://arxiv.org/abs/2604.19812v1An efficient method based on the evolutionary center algorithm for optimizing chemical-diffusive models for flame acceleration and DDT2026-04-16T02:34:07ZThis paper presents an efficient method based on Evolutionary Center Algorithm (ECA) for accurately and efficiently determining the optimal reaction and diffusion parameters for Chemical-Diffusive Models (CDM) to simulate flame acceleration (FA) and deflagration-to-detonation transition (DDT). The proposed method leverages the global search capability of the ECA and the local optimization strength of the Nelder-Mead (NM) algorithm. The hybrid approach (ECA-NM) can efficiently optimize CDM parameters that are capable of accurately reproducing the major properties of combustion waves. The CDMs for premixed flames and detonations of hydrogen in air or oxygen were developed using the present ECA-NM method and validated against canonical tests of combustion waves and previous experiments of FA and DDT. The results show that the major flame and detonation properties calculated using the developed CDMs match those obtained from detailed chemical reaction mechanisms over a wide range of equivalence ratio. The simulated FA and DDT in a channel also agree qualitatively and quantitatively with experiments in terms of complex flame instabilities (e.g., tulip and distorted tulip flames), flame displacement speed, and detonation occurrence. In addition, detailed comparisons to the traditional genetic algorithm demonstrate that the developed ECA-NM method diminishes the global error by four orders of magnitude while reducing the computational cost by two orders of magnitude. This work provides a significantly efficient method for developing chemical-diffusive models that allows quantitative multi-scale simulations of transient flames and detonations in complex scenarios.2026-04-16T02:34:07ZManuscript with 13 figures, 7 tables, and appendixHuahua XiaoXu ZhangMingbin ZhaoCongling Shihttp://arxiv.org/abs/2511.13677v2Open-shell frozen natural orbital approach for quantum eigensolvers2026-04-15T19:20:03ZWe present an open-shell frozen natural orbital (FNO) approach, which utilizes the second-order Z-averaged perturbation theory (ZAPT2), to reduce the restricted opten-shell Hartree-Fock virtual space size with controllable accuracy. Our ZAPT2 frozen natural orbital (ZAPT-FNO) selection scheme significantly outperforms the canonical molecular orbital virtual space truncation scheme based on Hartree-Fock orbital energies, especially when using large multiple-polarized and augmented basis sets. We demonstrate that the ZAPT-FNO-selected virtual orbitals lead to a systematic convergence of the correlation energies, but more importantly to the singlet-triplet T$_1$-S$_ 0$ energy gaps with respect to the complete active space (CAS) [occupied + virtual] size. We confirm our findings by simulating T$_1$-S$_ 0$ gaps in H$_2$O$_2$ and O$_2$ molecules using the traditional complete active space configuration interaction (CASCI) approach, as well as in stretched CH$_2$, for which we also employed the iterative qubit coupled cluster (iQCC) method as a quantum eigensolver. Finally, we applied the iQCC method with ZAPT-FNO-selected active space to the phosphorescent Ir(ppy)$_3$ complex with 260 electrons, where extended basis sets are required to achieve chemical (ca. 1 m$E_h$) accuracy. In this case, CASCI results are not available; however, the iQCC-computed T$_1$-S$_ 0$ gaps show robust convergence with enlarging basis set and CAS size, approaching the experimental value. Thus, the ZAPT-FNO method is very promising for improving the accuracy of quantum chemical modelling in a resource-efficient manner, and opens the door to simulating open-shell states of large materials within realistic active space sizes and without compromising on basis-set quality.2025-11-17T18:32:03Z16 pages, 7 figures, 5 tablesJ. Chem. Phys. 164, 154105 (2026)Angela F. HarperXiaobing LiuScott N. GeninIlya G. Ryabinkin10.1063/5.0312011http://arxiv.org/abs/2604.14115v1Configuration interaction extension of AGP for incorporating inter-geminal correlations2026-04-15T17:37:59ZIn this paper, we develop a class of antisymmetrized geminal power configuration interaction (AGP-CI) wave functions that extend the AGP framework by incorporating inter-geminal correlations through a CI expansion. To make these wavefunctions computationally tractable, we evaluate them by rewriting the AGP-CI ansatz as a linear combination of AGPs (LC-AGP), for which overlaps and Hamiltonian matrix elements can be computed with standard AGP machinery. Motivated by border-rank decompositions, we further reorganize this ansatz into a compact linear combination of AGPs depending on a small deformation parameter $τ$, which controls how closely the truncated expansion approximates the full AGP-CI state. Benchmark applications to the Hubbard model and to the small molecules H$_2$O and N$_2$ demonstrate that the proposed wavefunctions achieve consistently high accuracy and outperform the LC-AGP, particularly for systems with more electrons and in strongly correlated regimes.2026-04-15T17:37:59Z28 pages, 11 figuresAiri KawasakiFei GaoGustavo E. Scuseriahttp://arxiv.org/abs/2601.02294v5Coupling between thermochemical contributions of subvalence correlation and of higher-order post-CCSD(T) correlation effects -- a step toward `W5 theory'2026-04-15T14:10:09ZWe consider the thermochemical impact of post-CCSD(T) contributions to the total atomization energy (TAE, the sum of all bond energies) of first- and second-row molecules, and specifically their coupling with the subvalence correlation contribution. In particular, we find large contributions from (Q) when there are several neighboring second-row atoms. Otherwise, both higher-order triples $T_3$--(T) and connected quadruples (Q) are important in systems with strong static correlation. Reoptimization of the reference geometry for core-valence correlation increases the calculated TAE across the board, most pronouncedly so for second-row compounds with neighboring second-row atoms. %just slightly increases the calculated TAE for all species, but more pronouncedly so if strong static correlation is present, as well as for second-row compounds, again especially with neighboring second-row atoms. We present a first proposal for a `W5 theory' protocol and compare computed TAEs for the W4-08 benchmark with prior reference values. For some key second-row species, the new values represent nontrivial revisions. Our predicted TAE$_0$ values (TAE at 0 K) agree well with the ATcT (active thermochemical tables) values, including for the very recent expansion of the ATcT network to boron, silicon, and sulfur compounds.2026-01-05T17:30:59ZJ. Phys. Chem. A (John F. Stanton memorial issue), Open Access CC:BYJ. Phys. Chem. A 130, 2943-2955 (2026)Aditya BarmanGregory H. JonesKaila E. WeflenMargarita ShepelenkoJan M. L. Martin10.1021/acs.jpca.6c00467http://arxiv.org/abs/2604.13753v1Critical point search and linear response theory for computing electronic excitation energies of molecular systems. Part II. CASSCF2026-04-15T11:40:49ZThe computation of excited states within the Complete Active Space Self-Consistent Field (CASSCF) framework remains a significant challenge in quantum chemistry, both theoretically and algorithmically. In this work, we extend the Kähler manifold formalism introduced in Part I of this series to the CASSCF theory, and draw a geometrical connection from the time-dependent CASSCF equations to state-specific and linear response methodologies for excited states. This is achieved by first investigating the underlying CASSCF manifold and identifying its Kähler structure, which is complicated by the nontrivial coupling of CI and orbital degrees of freedom. Building on these theoretical findings, we derive the CASSCF linear response equations in a straightforward manner, and develop a robust state-specific method that relies solely on first-order derivatives of the CASSCF energy functional. Numerical results on representative molecular systems-water, formaldehyde, and ethylene-demonstrate the effectiveness of the proposed state-specific method, while revealing the difficulty of reliable identification of excited states due to nonlinearity induced by the CASSCF theory.2026-04-15T11:40:49Z16 pages, 2 figures, 2 tablesLaura GrazioliYukuan HuTommaso NottoliFilippo LippariniEric Cancèshttp://arxiv.org/abs/2502.05909v3Towards a Universal Foundation Model for Protein Dynamics: A Multi-Chain Tree-Structured Framework with Transformer Propagators2026-04-15T11:07:43ZSimulating large-scale protein dynamics using traditional all-atom molecular dynamics (MD) remains computationally prohibitive. We present a unified, universal framework for coarse-grained molecular dynamics (CG-MD) that achieves high-fidelity structural reconstruction and generalizes across diverse protein systems. Central to our approach is a hierarchical, tree-structured protein representation (TSCG) that maps Cartesian coordinates into a minimal set of interpretable collective variables. We extend this representation to accommodate multi-chain assemblies, demonstrating sub-angstrom precision in reconstructing full-atom structures from coarse-grained nodes. To model temporal evolution, we formulate protein dynamics as stochastic differential equations (SDEs), utilizing a Transformer-based architecture as a universal propagator. By representing collective variables as language-like sequences, our model transcends the limitations of protein-specific networks, generalizing to arbitrary sequence lengths and multi-chain configurations. The framework achieves an acceleration of over 10,000 to 20,000 times compared to traditional MD, generating microsecond-long trajectories within minutes. Our results show that the generated trajectories maintain statistical consistency with all-atom MD in RMSD profiles and structural ensembles. This universal model provides a salable solution for high-throughput protein simulation, offering a significant leap toward a foundation model for molecular dynamics.2025-02-09T14:08:23Z14 pages, 10 figuresJinzhen Zhuhttp://arxiv.org/abs/2604.13249v1Free energy differences and coexistence of clathrate structures II and H via lattice-switch Monte Carlo2026-04-14T19:24:09ZWe introduce a simulation technique to compute the free energy difference between two hydrate structures of different stoichiometry connected to a reservoir of gas molecules at a prescribed pressure. The method permits the determination of coexistence parameters for the system when the two hydrate structures have the same number of water molecules $N_w$. The approach is based on performing isobaric Lattice Switch Monte Carlo simulations to measure free energy differences between the hydrate structures when they are either fully occupied by gas molecules, or fully empty. This measurement is combined with thermodynamic integration within an ensemble in which the number of guest molecules $N_g$ can fluctuate under the control of a chemical potential $μ_g$. We analyze the properties of the resulting constant-$N_w,μ_g,P,T$ ensemble and show how it can be used to calculate coexistence points via a thermodynamic cycle. Applying the method to argon and methane structures, we find coexistence pressures that are in good agreement overall with the available experimental data.2026-04-14T19:24:09Z19 pages, 18 figuresOlivia S. MoroNigel B. WildingVincent Balleneggerhttp://arxiv.org/abs/2604.12914v1Efficient Implementation of Relativistic Coupled Cluster Linear Response Theory in Combination with Perturbation Sensitive Natural Spinors and Cholesky Decomposition Treatment of Two-electron Integrals2026-04-14T15:58:57ZWe present an efficient implementation of the low-cost linear-response coupled-cluster singles and doubles (LR-CCSD) method for computing static and frequency-dependent polarizabilities in systems with significant relativistic and electron-correlation effects. The approach employs X2C-based Hamiltonians (X2CAMF and X2CMP) and incorporates Cholesky decomposition to reduce memory requirements. In the current implementation, costly three- and four-external index integrals are generated on the fly, eliminating the need for their storage. Benchmark results indicate that the X2CMP Hamiltonian provides more consistent performance than X2CAMF, particularly for large and highly augmented basis sets. The proposed FNS++CD-X2CMP-LR-CCSD method shows excellent agreement with four-component reference values across a wide range of systems. Additionally, different strategies for constructing the FNS++ basis were assessed, and an averaged density approach was found to offer a favorable balance between accuracy and computational cost. On average, about 73% of the virtual spinor space is removed, demonstrating the efficiency and consistency of the FNS++ density-based truncation approach. The present implementation enables accurate and scalable relativistic response calculations for large molecular systems, as demonstrated by the calculation of the static polarizability of the Uranium Hexafluoride complex with a triple-zeta basis set more than 1400 basis functions.2026-04-14T15:58:57ZSudipta ChakrabortyMuskan BegomXubo WangAchintya Kumar Dutta