https://arxiv.org/api/qpjUwfy4f1893B51XFHYmru7V9E 2026-06-10T09:54:19Z 26559 105 15 http://arxiv.org/abs/2604.09412v2 Sharp description of local minima in the loss landscape of high-dimensional two-layer ReLU neural networks 2026-05-29T11:52:09Z We study the population loss landscape of two-layer ReLU networks of the form $\sum_{k=1}^K \mathrm{ReLU}(w_k^\top x)$ in a realisable teacher-student setting with Gaussian covariates. We show that local minima admit an exact low-dimensional representation in terms of summary statistics, yielding a sharp and interpretable characterisation of the landscape. We further establish a direct link with one-pass SGD: local minima correspond to attractive fixed points of the dynamics in summary statistics space. This perspective reveals a hierarchical organisation of minima into discrete families and shows how overparameterisation changes their stability and reachability under gradient-based dynamics. In this overparameterised regime, global minima become increasingly accessible, attracting the dynamics and reducing convergence to spurious solutions. Overall, our results reveal intrinsic limitations of common simplifying assumptions, which may miss essential features of the loss landscape even in minimal neural network models. 2026-04-10T15:26:00Z 29 pages, 18 figures. Accepted as a conference paper at ICML 2026 Jie Huang Bruno Loureiro Stefano Sarao Mannelli http://arxiv.org/abs/2308.15532v3 Information Bounds on phase transitions in disordered systems 2026-05-29T07:18:54Z Information theory, rooted in computer science, and many-body physics, have traditionally been studied as (almost) independent fields. Only recently has this paradigm started to shift, with many-body physics being studied and characterized using tools developed in information theory. In our work, we introduce a new perspective on this connection, and study phase transitions in models with randomness, such as localization in disordered systems, or random quantum circuits with measurements. Utilizing information-based arguments regarding probability distribution differentiation, we bound critical exponents in such phase transitions (specifically, those controlling the correlation or localization lengths). We benchmark our method and rederive the well-known Harris criterion, bounding critical exponents in the Anderson localization transition for noninteracting particles, as well as classical disordered spin systems. We then move on to apply our method to many-body localization. While in real space our critical exponent bound agrees with recent consensus, we find that, somewhat surprisingly, numerical results on Fock-space localization for limited-sized systems do not obey our bounds, indicating that the simulation results might not hold asymptotically (similarly to what is now believed to have occurred in the real-space problem). We also apply our approach to random quantum circuits with random measurements, for which we can derive bounds transcending recent mappings to percolation problems. 2023-08-29T18:00:07Z 9 pages, 2 figures, comments are welcome Noa Feldman Niv Davidson Moshe Goldstein 10.21468/SciPostPhys.20.5.146 http://arxiv.org/abs/2508.07707v2 Observation and Modulation of the Quantum Mpemba Effect on a Superconducting Quantum Processor 2026-05-29T06:17:07Z In non-equilibrium quantum systems, the quantum Mpemba effect (QME) emerges as a counterintuitive phenomenon: systems exhibiting greater initial symmetry breaking restore symmetry faster. It has been attracting broad interest in studying QME dynamics and potential applications in quantum information science. While theoretical exploration of QME has surged, experimental studies, specifically on its flexible modulation, remain limited. Here, we report the observation and modulation of QME using a superconducting processor featuring an all-to-all connected, tunable-coupling architecture that enables precise control from short- to long-range interactions. This platform allows independent manipulation of coupling regimes, on-site potentials, and initial states, enabling us to elucidate their roles in QME. To quantify symmetry restoration, we employ entanglement asymmetry (EA), derived from the reconstructed density matrix via quantum state tomography, as a sensitive probe. In strong short-range coupling regimes, EA crossovers during quenches from tilted Néel states confirm the presence of QME. In contrast, in intermediate coupling regimes, synchronized EA and entanglement entropy dynamics reveal the suppression of QME. Remarkably, QME reemerges with the introduction of on-site linear potentials or quenches from tilted ferromagnetic states, the latter proving robust against on-site disorder. Our study demonstrates flexible QME modulation on a superconducting platform with multiple controllable parameters, shedding light on quantum many-body non-equilibrium dynamics and opening avenues for quantum information applications. 2025-08-11T07:35:26Z Figures modified and new discussions added. Final version published in PRL Yueshan Xu Cai-Ping Fang Bing-Jie Chen Ming-Chuan Wang Zi-Yong Ge Yun-Hao Shi Yu Liu Cheng-Lin Deng Kui Zhao Zheng-He Liu Tian-Ming Li Hao Li Ziting Wang Gui-Han Liang Da'er Feng Xueyi Guo Xu-Yang Gu Yang He Hao-Tian Liu Zheng-Yang Mei Yongxi Xiao Yu Yan Yi-Han Yu Wei-Ping Yuan Jia-Chi Zhang Zheng-An Wang Gangqin Liu Xiaohui Song Ye Tian Yu-Ran Zhang Shi-Xin Zhang Kaixuan Huang Zhongcheng Xiang Dongning Zheng Kai Xu Heng Fan 10.1103/951q-j8kq http://arxiv.org/abs/2605.30822v1 Using graph neural networks to predict many-body interactions in amorphous materials 2026-05-29T04:14:19Z Many-body interactions govern the complex behavior of many amorphous materials, from metallic glasses to biological tissues, yet are often replaced by pairwise additive frameworks for computational efficiency. Here, we use classical density functional theory (DFT) to study a model soft glass of solvent-free polymer-grafted nanoparticles (PGNs), where the absence of solvent forces grafted chains to uniformly fill the interstitial space, generating strong angular-dependent many-body interactions between the cores. We show that NequIP, an equivariant message-passing graph neural network (GNN), learns the high-dimensional, rugged potential energy landscape of the system and reproduces classical DFT energies across a range of PGN design parameters at four orders of magnitude lower cost. Systematic analysis of GNN hyperparameters offers physical insights into the range, anisotropy, and effective body order of interactions. GNN-driven Monte Carlo simulations reveal locally favored icosahedral-like structures at equilibrium, and strikingly, recover equilibrium structures in agreement with experiments, despite the network being trained only on high-energy, out-of-equilibrium configurations. 2026-05-29T04:14:19Z Mehryar Jannesari Ghomsheh Donald L. Koch Sarah Hormozi http://arxiv.org/abs/2605.29969v1 Prototype-Guided Latent Alignment for Data-Efficient Fine-Tuning of Molecular Foundation Models 2026-05-28T14:07:48Z Machine learning interatomic potentials (MLIPs) have transformed materials discovery by leveraging graph neural networks (GNNs) to predict material properties with near density functional theory (DFT) accuracy. While large-scale pretrained foundation models offer transferable baseline representations, they frequently struggle to generalise to out-of-distribution (OOD) target systems -- a common challenge in modelling complex or chemically diverse materials. Fine-tuning is the standard remedy, but the high cost of generating DFT-labelled configurations confines adaptation to data-scarce regimes, where over-parameterised GNNs amplify overfitting and degrade target-domain performance. To address this, we propose a prototype-based alignment approach for data-efficient fine-tuning of MLIPs. Our method identifies local structural similarities between the source and target domains by grouping atoms with analogous chemical environments based on their latent representations. Each target-domain atom's energy contribution is aligned to its source-domain prototype, introducing an inductive bias that anchors fine-tuned representations to the pretrained structure, encouraging effective reuse of learned interactions and improving generalisation without restrictive assumptions on the target chemistry. We evaluate our method on the rMD17 benchmark using equivariant MACE and invariant SchNet across varying data budgets, and extend evaluation to the MACE-OFF foundation models on the SPICE dataset. Our approach consistently improves predictive accuracy in the low-data regime, reducing energy MAE by up to 18% over standard fine-tuning baselines. 2026-05-28T14:07:48Z 17 pages, 3 figures Rushikesh Pawar Harshit Rawat Ayush Kumar Phani Motamarri http://arxiv.org/abs/2605.29885v1 Open Problem: Separating Geometric and Algorithmic Compression via Cayley-Table Completion 2026-05-28T13:10:04Z Modern statistical learning theory and deep learning characterize generalization primarily in terms of continuous capacity control (e.g., norm-based regularization, margin maximization, low-rank bias). While highly successful in continuous domains, deep learning consistently fails to extrapolate exact algorithmic or discrete algebraic rules, reflecting a missing inductive bias toward algorithmic complexity minimization. We propose the Cayley-table completion as the canonical testbed for this missing bias, serving as the discrete algebraic counterpart to matrix completion. Just as matrix factorization combined with weight decay yields an implicit geometric bias toward low linear rank, recent results demonstrate that operator-valued tensor factorizations paired with a flatness prior yield an implicit algorithmic bias toward exact discrete associativity. We pose the open problem of establishing formal exact recovery bounds for Cayley-table completion, and challenge the community to generalize continuous flatness priors to autonomously discover broader discrete algorithmic axioms without combinatorial search. 2026-05-28T13:10:04Z 6 pages. Submitted to the Conference on Learning Theory (COLT) 2026 Open Problem track Dongsung Huh http://arxiv.org/abs/2605.29875v1 Estimates of ground state energies for the quantum SK and 2D-EA model, using deGennes-Suzuki-Kubo mean-field annealing dynamics 2026-05-28T12:59:09Z We perform a large scale simulation of quantum annealing in the Sherrington-Kirkpatrick (SK) spin glass up to a system size $N=40000$ to estimate its ground state energy using the deGennes-Suzuki-Kubo mean-field Ising dynamics, extending the earlier results (reported in Eur. Phys. J. B {\bf 98}, 226 (2025)). Here we numerically solve the deGennes-Suzuki-Kubo annealing dynamics to obtain the spin configurations and subsequently the ground state energy for a given system size at the end of the annealing (to the desired quantum system at the corresponding values of the transverse field), starting from a quantum paramagnetic state. The method shows high efficiency, with an overall algorithmic cost of $O(N^3)$ in estimating the energy of the ground state. We later extend this method to study the ground state energy of the Edwards-Anderson (EA) spin glass on a square lattice. 2026-05-28T12:59:09Z 6 pages, 5 figures Soumyaditya Das Soumyajyoti Biswas Bikas K. Chakrabarti http://arxiv.org/abs/2605.29871v1 Enhanced Density Fluctuations Near a Disordered Chiral Topological Transition 2026-05-28T12:55:33Z The universal statistics of density fluctuations of localized quantum states may offer unprecedented opportunities to probe and understand quantum transport in connection with dimensionality, coherence, symmetry and disorder. To date, the possible role of topological phase transitions in the fluctuation statistics is not studied yet. Using a Su-Schrieffer-Heeger chain subject to off-diagonal disorder (so that chiral symmetry is preserved), this work investigates how a disorder driven topological phase transition impacts on the spatial fluctuations of the logarithmic wave-packet density $\ln P(r)$ at distance $r$ from the initial excitation. Away from the transition, in both topological and trivial localized phases, the standard deviation follows the conventional one-dimensional scaling $σ[\ln P(r)]\sim r^θ$ with $θ\simeq 1/2$. Near the transition, however, the fluctuation growth is enhanced: the fitted exponent $θ$ increases above $1/2$ in a nonmonotonic manner before returning close to $1/2$ at criticality. We interpret this behavior from the energy-resolved density of states and localization length. Near the transition, several energy sectors carry appreciable spectral weight and exhibit competitive decay rates, preventing a single localization scale from dominating the accessible wave-packet tail and thereby enhancing the fluctuations of $\ln P(r)$. Our results establish wave-packet fluctuation statistics as a dynamical diagnostic of disordered chiral topological transitions and motivate broader studies of fluctuation phenomena in disordered topological quantum systems. 2026-05-28T12:55:33Z Hai-Tao Ding Sen Mu Leong-Chuan Kwek Gabriel Lemarié Jiangbin Gong http://arxiv.org/abs/2605.29745v1 Geometry and localization: Probing Localization Landscape Theory on the Bethe Lattice 2026-05-28T10:42:54Z The Localization Landscape Theory (LLT) offers a classical analogy for understanding Anderson localization through an effective confining potential, whose percolation threshold has been proposed to mark the mobility edge. While this correspondence shows striking numerical agreement in three dimensions, its theoretical foundations remain an open question. In this work, we extend the analysis of the LLT on the Bethe lattice presented in~\cite{Tonetti2026}. In this setting in both the Anderson localization transition and the LLT percolation problem admit exact solutions. Our analysis reveals that the two transitions are distinct, with markedly different critical behaviors. Notably, the LLT percolation transition falls into the standard mean-field universality class, in sharp contrast with the unconventional critical behavior of the Anderson transition on the Bethe lattice. Nonetheless, the LLT framework reproduces several exact results, capturing nontrivial features of the very low-disorder regime: it predicts the position of the isolated eigenvalue, the minimal disorder at which both the LLT percolation curve and the mobility edge first appear, and the Aizenman--Warzel lower bound for localization. We also study the dependence of the LLT percolation threshold on the energy shift, evaluate the LLT prediction for the Density of States, and derive several results on the statistical properties of the variables controlling the problem. Finally, we develop an extreme-value analysis showing that the LLT prediction for the Density of States overestimates the amplitude of the tails close to the boundary of the continuous spectrum. These findings provide an exact analytical benchmark showing that, despite its geometric appeal, the LLT does not generally reproduce the quantum critical properties of Anderson localization, while still offering a powerful tool to understand its very low-disorder regime. 2026-05-28T10:42:54Z 50 pages, 13 figures. arXiv admin note: substantial text overlap with arXiv:2512.04037 Lorenzo Tonetti Leticia F. Cugliandolo Marco Tarzia http://arxiv.org/abs/2605.29684v1 Kernel Renormalization in Bayesian Deep Neural Networks: the Equivalent Wishart Ansatz in the Proportional Regime 2026-05-28T09:49:54Z The scaling limit where both the size of the training set $P$ and the width $N$ of a deep neural network grow at the same rate, the so-called proportional-width regime, has been intensely studied for shallow, single-hidden-layer networks. However, extending these non-perturbative results from shallow architectures to deep non-linear networks has proven very challenging. Here we present an effective approximate approach to predict the generalization performance of Bayesian multi-layer perceptrons (MLPs) of fixed depth $L$ on arbitrary high-dimensional data. We propose an equivalent Wishart Ansatz to capture the dominant stochastic fluctuations of the hierarchical empirical kernels of MLPs. This allows us to perform a large deviation analysis for the partition function of MLPs in the proportional limit, expressed in terms of a renormalized NNGP kernel. In this description, even strong representation learning in the proportional limit is encoded in at most $L$ scalar order parameters, determined self-consistently. Extending the approach to convolutional architectures (CNNs), we identify a hierarchical local kernel renormalization mechanism, which allows to quantify more complex data-dependent transformations of the large-width kernel in CNNs due to finite-width effects. We test our effective theory against sampling experiments from the Bayesian posterior of finite deep neural networks with depths $L \sim O(10)$ and $P\sim O(10^3)$ on classic benchmark datasets, finding overall very good agreement together with two distinct types of systematic deviations. 2026-05-28T09:49:54Z 45 pages, 21 figures Paolo Baglioni Christian Keup Vincenzo Zimbardo Rosalba Pacelli Alessandro Vezzani Raffaella Burioni Pietro Rotondo http://arxiv.org/abs/2602.06791v2 Rare Event Analysis of Large Language Models 2026-05-28T09:05:59Z Being probabilistic models, during inference large language models (LLMs) display rare events: behaviour that is far from typical but highly significant. By definition all rare events are hard to see, but the enormous scale of LLM usage means that events completely unobserved during development are likely to become prominent in deployment. Here we present an end-to-end framework for the systematic analysis of rare events in LLMs. We provide a practical implementation spanning theory, efficient generation strategies, probability estimation and error analysis, which we illustrate with concrete examples. We outline extensions and applications to other models and contexts, highlighting the generality of the concepts and techniques presented here. 2026-02-06T15:50:36Z ICML 2026 Oral Spotlight Jake McAllister Dorman Edward Gillman Dominic C. Rose Jamie F. Mair Juan P. Garrahan http://arxiv.org/abs/2511.21299v2 Discovery and recovery of crystalline materials with property-conditioned transformers 2026-05-28T08:19:39Z Generative models have recently shown great promise for accelerating the design and discovery of new functional materials. Conditional generation enhances this capacity by allowing inverse design, where specific desired properties can be requested during the generation process. However, conditioning of transformer-based approaches, in particular, is constrained by discrete tokenisation schemes and the risk of catastrophic forgetting during fine-tuning. This work introduces CrystaLLM-π (property injection), a conditional autoregressive framework that integrates continuous property representations directly into the transformer's attention mechanism. Two architectures, Property-Key-Value (PKV) Prefix attention and PKV Residual attention, are presented. These methods bypass inefficient sequence-level tokenisation and preserve foundational knowledge from unsupervised pre-training on Crystallographic Information Files (CIFs) as textual input. We establish the efficacy of these mechanisms through systematic robustness studies and evaluate the framework's versatility across two distinct tasks. First, for structure recovery, the model processes high-dimensional, heterogeneous X-ray diffraction patterns, achieving structural accuracy competitive with specialised models and demonstrating applications to experimental structure recovery and polymorph differentiation. Second, for materials discovery, the model is fine-tuned on a specialised photovoltaic dataset to generate novel, stable candidates validated by Density Functional Theory (DFT). It implicitly learns to target optimal band gap regions for high photovoltaic efficiency, demonstrating a capability to map complex structure-property relationships. CrystaLLM-π provides a unified, flexible, and computationally efficient framework for inverse materials design. 2025-11-26T11:42:28Z Cyprien Bone Matthew Walker Bradley A. A. Martin Kuangdai Leng Luis M. Antunes Ricardo Grau-Crespo Amil Aligayev Javier Dominguez Keith T. Butler http://arxiv.org/abs/2602.20256v2 Spectral Decimation of Quantum Many-Body Hamiltonians 2026-05-27T21:35:29Z We develop a systematic theory of spectral decimation for quantum many-body Hamiltonians and show that it provides a quantitative probe of emergent symmetries in statistically mixed spectra. Building on an analytical description of statistical mixtures, we derive an explicit expression for the size of a characteristic symmetry sector (CSS), defined as the largest subsequence of levels exhibiting non-Poissonian correlations. The CSS dimension is shown to be the size-biased average of the underlying symmetry sectors, establishing a direct link between spectral statistics and Hilbert-space structure. We apply this framework to two paradigmatic settings: Hilbert-space fragmentation and disorder-induced many-body localization (MBL). In fragmented systems, the CSS reproduces the mixture prediction and isolates correlated subsectors even when the full spectrum appears nearly Poissonian. In the disordered Heisenberg chain, spectral decimation reveals the gradual emergence of integrability through a shrinking CSS, whose statistics exhibit signatures consistent with local integrals of motion. We introduce a characteristic symmetry entropy (CSE) as a finite-size scaling observable and extract, within accessible system sizes, the crossover exponents. Our results establish spectral decimation as a controlled, unbiased and computationally inexpensive diagnostic of hidden structure in many-body spectra, capable of distinguishing between chaotic dynamics, statistical mixtures, and emergent integrability. 2026-02-23T19:00:05Z v2 ;16+7 pages; 5+3 figures Feng He Arthur Hutsalyuk Giuseppe Mussardo Andrea Stampiggi 10.1103/4dct-rs4q http://arxiv.org/abs/2605.28949v1 Order-disorder trade-off in dirty quantum systems 2026-05-27T18:00:06Z We prove a trade-off theorem for order and disorder parameters in one-dimensional quantum spin systems with quenched disorder. For a disordered ensemble with exact Ising symmetry and average translation symmetry, any gapped ensemble must have one and only one of the following: an $O(1)$ order parameter or an $O(1)$ disorder parameter with even parity, both of the Edwards-Anderson type. The result extends to nearly gapped ensembles that accommodate Griffiths-type rare-region effects. These results offer a powerful and rigorous framework to understand the disorder effects beyond perturbative approaches. As applications, we (1) establish the existence of string order parameters for SPT phases; (2) derive a Lieb-Schultz-Mattis-type constraint for disordered ensembles, which requires a nearly gapped ensemble to spontaneously break the symmetry; and (3) discuss similar trade-off relations for disordered fermion chains, leading to an improved understanding of certain "intrinsically disordered" topological phases. 2026-05-27T18:00:06Z 24 pages Jinmin Yi Chong Wang http://arxiv.org/abs/2605.28929v1 Improving CFT Operators Using Machine Learning 2026-05-27T18:00:00Z Finite-size effects limit the accuracy with which conformal data can be extracted from lattice simulations of critical systems. While action improvement suppresses some corrections to scaling, it does not address operator-dependent effects arising from imperfect lattice representations of continuum conformal fields. In this work, we propose a data-driven method for improving lattice operators themselves, constructing estimators with enhanced overlap with the corresponding primary operators of the continuum conformal field theory. We identify improved lattice representations of leading spin and energy operators in three two-dimensional critical systems: the Ising model, the q = 3 Potts model, and the dilute q = 3 Potts model. In all cases, the resulting operators exhibit reduced corrections to scaling and yield more accurate estimates of scaling dimensions compared to conventional lattice choices. The code and analysis workflows used to produce these results are made available in an accompanying GitHub repository. 2026-05-27T18:00:00Z Lior Oppenheim Snir Gazit Zohar Ringel