https://arxiv.org/api/qpjUwfy4f1893B51XFHYmru7V9E2026-06-10T09:54:19Z2655910515http://arxiv.org/abs/2604.09412v2Sharp description of local minima in the loss landscape of high-dimensional two-layer ReLU neural networks2026-05-29T11:52:09ZWe study the population loss landscape of two-layer ReLU networks of the form $\sum_{k=1}^K \mathrm{ReLU}(w_k^\top x)$ in a realisable teacher-student setting with Gaussian covariates. We show that local minima admit an exact low-dimensional representation in terms of summary statistics, yielding a sharp and interpretable characterisation of the landscape. We further establish a direct link with one-pass SGD: local minima correspond to attractive fixed points of the dynamics in summary statistics space. This perspective reveals a hierarchical organisation of minima into discrete families and shows how overparameterisation changes their stability and reachability under gradient-based dynamics. In this overparameterised regime, global minima become increasingly accessible, attracting the dynamics and reducing convergence to spurious solutions. Overall, our results reveal intrinsic limitations of common simplifying assumptions, which may miss essential features of the loss landscape even in minimal neural network models.2026-04-10T15:26:00Z29 pages, 18 figures. Accepted as a conference paper at ICML 2026Jie HuangBruno LoureiroStefano Sarao Mannellihttp://arxiv.org/abs/2308.15532v3Information Bounds on phase transitions in disordered systems2026-05-29T07:18:54ZInformation theory, rooted in computer science, and many-body physics, have traditionally been studied as (almost) independent fields. Only recently has this paradigm started to shift, with many-body physics being studied and characterized using tools developed in information theory. In our work, we introduce a new perspective on this connection, and study phase transitions in models with randomness, such as localization in disordered systems, or random quantum circuits with measurements. Utilizing information-based arguments regarding probability distribution differentiation, we bound critical exponents in such phase transitions (specifically, those controlling the correlation or localization lengths). We benchmark our method and rederive the well-known Harris criterion, bounding critical exponents in the Anderson localization transition for noninteracting particles, as well as classical disordered spin systems. We then move on to apply our method to many-body localization. While in real space our critical exponent bound agrees with recent consensus, we find that, somewhat surprisingly, numerical results on Fock-space localization for limited-sized systems do not obey our bounds, indicating that the simulation results might not hold asymptotically (similarly to what is now believed to have occurred in the real-space problem). We also apply our approach to random quantum circuits with random measurements, for which we can derive bounds transcending recent mappings to percolation problems.2023-08-29T18:00:07Z9 pages, 2 figures, comments are welcomeNoa FeldmanNiv DavidsonMoshe Goldstein10.21468/SciPostPhys.20.5.146http://arxiv.org/abs/2508.07707v2Observation and Modulation of the Quantum Mpemba Effect on a Superconducting Quantum Processor2026-05-29T06:17:07ZIn non-equilibrium quantum systems, the quantum Mpemba effect (QME) emerges as a counterintuitive phenomenon: systems exhibiting greater initial symmetry breaking restore symmetry faster. It has been attracting broad interest in studying QME dynamics and potential applications in quantum information science. While theoretical exploration of QME has surged, experimental studies, specifically on its flexible modulation, remain limited. Here, we report the observation and modulation of QME using a superconducting processor featuring an all-to-all connected, tunable-coupling architecture that enables precise control from short- to long-range interactions. This platform allows independent manipulation of coupling regimes, on-site potentials, and initial states, enabling us to elucidate their roles in QME. To quantify symmetry restoration, we employ entanglement asymmetry (EA), derived from the reconstructed density matrix via quantum state tomography, as a sensitive probe. In strong short-range coupling regimes, EA crossovers during quenches from tilted Néel states confirm the presence of QME. In contrast, in intermediate coupling regimes, synchronized EA and entanglement entropy dynamics reveal the suppression of QME. Remarkably, QME reemerges with the introduction of on-site linear potentials or quenches from tilted ferromagnetic states, the latter proving robust against on-site disorder. Our study demonstrates flexible QME modulation on a superconducting platform with multiple controllable parameters, shedding light on quantum many-body non-equilibrium dynamics and opening avenues for quantum information applications.2025-08-11T07:35:26ZFigures modified and new discussions added. Final version published in PRLYueshan XuCai-Ping FangBing-Jie ChenMing-Chuan WangZi-Yong GeYun-Hao ShiYu LiuCheng-Lin DengKui ZhaoZheng-He LiuTian-Ming LiHao LiZiting WangGui-Han LiangDa'er FengXueyi GuoXu-Yang GuYang HeHao-Tian LiuZheng-Yang MeiYongxi XiaoYu YanYi-Han YuWei-Ping YuanJia-Chi ZhangZheng-An WangGangqin LiuXiaohui SongYe TianYu-Ran ZhangShi-Xin ZhangKaixuan HuangZhongcheng XiangDongning ZhengKai XuHeng Fan10.1103/951q-j8kqhttp://arxiv.org/abs/2605.30822v1Using graph neural networks to predict many-body interactions in amorphous materials2026-05-29T04:14:19ZMany-body interactions govern the complex behavior of many amorphous materials, from metallic glasses to biological tissues, yet are often replaced by pairwise additive frameworks for computational efficiency. Here, we use classical density functional theory (DFT) to study a model soft glass of solvent-free polymer-grafted nanoparticles (PGNs), where the absence of solvent forces grafted chains to uniformly fill the interstitial space, generating strong angular-dependent many-body interactions between the cores. We show that NequIP, an equivariant message-passing graph neural network (GNN), learns the high-dimensional, rugged potential energy landscape of the system and reproduces classical DFT energies across a range of PGN design parameters at four orders of magnitude lower cost. Systematic analysis of GNN hyperparameters offers physical insights into the range, anisotropy, and effective body order of interactions. GNN-driven Monte Carlo simulations reveal locally favored icosahedral-like structures at equilibrium, and strikingly, recover equilibrium structures in agreement with experiments, despite the network being trained only on high-energy, out-of-equilibrium configurations.2026-05-29T04:14:19ZMehryar Jannesari GhomshehDonald L. KochSarah Hormozihttp://arxiv.org/abs/2605.29969v1Prototype-Guided Latent Alignment for Data-Efficient Fine-Tuning of Molecular Foundation Models2026-05-28T14:07:48ZMachine learning interatomic potentials (MLIPs) have transformed materials discovery by leveraging graph neural networks (GNNs) to predict material properties with near density functional theory (DFT) accuracy. While large-scale pretrained foundation models offer transferable baseline representations, they frequently struggle to generalise to out-of-distribution (OOD) target systems -- a common challenge in modelling complex or chemically diverse materials. Fine-tuning is the standard remedy, but the high cost of generating DFT-labelled configurations confines adaptation to data-scarce regimes, where over-parameterised GNNs amplify overfitting and degrade target-domain performance. To address this, we propose a prototype-based alignment approach for data-efficient fine-tuning of MLIPs. Our method identifies local structural similarities between the source and target domains by grouping atoms with analogous chemical environments based on their latent representations. Each target-domain atom's energy contribution is aligned to its source-domain prototype, introducing an inductive bias that anchors fine-tuned representations to the pretrained structure, encouraging effective reuse of learned interactions and improving generalisation without restrictive assumptions on the target chemistry. We evaluate our method on the rMD17 benchmark using equivariant MACE and invariant SchNet across varying data budgets, and extend evaluation to the MACE-OFF foundation models on the SPICE dataset. Our approach consistently improves predictive accuracy in the low-data regime, reducing energy MAE by up to 18% over standard fine-tuning baselines.2026-05-28T14:07:48Z17 pages, 3 figuresRushikesh PawarHarshit RawatAyush KumarPhani Motamarrihttp://arxiv.org/abs/2605.29885v1Open Problem: Separating Geometric and Algorithmic Compression via Cayley-Table Completion2026-05-28T13:10:04ZModern statistical learning theory and deep learning characterize generalization primarily in terms of continuous capacity control (e.g., norm-based regularization, margin maximization, low-rank bias). While highly successful in continuous domains, deep learning consistently fails to extrapolate exact algorithmic or discrete algebraic rules, reflecting a missing inductive bias toward algorithmic complexity minimization. We propose the Cayley-table completion as the canonical testbed for this missing bias, serving as the discrete algebraic counterpart to matrix completion. Just as matrix factorization combined with weight decay yields an implicit geometric bias toward low linear rank, recent results demonstrate that operator-valued tensor factorizations paired with a flatness prior yield an implicit algorithmic bias toward exact discrete associativity. We pose the open problem of establishing formal exact recovery bounds for Cayley-table completion, and challenge the community to generalize continuous flatness priors to autonomously discover broader discrete algorithmic axioms without combinatorial search.2026-05-28T13:10:04Z6 pages. Submitted to the Conference on Learning Theory (COLT) 2026 Open Problem trackDongsung Huhhttp://arxiv.org/abs/2605.29875v1Estimates of ground state energies for the quantum SK and 2D-EA model, using deGennes-Suzuki-Kubo mean-field annealing dynamics2026-05-28T12:59:09ZWe perform a large scale simulation of quantum annealing in the Sherrington-Kirkpatrick (SK) spin glass up to a system size $N=40000$ to estimate its ground state energy using the deGennes-Suzuki-Kubo mean-field Ising dynamics, extending the earlier results (reported in Eur. Phys. J. B {\bf 98}, 226 (2025)). Here we numerically solve the deGennes-Suzuki-Kubo annealing dynamics to obtain the spin configurations and subsequently the ground state energy for a given system size at the end of the annealing (to the desired quantum system at the corresponding values of the transverse field), starting from a quantum paramagnetic state. The method shows high efficiency, with an overall algorithmic cost of $O(N^3)$ in estimating the energy of the ground state. We later extend this method to study the ground state energy of the Edwards-Anderson (EA) spin glass on a square lattice.2026-05-28T12:59:09Z6 pages, 5 figuresSoumyaditya DasSoumyajyoti BiswasBikas K. Chakrabartihttp://arxiv.org/abs/2605.29871v1Enhanced Density Fluctuations Near a Disordered Chiral Topological Transition2026-05-28T12:55:33ZThe universal statistics of density fluctuations of localized quantum states may offer unprecedented opportunities to probe and understand quantum transport in connection with dimensionality, coherence, symmetry and disorder. To date, the possible role of topological phase transitions in the fluctuation statistics is not studied yet. Using a Su-Schrieffer-Heeger chain subject to off-diagonal disorder (so that chiral symmetry is preserved), this work investigates how a disorder driven topological phase transition impacts on the spatial fluctuations of the logarithmic wave-packet density $\ln P(r)$ at distance $r$ from the initial excitation. Away from the transition, in both topological and trivial localized phases, the standard deviation follows the conventional one-dimensional scaling $σ[\ln P(r)]\sim r^θ$ with $θ\simeq 1/2$. Near the transition, however, the fluctuation growth is enhanced: the fitted exponent $θ$ increases above $1/2$ in a nonmonotonic manner before returning close to $1/2$ at criticality. We interpret this behavior from the energy-resolved density of states and localization length. Near the transition, several energy sectors carry appreciable spectral weight and exhibit competitive decay rates, preventing a single localization scale from dominating the accessible wave-packet tail and thereby enhancing the fluctuations of $\ln P(r)$. Our results establish wave-packet fluctuation statistics as a dynamical diagnostic of disordered chiral topological transitions and motivate broader studies of fluctuation phenomena in disordered topological quantum systems.2026-05-28T12:55:33ZHai-Tao DingSen MuLeong-Chuan KwekGabriel LemariéJiangbin Gonghttp://arxiv.org/abs/2605.29745v1Geometry and localization: Probing Localization Landscape Theory on the Bethe Lattice2026-05-28T10:42:54ZThe Localization Landscape Theory (LLT) offers a classical analogy for understanding Anderson localization through an effective confining potential, whose percolation threshold has been proposed to mark the mobility edge. While this correspondence shows striking numerical agreement in three dimensions, its theoretical foundations remain an open question. In this work, we extend the analysis of the LLT on the Bethe lattice presented in~\cite{Tonetti2026}. In this setting in both the Anderson localization transition and the LLT percolation problem admit exact solutions. Our analysis reveals that the two transitions are distinct, with markedly different critical behaviors. Notably, the LLT percolation transition falls into the standard mean-field universality class, in sharp contrast with the unconventional critical behavior of the Anderson transition on the Bethe lattice. Nonetheless, the LLT framework reproduces several exact results, capturing nontrivial features of the very low-disorder regime: it predicts the position of the isolated eigenvalue, the minimal disorder at which both the LLT percolation curve and the mobility edge first appear, and the Aizenman--Warzel lower bound for localization. We also study the dependence of the LLT percolation threshold on the energy shift, evaluate the LLT prediction for the Density of States, and derive several results on the statistical properties of the variables controlling the problem. Finally, we develop an extreme-value analysis showing that the LLT prediction for the Density of States overestimates the amplitude of the tails close to the boundary of the continuous spectrum. These findings provide an exact analytical benchmark showing that, despite its geometric appeal, the LLT does not generally reproduce the quantum critical properties of Anderson localization, while still offering a powerful tool to understand its very low-disorder regime.2026-05-28T10:42:54Z50 pages, 13 figures. arXiv admin note: substantial text overlap with arXiv:2512.04037Lorenzo TonettiLeticia F. CugliandoloMarco Tarziahttp://arxiv.org/abs/2605.29684v1Kernel Renormalization in Bayesian Deep Neural Networks: the Equivalent Wishart Ansatz in the Proportional Regime2026-05-28T09:49:54ZThe scaling limit where both the size of the training set $P$ and the width $N$ of a deep neural network grow at the same rate, the so-called proportional-width regime, has been intensely studied for shallow, single-hidden-layer networks. However, extending these non-perturbative results from shallow architectures to deep non-linear networks has proven very challenging. Here we present an effective approximate approach to predict the generalization performance of Bayesian multi-layer perceptrons (MLPs) of fixed depth $L$ on arbitrary high-dimensional data. We propose an equivalent Wishart Ansatz to capture the dominant stochastic fluctuations of the hierarchical empirical kernels of MLPs. This allows us to perform a large deviation analysis for the partition function of MLPs in the proportional limit, expressed in terms of a renormalized NNGP kernel. In this description, even strong representation learning in the proportional limit is encoded in at most $L$ scalar order parameters, determined self-consistently. Extending the approach to convolutional architectures (CNNs), we identify a hierarchical local kernel renormalization mechanism, which allows to quantify more complex data-dependent transformations of the large-width kernel in CNNs due to finite-width effects. We test our effective theory against sampling experiments from the Bayesian posterior of finite deep neural networks with depths $L \sim O(10)$ and $P\sim O(10^3)$ on classic benchmark datasets, finding overall very good agreement together with two distinct types of systematic deviations.2026-05-28T09:49:54Z45 pages, 21 figuresPaolo BaglioniChristian KeupVincenzo ZimbardoRosalba PacelliAlessandro VezzaniRaffaella BurioniPietro Rotondohttp://arxiv.org/abs/2602.06791v2Rare Event Analysis of Large Language Models2026-05-28T09:05:59ZBeing probabilistic models, during inference large language models (LLMs) display rare events: behaviour that is far from typical but highly significant. By definition all rare events are hard to see, but the enormous scale of LLM usage means that events completely unobserved during development are likely to become prominent in deployment. Here we present an end-to-end framework for the systematic analysis of rare events in LLMs. We provide a practical implementation spanning theory, efficient generation strategies, probability estimation and error analysis, which we illustrate with concrete examples. We outline extensions and applications to other models and contexts, highlighting the generality of the concepts and techniques presented here.2026-02-06T15:50:36ZICML 2026 Oral SpotlightJake McAllister DormanEdward GillmanDominic C. RoseJamie F. MairJuan P. Garrahanhttp://arxiv.org/abs/2511.21299v2Discovery and recovery of crystalline materials with property-conditioned transformers2026-05-28T08:19:39ZGenerative models have recently shown great promise for accelerating the design and discovery of new functional materials. Conditional generation enhances this capacity by allowing inverse design, where specific desired properties can be requested during the generation process. However, conditioning of transformer-based approaches, in particular, is constrained by discrete tokenisation schemes and the risk of catastrophic forgetting during fine-tuning. This work introduces CrystaLLM-π (property injection), a conditional autoregressive framework that integrates continuous property representations directly into the transformer's attention mechanism. Two architectures, Property-Key-Value (PKV) Prefix attention and PKV Residual attention, are presented. These methods bypass inefficient sequence-level tokenisation and preserve foundational knowledge from unsupervised pre-training on Crystallographic Information Files (CIFs) as textual input. We establish the efficacy of these mechanisms through systematic robustness studies and evaluate the framework's versatility across two distinct tasks. First, for structure recovery, the model processes high-dimensional, heterogeneous X-ray diffraction patterns, achieving structural accuracy competitive with specialised models and demonstrating applications to experimental structure recovery and polymorph differentiation. Second, for materials discovery, the model is fine-tuned on a specialised photovoltaic dataset to generate novel, stable candidates validated by Density Functional Theory (DFT). It implicitly learns to target optimal band gap regions for high photovoltaic efficiency, demonstrating a capability to map complex structure-property relationships. CrystaLLM-π provides a unified, flexible, and computationally efficient framework for inverse materials design.2025-11-26T11:42:28ZCyprien BoneMatthew WalkerBradley A. A. MartinKuangdai LengLuis M. AntunesRicardo Grau-CrespoAmil AligayevJavier DominguezKeith T. Butlerhttp://arxiv.org/abs/2602.20256v2Spectral Decimation of Quantum Many-Body Hamiltonians2026-05-27T21:35:29ZWe develop a systematic theory of spectral decimation for quantum many-body Hamiltonians and show that it provides a quantitative probe of emergent symmetries in statistically mixed spectra. Building on an analytical description of statistical mixtures, we derive an explicit expression for the size of a characteristic symmetry sector (CSS), defined as the largest subsequence of levels exhibiting non-Poissonian correlations. The CSS dimension is shown to be the size-biased average of the underlying symmetry sectors, establishing a direct link between spectral statistics and Hilbert-space structure. We apply this framework to two paradigmatic settings: Hilbert-space fragmentation and disorder-induced many-body localization (MBL). In fragmented systems, the CSS reproduces the mixture prediction and isolates correlated subsectors even when the full spectrum appears nearly Poissonian. In the disordered Heisenberg chain, spectral decimation reveals the gradual emergence of integrability through a shrinking CSS, whose statistics exhibit signatures consistent with local integrals of motion. We introduce a characteristic symmetry entropy (CSE) as a finite-size scaling observable and extract, within accessible system sizes, the crossover exponents. Our results establish spectral decimation as a controlled, unbiased and computationally inexpensive diagnostic of hidden structure in many-body spectra, capable of distinguishing between chaotic dynamics, statistical mixtures, and emergent integrability.2026-02-23T19:00:05Zv2 ;16+7 pages; 5+3 figuresFeng HeArthur HutsalyukGiuseppe MussardoAndrea Stampiggi10.1103/4dct-rs4qhttp://arxiv.org/abs/2605.28949v1Order-disorder trade-off in dirty quantum systems2026-05-27T18:00:06ZWe prove a trade-off theorem for order and disorder parameters in one-dimensional quantum spin systems with quenched disorder. For a disordered ensemble with exact Ising symmetry and average translation symmetry, any gapped ensemble must have one and only one of the following: an $O(1)$ order parameter or an $O(1)$ disorder parameter with even parity, both of the Edwards-Anderson type. The result extends to nearly gapped ensembles that accommodate Griffiths-type rare-region effects. These results offer a powerful and rigorous framework to understand the disorder effects beyond perturbative approaches. As applications, we (1) establish the existence of string order parameters for SPT phases; (2) derive a Lieb-Schultz-Mattis-type constraint for disordered ensembles, which requires a nearly gapped ensemble to spontaneously break the symmetry; and (3) discuss similar trade-off relations for disordered fermion chains, leading to an improved understanding of certain "intrinsically disordered" topological phases.2026-05-27T18:00:06Z24 pagesJinmin YiChong Wanghttp://arxiv.org/abs/2605.28929v1Improving CFT Operators Using Machine Learning2026-05-27T18:00:00ZFinite-size effects limit the accuracy with which conformal data can be extracted from lattice simulations of critical systems. While action improvement suppresses some corrections to scaling, it does not address operator-dependent effects arising from imperfect lattice representations of continuum conformal fields. In this work, we propose a data-driven method for improving lattice operators themselves, constructing estimators with enhanced overlap with the corresponding primary operators of the continuum conformal field theory. We identify improved lattice representations of leading spin and energy operators in three two-dimensional critical systems: the Ising model, the q = 3 Potts model, and the dilute q = 3 Potts model. In all cases, the resulting operators exhibit reduced corrections to scaling and yield more accurate estimates of scaling dimensions compared to conventional lattice choices. The code and analysis workflows used to produce these results are made available in an accompanying GitHub repository.2026-05-27T18:00:00ZLior OppenheimSnir GazitZohar Ringel