https://arxiv.org/api/yseN2YdASly7l5ArhcREOs8Qsn02026-06-13T10:42:29Z265721515http://arxiv.org/abs/2603.12901v2A theory of learning data statistics in diffusion models, from easy to hard2026-06-10T13:28:42ZWhile diffusion models have emerged as a powerful class of generative models, their learning dynamics remain poorly understood. We address this issue first by empirically showing that standard diffusion models trained on natural images exhibit a distributional simplicity bias, learning simple, pair-wise input statistics before specializing to higher-order correlations. We reproduce this behaviour in simple denoisers trained on a minimal data model, the mixed cumulant model, where we precisely control both pair-wise and higher-order correlations of the inputs. We identify a scalar invariant of the model that governs the sample complexity of learning pair-wise and higher-order correlations that we call the diffusion information exponent, in analogy to related invariants in different learning paradigms. Using this invariant, we prove that the denoiser learns simple, pair-wise statistics of the inputs at linear sample complexity, while more complex higher-order statistics, such as the fourth cumulant, require at least cubic sample complexity. We also prove that the sample complexity of learning the fourth cumulant is linear if pair-wise and higher-order statistics share a correlated latent structure. Our work describes a key mechanism for how diffusion models can learn distributions of increasing complexity.2026-03-13T11:07:01ZICML 2026Lorenzo BardoneClaudia MergerSebastian Goldthttp://arxiv.org/abs/2606.12058v1Phase Transitions in Attention: A Bayesian Theory of Copy Head Emergence2026-06-10T13:26:56ZAttention is the key mechanism underlying in-context learning in transformers, and attention patterns have been observed empirically to emerge abruptly during training. We present a Bayesian theory of feature learning in attention; we then focus on how the copy subcircuit in the first layer of an induction head is learned by analyzing a single-layer softmax attention network trained on a copy task. We derive a closed-form posterior over the attention matrix and reduce it to a low-dimensional order parameter space. This reduction reveals a phase transition in the amount of training data, which we verify using both Bayesian sampling and standard training with Adam. We contrast our results with linear attention and find that softmax attention exhibits a \emph{first-order phase transition} while in linear attention an initial \emph{second-order phase transition} is followed by a smooth, continuous evolution toward the structured attention pattern (\emph{crossover}). Our work provides a first-principles theoretical account of the abrupt emergence of the copy subcircuit, reminiscent of the one observed in training large language models.2026-06-10T13:26:56ZItay LavieKirsten FischerAndrey LekovFrederic Van MaeleZohar RingelMoritz Heliashttp://arxiv.org/abs/2606.07274v2Topological Anderson insulators and reentrant topological transitions in a quasiperiodic long-range Su-Schrieffer-Heeger model2026-06-10T12:17:43ZWe study a one-dimensional long-range Su-Schrieffer-Heeger model with third-nearest-neighbor hopping and subject to quasiperiodic disorder. In the clean limit, the model hosts phases characterized by winding numbers $W=-1,0,1$ and $2$. The introduction of quasiperiodic disorder profoundly modifies the phase diagram and induces a series of topological phase transitions. Owing to the competition between topological dimerization and localization, topological Anderson insulating (TAI) phases with different winding numbers emerge and can persist even when the spectral gap becomes nearly closed in the strong-disorder regime. In addition, we uncover multiple reentrant topological phase transitions induced by varying either the quasiperiodic disorder strength or the hopping amplitudes. Remarkably, the system exhibits staircase-like topological Anderson transitions, where the real-space winding number evolves through successive quantized steps with increasing disorder strength. Our results demonstrate that the interplay between long-range hopping and quasiperiodic disorder generates a rich landscape of disorder-induced topological phases and reentrant topological transition phenomena.2026-06-05T13:50:19Z9 pages, 6 figuresFang-Ming MengQi-Bo Zenghttp://arxiv.org/abs/2606.11965v1Exact distribution of the output of a deep-layered machine2026-06-10T11:43:45ZDeep-layered machines, in which each node computes a Boolean function of all nodes below it, underpin deep learning and digital computation. Yet the statistics of their global output function remain poorly understood. We derive the exact finite-depth distribution of the output of a machine with width $k$ and depth $n$. The distribution depends only on the Hamming weight of the output, and as $n$ increases favors functions with low and high Hamming weights. But this bias peaks at a crossover depth proportional to $2^k$ before collapsing onto the constant functions true and false.2026-06-10T11:43:45ZThomas M. A. Finkhttp://arxiv.org/abs/2606.11950v1Perspective: The Physics of Active Solids -- From Hamiltonians to Active Matter Models2026-06-10T11:24:49ZThe physics of active matter, wherein constituent particles consume energy to generate autonomous motion, has revolutionized non-equilibrium statistical mechanics. While a large body of work has successfully elucidated the behavior of dilute active systems, the dense regime -- characterized by ``active glasses and active solids'' -- presents profound challenges that defy conventional theoretical frameworks. Recent observations reveal two striking features in these dense systems: an apparent enhancement of Mermin-Wagner-Hohenberg (MWH) fluctuations leading to anomalous long-wavelength density fluctuations, and a remarkable correspondence between activity-induced annealing and annealing via oscillatory shear. In this perspective article, we propose a novel approach toward a deeper understanding of dense active matter: by developing active Hamiltonian models as equilibrium reference frameworks, we map out pathways toward non-equilibrium active systems. This strategy allows us to elucidate both the correspondence between driven and active systems and the enhanced MWH fluctuations, which likely arise from a strong coupling between spatially random active forces and long-wavelength density (phonon) modes. We outline a comprehensive roadmap employing complementary approaches, including the active Hamiltonian formalism, comparative studies of oscillatory shear in active and passive solids, and investigations of chiral active matter. Establishing this activity-oscillatory shear correspondence across diverse systems is essential to demonstrate its universality, reveal the underlying large-scale emergent physics, and place our hypothesis on a firmer theoretical ground.2026-06-10T11:24:49ZAntik BhattacharyaJürgen HorbachSmarajit Karmakarhttp://arxiv.org/abs/2606.07327v2Six Open Questions in Machine-Learned Interatomic Potential Foundation Models2026-06-10T08:53:54ZMachine-learned interatomic potentials (MLIPs) have had a profound impact on molecular modelling in recent years, promising to resolve the long-standing tension between the scale and accuracy of simulations. There has been a proliferation of new models and designs, and recently the paradigm of ``foundational'' MLIPs has become prevalent. Broadly speaking, foundation models are trained on large diverse datasets and promise to work well for new systems with minimal updates required. However, in such a new and fast moving field, there are many unanswered questions. In this article, we set out to articulate and explore what we see as the most important among these questions. We start by developing a working definition for foundational MLIPs and use this definition to frame the subsequent open questions. Despite the rapid progress in the field of MLIP models, we believe that these are fundamental questions which will continue to define cutting edge research in MLIPs in the years to come.2026-06-05T14:45:06ZIsabel CreedTim ReinIngvars VitenburgsWojciech G. StarkViktor EllingssonAhmed Y. IsmailGuangyu LiuYuchen LouBradley A. A. MartinCyprien BoneMatthew A. H. WalkerMueen TajShirui WangKelvin WongRuiqi WuPrakriti KayasthaBingqing ChengAditi KrishnapriyanMichele CeriottiMarcel F. LangerJarvist Moore FrostAlex M. GanoseVenkat KapilKeith T. Butlerhttp://arxiv.org/abs/2404.10119v2Modeling scattering matrix containing evanescent modes for wavefront shaping applications in disordered media2026-06-09T20:49:25ZWe developed an open-source scalar wave transport model to estimate the generalized scattering matrix (S matrix) of a disordered medium in the diffusion regime. The term generalized refers to the incorporation of evanescent wave field modes alongside propagating modes in the estimation of the S matrix. To achieve this, we employed the scalar Kirchhoff-Helmholtz boundary integral formulation together with the Green's function perturbation method, thereby extending the conventional Fisher-Lee relations to include evanescent modes. The estimated S matrix, which satisfies the generalized unitarity and reciprocity relations, is modeled for a 2D disordered waveguide. The generalized transmission matrix contained within the S matrix is utilized to estimate the optimal phase-conjugate wavefront for focusing onto an evanescent mode. The phenomenon of a universal transmission value of 2/3 for such an optimal phase conjugate wavefront is demonstrated in the context of evanescent wave mode focusing through a diffusive disorder. The presented code framework may be of interest to wavefront shaping researchers for visualizing and estimating wave transport properties in general.2024-04-15T20:17:50ZManuscript accepted in Physical Review ResearchMichael RajuBaptiste JayetStefan Andersson-Engelshttp://arxiv.org/abs/2606.11319v1Learning from almost nothing: How neural networks survive heavy input corruption2026-06-09T18:02:09ZLearning from imperfect data is a central theme in machine learning, connecting practical questions of robustness to fundamental questions of learnability. Here we examine attribute noise: learning from corrupted inputs while keeping the labels intact, a setting that has received considerably less analytical attention than its label-noise counterpart. We consider two types of corruption models: additive noise and replacement noise. Through experiments with multi-layer perceptrons (MLPs) on corrupted classification datasets, we find that neural networks remain robust, maintaining well-above-chance accuracy even when inputs are >90% corrupted -- far beyond human recognition. To understand this robustness, we analyze infinite-width networks in the heavy-corruption regime using a mean-field-inspired approach and derive a leading-order decision rule for the classification outcome: the network implements a prototype rule, the nearest-class-mean, assigning each test point to the class whose training-set average it most closely resembles. This leading-order decision rule is universal across a broad range of MLP architectures, holding for any depth, as well as a wide class of activation functions and noise distributions. The same centroid mechanism closely matches finite-width network behavior in our experiments and provides an interpretable and analytically tractable account of why learning can succeed even when individual training examples carry almost no signal.2026-06-09T18:02:09Z26 pages, 10 figuresJustin TahmassebpurAsadullah BhuiyanHyejin KimOmri Lesserhttp://arxiv.org/abs/2606.11302v1Ferromagnetism from the geometry of localized wavefunctions in moiré systems2026-06-09T18:00:03ZWe present a mechanism for ferromagnetism in narrow bands consisting of Anderson-localized states. We exploit single-particle localization to derive a controlled theory of exchange interactions within the narrow band. For quasiperiodic systems with a half-filled moiré band, we show that the critical interaction strength for ferromagnetism is highly sensitive to the geometry of real-space overlaps between localized orbitals: we find well-defined resonances at which ferromagnetism sets in for interaction energies that are far lower than the gap to other bands. Near these resonances, all the approximations in our theory are controlled, so our critical point predictions are quantitative. We show examples both in one and two dimensions. Our work identifies a route to ferromagnetism based on the geometry of real-space wavefunctions, distinct from previously found mechanisms based on the quantum geometry of Bloch bands.2026-06-09T18:00:03ZMiguel GonçalvesSarang Gopalakrishnanhttp://arxiv.org/abs/2606.11064v1The UZH protocol: Separating errors and constructing improved CP2K basis sets and pseudopotentials2026-06-09T16:24:46ZReliable density-functional simulations require numerical settings whose residual errors are smaller than the chemical and materials trends being interpreted. In CP2K/Quickstep, this requirement is complicated by the joint use of atom-centered Gaussian basis sets and norm-conserving pseudopotentials: a code-to-code discrepancy usually contains both contributions. We present the UZH protocol, a closed-loop CP2K workflow that calibrates molecularly optimized Gaussian basis sets on small molecules, validates the resulting settings in unary-crystal equation-of-state benchmarks, identifies whether the limiting approximation is the Gaussian basis or the pseudopotential. The diagnosis is then used to revise the parameter files. The central diagnostic is a three-way comparison between production CP2K-GTH-UZH calculations, SIRIUS calculations using the same Goedecker--Teter--Hutter pseudopotential in a systematic plane-wave representation, and all-electron full-potential linearized augmented-plane-wave SIRIUS references. This construction decomposes the practical CP2K error into a Gaussian-basis component and a pseudopotential component. The protocol distinguishes basis-limited noble-gas and heavy-element cases from pseudopotential-limited transition-metal cases, guides targeted revisions with the CP2K basis and pseudopotential optimizers, and produces improved MOLOPT basis sets and GTH pseudopotentials as explicit outputs of the workflow. The UZH protocol is therefore constructive: it does not merely measure or reduce errors a posteriori, but allows turning verification outliers into validated CP2K parameter files for simulations across molecules and condensed phases.2026-06-09T16:24:46ZHossein MirhosseiniTiziano M. A. MüllerMatthias KrackThomas D. KühneJürg Hutterhttp://arxiv.org/abs/2606.10915v1Local density of states distribution and multifractal eigenvectors of weighted random networks via the cavity approach2026-06-09T14:24:49ZWe study the local density of states (LDoS) distribution of a general class of weighted Erdős-Rényi graphs. Using the cavity method, we obtain a good approximation to the full LDoS distribution and compact expressions for its power-law tails, which we show to have exponent $3$ in the extended phase. We deduce that the eigenvectors in the continuous part of the spectrum are extended but (weakly) multifractal, and we extract expressions for the associated fractal dimensions and the singularity spectrum. We also demonstrate that the inverse participation ratio in this multifractal phase exhibits an unusual logarithmic scaling with system size, which is neither fully-extended nor localised by the usual definitions. Finally, we verify that some symmetry properties (derived from the non-linear sigma model), which have been shown to hold for many systems exhibiting multifractality, also hold in our case, both for the LDoS distribution and the singularity spectrum.2026-06-09T14:24:49Z14 pages, 5 figuresJoseph W. BaronTim Rogershttp://arxiv.org/abs/2603.24171v2Algorithms for generating planar networks simulating hierarchical patterns of cracks formed during film drying2026-06-09T12:51:27ZHierarchical crack patterns that arise during the drying of thin films of colloidal dispersions or polymer solutions on a solid substrate are of interest both from a fundamental standpoint and in the context of the creation of transparent electrodes for optoelectronics. This paper analyzes the morphology of such patterns based on image processing of real-world samples. Graph theory is used to extract chains of edges and analyze the network topology. A method based on the hierarchy of connections is applied to classify cracks by generation. The limitations of existing classification approaches related to the discreteness of the time scale and the use of only a part of the entire pattern are discussed. Three approaches are used to generate artificial hierarchical networks: random uniform partitioning, recursive Voronoi partitioning, and a crack growth simulation model, each modified to reproduce the hierarchical structure. A comparison was made of the geometric characteristics (distribution of crack angles, edge lengths, cell areas, and circularity coefficient) and topological properties (distribution of the number of cell sides) of real and simulated networks. It was shown that the simulation model best reproduces the key features of real cracks, including the characteristic right angles of their connections.2026-03-25T10:41:55Z17 pages, 12 figures, 52 refs, Supplemental MaterialYuri Yu. TarasevichAndrei V. EserkepovAndrei S. Burmistrovhttp://arxiv.org/abs/2606.10585v1Anomalous mobility edges and extended-localized transition in a quasiperiodic emitter-cavity array2026-06-09T08:50:48ZThe manipulation of localization in quasiperiodic systems by mobility edges or localization transition holds significant physical importance. In this letter, we demonstrated that the dissipation can induce the emergence of anomalous mobility edges and extended-localized transition in emitter-cavity arrays controlled by quasiperiodic potentials. Specifically, we observe that the localization properties of emitters is governed by the nature of quantum bound states, either discrete or embedded in continuum, providing a unified mechanism linking the emitter-photon bound physics to quasiperiodic criticality. Depending on the bound state discrete or continuumlike, the induced effective excitation hopping exhibits either exponentially decaying or sinusoidally oscillating, giving rise to the formation of localized or critical states, respectively. Through a generalized duality transformation, we analytically determine the anomalous mobility edges and the critical strength of potential, enabling the construction of a full phase diagram. The study reveals that the physical characteristics of cavity exert a significant influence on excitation localization. Therefore, the manipulation of excitation localization can be achieved solely by adjusting the cavity fields.2026-06-09T08:50:48Z8+9 pages, 3+4 figures. comments are welcomeH. T. CuiH. Z. ShenM. QinX. X. Yihttp://arxiv.org/abs/2606.10349v1Magnetic HIP-NN for spin dynamics in disordered itinerant magnets2026-06-09T02:57:37ZWe present a magnetic extension of the Hierarchically Interacting Particle Neural Network (HIP-NN) that enables large-scale simulations of electron-mediated spin dynamics in disordered itinerant magnets. The resulting magnetic HIP-NN (mHIP-NN) incorporates rotationally invariant spin correlations directly into hierarchical message-passing layers, enabling the network to learn emergent magnetic energy landscapes and effective local fields from coupled geometric-spin environments while preserving spin-rotation symmetry. As a benchmark application, we consider structurally disordered itinerant $s$-$d$ exchange models in which the effective magnetic forces arise dynamically from the instantaneous electronic structure and are computationally prohibitive to evaluate using conventional exact-diagonalization-based approaches. We show that mHIP-NN accurately reproduces the local torques governing Landau-Lifshitz-Gilbert dynamics and faithfully captures the nonequilibrium evolution of spatial spin correlations following thermal quenches. Our results establish symmetry-aware hierarchical message-passing networks as an efficient and scalable framework for large-scale simulations of frustrated itinerant spin systems and nonequilibrium magnetic dynamics. More broadly, because the learned energy functional remains fully differentiable with respect to both atomic coordinates and spin variables, the framework also provides a natural foundation for spin-dependent interatomic potentials and coupled atom-spin dynamics.2026-06-09T02:57:37Z12 pages, 5 figuresSupriyo GhoshYunhao FanSheng ZhangKipton BarrosGia-Wei Chernhttp://arxiv.org/abs/2606.10222v1Multifractal Signatures of Ageing and Dementia Development: A Multifractal Space-Filling Curve Analysis2026-06-08T22:22:46ZMultifractality is an effective formalism for quantifying the nonlinear, scale-free properties of complex data. In this study, we propose a novel and efficient methodology, termed Multifractal Space-filling Curve Analysis (MFSCA), for quantifying the correlation structure of multidimensional data. Within this framework, the original multidimensional data - while preserving both local and long-range organisational properties - are projected onto a one-dimensional representation using a fractal space-filling curve. The resulting one-dimensional signal is then analysed using multifractal algorithms. We demonstrate the utility of the method using both artificially generated multifractal structures and real data. In particular, we apply MFSCA to analyse magnetic resonance imaging (MRI) data from Alzheimer patients at different stages of dementia. Based on the results, we estimate the multifractal profiles of the brain for healthy subjects of different ages as well as for dementia patients. The analysis reveals that the spatial organization of brain structures, as measured by the degree of multifractality, progressively weakens with age and the development of dementia. A transition from multifractality to monofractality is observed both in control groups, when comparing the Young Control and Elderly Control groups, and among dementia subjects of similar age but at different stages of the disease, namely early dementia and mild cognitive impairment. Thus, from the perspective of multiscaling properties, the heterogeneous characteristics of spatial brain organization deteriorate under worsening conditions, leading to a homogeneous and weakly correlated structure. These findings not only effectively capture key aspects of brain organisation, but also demonstrate that the multifractality of MRI data can serve as a marker of structural brain changes.2026-06-08T22:22:46ZMarta LotkaJacek GrelaZbigniew DrogoszJeremi K. OchabPaweł Oświęcimka