https://arxiv.org/api/qcxMspGo1bXfretYMtt4UCE1oCk2026-03-20T20:32:32Z261527515http://arxiv.org/abs/2603.12034v1Vector spin glasses with Mattis interaction II: non-convex high-temperature models2026-03-12T15:11:05ZThis paper constitutes the second part of a two-paper series devoted to the systematic study of vector spin glass models whose energy function involves a spin glass part and a general Mattis interaction part.
In this paper, we focus on models whose spin glass part does not satisfy the usual convexity assumption. In this case, the Parisi formula breaks down, and there are no known methods to fully identify the limit free energy. It was suggested in [arXiv:1906.08471] that the limit free energy may be described using the unique solution of a partial differential equation of Hamilton--Jacobi type. In the present paper, we prove the validity of this conjecture in the high-temperature regime and provide an explicit representation for the free energy in terms of critical points.
Using the duality between the free energy and large deviation principles, one can then easily deduce from the previous result a large deviation principle for the mean magnetization as well as a representation for the free energy of spin glass models with additional Mattis interaction at high temperature.
In the companion paper, we establish similar results at all temperatures for models whose spin glass part is assumed to satisfy the usual convexity assumption.2026-03-12T15:11:05Z41 pagesHong-Bin ChenVictor Issahttp://arxiv.org/abs/2603.12033v1Vector spin glasses with Mattis interaction I: the convex case2026-03-12T15:10:58ZThis paper constitutes the first part of a two-paper series devoted to the systematic study of vector spin glass models whose energy function involves a spin glass part and a general Mattis interaction part.
In this paper, we focus on models whose spin glass part satisfies the usual convexity assumption. We identify the limit free energy via a Parisi-type formula and prove a large deviation principle for the mean magnetization. The proof is remarkably simple and short compared to previous approaches; it relies on treating the Mattis interaction as a parameter of the model.
In the companion paper, we establish similar results in the high-temperature regime for models whose spin glass part is not assumed to satisfy the usual convexity assumption.2026-03-12T15:10:58Z16 pagesHong-Bin ChenVictor Issahttp://arxiv.org/abs/2512.06297v2Entropic Confinement and Mode Connectivity in Overparameterized Neural Networks2026-03-12T14:47:33ZModern neural networks exhibit a striking property: basins of attraction in the loss landscape are often connected by low-loss paths, yet optimization dynamics generally remain confined to a single convex basin and rarely explore intermediate points. We resolve this paradox by identifying entropic barriers arising from the interplay between curvature variations along these paths and noise in optimization dynamics. Empirically, we find that curvature systematically rises away from minima, producing effective forces that bias noisy dynamics back toward the endpoints - even when the loss remains nearly flat. These barriers persist longer than energetic barriers, shaping the late-time localization of solutions in parameter space. Our results highlight the role of curvature-induced entropic forces in governing both connectivity and confinement in deep learning landscapes.2025-12-06T04:50:32ZICLR 2026Luca Di CarloChase GoddardDavid J. Schwabhttp://arxiv.org/abs/2603.11688v1Machine Learning of Topological Insulator and Anderson Insulator in One-Dimensional Extended Su-Schrieffer-Heeger Chain2026-03-12T08:56:13ZWe study disorder effects in the extended Su-Schrieffer-Heeger (SSH) model using a convolutional neural network (CNN) trained on reduced correlation matrices (RCMs) of disorder-free systems to predict winding number phase diagrams in systems with off-diagonal and diagonal disorder. The trained CNN model generalizes to chiral-symmetry-preserving off-diagonal disorder system but fails in the presence of chiral-symmetry-breaking diagonal disorder system. Using principal component analysis (PCA) of the RCM feature space, we demonstrate that disorder-free and symmetry-preserving systems share overlapping feature manifolds, whereas symmetry-breaking disorder causes them to diverge. Inverse participation ratio (IPR) and energy spectrum analysis further demonstrate that off-diagonal disorder preserves topological edge states, whereas diagonal disorder drives a transition to an Anderson insulator. Our results position machine learning not merely as a classifier, but as a sensitive probe for the symmetry-protected nature of quantum matter.2026-03-12T08:56:13Z8 pages, 9 figuresZhekai YinDepartment of Physics, Xiamen University Malaysia, Sepang, Selangor, MalaysiaC. K. OngDepartment of Physics, Xiamen University Malaysia, Sepang, Selangor, MalaysiaKey Laboratory for Magnetism and Magnetic Materials of the Ministry of Education, Lanzhou University, Lanzhou, Chinahttp://arxiv.org/abs/2407.15558v5Few-Shot Neuromorphic Vision in a Nonlinear Photonic Network Laser2026-03-12T08:27:31ZWith the growing prevalence of AI, demand increases for hardware that mimics the brain's ability to extract structure from limited data. In the retina, ganglion cells detect features from sparse inputs via lateral inhibition, where neurons antagonistically suppress activity of neighbouring cells. Biological neurons exhibit diverse heterogeneous nonlinear responses, linked to robust learning and strong performance in low-data regimes.
Here, we introduce a retinally-inspired photonic computing system where spatially-competing lasing modes in a random network laser act as heterogeneous, inhibitively-coupled neurons - enabling feature detection, few-shot classification, and segmentation.
This silicon-compatible scheme harnesses heterogeneous excitatory and inhibitory nonlinear physical dynamics which give rise to emergent photonic computing behaviour, including parallel feature detection and strong performance when training data is scarce. We report 98.05% and 87.85% accuracy on MNIST and Fashion-MNIST, and 90.12% on BreaKHis cancer diagnosis - outperforming software CNNs including EfficientNetV2 and the vision transformer ViT in few-shot and class-imbalanced regimes with training sets of up to several hundred images. We demonstrate combined segmentation and classification on the HAM10k skin lesion dataset, achieving DICE and Jaccard scores of 84.49% and 74.80%. These results demonstrate the potential of random lasing networks as nonlinear photonic learning systems, and highlight the ability of heterogeneous nonlinear dynamics to support strong learning in challenging low-data scenarios.2024-07-22T11:42:34ZWai Kit NgJakub DranczewskiAnna FischerT V RazimanDhruv SaxenaTobias FarchyKilian StenningJonathan PetersHeinz SchmidWill R BranfordMauricio BarahonaKirsten MoselundRiccardo SapienzaJack C. Gartsidehttp://arxiv.org/abs/2603.11326v1Unraveling anomalous relaxation effects in the thermodynamic limit2026-03-11T21:39:37ZWe address two central open problems in the theory of anomalous Mpemba-like relaxations: their extension beyond one spatial dimension and their consistent formulation in the thermodynamic limit. Our framework is the antiferromagnetic Ising model on a square lattice under an externally applied magnetic field, which enables us to work in the presence of a phase transition. The rich phase diagram contains two control parameters: temperature and magnetic field. We demonstrate that the standard assumption of relaxation dominated by a single leading exponential is inconsistent for intensive observables exhibiting standard fluctuations. Instead, as the system size increases, a continuous spectrum of time scales emerges. Nevertheless, we make the ansatz that, in the vicinity of the phase transition, the spectral projector onto the slowest time scales can be effectively characterized in terms of an equilibrium thermodynamic quantity: the susceptibility associated with the order parameter of the metastable phase. Combined with the richness of the phase diagram, this ansatz yields qualitative and semi-quantitative predictions for optimal protocols leading to a variety of anomalous relaxation phenomena involving simultaneous variations of temperature and magnetic field. These include direct and inverse Mpemba effects, cooling-heating asymmetries, and faster heating induced by precooling. Careful Monte Carlo simulations validate our theoretical predictions. Furthermore, minimal post-optimization suffices to convert our analytically guided protocols into fully optimal ones that display anomalous relaxations in their most pronounced form.2026-03-11T21:39:37ZEmilio PomaresVíctor Martín-MayorAntonio LasantaGabriel Álvarezhttp://arxiv.org/abs/2602.14928v3From Classical to Quantum: Extending Prometheus for Unsupervised Discovery of Phase Transitions in Three Dimensions and Quantum Systems2026-03-11T20:07:10ZWe extend the Prometheus framework for unsupervised phase transition discovery from 2D classical systems to 3D classical and quantum many-body systems, addressing scalability in higher dimensions and generalization to quantum fluctuations. For the 3D Ising model ($L \leq 32$), the framework detects the critical temperature within 0.01\% of literature values ($T_c/J = 4.511 \pm 0.005$) and extracts critical exponents with $\geq 70\%$ accuracy ($β= 0.328 \pm 0.015$, $γ= 1.24 \pm 0.06$, $ν= 0.632 \pm 0.025$), correctly identifying the 3D Ising universality class via $χ^2$ comparison ($p = 0.72$) without analytical guidance. For quantum systems, we developed quantum-aware VAE (Q-VAE) architectures using complex-valued wavefunctions and fidelity-based loss. Applied to the transverse field Ising model, we achieve 2\% accuracy in quantum critical point detection ($h_c/J = 1.00 \pm 0.02$) and successfully discover ground state magnetization as the order parameter ($r = 0.97$). Notably, for the disordered transverse field Ising model, we detect exotic infinite-randomness criticality characterized by activated dynamical scaling $\ln ξ\sim |h - h_c|^{-ψ}$, extracting a tunneling exponent $ψ= 0.48 \pm 0.08$ consistent with theoretical predictions ($ψ= 0.5$). This demonstrates that unsupervised learning can identify qualitatively different types of critical behavior, not just locate critical points. Our systematic validation across classical thermal transitions ($T = 0$ to $T > 0$) and quantum phase transitions ($T = 0$, varying $h$) establishes that VAE-based discovery generalizes across fundamentally different physical domains, providing robust tools for exploring phase diagrams where analytical solutions are unavailable.2026-02-16T17:06:20ZBrandon YeeWilson CollinsMaximilian Rutkowskihttp://arxiv.org/abs/2602.21468v3Unsupervised Discovery of Intermediate Phase Order in the Frustrated $J_1$-$J_2$ Heisenberg Model via Prometheus Framework2026-03-11T19:59:37ZThe spin-$1/2$ $J_1$-$J_2$ Heisenberg model on the square lattice exhibits a debated intermediate phase between Néel antiferromagnetic and stripe ordered regimes, with competing theories proposing plaquette valence bond, nematic, and quantum spin liquid ground states. We apply the Prometheus variational autoencoder framework -- previously validated on classical (2D, 3D Ising) and quantum (disordered transverse field Ising) phase transitions -- to systematically explore the $J_1$-$J_2$ phase diagram using a multi-scale approach. For $L=4$, we employ exact diagonalization with full wavefunction analysis via quantum-aware VAE. For larger systems ($L=6, 8$), we introduce a reduced density matrix (RDM) based methodology using DMRG ground states, enabling scaling beyond the exponential barrier of full Hilbert space representation. Through dense parameter scans of $J_2/J_1 \in [0, 1]$ and comprehensive latent space analysis, we identify the structure factor $S(π,π)$ and $S(π,0)$ as the dominant order parameters discovered by the VAE, with correlations exceeding $|r| > 0.97$. The RDM-VAE approach successfully captures the Néel-to-stripe crossover near $J_2/J_1 \approx 0.5$--$0.6$, demonstrating that local quantum correlations encoded in reduced density matrices contain sufficient information for unsupervised phase discovery. This work establishes a scalable pathway for applying machine learning to frustrated quantum systems where full wavefunction access is computationally prohibitive.2026-02-25T00:44:51ZBrandon YeeWilson CollinsMaximilian Rutkowskihttp://arxiv.org/abs/2603.11263v1Crossover to Sachdev-Ye-Kitaev criticality in an infinite-range quantum Heisenberg spin glass2026-03-11T19:47:11ZWe study the equilibrium dynamics of an infinite-range quantum Heisenberg model with random couplings, in which local magnetic moments arise from $\mathcal{N}_f$ flavors of spinful fermions. We employ an expansion in $\mathcal{N}_f$, which controls the strength of quantum fluctuations, and self-consistently include $1/\mathcal{N}_f$ corrections to the Luttinger-Ward functional. In the large-$\mathcal{N}_f$ limit, where quantum fluctuations are weak, the high- and low-temperature phases are respectively paramagnetic and spin glass ordered, with a transition temperature independent of $\mathcal{N}_f$. For small numbers of fermionic flavors, however, quantum fluctuations substantially suppress the ordering temperature. We show that this behavior reflects the proximity of the system to a Sachdev-Ye-Kitaev (SYK) phase, where both fermionic and spin spectral densities display critical behavior over a broad range of finite frequencies, with the latter exhibiting the scale-invariant form $χ''(ω)\sim \operatorname{sgn}(ω)$. At the lowest energies and temperatures, spin-glass dynamics ultimately take over, producing a universal sub-Ohmic dynamical spin susceptibility $χ''(ω)\sim \operatorname{sgn}(ω)\sqrt{|ω|}$. Our results establish a minimal framework for understanding dynamical crossovers between SYK criticality and spin-glass ordering.2026-03-11T19:47:11Z14 pages, 9 figuresHossein HosseinabadiSubir SachdevJamir Marinohttp://arxiv.org/abs/2603.11189v1DysonNet: Constant-Time Local Updates for Neural Quantum States2026-03-11T18:01:04ZNeural quantum states (NQS) provide a flexible variational framework for many-body wavefunctions, but suffer from high computational cost and limited interpretability. We introduce DysonNet, a broad class of NQS that couples strictly local nonlinearities through global linear layers. This structure is analogous to a truncated Dyson series which gives an intuitive interpretation of local wavefunction updates as scattering from static impurities. By resumming the scattering series, single-spin-flip updates can be computed in $\mathcal{O}(1)$ time, independent of system size, using an algorithm we call ABACUS. Implementing DysonNet with the state-space model S4, we obtain up to $230\times$ speedups over Vision-Transformers for computing the local estimator. This corresponds to an asymptotic $\mathcal{O}(N^2)$ improvement in training-time scaling, reaching $\mathcal{O}(N \log^2 N)$ total training complexity in area-law phases. Benchmarks on the 1D long-range Ising model and frustrated $J_1$-$J_2$ chains show that DysonNet matches state-of-the-art NQS accuracy while removing the dominant local-update overhead. More broadly, our results suggest a route to scalable NQS architectures where physical interpretability directly enables computational efficiency.2026-03-11T18:01:04Z26 pages, 7 figuresLucas WinterAndreas Nunnenkamphttp://arxiv.org/abs/2603.11161v1Algorithmic Capture, Computational Complexity, and Inductive Bias of Infinite Transformers2026-03-11T18:00:00ZWe formally define Algorithmic Capture (i.e., ``grokking'' an algorithm) as the ability of a neural network to generalize to arbitrary problem sizes ($T$) with controllable error and minimal sample adaptation, distinguishing true algorithmic learning from statistical interpolation. By analyzing infinite-width transformers in both the lazy and rich regimes, we derive upper bounds on the inference-time computational complexity of the functions these networks can learn. We show that despite their universal expressivity, transformers possess an inductive bias towards low-complexity algorithms within the Efficient Polynomial Time Heuristic Scheme (EPTHS) class. This bias effectively prevents them from capturing higher-complexity algorithms, while allowing success on simpler tasks like search, copy, and sort.2026-03-11T18:00:00ZOrit DavidovichZohar Ringelhttp://arxiv.org/abs/2603.11032v1Uncovering statistical structure in large-scale neural activity with Restricted Boltzmann Machines2026-03-11T17:55:45ZLarge-scale electrophysiological recordings now allow simultaneous monitoring of thousands of neurons across multiple brain regions, revealing structured variability in neural population activity. Understanding how these collective patterns emerge from microscopic neural interactions requires models that are scalable, predictive, and interpretable. Statistical physics provides principled frameworks to address this complexity, including maximum-entropy models that offer transparent descriptions of collective neural activity but remain largely limited to pairwise interactions and modest system sizes. Here, we use Restricted Boltzmann Machines (RBMs) to model the activity of $\sim1500$-$2000$ simultaneously recorded neurons from the Allen Institute Visual Behavior Neuropixels dataset, spanning multiple cortical and subcortical regions of the mouse brain. RBMs extend the maximum-entropy framework through latent variables, enabling the capture of higher-order dependencies while allowing explicit extraction of effective interaction networks. Recent advances in efficient Markov Chain sampling and training enable accurate learning of these models at this scale. RBMs reproduce the complex statistics of neural recordings with high accuracy. Generated samples match empirical pairwise and higher-order correlations, as well as global statistics such as the distribution of population activity. The inferred parameters provide direct access to effective neuronal interactions, revealing coordination patterns in population activity. These couplings display clear anatomical structure: neurons within visual cortical areas show stronger interactions, consistent with visually driven behavior, while cross-area couplings are weaker. Despite being trained on temporally shuffled data, Markov Chain Monte Carlo simulations also reproduce the global relaxation dynamics of neural activity.2026-03-11T17:55:45ZFirst draft, comments are welcomeNicolas BéreuxGiovanni CataniaAurélien DecelleFrancesca MignaccoAlfonso de Jesús Navas GómezBeatriz Seoanehttp://arxiv.org/abs/2603.10956v1Linear Readout of Neural Manifolds with Continuous Variables2026-03-11T16:45:14ZBrains and artificial neural networks compute with continuous variables such as object position or stimulus orientation. However, the complex variability in neural responses makes it difficult to link internal representational structure to task performance. We develop a statistical-mechanical theory of regression capacity that relates linear decoding efficiency of continuous variables to geometric properties of neural manifolds. Our theory handles complex neural variability and applies to real data, revealing increasing capacity for decoding object position and size along the monkey visual stream.2026-03-11T16:45:14ZWill SlattonChi-Ning ChouSueYeon Chunghttp://arxiv.org/abs/2603.10815v1Dissipation- versus Chaos-Induced Relaxation in Non-Markovian Quantum Many-Body Systems2026-03-11T14:24:21ZIn interacting quantum many-body systems, relaxation toward equilibrium reflects a competition between internal chaotic dynamics and environmental dissipation. While conventional Markovian baths typically produce exponential decay, non-Markovian dissipation can give rise to more intricate behavior, including algebraic relaxation. We study an open Sachdev-Ye-Kitaev (SYK) model coupled to a pseudogapped fermionic bath, using the Keldysh formalism to compute steady-state correlations in the large-$N$ limit. Our results uncover a rich dynamical phase diagram, with regimes of bath-driven power-law relaxation, chaos-driven exponential decay, and an intermediate pre-relaxation phase where exponential decay crosses over to algebraic decay. These findings demonstrate that non-Markovian environments can qualitatively reshape relaxation mechanisms in strongly correlated quantum many-body systems.2026-03-11T14:24:21Z12 pages, 7 figuresGabriel AlmeidaPedro RibeiroMasudul HaqueLucas Sáhttp://arxiv.org/abs/2602.00716v3Emergence of Distortions in High-Dimensional Guided Diffusion Models2026-03-11T14:06:17ZClassifier-free guidance (CFG) is the de facto standard for conditional sampling in diffusion models, yet it often leads to a loss of diversity in generated samples. We formalize this phenomenon as generative distortion, defined as the mismatch between the CFG-induced sampling distribution and the true conditional distribution. Considering Gaussian mixtures and their exact scores, and leveraging tools from statistical physics, we characterize the onset of distortion in a high-dimensional regime as a function of the number of classes. Our analysis reveals that distortions emerge through a phase transition in the effective potential governing the guided dynamics. In particular, our dynamical mean-field analysis shows that distortion persists when the number of modes grows exponentially with dimension, but vanishes in the sub-exponential regime. Consistent with prior finite-dimensional results, we further demonstrate that vanilla CFG shifts the mean and shrinks the variance of the conditional distribution. We show that standard CFG schedules are fundamentally incapable of preventing variance shrinkage. Finally, we propose a theoretically motivated guidance schedule featuring a negative-guidance window, which mitigates loss of diversity while preserving class separability.2026-01-31T13:19:45Z29 pages, 16 figuresEnrico VenturaBeatrice AchilliLuca AmbrogioniCarlo Lucibello