https://arxiv.org/api/K3crogWN6kSAKlgvTh4Kvpy74hE 2026-03-24T09:50:02Z 11816 135 15 http://arxiv.org/abs/2506.06750v2 Accuracy-Efficiency Trade-Offs in Spiking Neural Networks: A Lempel-Ziv Complexity Perspective on Learning Rules 2026-03-01T20:24:40Z Training spiking neural networks (SNNs) remains challenging due to temporal dynamics, non-differentiability of spike events, and sparse event-driven activations. This paper studies how the choice of learning paradigm (unsupervised, supervised, and hybrid) affects classification performance and computational cost in temporal pattern recognition. Building on our earlier study [Rudnicka et al., 2026], we use Lempel-Ziv complexity (LZC) as a compact, decision-relevant descriptor of spike-train temporal organization to quantify how different learning rules reshape class-conditional temporal structure. The pipeline combines a leaky integrate-and-fire (LIF) SNN with an LZC-based decision rule. We evaluate learning rules on synthetic sources with controlled temporal statistics (Bernoulli, two-state Markov, and Poisson spike processes) and on two-class subsets of MNIST and N-MNIST. Across datasets, gradient-based learning achieves the highest accuracy but at high computational cost, whereas bio-inspired rules (e.g., Tempotron and SpikeProp) offer favorable accuracy--efficiency trade-offs. These results highlight that selecting a learning rule should be guided by application constraints and the desired balance between separability and computational overhead. 2025-06-07T10:43:09Z Zofia Rudnicka Janusz Szczepanski Agnieszka Pregowska http://arxiv.org/abs/2603.01184v1 Scaling of learning time for high dimensional inputs 2026-03-01T16:51:18Z Representation learning from complex data typically involves models with a large number of parameters, which in turn require large amounts of data samples. In neural network models, model complexity grows with the number of inputs to each neuron, with a trade-off between model expressivity and learning time. A precise characterization of this trade-off would help explain the connectivity and learning times observed in artificial and biological networks. We present a theoretical analysis of how learning time depends on input dimensionality for a Hebbian learning model performing independent component analysis. Based on the geometry of high-dimensional spaces, we show that the learning dynamics reduce to a unidimensional problem, with learning times dependent only on initial conditions. For higher input dimensions, initial parameters have smaller learning gradients and larger learning times. We find that learning times have supralinear scaling, becoming quickly prohibitive for high input dimensions. These results reveal a fundamental limitation for learning in high dimensions and help elucidate how the optimal design of neural networks depends on data complexity. Our approach outlines a new framework for analyzing learning dynamics and model complexity in neural network models. 2026-03-01T16:51:18Z 14 pages, 5 figures Carlos Stein Brito http://arxiv.org/abs/2603.03362v1 Metric-Topology Factorization: A Computational Framework for Hippocampal-Neocortical Intelligence 2026-03-01T13:19:40Z The brain achieves stability and plasticity in a topologically complex, shifting world through Metric-Topology Factorization (MTF), separating discrete topological indexing for context selection from continuous metric condensation for local inference. Semantically rich environments defy single globally contractive geometries, causing obstructions under shifts, so intelligence factorizes these: the hippocampus provides sparse signatures indexing manifold identity, while the neocortex untangles geometry hierarchically. In the ventral stream, a dynamic-programming-like process quotients symmetries (e.g., translation, scale), transforming non-convex sensory mazes into separable bowls. Offline replay and consolidation amortize transformations for rapid task switching. Dreaming in REM involves stochastic hippocampal traversal to expose and regularize latent structures. Consciousness arises from resolving topological uncertainty into stable embeddings, with awareness for unamortized states. Evolutionarily, transitions like sensorimotor control to language expand topological complexity, demanding advanced indexing-metric separation. Intelligence emerges via recalibrating context-specific geometries, converting global navigation into local dynamics, not deeper search. 2026-03-01T13:19:40Z Xin Li http://arxiv.org/abs/2510.18516v2 Decoding Dynamic Visual Experience from Calcium Imaging via Cell-Pattern-Aware Pretraining 2026-03-01T10:15:48Z Neural recordings exhibit a distinctive form of heterogeneity rooted in differences in cell types, intrinsic circuit dynamics, and stochastic stimulus-response variability that goes beyond ordinary dataset variability, mixing statistically regular neurons with highly stochastic, stimulus-contingent ones within the same dataset. This heterogeneity poses a challenge for self-supervised learning (SSL) -- learnable statistical regularity -- thereby destabilizing representation learning and limiting reliable scaling. We introduce POYO-CAP (Cell-pattern Aware Pretraining), a biologically grounded hybrid pretraining strategy that first trains with masked reconstruction plus lightweight auxiliary supervision on statistically regular neurons -- identified via skewness and kurtosis -- and then fine-tunes on more stochastic populations. On the Allen Brain Observatory dataset, this curriculum yields 12--13\% relative improvements over from-scratch training and enables smooth, monotonic scaling with model size, whereas baselines trained on mixed populations plateau or destabilize. By making statistical predictability an explicit data-selection criterion, POYO-CAP turns neural heterogeneity into a scalable learning advantage for robust neural decoding. 2025-10-21T10:57:52Z Sangyoon Bae Mehdi Azabou Blake Richards Jiook Cha http://arxiv.org/abs/2502.01334v2 Deep generative computed perfusion-deficit mapping of ischaemic stroke 2026-03-01T09:07:21Z Focal deficits in ischaemic stroke result from impaired perfusion downstream of a critical vascular occlusion. While parenchymal lesions are traditionally used to predict clinical deficits, the underlying pattern of disrupted perfusion provides information upstream of the lesion, potentially yielding earlier predictive and localizing signals. Such perfusion maps can be derived from routine CT angiography (CTA) widely deployed in clinical practice. Analysing computed perfusion maps from 1,393 CTA-imaged-patients with acute ischaemic stroke, we use deep generative inference to localise neural substrates of NIHSS sub-scores. We show that our approach replicates known lesion-deficit relations without knowledge of the lesion itself and reveals novel neural dependents. The high achieved anatomical fidelity suggests acute CTA-derived computed perfusion maps may be of substantial clinical-and-scientific value in rich phenotyping of acute stroke. Using only hyperacute imaging, deep generative inference could power highly expressive models of functional anatomical relations in ischaemic stroke within the pre-interventional window. 2025-02-03T13:14:31Z Chayanin Tangwiriyasakul Pedro Borges Guilherme Pombo Stefano Moriconi Michael S. Elmalem Paul Wright Yee-Haur Mah Jane Rondina Sebastien Ourselin Parashkev Nachev M. Jorge Cardoso http://arxiv.org/abs/2510.18808v2 Does Feedback Alignment Work at Biological Timescales? 2026-02-28T21:21:43Z Feedback alignment and related weight-transport-free algorithms are often proposed as biologically plausible alternatives to backpropagation, yet they are typically formulated in discrete phases with implicitly synchronized forward and error signals. We develop a continuous-time model of feedback-alignment-type learning in which neural activities and synaptic weights evolve together under coupled first-order dynamics with distinct propagation, plasticity, and decay time constants. We show that learning is governed by the temporal overlap between presynaptic drive and a locally projected error signal, providing an analytic explanation for robustness to moderate timing mismatch and for failure when mismatch eliminates overlap. Our results show that in order for feedback-alignment-type algorithms to work at biological timescales, they must obey the same temporal overlap principle that applies to other biological processes like eligibility traces. 2025-10-21T17:04:06Z Marc Gong Bacvanski Liu Ziyin Tomaso Poggio http://arxiv.org/abs/2510.09951v3 Emergence of Spatial Representation in an Actor-Critic Agent with Hippocampus-Inspired Sequence Generator 2026-02-28T16:57:12Z Sequential firing of hippocampal place cells is often attributed to sequential sensory drive along a trajectory, and has also been attributed to planning and other cognitive functions. Here, we propose a mechanistic and parsimonious interpretation to complement these ideas: hippocampal sequences arise from intrinsic recurrent circuitry that propagates transient input over long horizons, acting as a temporal memory buffer that is especially useful when reliable sensory evidence is sparse. We implement this idea with a minimal sequence generator inspired by neurobiology and pair it with an actor-critic learner for egocentric visual navigation. Our agent reliably solves a continuous maze without explicit geometric cues, with performance depending on the length of the recurrent sequence. Crucially, the model outperforms LSTM cores under sparse input conditions (16 channels, $\sim2.5\%$ activity), but not under dense input, revealing a strong interaction between representational sparsity and memory architecture. Through learning, units develop localized place fields, distance-dependent spatial kernels, and task-dependent remapping, while inputs to the sequence generator orthogonalize and spatial information increases across layers. These phenomena align with neurobiological data and are causal to performance. Together, our results show that sparse input synergizes with sequence-generating dynamics, providing both a mechanistic account of place cell sequences in the mammalian hippocampus and a simple inductive bias for reinforcement learning based on sparse egocentric inputs in navigation tasks. 2025-10-11T01:38:23Z Accepted at ICLR 2026 Xiao-Xiong Lin Yuk-Hoi Yiu Christian Leibold http://arxiv.org/abs/2603.03358v1 Contextuality, Incompatibility, and Intra-System Entanglement of Mental Markers 2026-02-27T21:35:22Z Over the past two decades, quantum-like modeling (QLM) has emerged as a powerful framework for describing non-classical features of cognition and decision-making. Rather than assuming physical quantum processes in the brain, QLM employs the Hilbert space formalism to model contextuality, incompatibility of mental observables, and entanglement-like correlations. In this paper, we develop a quantum-informational model of mental markers within the broader I-field (information field) approach. We propose that, under conditions of information overload and limited cognitive resources, individuals primarily respond not to detailed semantic content but to compact content labels - mental markers - carrying cognitive and affective components. We formalize mental markers as structured quantum-like states and analyze the nonclassical correlations between their cognitive and affective components using the Contextuality-Incompatibility-Entanglement triad. Special attention is given to intra-system entanglement between rational (cognitive) evaluation and emotional (affective) coloring, accounting for context-dependent judgments, order effects, and affect-driven decision shifts. Illustrative examples with psychological interpretation and experimental perspectives are provided. An Appendix briefly discusses neurobiological analogues of information overload in neural networks, highlighting structural parallels with the proposed marker-based framework; coupling to the origin and diagnostics of neurological diseases is analyzed. The paper contributes to QLM by distinguishing inter-system and intra-system entanglement and by demonstrating that cognitive - affective entanglement constitutes a fundamental structural feature of mental markers in socially mediated information environments. 2026-02-27T21:35:22Z Andrei Khrennikov Felix Benninger Oded Shor http://arxiv.org/abs/2603.00213v1 Inferring brain plasticity rule under long-term stimulation with structured recurrent dynamics 2026-02-27T15:17:40Z Understanding how long-term stimulation reshapes neural circuits requires uncovering the rules of brain plasticity. While short-term synaptic modifications have been extensively characterized, the principles that drive circuit-level reorganization across hours to weeks remain unknown. Here, we formalize these principles as a latent dynamical law that governs how recurrent connectivity evolves under repeated interventions. To capture this law, we introduce the Stimulus-Evoked Evolution Recurrent dynamics (STEER) framework, a dual-timescale model that disentangles fast neural activity from slow plastic changes. STEER represents plasticity as low-dimensional latent coefficients evolving under a learnable recurrence, enabling testable inference of plasticity rules rather than absorbing them into black-box parameters. We validate STEER with four benchmarks: synthetic Lorenz systems with controlled parameter shifts, BCM-based networks with biologically grounded plasticity, a task learning setting with adaptively optimized external stimulation and longitudinal recordings from Parkinsonian rats receiving closed-loop DBS. Our results demonstrate that STEER recovers interpretable update equations, predicts network adaptation under unseen stimulation schedules, and supports the design of improved intervention protocols. By elevating long-term plasticity from a hidden confound to an identifiable dynamical object, STEER provides a data-driven foundation for both mechanistic insight and principled optimization of brain stimulation. 2026-02-27T15:17:40Z Zhichao Liang Jingzhe Lin Xinyi Li Guanyi Zhao Quanying Liu http://arxiv.org/abs/2602.13368v2 The Influence of Width Ratios on Structural Beauty in Male Faces 2026-02-27T14:19:01Z This study investigates the relationship between interocular distance relative to overall facial width (width ratio) and perceived subjective beauty in male faces. Building on the methodology of Pallett et al. (2010), who found that average proportions in female faces were rated as most attractive, the current study aimed to test this hypothesis in male faces. Faces from the Chicago Face Database (Ma et al., 2015) were morphed into average faces within three groups (with low, medium, and high width ratios), each composed of 96 or 97 individual images. These three average faces were then systematically manipulated in their width ratios across three levels in both directions, respectively, resulting in a total of 21 comparable faces. The use of multiple base faces served as a control for potential artifacts of image processing. Consequently, comparisons were restricted to within-group pairs to avoid confounding by co-varying facial features (e.g., skin tone), which precluded direct cross-condition comparisons but ensured internal validity. In a two-alternative forced-choice task, participants selected the more beautiful face from each pair. The data were analyzed using a Bayesian model which enables inference of the width ratio perceived as most beautiful. Results support the hypothesis that averageness in facial proportions correlates with higher perceived attractiveness. The study highlights the importance of controlling for image manipulation, including attempts at methodological implementation, and of considering ethnicity as a potential moderating variable. These findings offer a data-driven foundation for understanding facial aesthetics and cognitive processes of human perception, with applications in advertising, artificial face generation, and plastic surgery. 2026-02-13T13:27:47Z Theresa Tennstedt Benjamin Knopp Dominik Endres http://arxiv.org/abs/2603.03355v1 Inhibitory Cross-Talk Enables Functional Lateralization in Attention-Coupled Latent Memory 2026-02-27T08:58:51Z We present a memory-augmented transformer in which attention serves simultaneously as a retrieval, consolidation, and write-back operator. The core update, $A^\top A V W$, re-grounds retrieved values into persistent memory slots via the Gram matrix $A^\top A$, providing a principled tripartite projection: observation space $\to$ latent memory $\to$ supervised transformation. We partition the memory into lateralized left and right banks coupled through a sign-controlled cross-talk matrix $W_s$, and show that the sign of this coupling is decisive for specialization. Excitatory cross-talk ($s=+1$) causes bank-dominance collapse: one bank monopolises all inputs and $\mathcal{P}_{ct} \to 0.5$, despite lowering task loss. Inhibitory cross-talk ($s=-1$), motivated by the net inhibitory effect of callosal projections in human cortex, actively suppresses contralateral bank activation and achieves saturated specialization ($\mathcal{D}_{sep} = \pm 1.00$, $\mathcal{P}_{ct} \approx 0$). On a controlled symbolic benchmark combining an episodic bijection cipher (requiring associative recall) with a strict arithmetic progression (requiring rule extraction), the inhibitory model reduces cipher-domain loss by $124{\times}$ over the baseline while matching it on the arithmetic domain, confirming that persistent lateralized memory is necessary for episodic recall but not for rule-based prediction. 2026-02-27T08:58:51Z 10 pages, 3 figures, conference style Hong Jeong http://arxiv.org/abs/2507.09513v2 Animal behavioral analysis and neural encoding with transformer-based self-supervised pretraining 2026-02-27T04:57:04Z The brain can only be fully understood through the lens of the behavior it generates -- a guiding principle in modern neuroscience research that nevertheless presents significant technical challenges. Many studies capture behavior with cameras, but video analysis approaches typically rely on specialized models requiring extensive labeled data. We address this limitation with BEAST(BEhavioral Analysis via Self-supervised pretraining of Transformers), a novel and scalable framework that pretrains experiment-specific vision transformers for diverse neuro-behavior analyses. BEAST combines masked autoencoding with temporal contrastive learning to effectively leverage unlabeled video data. Through comprehensive evaluation across multiple species, we demonstrate improved performance in three critical neuro-behavioral tasks: extracting behavioral features that correlate with neural activity, and pose estimation and action segmentation in both the single- and multi-animal settings. Our method establishes a powerful and versatile backbone model that accelerates behavioral analysis in scenarios where labeled data remains scarce. 2025-07-13T06:43:05Z Yanchen Wang Han Yu Ari Blau Yizi Zhang The International Brain Laboratory Liam Paninski Cole Hurwitz Matt Whiteway http://arxiv.org/abs/2510.06091v2 Learning Mixtures of Linear Dynamical Systems via Hybrid Tensor-EM Method 2026-02-27T00:29:40Z Mixtures of linear dynamical systems (MoLDS) provide a path to model time-series data that exhibit diverse temporal dynamics across trajectories. However, its application remains challenging in complex and noisy settings, limiting its effectiveness for neural data analysis. Tensor-based moment methods can provide global identifiability guarantees for MoLDS, but their performance degrades under noise and complexity. Commonly used expectation-maximization (EM) methods offer flexibility in fitting latent models but are highly sensitive to initialization and prone to poor local minima. Here, we propose a tensor-based method that provides identifiability guarantees for learning MoLDS, which is followed by EM updates to combine the strengths of both approaches. The novelty in our approach lies in the construction of moment tensors using the input-output data to recover globally consistent estimates of mixture weights and system parameters. These estimates can then be refined through a Kalman EM algorithm, with closed-form updates for all LDS parameters. We validate our framework on synthetic benchmarks and real-world datasets. On synthetic data, the proposed Tensor-EM method achieves more reliable recovery and improved robustness compared to either pure tensor or randomly initialized EM methods. We then analyze neural recordings from the primate somatosensory cortex while a non-human primate performs reaches in different directions. Our method successfully models and clusters different conditions as separate subsystems, consistent with supervised single-LDS fits for each condition. Finally, we apply this approach to another neural dataset where monkeys perform a sequential reaching task. These results demonstrate that MoLDS provides an effective framework for modeling complex neural data, and that Tensor-EM is a reliable approach to MoLDS learning for these applications. 2025-10-07T16:17:52Z 24 pages, 14 figures Lulu Gong Shreya Saxena http://arxiv.org/abs/2407.06195v2 Spectral-Stimulus Information for Self-Supervised Stimulus Encoding 2026-02-26T20:35:19Z Mammalian spatial navigation relies on specialized neurons, such as place and grid cells, which encode position based on self-motion and environmental cues. While extensive research has explored the computational role of grid cells, the principles underlying efficient place cell coding remain less understood. Existing spatial information rate measures primarily assess single-neuron encoding, limiting insights into population-level representations, while, the role of correlation in neural coding remains a subject of considerable debate. To address this, we introduce novel, correlation-aware information-theoretic measures that quantify the encoding efficiency of multiple neurons, including the joint stimulus information rate for neuron pairs and the spectral-stimulus information for arbitrary sized populations. The spectral-stimulus information, defined as the leading eigenvalue of the stimulus information matrix, is maximized when neurons exhibit localized, non-overlapping firing fields, mirroring place cell and head direction cell activity. We apply these measures to neural data recorded in mice and monkeys, elucidating differences in encoding efficiency across neuronal pairs and populations. Then, we demonstrate that these measures can be used to train recurrent neural networks (RNNs) via self-supervised learning, leading to the emergence of place cells and head direction cells. Our findings highlight how neural populations collectively encode stimuli, offering a more comprehensive framework for understanding stimulus encoding and optimizing artificial navigation systems in novel environments. 2024-06-10T15:03:54Z Jared Deighton Wyatt Mackey Ioannis Schizas David L. Boothe Vasileios Maroulas http://arxiv.org/abs/2602.23274v1 Exploiting network topology in brain-scale simulations of spiking neural networks 2026-02-26T17:49:13Z Simulation code for conventional supercomputers serves as a reference for neuromorphic computing systems. The present bottleneck of distributed large-scale spiking neuronal network simulations is the communication between compute nodes. Communication speed seems limited by the interconnect between the nodes and the software library orchestrating the data transfer. Profiling reveals, however, that the variability of the time required by the compute nodes between communication calls is large. The bottleneck is in fact the waiting time for the slowest node. A statistical model explains total simulation time on the basis of the distribution of computation times between communication calls. A fundamental cure is to avoid communication calls because this requires fewer synchronizations and reduces the variability of computation times across compute nodes. The organization of the mammalian brain into areas lends itself to such an optimization strategy. Connections between neurons within an area have short delays, but the delays of the long-range connections across areas are an order of magnitude longer. This suggests a structure-aware mapping of areas to compute nodes allowing for a partition into more frequent communication between nodes simulating a particular area and less frequent global communication. We demonstrate a substantial performance gain on a real-world example. This work proposes a local-global hybrid communication architecture for large-scale neuronal network simulations as a first step in mapping the structure of the brain to the structure of a supercomputer. It challenges the long-standing belief that the bottleneck of simulation is synchronization inherent in the collective calls of standard communication libraries. We provide guidelines for the energy efficient simulation of neuronal networks on conventional computing systems and raise the bar for neuromorphic systems. 2026-02-26T17:49:13Z Melissa Lober Markus Diesmann Susanne Kunkel