https://arxiv.org/api/nxnSKSUyPNNhTtTsaluZf10os9M 2026-06-22T22:41:12Z 12181 525 15 http://arxiv.org/abs/2601.09320v2 Mapping Connectomic Structure to Function(s) in Cerebellar-like Networks using Kernel Regression 2026-03-19T22:27:48Z

Cerebellar-like networks, in which input activity patterns are separated by projection to a much higher-dimensional space before classification, are a recurring neurobiological motif, present in the cerebellum, dentate gyrus, insect olfactory system, and electrosensory system of the electric fish. Their relatively well-understood design presents a promising test-case for probing principles of biological learning. The circuits' expansive projections have long been modelled as random, enabling effective general purpose pattern separation. However, electron-microscopy studies have discovered interesting hints of structure in both the fly mushroom body and mouse cerebellum. Recent numerical work suggested that this non-random connectivity enables the circuit to prioritise learning of some, presumably natural, tasks over others. Here, rather than numerical results, we present a robust mathematical link between the observed connectivity patterns and the cerebellar circuit's learning ability. In particular, we extend a simplified kernel regression model of the system and use recent machine learning theory results to relate connectivity to learning. We find that the reported structure in the projection weights shapes the network's inductive bias in intuitive ways: functions are easier to learn if they depend on inputs that are oversampled, or on collections of neurons that tend to connect to the same hidden layer neurons. Our approach is analytically tractable and pleasingly simple, and we hope it continues to serve as a model for understanding the functional implications of other processing motifs in cerebellar-like networks.

2026-01-14T09:41:46Z 12 pages, 7 figures William Dorrell Peter E. Latham http://arxiv.org/abs/2603.19425v1 Curvature Sensitive Cells in the Modular Structures of The Visual Cortex 2026-03-19T19:36:54Z

We propose a model of the functional architecture of curvature-sensitive cells in the primary visual cortex. The model accounts for the modular and hierarchical organization of the cortex, the horizontal connectivity, and the shape of receptive profiles of these cells as Gabor-type filters. We construct a canonical affine subbundle of the cotangent bundle of the manifold of oriented contact elements of the retina as a geometric model for these cells, and show that this subbundle carries an Engel structure related to that of the Cartan prolongation. On an open submanifold of the Cartan prolongation, we identify generators of the Engel distribution whose iterated Lie brackets span the Lie algebra of SIM(2). The identification of sim(2) as the Lie algebra of these generators determines SIM(2) as the natural symmetry group for curvature-sensitive cells. Finally, we characterize the receptive profiles of curvature-sensitive cells as minima of a SIM(2)-adapted uncertainty principle applied to the generators of the Engel structure.

2026-03-19T19:36:54Z Giovanna Citti Vasiliki Liontou http://arxiv.org/abs/2603.19139v1 Hierarchical Latent Structure Learning through Online Inference 2026-03-19T16:57:36Z

Learning systems must balance generalization across experiences with discrimination of task-relevant details. Effective learning therefore requires representations that support both. Online latent-cause models support incremental inference but assume flat partitions, whereas hierarchical Bayesian models capture multilevel structure but typically require offline inference. We introduce the Hierarchical Online Learning of Multiscale Experience Structure (HOLMES) model, a computational framework for hierarchical latent structure learning through online inference. HOLMES combines a variation on the nested Chinese Restaurant Process prior with sequential Monte Carlo inference to perform tractable trial-by-trial inference over hierarchical latent representations without explicit supervision over the latent structure. In simulations, HOLMES matched the predictive performance of flat models while learning more compact representations that supported one-shot transfer to higher-level latent categories. In a context-dependent task with nested temporal structure, HOLMES also improved outcome prediction relative to flat models. These results provide a tractable computational framework for discovering hierarchical structure in sequential data.

2026-03-19T16:57:36Z 4 figures, 5 supplementary figures Ines Aitsahalia Kiyohito Iigaya http://arxiv.org/abs/2603.22311v1 Ca2+ transient detection and segmentation with the Astronomically motivated algorithm for Background Estimation And Transient Segmentation (Astro-BEATS) 2026-03-19T14:55:15Z

Fluorescence-based Ca$^{2+}$-imaging is a powerful tool for studying localized neuronal activity, including miniature Synaptic Calcium Transients, providing real-time insights into synaptic activity. These transients induce only subtle changes in the fluorescence signal, often barely above baseline, which poses a significant challenge for automated synaptic transient detection and segmentation. Detecting astronomical transients similarly requires efficient algorithms that will remain robust over a large field of view with varying noise properties. We leverage techniques used in astronomical transient detection for miniature Synaptic Calcium Transient detection in fluorescence microscopy. We present Astro-BEATS, an automatic miniature Synaptic Calcium Transient segmentation algorithm that incorporates image estimation and source-finding techniques used in astronomy and designed for Ca$^{2+}$-imaging videos. Astro-BEATS outperforms current threshold-based approaches for synaptic Ca$^{2+}$ transient detection and segmentation. The produced segmentation masks can be used to train a supervised deep learning algorithm for improved synaptic Ca$^{2+}$ transient detection in Ca$^{2+}$-imaging data. The speed of Astro-BEATS and its applicability to previously unseen datasets without re-optimization makes it particularly useful for generating training datasets for deep learning-based approaches.

2026-03-19T14:55:15Z 29 pages, 4 figures, 12 supplementary pages, 5 supplementary figures Bolin Fan Anthony Bilodeau Frederic Beaupre Theresa Wiesner Christian Gagne Flavie Lavoie-Cardinal Renee Hlozek 10.64898/2026.03.13.711411 http://arxiv.org/abs/2601.06134v2 DeeperBrain: A Neuro-Grounded EEG Foundation Model Towards Universal BCI 2026-03-19T06:41:01Z

Electroencephalography (EEG) foundation models hold significant promise for universal Brain-Computer Interfaces (BCIs). However, existing approaches often rely on end-to-end fine-tuning and exhibit limited efficacy under frozen-probing protocols, lacking the intrinsic universality required for broad generalization. This limitation stems from adapting general-purpose sequence architectures that overlook the biophysical and dynamical principles of neural activity. To bridge this gap, we propose DeeperBrain, a neuro-grounded foundation model integrating domain-specific inductive biases into its model design and learning objectives. Architecturally, DeeperBrain incorporates a volume conduction-aware channel encoding to model spatial mixing via 3D geometry, and a neurodynamics-aware temporal encoding capturing slow adaptations using oscillatory and exponential bases. For pretraining, we introduce a dual-objective strategy combining Masked EEG Reconstruction (MER) for local fidelity and Neurodynamics Statistics Prediction (NSP). NSP enforces alignment with macroscopic brain states by predicting interpretable order parameters, including spectral power, functional connectivity, cross-frequency coupling, and dynamic complexity. Extensive experiments demonstrate that DeeperBrain achieves state-of-the-art or highly competitive performance under end-to-end fine-tuning. Crucially, it maintains superior efficacy under a rigorous frozen-probing protocol, verifying that embedding neuroscientific first principles endows learned representations with the intrinsic universality essential for universal BCI. The code will be publicly available.

2026-01-05T05:31:45Z Preprint Jiquan Wang Sha Zhao Yangxuan Zhou Yiming Kang Shijian Li Gang Pan http://arxiv.org/abs/2603.18475v1 Resolving the Blow-Up: A Time-Dilated Numerical Framework for Multiple Firing Events in Mean-Field Neuronal Networks 2026-03-19T04:17:47Z

In large-scale excitatory neuronal networks, rapid synchronization manifests as {multiple firing events (MFEs)}, mathematically characterized by a finite-time blow-up of the neuronal firing rate in the mean-field Fokker-Planck equation. Standard numerical methods struggle to resolve this singularity due to the divergent boundary flux and the instantaneous nature of the population voltage reset. In this work, we propose a robust {multiscale numerical framework based on time dilation}. By transforming the governing equation into a dilated timescale proportional to the firing activity, we desingularize the blow-up, effectively stretching the instantaneous synchronization event into a resolved mesoscopic process. This approach is shown to be physically consistent with the {microscopic cascade mechanism} underlying MFEs and the system's inherent fragility. To implement this numerically, we develop a hybrid scheme that utilizes a {mesh-independent flux criterion} to switch between timescales and a semi-analytical ``moving Gaussian'' method to accurately evolve the post-blowup Dirac mass. Numerical benchmarks demonstrate that our solver not only captures steady states with high accuracy but also efficiently reproduces periodic MFEs, matching Monte Carlo simulations without the severe time-step restrictions associated with particle cascades.

2026-03-19T04:17:47Z Xu'an Dou Louis Tao Zhe Xue Zhennan Zhou http://arxiv.org/abs/1704.01148v10 The Quantification Horizon Theory of Consciousness 2026-03-18T18:54:41Z

To make nature mathematically tractable, the scientific model of the world omits qualia--colors, sounds, tastes, sensations--leaving only what admits of numerical characterization. The "hard problem" of consciousness--the enigma of why and how physical processing gives rise to felt experience--remains unsolved. The Quantification Horizon Theory of Consciousness (QHT) proposes that this enigma reflects a structural limitation of mathematical description: quantitative models capture only quantifiable features of reality; qualia are left out. Yet despite this limitation, QHT argues that such models can account for the unquantifiable--not by explaining it, but by registering its presence, in the form of a signpost. There are specific features of information geometry--compression singularities--that intuitively correspond to the hallmark properties of consciousness and could serve as precisely such signposts. QHT proposes that these singularities mark a quantification horizon--a boundary beyond which quantitative description cannot reach. On this proposal, qualia lie beyond the horizon. From this basis, the theory derives ineffability, privacy, and subjectivity as structural consequences and proposes structural accounts of unity and causal efficacy. The theory proposes substrate-independent dynamical criteria for determining which systems are plausible candidates for consciousness, avoids panpsychism, makes testable predictions, and offers concrete implications for artificial intelligence and artificial consciousness.

2017-04-04T18:32:58Z T. R. Le http://arxiv.org/abs/2603.17947v1 Unified Policy Value Decomposition for Rapid Adaptation 2026-03-18T17:19:56Z

Rapid adaptation in complex control systems remains a central challenge in reinforcement learning. We introduce a framework in which policy and value functions share a low-dimensional coefficient vector - a goal embedding - that captures task identity and enables immediate adaptation to novel tasks without retraining representations. During pretraining, we jointly learn structured value bases and compatible policy bases through a bilinear actor-critic decomposition. The critic factorizes as Q = sum_k G_k(g) y_k(s,a), where G_k(g) is a goal-conditioned coefficient vector and y_k(s,a) are learned value basis functions. This multiplicative gating - where a context signal scales a set of state-dependent bases - is reminiscent of gain modulation observed in Layer 5 pyramidal neurons, where top-down inputs modulate the gain of sensory-driven responses without altering their tuning. Building on Successor Features, we extend the decomposition to the actor, which composes a set of primitive policies weighted by the same coefficients G_k(g). At test time the bases are frozen and G_k(g) is estimated zero-shot via a single forward pass, enabling immediate adaptation to novel tasks without any gradient update. We train a Soft Actor-Critic agent on the MuJoCo Ant environment under a multi-directional locomotion objective, requiring the agent to walk in eight directions specified as continuous goal vectors. The bilinear structure allows each policy head to specialize to a subset of directions, while the shared coefficient layer generalizes across them, accommodating novel directions by interpolating in goal embedding space. Our results suggest that shared low-dimensional goal embeddings offer a general mechanism for rapid, structured adaptation in high-dimensional control, and highlight a potentially biologically plausible principle for efficient transfer in complex reinforcement learning systems.

2026-03-18T17:19:56Z Cristiano Capone Luca Falorsi Andrea Ciardiello Luca Manneschi http://arxiv.org/abs/2603.17676v1 Inhibitory normalization of error signals improves learning in neural circuits 2026-03-18T12:54:31Z

Normalization is a critical operation in neural circuits. In the brain, there is evidence that normalization is implemented via inhibitory interneurons and allows neural populations to adjust to changes in the distribution of their inputs. In artificial neural networks (ANNs), normalization is used to improve learning in tasks that involve complex input distributions. However, it is unclear whether inhibition-mediated normalization in biological neural circuits also improves learning. Here, we explore this possibility using ANNs with separate excitatory and inhibitory populations trained on an image recognition task with variable luminosity. We find that inhibition-mediated normalization does not improve learning if normalization is applied only during inference. However, when this normalization is extended to include back-propagated errors, performance improves significantly. These results suggest that if inhibition-mediated normalization improves learning in the brain, it additionally requires the normalization of learning signals.

2026-03-18T12:54:31Z 28 pages, 7 figures. Submitted to Neural Computation Roy Henha Eyono Daniel Levenstein Arna Ghosh Jonathan Cornford Blake Richards http://arxiv.org/abs/2410.03657v3 Low-dimensional model for adaptive networks of spiking neurons 2026-03-18T11:28:29Z

We investigate a large ensemble of Quadratic Integrate-and-Fire (QIF) neurons with heterogeneous input currents and adaptation variables. Our analysis reveals that for a specific class of adaptation, termed quadratic spike-frequency adaptation (QSFA), the high-dimensional system can be exactly reduced to a low-dimensional system of ordinary differential equations, which describes the dynamics of three mean-field variables: the population's firing rate, the mean membrane potential, and a mean adaptation variable. The resulting low-dimensional firing rate equations (FRE) uncover a key generic feature of heterogeneous networks with spike frequency adaptation: Both the center and the width of the distribution of the neurons' firing frequencies are reduced, and this largely promotes the emergence of collective synchronization in the network. Our findings are further supported by the bifurcation analysis of the FRE, which accurately captures the collective dynamics of the spiking neuron network, including phenomena such as collective oscillations, bursting, and macroscopic chaos.

2024-10-04T17:58:45Z Physical Review E 111, 014422 (2015) Bastian Pietras Pau Clusella Ernest Montbrió 10.1103/PhysRevE.111.014422 http://arxiv.org/abs/2407.14708v4 Modeling flexible behavior with remapping-based hippocampal sequence learning 2026-03-18T10:45:24Z

Animals flexibly change their behavior depending on context. It is reported that the hippocampus is one of the most prominent regions for contextual behaviors, and its sequential activity shows context dependency. However, how such context-dependent sequential activity is established through reorganization of neuronal activity (remapping) is unclear. To better understand the formation of hippocampal activity and its contribution to context-dependent flexible behavior, we present a novel biologically plausible reinforcement learning model. In this model, Context selector promotes the formation of context-dependent sequential activity and allows for flexible switching of behavior in multiple contexts. This model reproduces a variety of findings from neural activity, optogenetic inactivation, human fMRI, and clinical research. Furthermore, our model predicts that imbalances in the ratio between sensory and contextual representations in Context selector account for schizophrenia (SZ) and autism spectrum disorder (ASD)-like behaviors.

2024-07-20T00:00:15Z Yoshiki Ito Taro Toyoizumi 10.7554/eLife.106506. http://arxiv.org/abs/2603.17392v1 Agentic Cognitive Profiling: Realigning Automated Alzheimer's Disease Detection with Clinical Construct Validity 2026-03-18T06:15:35Z

Automated Alzheimer's Disease (AD) screening has predominantly followed the inductive paradigm of pattern recognition, which directly maps the input signal to the outcome label. This paradigm sacrifices construct validity of clinical protocol for statistical shortcuts. This paper proposes Agentic Cognitive Profiling (ACP), an agentic framework that realigns automated screening with clinical protocol logic across multiple cognitive domains. Rather than learning opaque mappings from transcripts to labels, the framework decomposes standardized assessments into atomic cognitive tasks and orchestrates specialized LLM agents to extract verifiable scoring primitives. Central to our design is decoupling semantic understanding from measurement by delegating all quantification to deterministic function calling, thereby mitigating hallucination and restoring construct validity. Unlike popular datasets that typically comprise around a hundred participants under a single task, we evaluate on a clinically-annotated corpus of 402 participants across eight structured cognitive tasks spanning multiple cognitive domains. The framework achieves 90.5% score match rate in task examination and 85.3% accuracy in AD prediction, surpassing popular baselines while generating interpretable cognitive profiles grounded in behavioral evidence. This work demonstrates that construct validity and predictive performance need not be traded off, charting a path toward AD screening systems that explain rather than merely predict.

2026-03-18T06:15:35Z Jiawen Kang Kun Li Dongrui Han Jinchao Li Junan Li Lingwei Meng Xixin Wu Helen Meng http://arxiv.org/abs/2508.08435v5 Fast weight programming and linear transformers: from machine learning to neurobiology 2026-03-18T02:06:05Z

Recent advances in artificial neural networks for machine learning, and language modeling in particular, have established a family of recurrent neural network (RNN) architectures that, unlike conventional RNNs with vector-form hidden states, use two-dimensional (2D) matrix-form hidden states. Such 2D-state RNNs, known as Fast Weight Programmers (FWPs), can be interpreted as a neural network whose synaptic weights (called fast weights) dynamically change over time as a function of input observations, and serve as short-term memory storage; corresponding synaptic weight modifications are controlled or programmed by another network (the programmer) whose parameters are trained (e.g., by gradient descent). In this Primer, we review the technical foundations of FWPs, their computational characteristics, and their connections to transformers and state space models. We also discuss connections between FWPs and models of synaptic plasticity in the brain, suggesting a convergence of natural and artificial intelligence.

2025-08-11T19:50:03Z Accepted to TMLR 2025 Kazuki Irie Samuel J. Gershman http://arxiv.org/abs/2507.11027v4 Functionalist Emotion Modeling in Biomimetic Reinforcement Learning 2026-03-18T00:58:08Z

We explore a functionalist approach to emotion by employing an ansatz -- an initial set of assumptions -- that a hypothetical concept generation model incorporates unproven but biologically plausible traits. From these traits, we mathematically construct a theoretical reinforcement learning framework grounded in functionalist principles and examine how the resulting utility function aligns with emotional valence in biological systems. Our focus is on structuring the functionalist perspective through a conceptual network, particularly emphasizing the construction of the utility function, not to provide an exhaustive explanation of emotions. The primary emphasis is not of planning or action execution, but such factors are addressed when pertinent. Finally, we apply the framework to psychological phenomena such as humor, psychopathy, and advertising, demonstrating its breadth of explanatory power.

2025-07-15T06:48:59Z Louis Wang http://arxiv.org/abs/2603.26707v1 The Cognitive Divergence: AI Context Windows, Human Attention Decline, and the Delegation Feedback Loop 2026-03-17T18:53:45Z

This paper documents and theorises a self-reinforcing dynamic between two measurable trends: the exponential expansion of large language model (LLM) context windows and the secular contraction of human sustained-attention capacity. We term the resulting asymmetry the Cognitive Divergence. AI context windows have grown from 512 tokens in 2017 to 2,000,000 tokens by 2026 (factor ~3,906; fitted lambda = 0.59/yr; doubling time ~14 months). Over the same period, human Effective Context Span (ECS) -- a token-equivalent measure derived from validated reading-rate meta-analysis (Brysbaert, 2019) and an empirically motivated Comprehension Scaling Factor -- has declined from approximately 16,000 tokens (2004 baseline) to an estimated 1,800 tokens (2026, extrapolated from longitudinal behavioural data ending 2020 (Mark, 2023); see Section 9 for uncertainty discussion). The AI-to-human ratio grew from near parity at the ChatGPT launch (November 2022) to 556--1,111x raw and 56--111x quality-adjusted, after accounting for retrieval degradation (Liu et al., 2024; Chroma, 2025). Beyond documenting this divergence, the paper introduces the Delegation Feedback Loop hypothesis: as AI capability grows, the cognitive threshold at which humans delegate to AI falls, extending to tasks of negligible demand; the resulting reduction in cognitive practice may further attenuate the capacities already documented as declining (Gerlich, 2025; Kim et al., 2026; Kosmyna et al., 2025). Neither trend reverses spontaneously. The paper characterises the divergence statistically, reviews neurobiological mechanisms across eight peer-reviewed neuroimaging studies, presents empirical evidence bearing on the delegation threshold, and proposes a research agenda centred on a validated ECS psychometric instrument and longitudinal study of AI-mediated cognitive change.

2026-03-17T18:53:45Z 28 pages, 1 figure, 5 tables. Preprint, not peer reviewed Netanel Eliav Machine Human Intelligence Lab