https://arxiv.org/api/wmShhiCwInQPC0uIZeyjhnHkEPk2026-03-22T13:27:11Z118039015http://arxiv.org/abs/2603.04688v1Why the Brain Consolidates: Predictive Forgetting for Optimal Generalisation2026-03-05T00:03:05ZStandard accounts of memory consolidation emphasise the stabilisation of stored representations, but struggle to explain representational drift, semanticisation, or the necessity of offline replay. Here we propose that high-capacity neocortical networks optimise stored representations for generalisation by reducing complexity via predictive forgetting, i.e. the selective retention of experienced information that predicts future outcomes or experience. We show that predictive forgetting formally improves information-theoretic generalisation bounds on stored representations. Under high-fidelity encoding constraints, such compression is generally unattainable in a single pass; high-capacity networks therefore benefit from temporally separated, iterative refinement of stored traces without re-accessing sensory input. We demonstrate this capacity dependence with simulations in autoencoder-based neocortical models, biologically plausible predictive coding circuits, and Transformer-based language models, and derive quantitative predictions for consolidation-dependent changes in neural representational geometry. These results identify a computational role for off-line consolidation beyond stabilisation, showing that outcome-conditioned compression optimises the retention-generalisation trade-off.2026-03-05T00:03:05Z25 pages, 6 figuresZafeirios FountasAdnan OomerjeeHaitham Bou-AmmarJun WangNeil Burgesshttp://arxiv.org/abs/2603.04622v1INTENSE: Detecting and disentangling neuronal selectivity in calcium imaging data2026-03-04T21:31:29ZNeurons encode information about the environment through their activity. As animals explore the environment, neurons rapidly acquire selectivity for distinct features of the external world; characterizing how these selectivity patterns emerge, reorganize, and overlap is key to linking neural activity to behavior and cognition. Calcium imaging in freely behaving animals can record large neuronal populations, but quantifying neuron-behavior selectivity directly from continuous fluorescence is challenging because both signals are temporally autocorrelated and calcium kinetics introduce time lags.
Here we present INTENSE (INformation-Theoretic Evaluation of Neuronal SElectivity), an open-source framework that uses mutual information to detect neuron-behavior associations from raw calcium fluorescence data. INTENSE controls false discoveries using circular-shift permutation testing that preserves temporal structure and optimizes temporal delays to account for indicator kinetics and prospective/retrospective encoding. To separate genuine mixed selectivity from associations driven by behavioral covariance, INTENSE applies conditional mutual information-based disentanglement.
We validated INTENSE on synthetic datasets, demonstrating robust detection across diverse signal-to-noise ratios and reliability conditions, whereas methods lacking temporal controls show poor performance. Applied to CA1 miniscope recordings in mice freely exploring an open field, INTENSE reveals robust selectivity to multiple variables (place, head direction, object interaction, locomotion) and refines mixed-selectivity estimates by distinguishing redundant from genuinely multi-variable encoding. Together, INTENSE enables high-throughput, information-theoretic selectivity mapping with principled control of temporal structure and behavioral covariance, bridging large-scale recordings to circuit-level hypotheses.2026-03-04T21:31:29ZNikita PospelovViktor PlusninOlga RogozhnikovaAnna IvanovaVladimir SotskovKsenia ToropovaOlga IvashkinaVladik AvetisovKonstantin Anokhinhttp://arxiv.org/abs/2603.04149v1Topological Origin of the Diversity of Timescales in Recurrent Neural Circuits2026-03-04T15:02:03ZStructural and functional heterogeneity are hallmarks of cortical circuits, from broad degree distributions in the mouse connectome to diverse intrinsic neuronal timescales. Yet a mechanistic link between connectivity heterogeneity and functional diversity is lacking. To bridge this gap, we introduce a random recurrent network in which connectivity is generated by a configuration model with tunable degree heterogeneity and synaptic weights exhibiting varying levels of correlation. Using generating-functional methods, we derive a heterogeneous dynamical mean-field theory (hDMFT) with degree-conditioned stochastic dynamics. The theory shows that the interaction of partial symmetry in the weights and degree heterogeneity induces a non-Markovian memory term in the form of an emergent self-coupling whose strength scales with degree and produces a broad distribution of activity timescales. We obtain analytic stability criteria demonstrating that degree heterogeneity lowers the critical gain and localizes unstable modes onto hubs. The resulting rich dynamical landscape includes silent, chaotic, and multistable regimes, which we uncover via spectral, replica, and Lyapunov exponent analyses. We highlight the computational benefits of the observed timescale heterogeneity by revealing that, under an external input drive featuring a broadband spectrum, slow hub neurons act as integrators, demixing slow input components. Finally, instantiating the model with the empirically measured topology from the MICrONS cubic-millimeter mouse connectome explains the broad range of single-neuron timescales and their positive correlation with in-degree observed in resting-state recordings. Our results provide a mechanistic link between connectome topology, neural dynamics, and computation, identifying hubs in partially symmetric networks as a natural substrate for multiplexed processing across timescales.2026-03-04T15:02:03ZMarco ZenariLuca TaffarelloLuca MazzucatoAmos MaritanSamir Suweishttp://arxiv.org/abs/2511.14555v3DecNefSimulator: A Modular, Interpretable Framework for Decoded Neurofeedback Simulation Using Generative Models2026-03-04T10:53:26ZDecoded Neurofeedback (DecNef) is a flourishing non-invasive approach to brain modulation with wide-ranging applications in neuromedicine and cognitive neuroscience. However, progress in DecNef research remains constrained by subject-dependent learning variability, reliance on indirect measures to quantify progress, and the high cost and time demands of experimentation.
We present DecNefSimulator, a modular and interpretable simulation framework that formalizes DecNef as a machine learning problem. Beyond providing a virtual laboratory, DecNefSimulator enables researchers to model, analyze and understand neurofeedback dynamics. Using latent variable generative models as simulated participants, DecNefSimulator allows direct observation of internal cognitive states and systematic evaluation of how different protocol designs and subject characteristics influence learning.
We demonstrate how this approach can (i) reproduce empirical phenomena of DecNef learning, (ii) identify conditions under which DecNef feedback fails to induce learning, and (iii) guide the design of more robust and reliable DecNef protocols in silico before human implementation.
In summary, DecNefSimulator bridges computational modeling and cognitive neuroscience, offering a principled foundation for methodological innovation, robust protocol design, and ultimately, a deeper understanding of DecNef-based brain modulation.2025-11-18T14:58:59ZAlexander OlzaRoberto SantanaDavid Sotohttp://arxiv.org/abs/2603.03870v1Two-phase quadratic integrate-and-fire neurons: Exact low-dimensional description for ensembles of finite-voltage neurons2026-03-04T09:23:29ZWe introduce a two-phase quadratic integrate-and-fire (QIF) neuron whose membrane potential evolves according to two alternating Riccati equations within finite bounds. This simple extension removes the unphysical voltage divergence of the standard QIF model while producing realistic spike waveforms. Despite this modification, the system retains an exact low-dimensional description in the thermodynamic limit, governed by a single complex Riccati equation. Expressions for collective quantities such as the firing rate and mean voltage remain compact and analytically tractable. Because the formalism preserves the mathematical structure of the standard QIF ensemble, it inherits its many generalizations and can serve as a drop-in replacement in existing mean-field frameworks, providing a more biologically plausible yet still exactly solvable neuronal model.2026-03-04T09:23:29Z6 pages, 2 figuresPhysical Review Research 8, L012049 (2026)Rok Cestnik10.1103/tq4x-8ny1http://arxiv.org/abs/2603.03864v1Performance of Conventional EEG Biomarkers Across Different Clinical Phases of Major Depressive Disorder: A Comprehensive Evaluation2026-03-04T09:19:00ZWhile EEG features differentiate Major Depressive Disorder (MDD) from healthy controls (HC), their clinical utility as biomarkers depends on a monotonic trajectory across the disease spectrum, from the acute (AC) phase to the maintenance (MA) phase and finally to the healthy baseline. However, the progression of the MA phase remains poorly understood in traditional marker analysis. Analyzing EEG data from 74 individuals (24 AC, 23 MA, and 27 HC), this study provides a comprehensive evaluation of classic ERP and resting-state indices across AC, MA, and HC groups. Our results demonstrate that almost no conventional metrics strictly satisfy the criterion of monotonic progression, likely due to profound inter-individual heterogeneity. These findings highlight the inherent limitations of group-level feature extraction and provide critical insights for developing future paradigms and algorithms to identify neurobiological markers with genuine clinical utility.2026-03-04T09:19:00Z74 subjects, 3 groups, 3 conditionsFeng YanXuteng WangShuyu YangYue ZhaoXiaobin WongZhiren Wanghttp://arxiv.org/abs/2603.03476v1Stringology-Based Motif Discovery from EEG Signals: an ADHD Case Study2026-03-03T19:44:55ZWe propose a novel computational framework for analyzing electroencephalography (EEG) time series using methods from stringology, the study of efficient algorithms for string processing, to systematically identify and characterize recurrent temporal patterns in neural signals. The primary aim is to introduce quantitative measures to understand neural signal dynamics, with the present findings serving as a proof-of-concept. The framework adapts order-preserving matching (OPM) and Cartesian tree matching (CTM) to detect temporal motifs that preserve relative ordering and hierarchical structure while remaining invariant to amplitude scaling. This approach provides a temporally precise representation of EEG dynamics that complements traditional spectral and global complexity analyses. To evaluate its utility, we applied the framework to multichannel EEG recordings from individuals with attention-deficit/hyperactivity disorder (ADHD) and matched controls using a publicly available dataset. Highly recurrent, group-specific motifs were extracted and quantified using both OPM and CTM. The ADHD group exhibited significantly higher motif frequencies, suggesting increased repetitiveness in neural activity. OPM analysis revealed shorter motif lengths and greater gradient instability in ADHD, reflected in larger mean and maximal inter-sample amplitude changes. CTM analysis further demonstrated reduced hierarchical complexity in ADHD, characterized by shallower tree structures and fewer hierarchical levels despite comparable motif lengths. These findings suggest that ADHD-related EEG alterations involve systematic differences in the structure, stability, and hierarchical organization of recurrent temporal patterns. The proposed stringology-based motif framework provides a complementary computational tool with potential applications for objective biomarker development in neurodevelopmental disorders.2026-03-03T19:44:55ZAnat DahanSamah Ghazawihttp://arxiv.org/abs/2603.01387v2An Information-Theoretic Framework For Optimizing Experimental Design To Distinguish Probabilistic Neural Codes2026-03-03T19:38:38ZThe Bayesian brain hypothesis has been a leading theory in understanding perceptual decision-making under uncertainty. While extensive psychophysical evidence supports the notion of the brain performing Bayesian computations, how uncertainty information is encoded in sensory neural populations remains elusive. Specifically, two competing hypotheses propose that early sensory populations encode either the likelihood function (exemplified by probabilistic population codes) or the posterior distribution (exemplified by neural sampling codes) over the stimulus, with the key distinction lying in whether stimulus priors would modulate the neural responses. However, experimentally differentiating these two hypotheses has remained challenging, as it is unclear what task design would effectively distinguish the two. In this work, we present an information-theoretic framework for optimizing the task stimulus distribution that would maximally differentiate competing probabilistic neural codes. To quantify how distinguishable the two probabilistic coding hypotheses are under a given task design, we derive the information gap--the expected performance difference when likelihood versus posterior decoders are applied to neural populations--by evaluating the Kullback-Leibler divergence between the true posterior and a task-marginalized surrogate posterior. Through extensive simulations, we demonstrate that the information gap accurately predicts decoder performance differences across diverse task settings. Critically, maximizing the information gap yields stimulus distributions that optimally differentiate likelihood and posterior coding hypotheses. Our framework enables principled, theory-driven experimental designs with maximal discriminative power to differentiate probabilistic neural codes, advancing our understanding of how neural populations represent and process sensory uncertainty.2026-03-02T02:32:08ZAccepted to The Fourteenth International Conference on Learning Representations (ICLR 2026)Po-Chen KuoEdgar Y. Walkerhttp://arxiv.org/abs/2603.16897v1EEG-Based Brain-LLM Interface for Human Preference Aligned Generation2026-03-03T19:00:11ZLarge language models (LLMs) are becoming an increasingly important component of human--computer interaction, enabling users to coordinate a wide range of intelligent agents through natural language. While language-based interfaces are powerful and flexible, they implicitly assume that users can reliably produce explicit linguistic input, an assumption that may not hold for users with speech or motor impairments, e.g., Amyotrophic Lateral Sclerosis (ALS). In this work, we investigate whether neural signals can be used as an alternative input to LLMs, particularly to support those socially marginalized or underserved users. We build a simple brain-LLM interface, which uses EEG signals to guide image generation models at test time. Specifically, we first train a classifier to estimate user satisfaction from EEG signals. Its predictions are then incorporated into a test-time scaling (TTS) framework that dynamically adapts model inference using neural feedback collected during user evaluation. The experiments show that EEG can predict user satisfaction, suggesting that neural activity carries information on real-time preference inference. These findings provide a first step toward integrating neural feedback into adaptive language-model inference, and hopefully open up new possibilities for future research on adaptive LLM interaction.2026-03-03T19:00:11Z15 pages, 9 figuresJunzi ZhangJianing ShenWeijie TuYi ZhangHailin ZhangTom GedeonBin JiangYue Yaohttp://arxiv.org/abs/2603.03414v1Cognitive Dark Matter: Measuring What AI Misses2026-03-03T18:41:47ZWe propose that the jagged intelligence landscape of modern AI systems arises from a missing training signal that we call "cognitive dark matter" (CDM): brain functions that meaningfully shape behavior yet are hard to infer from behavior alone. We identify key CDM domains-metacognition, cognitive flexibility, episodic memory, lifelong learning, abductive reasoning, social and common-sense reasoning, and emotional intelligence-and present evidence that current AI benchmarks and large-scale neuroscience datasets are both heavily skewed toward already-mastered capabilities, with CDM-loaded functions largely unmeasured. We then outline a research program centered on three complementary data types designed to surface CDM for model training: (i) latent variables from large-scale cognitive models, (ii) process-tracing data such as eye-tracking and think-aloud protocols, and (iii) paired neural-behavioral data. These data will enable AI training on cognitive process rather than behavioral outcome alone, producing models with more general, less jagged intelligence. As a dual benefit, the same data will advance our understanding of human intelligence itself.2026-03-03T18:41:47Z11 pages, 3 figuresPatrick J. MineaultThomas L. GriffithsSean Escolahttp://arxiv.org/abs/2506.01303v3Dynamic Manifold Hopfield Networks for Context-Dependent Associative Memory2026-03-03T16:00:20ZNeural population activity in cortical and hippocampal circuits can be flexibly reorganized by context, suggesting that cognition relies on dynamic manifolds rather than static representations. However, how such dynamic organization can be realized mechanistically within a unified dynamical system remains unclear. Continuous Hopfield networks provide a classical attractor framework in which neural dynamics follow gradient descent on a fixed energy landscape, constraining retrieval within a static attractor manifold geometry. Extending this approach, we introduce Dynamic Manifold Hopfield Networks (DMHN), continuous dynamical models in which contextual modulation dynamically reshapes attractor geometry, transforming a static attractor manifold into a context-dependent family of neural manifolds. In DMHN, network interactions are learned in a data-driven manner, to intrinsically deform the geometry of its attractor manifold across cues without explicit context-specific parameterization. As a result, in associative retrieval, DMHN achieve substantially higher capacity and robustness than classical and modern Hopfield networks: when storing $2N$ patterns in a network of $N$ neurons, DMHN attain reliable retrieval with an average accuracy of 64%, compared with 1% and 13% for classical and modern variants, respectively. Together, these results establish dynamic reorganization of attractor manifold geometry as a principled mechanism for context-dependent remapping in neural associative memory.2025-06-02T04:24:36ZChong LiTaiping ZengXiangyang XueJianfeng Fenghttp://arxiv.org/abs/2603.03037v1Zigzag Persistence of Neural Responses to Time-Varying Stimuli2026-03-03T14:27:41ZWe use topological data analysis to study neural population activity in the Sensorium 2023 dataset, which records responses from thousands of mouse visual cortex neurons to diverse video stimuli. For each video, we build frame-by-frame cubical complexes from neuronal activity and apply zigzag persistent homology to capture how topological structure evolves over time. These dynamics are summarized with persistence landscapes, providing a compact vectorized representation of temporal features. We focus on one-dimensional topological features-loops in the data-that reflect coordinated, cyclical patterns of neural co-activation. To test their informativeness, we compare repeated trials of different videos by clustering their resulting topological neural representations. Our results show that these topological descriptors reliably distinguish neural responses to distinct stimuli. This work highlights a connection between evolving neuronal activity and interpretable topological signatures, advancing the use of topological data analysis for uncovering neural coding in complex dynamical systems.2026-03-03T14:27:41Z4+7 pages, 7 figures, accepted as proceedings of the Geometry, Topology and Machine Learning Workshop (GTML) 2025Yuri GardinazziAlessio AnsuiniEugenio PiasiniFabio AnselmiMatteo Biagettihttp://arxiv.org/abs/2509.00555v2Integrated information and predictive processing theories of consciousness: An adversarial collaborative review2026-03-03T13:36:42ZAs neuroscientific theories of consciousness continue to proliferate, the need to assess their similarities and differences - as well as their predictive and explanatory power - becomes ever more pressing. Recently, a number of structured adversarial collaborations have been devised to test the competing predictions of several candidate theories of consciousness. In this review, we compare and contrast three theories being investigated in one such adversarial collaboration: Integrated Information Theory, Neurorepresentationalism, and Active Inference. We begin by presenting the core claims of each theory, before comparing them in terms of the phenomena they seek to explain, the sorts of explanations they avail, and the methodological strategies they endorse. We then consider some of the inherent challenges of theory-testing, and how adversarial collaboration addresses some of these difficulties. The stage is then set for the empirical work to come: first, we outline the key hypotheses to be tested across a series of multi-site experiments; second, we discuss the kinds of observations that would support or challenge each theory; third, we consider how these theories might assimilate or accommodate such observations. Finally, we show how data harvested across disparate experiments (and their replicates) may be formally integrated to provide a quantitative measure of the evidential support accrued under each theory. Besides orienting the reader to the theoretical foundations of our collaboration, this review aims to provide valuable meta-scientific insights into the mechanics of adversarial collaboration and theory-testing in general - including the way theories may be evaluated in terms of the scientific progress they deliver.2025-08-30T16:41:13ZAndrew W. CorcoranAndrew M. HaunReinder DormanGiulio TononiKarl J. FristonCyriel M. A. Pennartz TWCF :INTREPID Consortiumhttp://arxiv.org/abs/2407.01656v5Absolute abstraction: a renormalisation group approach2026-03-03T10:38:00ZAbstraction is the process of extracting the essential features from raw data while ignoring irrelevant details. It is well known that abstraction emerges with depth in neural networks, where deep layers capture abstract characteristics of data by combining lower level features encoded in shallow layers (e.g. edges). Yet we argue that depth alone is not enough to develop truly abstract representations. We advocate that the level of abstraction crucially depends on how broad the training set is. We address the issue within a renormalisation group approach where a representation is expanded to encompass a broader set of data. We take the unique fixed point of this transformation -- the Hierarchical Feature Model -- as a candidate for a representation which is absolutely abstract. This theoretical picture is tested in numerical experiments based on Deep Belief Networks and auto-encoders trained on data of different breadth. These show that representations in neural networks approach the Hierarchical Feature Model as the data get broader and as depth increases, in agreement with theoretical predictions.2024-07-01T14:13:11Z35 pages, 6 figuresCarlo Orientale CaputoElias SeiffertEnrico FrausinMatteo Marsilihttp://arxiv.org/abs/2603.02491v1What Capable Agents Must Know: Selection Theorems for Robust Decision-Making under Uncertainty2026-03-03T00:47:58ZAs artificial agents become increasingly capable, what internal structure is *necessary* for an agent to act competently under uncertainty? Classical results show that optimal control can be *implemented* using belief states or world models, but not that such representations are required. We prove quantitative "selection theorems" showing that low *average-case regret* on structured families of action-conditioned prediction tasks forces an agent to implement a predictive, structured internal state. Our results cover stochastic policies, partial observability, and evaluation under task distributions, without assuming optimality, determinism, or access to an explicit model. Technically, we reduce predictive modeling to binary "betting" decisions and show that regret bounds limit probability mass on suboptimal bets, enforcing the predictive distinctions needed to separate high-margin outcomes. In fully observed settings, this yields approximate recovery of the interventional transition kernel; under partial observability, it implies necessity of belief-like memory and predictive state, addressing an open question in prior world-model recovery work.2026-03-03T00:47:58Z18 pagesAran Nayebi