https://arxiv.org/api/JRMm4dtEhUMbA4C2vsMe2G5U4W42026-06-21T14:10:24Z1218110515http://arxiv.org/abs/2606.03481v1Short-Term Synaptic Plasticity Stabilizes Goal-Conditioned Dynamics in a PFC-Inspired Reservoir Model for Multistep Goal-Directed Action Planning2026-06-02T10:59:46ZThe prefrontal cortex (PFC) maintains goal information for action planning, but how recurrent circuits preserve it in an action-usable form over behavioral timescales remains unclear. Here we ask whether short-term synaptic plasticity (STP) can stabilize goal information as action-usable, goal-conditioned dynamics. We incorporated STP into a PFC-inspired reservoir computing model with basal-ganglia-inspired temporal-difference readout learning, and evaluated paired models with and without STP across 100 independently generated networks in a multistep goal-directed action-selection task with delayed execution. Goal identity was highly decodable during the delay even without STP, so STP was not required to form a linearly readable goal representation. Under state noise, however, success without STP fell from 75.8% to 49.5%, whereas the model with STP remained essentially unchanged (91.8% without noise versus 89.2% under noise; paired Cohen's dz=1.31). Time-resolved decoding, state-space separability, and action-value-difference analyses showed that STP preserved goal information as action-relevant goal-conditioned dynamics available at later action opportunities. Gain-matched and STP-state perturbation controls argued against a simple fixed recurrent-scaling explanation and supported online, history-dependent synaptic modulation. Effective-connectivity analyses showed delay-period goal-specific patterning that increased toward the later part of the trial with STP, where it should be read as goal- and task-state-conditioned patterning; effective connectivity without STP was time-invariant. A grid search identified a facilitation-dominant range of STP time constants associated with high success rates. These results suggest that STP supports robust goal-conditioned dynamics through dynamic modulation of goal-dependent effective recurrent connectivity.2026-06-02T10:59:46Z68 pages, 33 figures, 3 tables; includes supplementary material; submitted to Neural NetworksJin NakamuraYuichi Katorihttp://arxiv.org/abs/2606.03471v1A formal definition and meta-model for a machine theory of mind2026-06-02T10:48:59ZThis paper proposes, for the first time, a rigorous formal definition of the concept of Machine Theory of Mind, based on principles supported by evidence from cognitive psychology, neuroscience and artificial intelligence, and uses the above as a lens to examine state-of-the-art and current efforts in the field, driving a potential agenda for further research there able to "crack" the problem. It also advances a general holistic meta-model for Machine Theory of Mind, and examines the state of the art when it comes to empirically benchmarking such models.2026-06-02T10:48:59Z48 pages, 2 figuresFabio Cuzzolinhttp://arxiv.org/abs/2508.04983v3Kinetic energy in random recurrent neural networks2026-06-02T07:17:09ZHigh-dimensional chaotic dynamics can emerge in a large random recurrent neural network when the synaptic gain crosses a threshold. Recent works showed that the kinetic energy of neural activity links the chaotic dynamics and the supporting unstable fixed points (equilibria) in the phase space. Here, we investigate the kinetic-energy-centric properties of random recurrent neural networks by combining dynamical mean-field theory with extensive numerical simulations. We find that the average kinetic energy shifts continuously from zero to a positive value at the known critical value of coupling variance (synaptic gain) and exhibits a cubic scaling behavior near the critical point from above. This scaling behavior is supported by numerical simulations and provides a quantitative characterization of how fast the dynamics change during the onset of chaos as well as how far the chaotic dynamics are away from the unstable fixed points. The steady-state activity distribution is further calculated by the theory and compared with simulations on finite-size systems from the kinetic-energy optimization perspective as well. The activity distribution is also analyzed in a geometric angle, revealing that although the original chaotic dynamics and the gradient dynamics of the kinetic energy are arranged in a shell-like structure, they are well separated in the polar direction. The trajectory length on the chaotic manifold can be derived from the stationary kinetic energy, and the associated stationary behavior is analyzed as well.2025-08-07T02:28:51Z30 pages, 8 figures, revised manuscript to PRELi-Ru ZhangHaiping Huanghttp://arxiv.org/abs/2606.03118v1Learning to See via Epiretinal Implant Stimulation in silico with Model-Based Deep Reinforcement Learning2026-06-02T04:03:43ZObjective: Diseases such as age-related macular degeneration and retinitis pigmentosa cause the degradation of the photoreceptor layer. One approach to restore vision is to electrically stimulate the surviving retinal ganglion cells with a microelectrode array such as epiretinal implants. Epiretinal implants are known to generate visible anisotropic shapes elongated along the axon fascicles of neighboring retinal ganglion cells. Recent work has demonstrated that to obtain isotropic pixel-like shapes, it is possible to map axon fascicles and avoid stimulating them by inactivating electrodes or lowering stimulation current levels. Avoiding axon fascicle stimulation aims to remove brushstroke-like shapes in favor of a more reduced set of pixel-like shapes. Approach: In this study, we propose the use of isotropic and anisotropic shapes to render intelligible images on the retina of a virtual patient in a reinforcement learning environment named rlretina. The environment formalizes the task as using brushstrokes in a stroke-based rendering task. Main Results: We train a deep reinforcement learning agent that learns to assemble isotropic and anisotropic shapes to form an image. We investigate which error-based or perception-based metrics is adequate to reward the agent. The agent is trained in a model-based data generation fashion using the psychophysically validated axon map model to render images as perceived by different virtual patients. We show that the agent can generate more intelligible images compared to the naive method in different virtual patients. Significance: This work shares a new way to address epiretinal stimulation that constitutes a first step towards improving visual acuity in artificially-restored vision using anisotropic phosphenes.2026-06-02T04:03:43Z18 pages, 6 figures. Published version: Biomed. Phys. Eng. Express 10, 025006 (2024)Biomed. Phys. Eng. Express 10 (2024) 025006Jacob LavoieMarwan BesrourWilliam LemaireJean RouatRéjean FontaineEric Plourde10.1088/2057-1976/acf1a5http://arxiv.org/abs/2511.13899v2A Factorized Low-Rank RNN Framework for Uncovering Independent Neural Latent Dynamics and Connectivity2026-06-02T03:01:03ZLow-rank recurrent neural networks (lrRNNs) are a class of models that uncover low-dimensional latent dynamics underlying neural population activity. Although their functional connectivity is low-rank, it lacks independence interpretations, making it difficult to assign distinct computational roles to different latent dimensions. To address this, we propose the Factored Recurrent Neural Network (FacRNN), a generative lrRNN framework that assumes group-wise independence among latent dynamics while allowing flexible within-group entanglement. These independent latent groups allow latent dynamics to evolve separately, but are internally rich for complex computation. We reformulate the lrRNN under a variational autoencoder (VAE) framework, enabling us to introduce a partial correlation penalty that encourages independence between groups of latent dimensions. Experiments on synthetic, monkey M1, and mouse voltage imaging data show that FacRNN consistently improves the disentanglement and interpretability of learned neural latent trajectories in low-dimensional space and low-rank connectivity over baseline lrRNNs that do not encourage group-wise independence.2025-11-17T20:49:58ZChengrui LiYunmiao WangYule WangWeihan LiDieter JaegerAnqi Wuhttp://arxiv.org/abs/2606.02937v1BEAST3D: Animal behavioral analysis and neural encoding from multi-view video via Gaussian splatting2026-06-01T22:34:14ZMulti-view video recordings are increasingly used to capture the 3D movements of animals in experimental settings, yet extracting rich 3D representations from these recordings remains challenging. Supervised pose estimation requires extensive manual annotation, while general-purpose 3D reconstruction models trained on generic scene datasets fail on the specialized imagery and sparse-view setting of laboratory experiments. We address these limitations with BEAST3D, a self-supervised pretraining framework that learns 3D visual representations from unlabeled, calibrated multi-view video. BEAST3D uses a vision transformer to predict 3D Gaussian splats that reconstruct held-out views through differentiable rendering, while simultaneously segmenting the animal from the background. BEAST3D reconstructs 3D structure with as few as four views by conditioning directly on known camera parameters--unlike general-purpose models, which must estimate camera geometry from dense overlapping viewpoints that are seldom available in lab settings. Through comprehensive evaluation across four species, we demonstrate that BEAST3D produces rich, viewpoint-invariant features that transfer effectively to three downstream tasks: novel view synthesis, which validates the quality of the learned 3D representations; multi-view pose estimation, which provides the sparse keypoint trajectories widely used in behavioral analysis; and neural encoding, which relates 3D behavioral features to simultaneously recorded neural activity. BEAST3D thus establishes a versatile framework for behavioral analysis that leverages 3D structure in modern multi-view laboratory recordings.2026-06-01T22:34:14ZYanchen WangLenny AharonWangshu ZhuKyle DaruwallaLinghua ZhangJiaru ZouSelmaan ChettihHelen HouLiam PaninskiMatthew R Whitewayhttp://arxiv.org/abs/2602.18690v2Neural Fields as World Models2026-06-01T18:47:07ZHumans rehearse possible futures offline, as in mental practice and perhaps dreaming, suggesting that world models may support task learning away from the environment. Standard machine learning world models compress visual input into latent vectors, discarding the spatial structure that characterizes sensory cortex. We propose isomorphic world models: architectures that preserve sensory topology, so physics prediction becomes geometric propagation rather than abstract state transition. We implement this idea with motor-gated neural fields, where activity evolves through local lateral connectivity and motor commands multiplicatively modulate specific channels. Across three experiments, the same architecture learns ballistic prediction without ``teleporting,'' improves a catching policy offline by propagating task error through a frozen learned world model, and develops body-selective motor channels without body labels. These results provide preliminary evidence that physical prediction, offline task learning, and body-linked representation share a common computational substrate: action-conditional prediction within a spatial map.2026-02-21T01:52:43Z6 pages, 6 figures. Annual Meeting of the Cognitive Science Society (CogSci 2026)Joshua Nunleyhttp://arxiv.org/abs/2606.02392v1Topology as Logic: Structural Role Geometry Across Formal, Software, Biological, and Prebiotic Systems2026-06-01T15:42:49ZWe ask whether dependency topology correlates with functional load-bearing organization as recoverable geometry -- not as a metaphor, but as a measurable structural property detectable by multilayer network analysis. Across seven independent substrates, we show that hub persistence and rank divergence under the Functional Proximity Law recover operational organization that domain experts describe as logic: axiomatic load-bearing structure in formal mathematics, control and contract structure in legacy software, conserved hub grammar across approx. 600 million years of neural evolution, catalytic role organization in a published prebiotic autocatalytic network, carry-path dominance in a 4-bit digital circuit, betweenness persistence in the ISCAS85 c432 standard benchmark (n=196), and a directional formal-systems replication in the Coq Corelib (n=17). A key methodological finding: degree-based hub persistence is weak between physical wiring and simulation state-correlation layers (r=0.21 in c432), while betweenness-based persistence is stronger (r=0.77 in the 4-bit ALU post-hoc; r=0.34 in c432). The ISCAS85 pre-registered primary hypothesis was CONFIRMED (degree r=0.426, p=0.002, Spearman r=0.551). The formal-systems claim is supported by two proof-assistant corpora: Lean 4 mathlib4 (CONFIRMED, r=0.777, p=0.004) and Coq Corelib (PARTIAL, direction confirmed, r=0.288, p=0.287, n=17, underpowered). All seven experiments were pre-registered before analysis.2026-06-01T15:42:49Z7 pages, 1 table. Version 1. Seven pre-registered experiments across digital circuits, formal mathematics (Lean 4 and Coq), legacy COBOL, neural connectomics (C. elegans to Drosophila, ~600 Myr), and prebiotic chemistry. Companion paper: arXiv:2604.23639. Pre-registration: https://github.com/vladi160/preregistrations. Zenodo: https://doi.org/10.5281/zenodo.20489745Vladi Ivanovhttp://arxiv.org/abs/2606.02385v1How Optimality Structures Sparse Dictionaries: A Theory for Understanding SAE Representations2026-06-01T15:34:34ZSparse Autoencoders (SAEs) have found success parsing neural representations into interpretable concepts, providing a basis for understanding and control. However, what exactly SAEs extract, and, correspondingly, the scientific conclusions we can draw from them, are not obvious. Empirically, the proof is in the pudding: SAEs learn interpretable features. Theoretically, we lack a clear account of what properties a 'concept' must satisfy for an SAE to extract it. There has been extensive identifiability work studying the conditions under which sparse coding recovers ground-truth features; however, these approaches tends to focus on simple data-generating models (e.g. sparse independent features) which poorly approximate the internet-swallowing language-model representations on which SAEs are trained. Here, avoiding data-generating models, we ask simply what properties any dictionary learning optimum must satisfy. Concretely, we extend local optimality analyses (Gribonval & Schnass, 2010) to the nonnegative joint-optimisation problem that vanilla SAEs approximate, and derive constraints relating optimal SAE features to their distributions. We use these constraints to explain a range of observed SAE behaviours - hierarchical splitting & absorption, the structure of residuals, and dense antipodal features - each reflecting how L1+nonnegativity interact with data to structure optimal dictionaries. Finally, we construct a novel large-dictionary convex problem and explore the wide atom-per-datapoint limit. In sum, we hope to tease model assumptions from unexpected observations, letting us learn more from SAEs' successes and provide principles for designing their successors.2026-06-01T15:34:34Z27 pages, 5 figuresWilliam Dorrellhttp://arxiv.org/abs/2606.02305v1Mapping Whisper Representations to Human ECoG Responses with Interpretable Time-Resolved Neural Encoding2026-06-01T14:25:36ZUnderstanding how speech foundation models relate to human cortical activity is a key challenge for computational neuroscience. Here, we investigate how internal representations from Whisper predict intracranial ECoG responses during naturalistic speech perception. We introduce a time-resolved neural encoder that combines speech embeddings with a recurrent temporal model and soft attention, allowing us to examine layer-wise brain alignment. Intermediate Whisper layers provide the strongest correspondence with neural activity, supporting a hierarchical match between model representations and cortical speech processing. Comparisons with baselines show that high-resolution ECoG responses benefit from temporally structured modelling beyond linear mappings from the same speech representations. In addition, attention maps reveal temporally local alignment between speech embeddings and neural responses, while a phonemic interpretability analysis identifies anatomically coherent phoneme-category organization among encoding-informative electrodes. Together, these results suggest that speech foundation models offer a useful framework for studying time-resolved cortical speech representations.2026-06-01T14:25:36ZPresented at ICLR 2026 Workshop on Representational Alignment (Re-Align)Matteo CiferriTommaso BoccatoMichal OlakMatteo FerranteNicola Toschihttp://arxiv.org/abs/2606.02121v1What biology can, and cannot, tell us about conscious AI2026-06-01T11:51:42ZProgress in AI is turning machine consciousness from a philosophical curiosity into a societal issue, and has led to criticism of the widespread computational functionalism framework. Biological Naturalism (BN) claims that biology, not computation, is crucial for consciousness. We discuss which forms of BN are empirically testable. For Type-A-BN, biology intrinsically matters for consciousness, without affording unique information processing capabilities. We argue, similarly to the unfolding argument, that this dissociates consciousness from behaviour, making Type-A-BN untestable. For Type-B-BN, biology matters because it affords unique information processing capabilities. Type-B-BN is testable, and not incompatible with computational functionalism. Both face the same task: relating consciousness to information processing. Biology can act as a guide on this quest, but not as a solution.2026-06-01T11:51:42ZUlysse KlatzmannAdrien Doerighttp://arxiv.org/abs/2606.02099v1Unveiling the shared grey matter signature between Alzheimer's and Parkinson's Disease2026-06-01T11:30:29ZINTRODUCTION. This study presents the first quantification of vertex-level grey-matter associations between Alzheimer's disease (AD) and Parkinson's disease (PD) using highresolution brain maps aggregated from large MRI datasets. The aim is to identify shared neuroanatomical signatures between the two diseases. METHODS. Leveraging a novel statistical framework (SumR2 regression), adapted from genetic correlation analysis, we estimated the shared neuroanatomical signature (grey-matter correlation: rGM) between AD and PD. RESULTS. A significant positive brain-wide grey-matter correlation (rGM=0.24, 95%CI 0.20-0.28) was observed between AD and PD. This correlation was further observed across disease stages and replicated using UK Biobank data. We located 9 vertex-wise clusters (106 vertices) that contribute to the significant rGM, highlighting reduced thickness in the bilateral putamen and right accumbens as associated with both AD and PD. DISCUSSION. Our findings suggest that shared neuroanatomical features emerge early in neurodegeneration and have implications for early screening, disease monitoring, and targeted interventions. from the Parkinson's Progression Markers Initiative (PPMI) database (www.ppmi-info.org/access-2026-06-01T11:30:29ZVishaak GangasandraUQFinn BlainUQElise DelzantARAMISMichelle LuptonARAMIS, UQ, QUTMiguel RenteríaARAMIS, UQ, QUTSarah MedlandARAMIS, UQ, QUTBaptiste Couvy-duchesneARAMIS, UQ, QUThttp://arxiv.org/abs/2604.04958v3CalM: A Self-Supervised Foundation Model for Population Dynamics in Calcium Imaging Data2026-06-01T08:09:36ZRecent work suggests that large-scale, multi-animal modeling can significantly improve neural recording analysis. However, for functional calcium traces, existing approaches remain task-specific, limiting transfer across common neuroscience objectives. To address this challenge, we propose \textbf{CalM}, a self-supervised neural foundation model trained solely on neuronal calcium traces and adaptable to multiple downstream tasks, including forecasting and decoding. Our key contribution is a pretraining framework, composed of a high-performance tokenizer mapping single-neuron traces into a shared discrete vocabulary, and a dual-axis autoregressive transformer modeling dependencies along both the neural and the temporal axis. We evaluate CalM on a large-scale, multi-animal, multi-session dataset. On the neural population dynamics forecasting task, CalM achieves competitive performance against strong specialized baselines after pretraining. With a task-specific head, CalM further adapts to the behavior decoding task and achieves superior results compared with supervised decoding models. Moreover, linear analyses of CalM representations reveal interpretable functional structures beyond predictive accuracy. Taken together, we propose a novel and effective self-supervised pretraining paradigm for foundation models based on calcium traces, paving the way for scalable pretraining and broad applications in functional neural analysis. Code is released at https://github.com/TSuXinH/CalM.2026-04-03T13:46:41ZICML accepted versionXinhong XuYimeng ZhangQichen QianYuanlong Zhanghttp://arxiv.org/abs/2606.01841v1The Neuromorphic Supremacy2026-06-01T07:52:40ZLive neural systems demonstrate remarkable capabilities to learn new behavior and patterns from mere few examples and are known to operate robustly under severe sensory noise. These capabilities, however, remain largely out of reach for modern artificial neural networks, including deep learning models. We show that this gap can be bridged by embedding novel genuine neuromorphic circuits into conventional artificial neural network architectures. These circuits comprise astrocytic modulation and spiking dynamics inherent to biological neural structures. Tested across standard benchmarks representing tasks of varying complexity, the hybrid models achieve high accuracy from few training examples per class and sustain high performance under occlusion and impulse noise that cause performance collapse in standard models without neuromorphic adaptation. We term this phenomenon neuromorphic supremacy - a regime in which architectures grounded in neurobiology decisively outperform classical deep learning, pointing toward a principled foundation for perception in embodied AI systems operating in noisy, data-scarce environments.2026-06-01T07:52:40ZYuliya TsybinaIvan Y. TyukinAlexander N. GorbanVictor KazantsevDianhui WangSusanna Gordleevahttp://arxiv.org/abs/2511.04047v7Considering a generative mechanism of consciousness from the perspective of inter-level causation2026-06-01T06:30:15ZWhy do some physical systems possess consciousness, while others do not? We view consciousness not as a subjective experience, but rather as a physical event accompanying experience. Is this a question of physics? Or is it a question of the theory of causation? Physics and the theory of causation serve different descriptive purposes. To describe a causal model, we introduce an asymmetric relation between cause and effect that is necessary for describing causality, but not physical laws. We propose that the generation of consciousness is determined by a system's internal causal mechanisms, rather than by a system's functions (i.e., physically determined input-output relations). To explain these intrinsic causes, we focus on whole-to-parts causality. Traditionally, whole-to-parts causality is considered an emergent phenomenon rather than a mechanism. We devise a method for explicitly implementing these mechanisms in a causal model by examining how causes originating at higher levels are transmitted to lower levels within a system. We then propose a dual-laws model (DLM), which features distinct dynamical laws at higher and lower levels. Finally, we discuss the generation of functional consciousness and its causality based on the DLM.2025-11-06T04:34:52ZYoshiyuki OhmuraYasuo Kuniyoshi