https://arxiv.org/api/3dc0lULI8YQs0kF8sVLPZnJ4viA 2026-03-22T12:04:09Z 11803 75 15 http://arxiv.org/abs/2603.07369v1 Task learning increases information redundancy of neural responses in macaque visual cortex 2026-03-07T22:41:26Z

How does the brain optimize sensory information for decision-making in new tasks? One hypothesis suggests learning reduces redundancy in neural representations to improve efficiency, while another, based on Bayesian inference, predicts learning increases redundancy by distributing information across neurons. We tested these hypotheses by tracking population responses in macaque cortical area V4 as monkeys learned visual discrimination tasks. We found strong support for the Bayesian predictions: task learning increased redundancy in neural responses over weeks of training and within single trials. This redundancy did not reduce information but instead increased the information carried by individual neurons. These insights suggest sensory processing in the brain reflects a generative rather than discriminative inference process.

2026-03-07T22:41:26Z published in Science, accepted manuscript prior to editing, main text: 33 pages, 5 figures, 39 supplementary pages, 22 supplementary figures, 7 supplementary tables Science, 391(6789), 1029-1035 (2026) Shizhao Liu Anton Pletenev Ralf M. Haefner Adam C. Snyder 10.1126/science.adw7707 http://arxiv.org/abs/2603.07364v1 Neural Control and Learning of Simulated Hand Movements With an EMG-Based Closed-Loop Interface 2026-03-07T22:17:38Z

The standard engineering approach when facing uncertainty is modelling. Mixing data from a well-calibrated model with real recordings has led to breakthroughs in many applications of AI, from computer vision to autonomous driving. This type of model-based data augmentation is now beginning to show promising results in biosignal processing as well. However, while these simulated data are necessary, they are not sufficient for virtual neurophysiological experiments. Simply generating neural signals that reproduce a predetermined motor behaviour does not capture the flexibility, variability, and causal structure required to probe neural mechanisms during control tasks. In this study, we present an in silico neuromechanical model that combines a fully forward musculoskeletal simulation, reinforcement learning, and sequential, online electromyography synthesis. This framework provides not only synchronised kinematics, dynamics, and corresponding neural activity, but also explicitly models feedback and feedforward control in a virtual participant. In this way, online control problems can be represented, as the simulated human adapts its behaviour via a learned RL policy in response to a neural interface. For example, the virtual user can learn hand movements robust to perturbations or the control of a virtual gesture decoder. We illustrate the approach using a gesturing task within a biomechanical hand model, and lay the groundwork for using this technique to evaluate neural controllers, augment training datasets, and generate synthetic data for neurological conditions.

2026-03-07T22:17:38Z Balint K. Hodossy Dario Farina http://arxiv.org/abs/2603.07275v1 Polarization-wave propagation as a biophysical mechanism of visual cognition 2026-03-07T16:31:20Z

Recent experimental studies indicate that visual cognition is accompanied by slowly propagating biophysical travelling waves in cortical tissue. Here we propose polarization waves as a coherent physical framework for visual cognition. We first compute the propagation of scalar potential fields generated by impressed ionic currents in the primary visual cortex using a telegraph-type model and extract the velocity of the moving potential ridge. By exploiting the linear convolution structure, we then demonstrate that the scalar potential field and the polarization wave, arising from slowly oscillating neuronal dipoles, propagate with identical velocities. Remarkably, this velocity coincides with the independently predicted propagation speed of the cognitively inferred modulated wave (~1.5 cm/s). Because ionic influx entering a single optic-nerve channel integrates signals from more than a hundred photoreceptors, the resulting polarization field necessarily spans a distribution of wave numbers. We show that amplitudes of such multi-k polarization waves undergo dispersive spreading in time, which possibly suppresses cross-channel interference in visual perception.

2026-03-07T16:31:20Z 24 pages for main manuscript including figures, 4 figures, 11 pages for supplementary information, 35 references cited Hyun Myung Jang Youngwoo Jang Hyeon Han http://arxiv.org/abs/2603.07217v1 A Miniature Brain Transformer: Thalamic Gating, Hippocampal Lateralization, Amygdaloid Salience, and Prefrontal Working Memory in Attention-Coupled Latent Memory 2026-03-07T13:53:01Z

We present a miniature brain transformer architecture that extends the attention-coupled latent memory framework with four additional brain-region analogues: a thalamic relay, an amygdaloid salience module, a prefrontal working-memory (PFC) buffer, and a cerebellar fast-path, all coupled by inhibitory callosal cross-talk between lateralized hippocampal banks. We evaluate on a two-domain benchmark -- MQAR (Multi-Query Associative Recall; episodic domain) and modular arithmetic (+1 mod 10; rule-based domain) -- using a seven-variant additive ablation. The central empirical finding is a surprise: inhibitory callosal coupling alone never lateralizes the banks (variants 1-5 maintain D_sep ~ 0.25 and P_ct ~ 0.25 for all 30 epochs). Functional lateralization requires the synergy of PFC and inhibition: only when the PFC buffer is added (variant 6) does a sharp, discontinuous phase transition fire -- at epoch 11 for the PFC-only variant and epoch 10 for the full model -- collapsing P_ct from 0.25 to ~0.002 and more than doubling D_sep from 0.251 to 0.501 in a single gradient step. The PFC buffer acts as a symmetry-breaker: its slowly drifting domain context creates the initial asymmetry that the inhibitory feedback loop then amplifies irreversibly. The cerebellar fast-path accelerates the transition by one epoch (epoch 10 vs. epoch 11) with no asymptotic change, confirming its convergence-acceleration role. The result constitutes a novel, falsifiable prediction -- no lateralization without working memory context -- and a principled, neurobiologically motivated blueprint for hierarchical persistent memory in sequence models.

2026-03-07T13:53:01Z 18 pages, 3 figures, 6 tables Hong Jeong http://arxiv.org/abs/2603.06816v1 "Dark Triad" Model Organisms of Misalignment: Narrow Fine-Tuning Mirrors Human Antisocial Behavior 2026-03-06T19:23:21Z

The alignment problem refers to concerns regarding powerful intelligences, ensuring compatibility with human preferences and values as capabilities increase. Current large language models (LLMs) show misaligned behaviors, such as strategic deception, manipulation, and reward-seeking, that can arise despite safety training. Gaining a mechanistic understanding of these failures requires empirical approaches that can isolate behavioral patterns in controlled settings. We propose that biological misalignment precedes artificial misalignment, and leverage the Dark Triad of personality (narcissism, psychopathy, and Machiavellianism) as a psychologically grounded framework for constructing model organisms of misalignment. In Study 1, we establish comprehensive behavioral profiles of Dark Triad traits in a human population (N = 318), identifying affective dissonance as a central empathic deficit connecting the traits, as well as trait-specific patterns in moral reasoning and deceptive behavior. In Study 2, we demonstrate that dark personas can be reliably induced in frontier LLMs through minimal fine-tuning on validated psychometric instruments. Narrow training datasets as small as 36 psychometric items resulted in significant shifts across behavioral measures that closely mirrored human antisocial profiles. Critically, models generalized beyond training items, demonstrating out-of-context reasoning rather than memorization. These findings reveal latent persona structures within LLMs that can be readily activated through narrow interventions, positioning the Dark Triad as a validated framework for inducing, detecting, and understanding misalignment across both biological and artificial intelligence.

2026-03-06T19:23:21Z 38 pages, 17 figures Roshni Lulla Fiona Collins Sanaya Parekh Thilo Hagendorff Jonas Kaplan http://arxiv.org/abs/2603.06557v1 Causal Interpretation of Neural Network Computations with Contribution Decomposition 2026-03-06T18:46:06Z

Understanding how neural networks transform inputs into outputs is crucial for interpreting and manipulating their behavior. Most existing approaches analyze internal representations by identifying hidden-layer activation patterns correlated with human-interpretable concepts. Here we take a direct approach to examine how hidden neurons act to drive network outputs. We introduce CODEC (Contribution Decomposition), a method that uses sparse autoencoders to decompose network behavior into sparse motifs of hidden-neuron contributions, revealing causal processes that cannot be determined by analyzing activations alone. Applying CODEC to benchmark image-classification networks, we find that contributions grow in sparsity and dimensionality across layers and, unexpectedly, that they progressively decorrelate positive and negative effects on network outputs. We further show that decomposing contributions into sparse modes enables greater control and interpretation of intermediate layers, supporting both causal manipulations of network output and human-interpretable visualizations of distinct image components that combine to drive that output. Finally, by analyzing state-of-the-art models of neural activity in the vertebrate retina, we demonstrate that CODEC uncovers combinatorial actions of model interneurons and identifies the sources of dynamic receptive fields. Overall, CODEC provides a rich and interpretable framework for understanding how nonlinear computations evolve across hierarchical layers, establishing contribution modes as an informative unit of analysis for mechanistic insights into artificial neural networks.

2026-03-06T18:46:06Z 32 pages, 19 figures. ICLR 2026 poster Joshua Brendan Melander Zaki Alaoui Shenghua Liu Surya Ganguli Stephen A. Baccus http://arxiv.org/abs/2603.18028v1 Clinically Meaningful Explainability for NeuroAI: An ethical, technical, and clinical perspective 2026-03-06T10:16:31Z

While explainable AI (XAI) is often heralded as a means to enhance transparency and trustworthiness in closed-loop neurotechnology for psychiatric and neurological conditions, its real-world prevalence remains low. Moreover, empirical evidence suggests that the type of explanations provided by current XAI methods often fails to align with clinicians' end-user needs. In this viewpoint, we argue that clinically meaningful explainability (CME) is essential for AI-enabled closed-loop medical neurotechnology and must be addressed from an ethical, technical, and clinical perspective. Instead of exhaustive technical detail, clinicians prioritize clinically relevant, actionable explanations, such as clear representations of input-output relationships and feature importance. Full technical transparency, although theoretically desirable, often proves irrelevant or even overwhelming in practice, as it may lead to informational overload. Therefore, we advocate for CME in the neurotechnology domain: prioritizing actionable clarity over technical completeness and designing interface visualizations that intuitively map AI outputs and key features into clinically meaningful formats. To this end, we introduce a reference architecture called NeuroXplain, which translates CME into actionable technical design recommendations for any future neurostimulation device. Our aim is to inform stakeholders working in neurotechnology and regulatory framework development to ensure that explainability fulfills the right needs for the right stakeholders and ultimately leads to better patient treatment and care.

2026-03-06T10:16:31Z 20 pages, 2 figures Laura Schopp Ambra DImperio Jalal Etesami Marcello Ienca http://arxiv.org/abs/2603.05612v1 Behavior-dLDS: A decomposed linear dynamical systems model for neural activity partially constrained by behavior 2026-03-05T19:11:42Z

Brain-wide recordings of large-scale networks of neurons now provide an unprecedented view into how the brain drives behavior. However, brain activity contains both information directly related to behavior as well as the potential for many internal computations. Moreover, observable behavior is executed not only by the brain, but also by the spinal cord and peripheral nervous system. Behavior is a coarse-grained product of neural activity, and we thus take the view that it can be best represented by lower-dimensional latent neural dynamics. Capturing this indirect relationship while disambiguating behavior-generating networks from internal computations running in parallel requires new modeling approaches that can embody the parallel and distributed nature of large-scale neural populations. We thus present behavior-decomposed linear dynamical systems (b-dLDS) to disentangle simultaneously recorded subsystems and identify how the latent neural subsystems relate to behavior. We demonstrate the ability of b-dLDS to decouple behavioral vs. internal computations on controlled, simulated data, showing improvements over a state-of-the-art model that uses behavior to supervise all dynamics based on behavior. We then show that b-dLDS can further scale up to tens of thousands of neurons by applying our model to large-scale recording of a zebrafish hindbrain during the complex positional homeostasis behavior, wherein b-dLDS highlights behavior-related dynamic connectivity networks.

2026-03-05T19:11:42Z Eva Yezerets En Yang Misha B. Ahrens Adam S. Charles http://arxiv.org/abs/2603.05418v1 The Spatial and Temporal Resolution of Motor Intention in Multi-Target Prediction 2026-03-05T17:40:30Z

Reaching for grasping, and manipulating objects are essential motor functions in everyday life. Decoding human motor intentions is a central challenge for rehabilitation and assistive technologies. This study focuses on predicting intentions by inferring movement direction and target location from multichannel electromyography (EMG) signals, and investigating how spatially and temporally accurate such information can be detected relative to movement onset. We present a computational pipeline that combines data-driven temporal segmentation with classical and deep learning classifiers in order to analyse EMG data recorded during the planning, early execution, and target contact phases of a delayed reaching task. Early intention prediction enables devices to anticipate user actions, improving responsiveness and supporting active motor recovery in adaptive rehabilitation systems. Random Forest achieves $80\%$ accuracy and Convolutional Neural Network $75\%$ accuracy across $25$ spatial targets, each separated by $14^\circ$ azimuth/altitude. Furthermore, a systematic evaluation of EMG channels, feature sets, and temporal windows demonstrates that motor intention can be efficiently decoded even with drastically reduced data. This work sheds light on the temporal and spatial evolution of motor intention, paving the way for anticipatory control in adaptive rehabilitation systems and driving advancements in computational approaches to motor neuroscience.

2026-03-05T17:40:30Z Marie Dominique Schmidt Ioannis Iossifidis http://arxiv.org/abs/2511.01870v2 CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution 2026-03-05T13:45:17Z

Studying the cellular architecture of the human cerebral cortex is critical for understanding brain organization and function. It requires investigating complex texture patterns in histological images, yet automatic methods that scale across whole brains are still lacking. Here we introduce CytoNet, a foundation model trained on 1 million unlabeled microscopic image patches from over 4,000 histological sections spanning ten postmortem human brains. Using co-localization in the cortical sheet for self-supervision, CytoNet encodes complex cellular patterns into expressive and anatomically meaningful feature representations. CytoNet supports multiple downstream applications, including area classification, laminar segmentation, quantification of microarchitectural variation, and data-driven mapping of previously uncharted areas. In addition, CytoNet captures microarchitectural signatures of macroscale functional organization, enabling decoding of functional network parcellations from cytoarchitectonic features. Together, these results establish CytoNet as a unified framework for scalable analysis of cortical microarchitecture and for linking cellular architecture to structure-function organization in the human cerebral cortex.

2025-10-21T11:39:23Z 42 pages, 10 figures, 7 tables. Extended version with functional decoding Christian Schiffer Zeynep Boztoprak Jan-Oliver Kropp Julia Thönnißen Katia Berr Hannah Spitzer Katrin Amunts Timo Dickscheid http://arxiv.org/abs/2603.03347v2 Efficient Coding Predicts Synaptic Conductance 2026-03-05T11:47:27Z

Synapses are information efficient in the sense that their natural conductance values convey as many bits per Joule as possible, but efficiency falls rapidly if the conductance is forced to deviate from its natural value (Harris et al, 2015. However, the exact manner in which efficiency falls as conductance deviates from its natural value remains unexplained. Recently, Malkin et al (2026) showed that synaptic noise is minimised given the available energy, consistent with a minimal energy boundary. This minimal energy boundary is a necessary, but not sufficient, condition for maximising information efficiency. By expressing the minimal energy boundary in terms of Shannon's information theory (Shannon, 1949), we show that synapses operate at signal-to-noise ratios which maximise information efficiency, and that this accurately predicts the decrease in efficiency values observed in Harris et al (2015) across a wide range of synaptic conductances. Crucially, the proposed model contains no free parameters because it is derived from the biophysics of the synapse. The results reported here are consistent with the general principle that neuronal systems in the brain have evolved to be as efficient as possible in terms of the number of bits per Joule.

2026-02-25T15:51:39Z James V Stone http://arxiv.org/abs/2603.03201v2 A Dynamical Theory of Sequential Retrieval in Input-Driven Hopfield Networks 2026-03-05T11:03:58Z

Reasoning is the ability to integrate internal states and external inputs in a meaningful and semantically consistent flow. Contemporary machine learning (ML) systems increasingly rely on such sequential reasoning, from language understanding to multi-modal generation, often operating over dictionaries of prototypical patterns reminiscent of associative memory models. Understanding retrieval and sequentiality in associative memory models provides a powerful bridge to gain insight into ML reasoning. While the static retrieval properties of associative memory models are well understood, the theoretical foundations of sequential retrieval and multi-memory integration remain limited, with existing studies largely relying on numerical evidence. This work develops a dynamical theory of sequential reasoning in Hopfield networks. We consider the recently proposed input-driven plasticity (IDP) Hopfield network and analyze a two-timescale architecture coupling fast associative retrieval with slow reasoning dynamics. We derive explicit conditions for self-sustained memory transitions, including gain thresholds, escape times, and collapse regimes. Together, these results provide a principled mathematical account of sequentiality in associative memory models, bridging classical Hopfield dynamics and modern reasoning architectures.

2026-03-03T17:54:36Z Simone Betteti Giacomo Baggio Sandro Zampieri http://arxiv.org/abs/2601.12424v2 If Grid Cells are the Answer, What is the Question? A Review of Normative Grid Cell Theory 2026-03-05T03:43:22Z

For 20 years the beautiful structure in the grid cell code has presented an attractive puzzle: what computation do these representations subserve, and why does it manifest so curiously in neurons. The first question quickly attracted an answer: grid cells subserve path-integration, the ability to keep track of one's position as you move about the world. Subsequent work has only solidified this link: bottom-up mechanistic models that perform path-integration match the measured neural responses, while experimental perturbations that selectively disrupt grid cell activity impair performance on path-integration dependent tasks. A more controversial area of work has been top-down normative modelling: why has the brain chosen to compute like this? Floods of ink have been spilt attempting to build a precise link between the population's objective and the measured implementation. The holy grail is a normative link with broad predictive power which generalises to other neural systems. We review this literature and argue that, despite some controversies, the literature largely agrees that grid cells can be explained as a (1) biologically plausible (2) high fidelity, non-linearly decodable code for position that (3) subserves path-integration. As a rare area of neuroscience with mature theoretical and experimental work, this story holds lessons for normative theories of neural computations, and on the risks and rewards of integrating task-optimised neural networks into such theorising.

2026-01-18T14:11:10Z 18 pages, 6 figures William Dorrell James C. R. Whittington http://arxiv.org/abs/2601.10482v3 Convex Efficient Coding 2026-03-05T03:34:54Z

Why do neurons encode information the way they do? Normative answers to this question model neural activity as the solution to an optimisation problem; for example, the celebrated efficient coding hypothesis frames neural activity as the optimal encoding of information under efficiency constraints. Successful normative theories have varied dramatically in complexity, from simple linear models (Atick & Redlich '90), to complex deep neural networks (Lindsay '21). What complex models gain in flexibility, they lose in tractability and often understandability. Here, we split the difference by constructing a set of tractable but flexible normative representational theories. Instead of optimising the neural activities directly, following Sengupta et al. '18, we optimise the representational similarity, a matrix formed from the dot products of each pair of neural responses. Using this, we show that a large family of interesting optimisation problems are convex. This family includes problems corresponding to linear and some non-linear neural networks, and problems from the literature not previously recognised as convex, such as modified versions of semi-nonnegative matrix factorisation or nonnegative sparse coding. We put these findings to work in three ways. First, we provide the first necessary and sufficient identifiability result for a form of semi-nonnegative matrix factorisation. Second, we show that if neural tunings are `different enough' then they are uniquely linked to the optimal representational similarity, partially justifying the use of single neuron tuning analysis in neuroscience. Finally, we use the tractable nonlinearity of some of our problems to explain why dense retinal codes, but not sparse cortical codes, optimally split the coding of a single variable into ON & OFF channels. In sum, we identify a space of convex problems, and use them to derive neural coding results.

2026-01-15T15:05:28Z 37 pages, 4 figures Proceedings of the 14th International Conference on Learning Representations, 2026 William Dorrell Peter E. Latham James Whittington http://arxiv.org/abs/2603.04747v1 Neural geometry in the human hippocampus enables generalization across spatial position and gaze 2026-03-05T02:49:57Z

Hippocampal neurons track positions of self, others, and gaze direction. However, it is unclear how their respective neural codes differ enough to avoid confusion while allowing for abstraction. We recorded from populations of hippocampal neurons while participants performed a joystick-controlled virtual prey pursuit task involving multiple moving agents. We found that neurons have mixed selective responses that map positions of self, prey, and predator, as well as gaze. Their codes occupied mostly orthogonal subspaces, but these subspaces geometric structure allowed them to be aligned by simple linear transformations. Moreover, their geometry supported generalization across spatial maps, such that a linear rule learned on one agent transfers to another. This scheme enables reliable individuation and abstraction across both agent identity and viewpoint. Together, these findings suggest that hippocampal spatial knowledge is structured as a family of geometrically related manifolds that can be flexibly aligned to different agents and gaze directions.

2026-03-05T02:49:57Z Assia Chericoni Chad Diao Xinyuan Yan Taha Ismail Elizabeth A. Mickiewicz Melissa Franch Ana G. Chavez Danika Paulo Eleonora Bartoli Nicole R. Provenza Seng Bum Michael Yoo Jay Hennig Joshua Jacobs Benjamin Y. Hayden Sameer A. Sheth