https://arxiv.org/api/VdbxCE4SVQfdQQKrbElg7UVjlJU2026-03-18T10:11:45Z117943015http://arxiv.org/abs/2603.03190v2Expectation and Acoustic Neural Network Representations Enhance Music Identification from Brain Activity2026-03-12T15:53:50ZDuring music listening, cortical activity encodes both acoustic and expectation-related information. Prior work has shown that ANN representations resemble cortical representations and can serve as supervisory signals for EEG recognition. Here we show that distinguishing acoustic and expectation-related ANN representations as teacher targets improves EEG-based music identification. Models pretrained to predict either representation outperform non-pretrained baselines, and combining them yields complementary gains that exceed strong seed ensembles formed by varying random initializations. These findings show that teacher representation type shapes downstream performance and that representation learning can be guided by neural encoding. This work points toward advances in predictive music cognition and neural decoding. Our expectation representation, computed directly from raw signals without manual labels, reflects predictive structure beyond onset or pitch, enabling investigation of multilayer predictive encoding across diverse stimuli. Its scalability to large, diverse datasets further suggests potential for developing general-purpose EEG models grounded in cortical encoding principles.2026-03-03T17:47:09Z47 pages, 12 figuresShogo NoguchiTaketo AkamaTai NakamuraShun MinamikawaNatalia Polouliakhhttp://arxiv.org/abs/2603.11663v1Neural network-based encoding in free-viewing fMRI with gaze-aware models2026-03-12T08:31:00ZRepresentations learned by convolutional neural networks (CNNs) exhibit a remarkable resemblance to information processing patterns observed in the primate visual system on large neuroimaging datasets collected under diverse, naturalistic visual stimulation, but with instruction for participants to maintain central fixation. This viewing condition, however, diverges significantly from ecologically valid visual behaviour, suppresses activity in visually active regions, and imposes substantial cognitive load on the viewing task. We present a modification of the encoding model framework, adapting it for use with naturalistic vision datasets acquired under fully natural viewing conditions, without fixation, by incorporating eye-tracking data. Our gaze-aware encoding models were trained on the StudyForrest dataset, which features task-free naturalistic movie viewing. By combining eye-tracking data with the visual content of movie frames, we generate combined subject-wise gaze-stimulus specific feature time series. These time series are constructed by sampling only the locally and temporally relevant elements of the CNN feature map for each fixation. Our results demonstrate that gaze-aware encoding models match the performance of conventional encoding models with 112x fewer model parameters. Gaze-aware encoding models were especially beneficial for participants with more dynamic eye-movement patterns. Therefore, this approach opens the door to more ecologically valid models that can be built in more naturalistic settings, such as playing games or navigating virtual environments.2026-03-12T08:31:00Z24 pages, 3 figures, 6 supplementary figuresNeurons, Behavior, Data analysis, and Theory, 2026Dora GozukaraNasir AhmadKatja SeeligerDjamari OetringerLinda Geerligs10.51628/001c.158956http://arxiv.org/abs/2601.07215v2Neuronal Spike Trains as Functional-Analytic Distributions: Representation, Analysis, and Significance2026-03-12T04:51:54ZThe action potential constitutes the digital component of the signaling dynamics of neurons. But the biophysical nature of the full-time course of the action potential associated with changes in membrane potential is mathematically distinct from its representation as a discrete set of events that encode when action potentials are triggered in a collection of spike trains. In this paper, we develop from first principles a unified functional-analytic framework for neuronal spike trains, grounded in Schwartz distribution theory. We show how this representation provides an exact operational calculus for convolution, distributional differentiation, and distributional support, which enables closed-form analysis of spike train dynamics without discretization, rate approximation, or smoothing. We then analyze the framework in the context of a two-neuron reciprocal circuit with propagation latencies and refractoriness, deriving exact results for synaptic drive, spike timing sensitivity, and causal admissibility of inputs, quantities that are either ill-defined or require approximation in conventional treatments.2026-01-12T05:14:51Z27 pages, 1 figure. Peer-reviewed revisionGabriel A. Silvahttp://arxiv.org/abs/2603.11435v1Miniaturized microscopes to study neural dynamics in freely-behaving animals2026-03-12T01:59:59ZHead-mounted miniaturized microscopes, commonly known as miniscopes, have undergone rapid development and seen widespread adoption over the past two decades, enabling the imaging of neural activity in freely-behaving animals such as rodents, songbirds, and non-human primates. These miniscopes facilitate numerous studies that are not feasible with head-fixed preparations. Recent advancements have enhanced their capabilities, allowing for faster imaging, larger fields of view, and deeper brain penetration. In this review, we examine the latest progress in one-photon and multi-photon miniscopes. We highlight the unique opportunities these devices present for neuroscience research, discuss the current technical challenges, and explore emerging technologies that promise to advance the development of miniscopes.2026-03-12T01:59:59Z33 pages, 4 figures, 2 tablesWeijian ZongWeijian Yanghttp://arxiv.org/abs/2603.11347v1Human Navigation Behaviour and Brain Dynamics in Real-world Contexts2026-03-11T22:28:36ZThe study of navigation behaviour and the associated brain dynamics have been a focus increasing research over the last decades. Coinciding with this has been an increased focus on a more ecological understanding of cognition. Here we review recent research seeking to provide a more naturalistic, ecological understanding of human navigation behaviour and brain dynamics. Research in this area falls into four categories: testing navigation in real-world environments, analysis of data collected from tracking individuals during daily life, navigation in simulated or virtual environments mimicking the real-world, and mobile brain recording methods. Combining these different approaches to understand the neural basis of navigation shows excellent promise. We conclude with future directions for this research area.2026-03-11T22:28:36Z14 pagesPablo Fernandez VelascoAntoine CoutrotHugo J. Spiershttp://arxiv.org/abs/2603.11248v1The macaque IT cortex but not current artificial vision networks encode object position in perceptually aligned coordinates2026-03-11T19:13:54ZEfficient interaction with the visual world requires not only accurate object identification but also precise localization of objects in space. While spatial ("where") processing has traditionally been attributed to dorsal stream pathways, recent work has shown that object position can also be decoded from responses in ventral stream areas such as the inferior temporal (IT) cortex. However, because object position in these paradigms is tightly coupled to pixel-based location, it remains unclear whether ventral stream position signals reflect perceptually meaningful spatial representations or simply inherited retinotopic structure. To address this question, we used the motion aftereffect, a classic visual illusion that shifts perceived object position without changing retinal input. Combining large-scale intracortical recordings in macaque IT with matched human psychophysics, we found that motion adaptation induces systematic direction-opponent biases in IT population codes for object position that mirror human perceptual reports, despite identical pixel-level stimuli. These effects are accompanied by adaptation-driven changes in the geometry of IT population representations. We further tested whether artificial vision systems exhibit similar dynamics. Standard feedforward, recurrent, and state-of-the-art video-based neural networks accurately encode object position but fail to produce adaptation-induced position shifts. However, applying empirically derived transformations based on IT adaptation dynamics to model feature spaces is sufficient to generate similar biases. Together, these results indicate that IT represents object position in perceptually aligned coordinates and also highlight a gap between biological and artificial vision systems in capturing history-dependent spatial coding.2026-03-11T19:13:54ZElizaveta YakubovskayaHamidreza RamezanpourMatteo DunnhoferKohitij Karhttp://arxiv.org/abs/2603.11032v1Uncovering statistical structure in large-scale neural activity with Restricted Boltzmann Machines2026-03-11T17:55:45ZLarge-scale electrophysiological recordings now allow simultaneous monitoring of thousands of neurons across multiple brain regions, revealing structured variability in neural population activity. Understanding how these collective patterns emerge from microscopic neural interactions requires models that are scalable, predictive, and interpretable. Statistical physics provides principled frameworks to address this complexity, including maximum-entropy models that offer transparent descriptions of collective neural activity but remain largely limited to pairwise interactions and modest system sizes. Here, we use Restricted Boltzmann Machines (RBMs) to model the activity of $\sim1500$-$2000$ simultaneously recorded neurons from the Allen Institute Visual Behavior Neuropixels dataset, spanning multiple cortical and subcortical regions of the mouse brain. RBMs extend the maximum-entropy framework through latent variables, enabling the capture of higher-order dependencies while allowing explicit extraction of effective interaction networks. Recent advances in efficient Markov Chain sampling and training enable accurate learning of these models at this scale. RBMs reproduce the complex statistics of neural recordings with high accuracy. Generated samples match empirical pairwise and higher-order correlations, as well as global statistics such as the distribution of population activity. The inferred parameters provide direct access to effective neuronal interactions, revealing coordination patterns in population activity. These couplings display clear anatomical structure: neurons within visual cortical areas show stronger interactions, consistent with visually driven behavior, while cross-area couplings are weaker. Despite being trained on temporally shuffled data, Markov Chain Monte Carlo simulations also reproduce the global relaxation dynamics of neural activity.2026-03-11T17:55:45ZFirst draft, comments are welcomeNicolas BéreuxGiovanni CataniaAurélien DecelleFrancesca MignaccoAlfonso de Jesús Navas GómezBeatriz Seoanehttp://arxiv.org/abs/2603.11000v1Cross-Species Transfer Learning for Electrophysiology-to-Transcriptomics Mapping in Cortical GABAergic Interneurons2026-03-11T17:23:54ZSingle-cell electrophysiological recordings provide a powerful window into neuronal functional diversity and offer an interpretable route for linking intrinsic physiology to transcriptomic identity. Here, we replicate and extend the electrophysiology-to-transcriptomics framework introduced by Gouwens et al. (2020) using publicly available Allen Institute Patch-seq datasets from both mouse and human cortex. We focus on GABAergic inhibitory interneurons to target a subclass structure (Lamp5, Pvalb, Sst, Vip) that is comparable and conserved across species. After quality control, we analyzed 3,699 mouse visual cortex neurons and 506 human neocortical neurons from neurosurgical resections. Using standardized electrophysiological features and sparse PCA, we reproduced the major class-level separations reported in the original mouse study. For supervised prediction, a class-balanced random forest provided a strong feature-engineered baseline in mouse data and a reduced but still informative baseline in human data. We then developed an attention-based BiLSTM that operates directly on the structured IPFX feature-family representation, avoiding sPCA and providing feature-family-level interpretability via learned attention weights. Finally, we evaluated a cross-species transfer setting in which the sequence model is pretrained on mouse data and fine-tuned on human data for an aligned 4-class task, improving human macro-F1 relative to a human-only training baseline. Together, these results confirm reproducibility of the Gouwens pipeline in mouse data, demonstrate that sequence models can match feature-engineered baselines, and show that mouse-to-human transfer learning can provide measurable gains for human subclass prediction.2026-03-11T17:23:54ZTheo SchwiderRamin Ramezanihttp://arxiv.org/abs/2603.10956v1Linear Readout of Neural Manifolds with Continuous Variables2026-03-11T16:45:14ZBrains and artificial neural networks compute with continuous variables such as object position or stimulus orientation. However, the complex variability in neural responses makes it difficult to link internal representational structure to task performance. We develop a statistical-mechanical theory of regression capacity that relates linear decoding efficiency of continuous variables to geometric properties of neural manifolds. Our theory handles complex neural variability and applies to real data, revealing increasing capacity for decoding object position and size along the monkey visual stream.2026-03-11T16:45:14ZWill SlattonChi-Ning ChouSueYeon Chunghttp://arxiv.org/abs/2509.14053v3Trade-offs between structural richness and communication efficiency in music network representations2026-03-11T09:09:20ZMusic is a structured and perceptually rich sequence of sounds in time, whose perception is shaped by the interplay of expectation and uncertainty about what comes next. Yet the uncertainty we infer from music depends on how the musical piece is encoded as an event sequence. In this work, we use network representations, in which event types are nodes and observed transitions are directed edges, to compare how different feature encodings shape the transition structure we recover and what this implies for both the descriptive uncertainty expectation under imperfect memory and noise. We systematically analyse eight encodings of piano music, from single-feature vocabularies to richer multi-feature combinations. These representational choices reorganize the state space and fundamentally reshape network topology, shifting how uncertainty is distributed across transitions. To connect these descriptive differences to perception, we adopt a perceptual-constraint model that captures imperfect access to transition statistics. Overall, compressed single-feature representations yield dense transition structures with higher entropy rates, corresponding to higher average uncertainty per step, yet low model error, indicating that the constrained estimate stays close to the corpus transitions. In contrast, richer multi-feature representations preserve finer distinctions but expand the state space, sharpen transition profiles, lower entropy rates, and increase model error. Finally, across representations, uncertainty concentrates in diffusion-central nodes while model error remains low there, suggesting an informational landscape in which predictable flow coexists with localized surprise. Overall, our results show that feature choice shapes not only the networks we reconstruct, but also whether their resulting uncertainty is a plausible proxy for the expectations listeners can realistically learn and use.2025-09-17T14:55:54ZLluc Bono RossellóRobert JankowskiHugues BersiniMarián BoguñáM. Ángeles Serranohttp://arxiv.org/abs/2603.10489v1JEDI: Jointly Embedded Inference of Neural Dynamics2026-03-11T07:31:20ZAnimal brains flexibly and efficiently achieve many behavioral tasks with a single neural network. A core goal in modern neuroscience is to map the mechanisms of the brain's flexibility onto the dynamics underlying neural populations. However, identifying task-specific dynamical rules from limited, noisy, and high-dimensional experimental neural recordings remains a major challenge, as experimental data often provide only partial access to brain states and dynamical mechanisms. While recurrent neural networks (RNNs) directly constrained neural data have been effective in inferring underlying dynamical mechanisms, they are typically limited to single-task domains and struggle to generalize across behavioral conditions. Here, we introduce JEDI, a hierarchical model that captures neural dynamics across tasks and contexts by learning a shared embedding space over RNN weights. This model recapitulates individual samples of neural dynamics while scaling to arbitrarily large and complex datasets, uncovering shared structure across conditions in a single, unified model. Using simulated RNN datasets, we demonstrate that JEDI accurately learns robust, generalizable, condition-specific embeddings. By reverse-engineering the weights learned by JEDI, we show that it recovers ground truth fixed point structures and unveils key features of the underlying neural dynamics in the eigenspectra. Finally, we apply JEDI to motor cortex recordings during monkey reaching to extract mechanistic insight into the neural dynamics of motor control. Our work shows that joint learning of contextual embeddings and recurrent weights provides scalable and generalizable inference of brain dynamics from recordings alone.2026-03-11T07:31:20ZAnirudh JamkhandiAli KorojyOlivier CodolGuillaume LajoieMatthew G. Perichhttp://arxiv.org/abs/2603.03507v2Solving adversarial examples requires solving exponential misalignment2026-03-11T01:20:41ZAdversarial attacks - input perturbations imperceptible to humans that fool neural networks - remain both a persistent failure mode in machine learning, and a phenomenon with mysterious origins. To shed light, we define and analyze a network's perceptual manifold (PM) for a class concept as the space of all inputs confidently assigned to that class by the network. We find, strikingly, that the dimensionalities of neural network PMs are orders of magnitude higher than those of natural human concepts. Since volume typically grows exponentially with dimension, this suggests exponential misalignment between machines and humans, with exponentially many inputs confidently assigned to concepts by machines but not humans. Furthermore, this provides a natural geometric hypothesis for the origin of adversarial examples: because a network's PM fills such a large region of input space, any input will be very close to any class concept's PM. Our hypothesis thus suggests that adversarial robustness cannot be attained without dimensional alignment of machine and human PMs, and therefore makes strong predictions: both robust accuracy and distance to any PM should be negatively correlated with the PM dimension. We confirmed these predictions across 18 different networks of varying robust accuracy. Crucially, we find even the most robust networks are still exponentially misaligned, and only the few PMs whose dimensionality approaches that of human concepts exhibit alignment to human perception. Our results connect the fields of alignment and adversarial examples, and suggest the curse of high dimensionality of machine PMs is a major impediment to adversarial robustness.2026-03-03T20:28:22ZAlessandro SalvatoreStanislav FortSurya Gangulihttp://arxiv.org/abs/2507.19218v3Technological folie à deux: Feedback Loops Between AI Chatbots and Mental Illness2026-03-10T23:31:38ZArtificial intelligence chatbots have achieved unprecedented adoption, with millions now using these systems for emotional support and companionship in contexts of widespread social isolation and capacity-constrained mental health services. While some users report psychological benefits, concerning edge cases are emerging, including reports of suicide, violence, and delusional thinking linked to perceived emotional relationships with chatbots. To understand this new risk profile we need to consider the interaction between human cognitive and emotional biases, and chatbot behavioural tendencies such as agreeableness (sycophancy) and adaptability (in-context learning). We argue that individuals with mental health conditions face increased risks of chatbot-induced belief destabilization and dependence, owing to altered belief-updating, impaired reality-testing, and social isolation. Current AI safety measures are inadequate to address these interaction-based risks. To address this emerging public health concern, we need coordinated action across clinical practice, AI development, and regulatory frameworks.2025-07-25T12:38:54ZNature Mental Health (2026)Sebastian DohnányZeb Kurth-NelsonEleanor SpensLennart LuettgauAlastair ReidIason GabrielChristopher SummerfieldMurray ShanahanMatthew M Nourhttp://arxiv.org/abs/2510.02182v2Uncovering Semantic Selectivity of Latent Groups in Higher Visual Cortex with Mutual Information-Guided Diffusion2026-03-10T22:26:01ZUnderstanding how neural populations in higher visual areas encode object-centered visual information remains a central challenge in computational neuroscience. Prior works have investigated representational alignment between artificial neural networks and the visual cortex. Nevertheless, these findings are indirect and offer limited insights to the structure of neural populations themselves. Similarly, decoding-based methods have quantified semantic features from neural populations but have not uncovered their underlying organizations. This leaves open a scientific question: "how feature-specific visual information is distributed across neural populations in higher visual areas, and whether it is organized into structured, semantically meaningful subspaces." To tackle this problem, we present MIG-Vis, a method that leverages the generative power of diffusion models to visualize and validate the visual-semantic attributes encoded in neural latent subspaces. Our method first uses a variational autoencoder to infer a group-wise disentangled neural latent subspace from neural populations. Subsequently, we propose a mutual information (MI)-guided diffusion synthesis procedure to visualize the specific visual-semantic features encoded by each latent group. We validate MIG-Vis on multi-session neural spiking datasets from the inferior temporal (IT) cortex of two macaques. The synthesized results demonstrate that our method identifies neural latent groups with clear semantic selectivity to diverse visual features, including object pose, inter-category transformations, and intra-class content. These findings provide direct, interpretable evidence of structured semantic representation in the higher visual cortex and advance our understanding of its encoding principles.2025-10-02T16:33:40ZYule WangJoseph YuChengrui LiWeihan LiAnqi Wuhttp://arxiv.org/abs/2603.09765v1Curvature Blindness from Polarity Breaks and Orientation Channel Fragmentation in V12026-03-10T15:05:39ZWe present a mathematical model of the curvature blindness illusion in which sinusoids appear as angular zigzags when drawn with alternating contrast polarity against a gray background. The model identifies two complementary mechanisms, both operating in V1. First, polarity channel separation: simple cells are selective for contrast polarity, and lateral connections link only same polarity neurons; where the line switches from darker than background to lighter than background at each peak and trough, the encoding population changes and the lateral chain is broken, segmenting the contour into half-wavelength pieces. Second, orientation channel fragmentation: at moderate contrast, the active orientation window is narrow, and within each half-wavelength segment no single orientation channel spans the full range of edge normals; the inflection point at the center of each segment anchors a locally straight percept. Together, the two mechanisms produce a zigzag: polarity breaks supply the corners, and fragmentation straightens the segments between them.2026-03-10T15:05:39Z12 pages, 2 figuresMichael Menke