https://arxiv.org/api/wJ98OS4PwH4OzKO6u9JPp8F2wMY 2026-06-21T21:22:18Z 12181 195 15 http://arxiv.org/abs/2605.00024v2 Self-organized criticality enables conscious integration through brain-body resonance 2026-05-21T09:04:21Z The "binding problem" of how distributed neural activity unifies into conscious experience has remained an open challenge since its articulation in 1890. We present evidence that conscious integration relies on self-organized criticality maintained by brain-body resonance, placing human cognition within the universality class of critical systems. Using 64-channel EEG data, we demonstrate that conventional preprocessing inadvertently eliminates the very integrative dynamics it seeks to measure. Removing physiological signals conventionally treated as "artifacts" drastically reduces the shared variance between global phase synchronization and stimulus-evoked amplitude, an effect highly specific to physiological components. We trace this to a fundamental brain-body resonance at 78 milliseconds that establishes zero-lag synchronization driven by robust bidirectional causality. Crucially, raw data exhibits heavy-tailed avalanche dynamics indicative of a near-critical regime, whereas conventionally cleaned data definitively rejects power-law distributions, signaling an artificial shift to subcriticality. Finally, we show these critical dynamics enable holographic information encoding, evidenced by a significant emergence of spatial interference patterns post-resonance. Together, these findings indicate that physiological signals actively and selectively support the coupling between large-scale neural coordination and event-related processing. 2026-04-21T22:49:29Z Ahmed Gamal Eldin http://arxiv.org/abs/2605.21356v1 A simple model of co-emergence of grid and place fields 2026-05-20T16:19:56Z Grid cells in the medial entorhinal cortex and place cells in the hippocampus together support spatial navigation. The two regions are reciprocally connected, and there is a chicken-and-egg problem for how both arise and reinforce each other during development. Current computational accounts either derive one type from the other or use network dynamics to model the emergence of one type in isolation. We introduce a unified recurrent network model that instantiates Dale's Law (every neuron is either excitatory or inhibitory), and is trained to predict the next sensory observation from masked previous sensory observations and egocentric motion. To our knowledge, this is the first single-objective model in which grid and place cells co-emerge without supervision of either type, or reliance on pre-existing spatial-cell representations. The two kinds of spatial codes coexist across 1,000 different training configurations, with their balance set by the amount of sensory noise and masking. Without retraining, the network qualitatively reproduces experimentally observed grid fragmentation in hairpin mazes, grid merging after wall removal, lattice alignment across connected rooms, locally ordered 3D fields observed in freely flying bats, as well as the developmental order in which place cells precede grid cells. We interpret these results in terms of two complementary encoding pressures within a single sensory-prediction objective: (1) correcting errors or reconstructing missing components of sensory observations, and (2) prediction of the next sensory state during navigation. Our results suggest a circuit-level account of the co-emergence of grid and place cells, and experimentally testable predictions for the two kinds of spatial codes. 2026-05-20T16:19:56Z Zhaoze Wang Genela Morris Dori Derdikman Pratik Chaudhari Vijay Balasubramanian http://arxiv.org/abs/2605.21324v1 Stimulus symmetries can confound representational similarity analyses 2026-05-20T15:51:21Z What can representational similarity matrices (RSMs) tell us about a neural code? As the popularity of these summary statistics grows, so too does the need for a more complete characterization of their properties. Here, we show that symmetries in network inputs can confound RSM-based analyses. Stimulus symmetries render many representations functionally equivalent, but these different configurations can lead to different RSMs. These different RSMs reflect qualitatively different representational geometries. We show that stochastic gradient descent or energetic regularization can generate sparse, drifting codes, leading in turn to drifting RSMs. Moreover, we demonstrate that these phenomena are present in networks trained to encode image data, where the symmetry is latent. Our results illustrate the challenges inherent in comparing nonlinear neural codes, when functionally-equivalent representations are not related by a simple rotation. 2026-05-20T15:51:21Z 40 pages Farhad Pashakhanloo Jacob A. Zavatone-Veth http://arxiv.org/abs/2512.10983v2 Compartmental-reaction diffusion framework for microscale dynamics of extracellular serotonin in brain tissue 2026-05-20T15:27:42Z Serotonin (5-hydroxytryptamine) is a major neurotransmitter whose release from densely distributed serotonergic varicosities shapes plasticity and network integration throughout the brain, yet its extracellular dynamics remain poorly understood due to the sub-micrometer and millisecond scales involved. We develop a mathematical framework that captures the coupled reaction-diffusion processes governing serotonin signaling in realistic tissue microenvironments. Formulating a two-dimensional compartmental-reaction diffusion system, we use strong localized perturbation theory to derive an asymptotically equivalent set of nonlinear integro-ODEs that preserve diffusive coupling while enabling efficient computation. We analyze period-averaged steady states, establish bounds using Jensen's inequality, obtain closed-form spike maxima and minima, and implement a fast marching-scheme solver based on sum-of-exponentials kernels. These mathematical results provide quantitative insight into how firing frequency, varicosity geometry, and uptake kinetics shape extracellular serotonin. The model reveals that varicosities form diffusively coupled microdomains capable of generating spatial "serotonin reservoirs," clarifies aspects of local versus volume transmission, and yields predictions relevant to interpreting high-resolution serotonin imaging and the actions of selective serotonin-reuptake inhibitors. 2025-12-04T20:49:56Z Merlin Pelz Skirmantas Janusonis Gregory Handy http://arxiv.org/abs/2604.01295v2 Parallelized Hierarchical Connectome: A Spatiotemporal Recurrent Framework for Spiking State-Space Models 2026-05-20T14:09:08Z This work presents the Parallelized Hierarchical Connectome (PHC), a general architectural framework that upgrades temporal-only State-Space Models (SSMs) into spatiotemporal recurrent networks. Conventional SSMs achieve parallel-scan training but are limited to temporal recurrence, lacking lateral or feedback interactions within a single timestep. PHC maps the diagonal SSM core to a shared Neuron Layer and inter-neuronal communication to a shared Synapse Layer of hierarchical regions, reconnected by a Multi-Transmission Loop iterating spatial recurrence within each temporal window, at parameter complexity Theta(D^2) versus Theta(D^2 L) of stacked SSMs. This spatiotemporal framework enables the seamless integration of neuro-physical priors typically intractable for standard SSMs, including adaptive LIF, synaptic delay, STP, Dale's Law with E/I-asymmetric topology, and STDP. The framework is instantiated as PHCSSM, the first spiking SSM that integrates all five biological priors and is evaluated on long-sequence data, achieving test accuracy competitive with state-of-the-art SSM baselines at 1,312 to 4,891 trainable parameters (1 to 4 orders of magnitude smaller than every baseline). PHCSSM further admits a sequential recurrent spiking neural network (RSNN) deployment mode that converges asymptotically to the parallel-scan training mode without artificial-neural-network-to-spiking-neural-network (ANN-to-SNN) conversion, with cross-backend reproducibility verified across four hardware backends (x86 CPU, H100 GPU, Cortex-A76, Cortex-M4F) including end-to-end deployment on the Cortex-M4F microcontroller (40 KB SRAM, 128 KB Flash). PHCSSM thereby bridges parallel-scan SSM and biologically grounded RSNN, two paradigms with previously incompatible training regimes, into a single architecture and trained weights. 2026-04-01T18:02:29Z 38 pages, 3 figures, 9 tables. Submitted to Neural Networks Po-Han Chiang http://arxiv.org/abs/2605.19070v2 Computational Auditory Periphery Models: the Return of the Rodent 2026-05-20T13:21:45Z Animal experiments have provided many insights on auditory function, notably in cases of sensorineural hearing loss (SNHL). However, it is not always clear how these findings translate to the human auditory system in clinically relevant contexts. Cross-species computational models of the auditory periphery can help bridge the gap between non-invasive human diagnostics and experimental evidence from animal studies. In this work we adapted a 1-D nonlinear cochlear transmission-line model designed for the human auditory periphery to mouse and gerbil, enabling a single computational framework for cross-species research on SNHL. Species-specific anatomical and physiological parameters - including basilar membrane (BM) length and width, stapes area, middle-ear transfer functions, and frequency range - were adjusted to match each species' auditory periphery and hearing range. Other cochlear parameters were calibrated to reproduce realistic cochlear tuning and compression. The adapted mouse and gerbil models were validated against experimental BM velocity level-growth characteristics, auditory-nerve (AN) tuning curves, and DPOAEs. Simulated AN outputs reasonably matched empirical measurements, including realistic AN thresholds and frequency selectivity. However, the discrepancy between simulations and measurements became larger for cochlear sections closer to the base or apex. Simulations of cochlear synaptopathy reproduced observed differences in recorded auditory brainstem and envelope following responses from mice and gerbils with SNHL. OHC individualization of the mouse model based on DPOAEs failed to faithfully reproduce individual measurements, although intergroup differences in OHC damage were captured. Our findings demonstrate that biophysically grounded auditory models can be translated across species while preserving realistic sound-coding properties and pathophysiological alterations. 2026-05-18T19:53:18Z Morgan Thienpont F. Deloche S. Keshishzadeh D. Kiselev J. Bourien J. -L. Puel B. N. Buran N. Bramhall S. Verhulst http://arxiv.org/abs/2605.20496v1 Platonic Representations in the Human Brain: Unsupervised Recovery of Universal Geometry 2026-05-19T21:04:15Z The Strong Platonic Representation Hypothesis suggests that representational convergence in artificial neural networks can be harnessed constructively: embeddings can be translated across models through a universal latent space without paired data. We ask whether an analogous geometry can be recovered across human brains. Using fMRI data from the Natural Scenes Dataset, we propose a self-supervised encoder that learns subject-specific embeddings from brain data alone by exploiting repeated stimulus presentations. We show that these independently learned spaces can be translated across subjects using unsupervised orthogonal rotations, without paired cross-subject samples or intermediate model representations. Synchronizing pairwise rotations into a single shared latent space further improves cross-subject retrieval, indicating that subject-specific spaces are mutually compatible with a common coordinate system. These results provide evidence for a shared neural geometry in the human visual cortex: subject-specific fMRI representations are approximately isometric across individuals and can be translated through purely geometric transformations. 2026-05-19T21:04:15Z Code available at https://github.com/memory-formation/platonic-representations-fmri Pablo Marcos-Manchón Rishi Jha Lluís Fuentemilla http://arxiv.org/abs/2506.08277v3 Task-conditioned probing of instruction-tuned multimodal LLMs: Region-specific brain alignment patterns under naturalistic stimuli 2026-05-19T18:07:46Z Recent voxel-wise multimodal brain encoding studies have shown that multimodal large language models (MLLMs) exhibit a higher degree of brain alignment compared to unimodal models. More recently, instruction-tuned multimodal (IT) models have been shown to generate task-specific representations that align strongly with brain activity, yet most prior evaluations focus on unimodal stimuli or non-instruction-tuned models under multimodal stimuli. We still lack a clear understanding of whether instruction-tuning is associated with IT-MLLMs organizing their representations around functional task demands or if they simply reflect surface semantics. To address this, we estimate brain alignment by predicting fMRI responses recorded during naturalistic movie watching (video with audio) from MLLM representations. Using instruction-specific embeddings from six video and two audio IT-MLLMs, across 13 video task instructions, we find that instruction-tuned video MLLMs show higher brain alignment than in-context learning (ICL) multimodal models (~9%), non-instruction-tuned multimodal models (~15%), and unimodal baselines (~20%). Our evaluation of MLLMs across video and audio tasks, and language-guided probing produces distinct task-specific MLLM representations that vary across brain regions. We also find that ICL models show strong semantic organization (r=0.78), while IT models show weak coupling to instruction-text semantics (r=0.14), consistent with task-conditioned subspaces associated with higher brain alignment. These findings are consistent with an association between task-specific instructions and stronger brain-MLLM alignment, and open new avenues for mapping joint information processing in both systems. We make the code publicly available [https://github.com/subbareddy248/mllm_videos]. 2025-06-09T22:48:36Z 57 pages, 39 figures Subba Reddy Oota Khushbu Pahwa Prachi Jindal Satya Sai Srinath Namburi Maneesh Singh Tanmoy Chakraborty Bapi S. Raju Manish Gupta http://arxiv.org/abs/2605.20127v1 Beyond Prediction Accuracy: Target-Space Recovery Profiles for Evaluating Model-Brain Alignment 2026-05-19T17:14:27Z Artificial vision models are often evaluated against the human visual cortex by measuring how accurately their internal representations predict brain responses. However, prediction accuracy alone does not indicate which dimensions of the target brain's response space are recovered. Here, we introduce a unified framework for evaluating both model-brain and brain-brain alignment by identifying the response dimensions recovered by prediction. Using repeated fMRI measurements, we first identify target-brain response dimensions that can be reproducibly predicted across independent trial splits. We then predict target-brain responses from either another subject's brain responses or a vision model's internal representations, and quantify how strongly each of these reproducible response dimensions is recovered. Applying this framework to a subset of the Natural Scenes Dataset, in which eight subjects viewed the same natural images during fMRI, we find that the early-to-intermediate visual-cortex responses contain a low-dimensional set of reproducible dimensions. Brain-to-brain comparisons identify which of these dimensions are consistently recoverable from other subjects' brains, providing a diagnostic human reference rather than only a scalar benchmark. In some cases, pretrained and randomly initialized models achieve similar prediction accuracy while showing distinct recovery profiles across these response dimensions. These results show that prediction accuracy alone can mask model-brain mismatches. By making explicit which reproducible brain response dimensions are recovered by prediction, our framework provides a more diagnostic evaluation of alignment between artificial vision models and the human visual cortex. 2026-05-19T17:14:27Z 34 pages, 12 figures, 5 tables Ken Nakamura Tomoya Nakai Ryuto Yashiro Ayumu Yamashita Kaoru Amano http://arxiv.org/abs/2601.10397v3 Reshaping Neural Representation via Associative, Presynaptic Short-Term Plasticity 2026-05-19T14:34:07Z Short-term synaptic plasticity (STP) is often regarded as a presynaptic filter of spikes, independent of postsynaptic activity. Recent experiments, however, indicate an associative STP that depends on pre- and postsynaptic coactivation. We develop a normative, information-theoretic theory of associative STP. Extending Fisher-information-based learning to Tsodyks-Markram synapses, we derive learning rules for baseline weight and release probability that maximize stimulus information under resource constraints. The rules split into a postsynaptic term tracking local firing and a presynaptic, phase-advanced term that selectively detects stimulus onset. For slowly varying inputs, this onset sensitivity favors anti-causal connectivity and enhances response offset during drive and reverse replay after drive removal in recurrent circuits. Linear-response analysis shows that STP yields frequency-dependent phase selectivity and that release-probability constraints tune temporal asymmetry. These results identify release-probability plasticity as a principled substrate for rapidly reconfigurable temporal coding. 2026-01-15T13:46:07Z Genki Shimizu Taro Toyoizumi http://arxiv.org/abs/2605.19816v1 Performance of low vision individuals when selecting a target with head-pointing in virtual reality 2026-05-19T13:11:35Z Purpose: To investigate psychophysically the ability of low vision individuals with central visual field loss (CFL) to perform a visually-guided pointing task in a virtual reality environment. Methods: Patients with CFL (n=25, ages = 67-90 years) and normally-sighted controls (n=26, ages = 67-85 years) had to select a target (2{\textdegree} diameter dot) with a head-contingent cursor (6{\textdegree} diameter reticle). Target selection occurred when target was validly pointed at for 1.5 seconds. Pointing was valid when target was inside an invisible pointer activation zone (PAZ) centered on reticle. Task difficulty was decreased by increasing PAZ diameter from 0.5{\textdegree} to 8{\textdegree}. Performance was assessed by measuring the time needed to select the target. The task was also performed with an array of three simultaneously-displayed cursors. Results: Selection times decreased (from 14.1 and 8.4 seconds for patients and controls respectively) with increasing PAZ diameter and reached a similar asymptote for both groups (1.4 seconds). The rate of this decrease was smaller for patients so that PAZ diameter needed for their best performance was much larger than PAZ diameter needed for controls' best performance (average: 3.48{\textdegree} vs 1.32{\textdegree}). In the three-reticle condition, both groups tended to use the cursor closer to the target. Conclusions: Patients with CFL are able to point at a 2{\textdegree} target thanks to head-pointing. Their performance can get close to controls' best performance by increasing PAZ size. Translational relevance: This research suggests guidelines to improve the accessibility of visually-guided pointing tools for human-machine interfaces designed for low vision individuals. 2026-05-19T13:11:35Z Camille Bordeau CRPN Célia Passerel CRPN Ambre Denis-Noël LPL Jean-Baptiste Melmi CRPN Marianne Vaugoyeau LNC, CRPN Carlos Aguilar UniCA, BIOVISION Iliana Huyet UniCA, BIOVISION Caroline Topart UniCA, BIOVISION François Devin UniCA, BIOVISION Frédéric Matonti UniCA, BIOVISION Pierre Kornprobst UniCA, BIOVISION Eric Castet CRPN http://arxiv.org/abs/2605.30372v1 Evolutionary Algorithm for Reservoir Learning and Yielding 2026-05-19T11:22:21Z Reservoir computing, a type of recurrent neural network, is a promising approach for temporal learning as it separates dynamic processing from the trained readout layer. However, classical Echo State Networks (ESNs) often require task-specific tuning of their architecture and hyperparameters to achieve good performance. This paper introduces EARLY (Evolutionary Algorithm for Reservoir Learning and Yielding), a framework designed to evolve both the topology and hyperparameters of multi-reservoir ESNs. Inspired by the modular organisation of the brain, EARLY encodes architectures as graph-based genomes and applies crossover, mutation, and selection to discover effective configurations. Our goal is to create both generic architectures and tasks inducing generalization. The method is evaluated on temporal learning tasks from the CogScale dataset. Results show that evolved architectures outperform those obtained with random search on several tasks and exhibit structural differences depending on task difficulty: simpler tasks yield lightweight architectures, while more complex tasks favour richer modular organisations. These findings suggest that evolutionary search can help identify reusable reservoir structures for a broader range of temporal problems. The evolved architectures are further evaluated on a cross-situational learning dataset to assess their ability to adapt to new environments. 2026-05-19T11:22:21Z GECCO '26 - The Genetic and Evolutionary Computation Conference, Jul 2026, San jos{é}, Costa Rica Julien Testu UB, Mnemosyne Pierrick Legrand ENSC, Bordeaux INP Xavier Hinaut Mnemosyne http://arxiv.org/abs/2605.19646v1 BCI-sift: An automated feature selection toolbox for Brain Computer Interface applications 2026-05-19T10:32:36Z Advancements in clinical Brain-Computer Interfaces (BCIs) depend on precise and reliable signal interpretation. However, the high-dimensional and noisy nature of data captured from both implanted and non-implanted BCIs poses significant challenges, motivating the use of feature selection algorithms. We introduce BCI-sift (BCI Systematic and Interpretable Feature Tuning), a Python-based toolbox designed to streamline the application of diverse optimization algorithms to BCI datasets for identifying the most relevant features in machine learning tasks. Our scikit-learn-compatible toolbox (github.com/UMCU-RIBS/BCI-sift) simplifies feature selection in BCI tasks by integrating advanced optimization methods. We validated the toolbox on high-density electrocorticography (HD ECoG) data from eight able-bodied participants with 64-128 electrodes implanted over the sensorimotor cortex, who repeatedly spoke 12 words. BCI-sift identified informative neural features across electrode, temporal, and frequency dimensions. The anatomical locations of electrode selections were consistent across participants and aligned with known functional organization of the sensorimotor cortex. Relevant time points clustered around speech production, and the high-frequency band was identified as most informative, in line with prior work. Feature selection improved classification accuracy compared to using all features. BCI-sift provides an accessible and versatile platform for feature selection in BCI research, enabling improved decoding performance, automated feature analysis, and enhanced interpretability. While validated on HD ECoG data, the approach is broadly applicable to other BCI modalities. By enhancing classification accuracy and interpretability, BCI-sift addresses key challenges in developing efficient and transparent BCI systems. 2026-05-19T10:32:36Z 19 pages, 12 figures Elena C Offenberg Dirk Keller Mariska J Vansteensel Zachary V Freudenburg Nick F Ramsey Julia Berezutskaya http://arxiv.org/abs/2512.00281v3 Beyond Size and Growth: Rethinking Lung Cancer Screening with AI Based Nodule Detection and Diagnosis 2026-05-19T08:50:57Z Early detection of malignant lung nodules remains constrained by size and growth based screening criteria, often delaying diagnosis. We present an integrated AI system that jointly performs nodule detection and malignancy assessment directly at the nodule level from low dose CT scans, within a unified CADe/CADx framework. Unlike conventional pipelines separating detection and diagnosis, our approach targets malignant nodules directly, redefining evaluation at the point where clinical decisions are made. To address limitations in dataset scale and explainability, the system consists of a Large Ensemble Model (LEM) combining ensembles of shallow deep learning and feature based models. It was trained and evaluated on 25,709 scans with 69,449 annotated nodules, with external validation on an independent cohort. It achieved an AUC of 0.98 internally and 0.945 externally, outperforming all growth based metrics, Lung RADS size based triage, European volume and VDT based screening criteria, radiologists, and leading AI models. The model maintains high sensitivity at low false positive rates, excels for small and early stage cancers, and enables malignancy assessment up to one year earlier than radiologists for indeterminate and slow growing nodules. This approach has the potential to streamline lung cancer screening workflows and support earlier, more actionable clinical decision making. 2025-11-29T02:17:32Z 25 pages, 8 figures, with supplementary information containing 11 figures Sylvain Bodard Pierre Baudot Benjamin Renoust Charles Voyton Gwendoline De Bie Ezequiel Geremia Van-Khoa Le Danny Francis Pierre-Henri Siot Yousra Haddou Vincent Bobin Jean-Christophe Brisset Carey C. Thomson Valerie Bourdes Benoit Huet http://arxiv.org/abs/2602.07570v2 How does longer temporal context enhance multimodal narrative video processing in the brain? 2026-05-19T05:04:40Z Understanding how humans and artificial intelligence systems process complex narrative videos is a fundamental challenge at the intersection of neuroscience and machine learning. This study investigates how the temporal context length of video clips (3--24 s clips) and the narrative-task prompting shape brain-model alignment during naturalistic movie watching. Using fMRI recordings from participants viewing full-length movies, we examine how brain regions sensitive to narrative context dynamically represent information over varying timescales and how these neural patterns align with model-derived features. We find that increasing clip duration substantially improves brain alignment for multimodal large language models (MLLMs), whereas unimodal video models show little to no gain. Further, shorter temporal windows align with perceptual and early language regions, while longer windows preferentially align higher-order integrative regions, mirrored by a layer-to-cortex hierarchy in MLLMs. Finally, experiments with four narrative-task prompts show that they elicit task-specific, region-dependent brain alignment patterns and context-dependent shifts in clip-level tuning in higher-order regions. Our work positions long-form narrative movies as a principled testbed for studying long-timescale temporal integration in long-context MLLMs and its relationship to cortical responses during narrative comprehension. 2026-02-07T14:34:00Z 22 pages, 15 figures Prachi Jindal Anant Khandelwal Manish Gupta Bapi S. Raju Subba Reddy Oota Tanmoy Chakraborty