https://arxiv.org/api/MQqHQ7bc5yNXaAH/gCrKWNTBfSQ2026-06-21T23:53:53Z1218122515http://arxiv.org/abs/2605.28854v1Large language models reorganize representational geometry during in-context learning2026-05-16T22:31:00ZLarge language models (LLMs) exhibit remarkable flexibility: they can adapt to novel tasks from in-context examples without any parameter updates, a capability known as in-context learning (ICL). Prior work on synthetic tasks has shown that ICL can implement specific algorithms, demonstrating architectural competence, and mechanistic analyses have identified key circuits that support this behavior. However, because in-context computation -- regardless of its algorithmic form -- relies on transformations in high-dimensional representation space, it remains unclear how the geometry of that space shapes ICL effectiveness. Motivated by the neuroscience view of classification as the untangling of neural representations, we hypothesize that ICL depends on the successful online untangling of task-relevant representations. To test this idea, we study how LLMs classify in-context examples whose labels are defined by the model's own internal representations with known structure. We show that ICL performance correlates systematically with the representational structure of the underlying classification task and that successful ICL is accompanied by geometric reorganization that increases online separability. We further find that LLM behavior is well described by a prototype-like algorithm that integrates evidence while reshaping representations to support classification. These findings offer a geometric account of ICL in pretrained LLMs, establish representational geometry as a mechanistic constraint on ICL, and quantify the gap between what pretrained representations afford and what in-context learning can exploit.2026-05-16T22:31:00ZHua-Dong XiongLi Ji-AnRobert C. WilsonKwonjoon LeeXue-Xin Weihttp://arxiv.org/abs/2406.14427v4Principles of frugal inference and control2026-05-16T20:46:29ZA central challenge for intelligent agents in an uncertain world is striking the right balance between utility maximization and resource use, not only for external movement but also for internal computation. Existing theories of control under uncertainty typically treat inference as cost-free, despite the substantial computational and energetic burden it imposes in both artificial and biological systems. To remedy this problem, we introduce a novel variant of the POMDP framework in which the information acquired through inference is treated as a resource that must be optimized alongside utility. Solving a local linear-Gaussian approximation of the resulting problem reveals three general principles of resource-efficient control. First, when information is costly, inference shifts from a Bayes-optimal (lossless) compression of the past to a lossy regime that strategically leaves some uncertainty unresolved to optimize resource use. Second, relaxing exact Bayesian inference creates a manifold of equivalent solutions, reflecting multiple ways to combine imperfect inference with compensatory control. This flexibility can be used to meet additional objectives or constraints without sacrificing performance on the original task. Third, beyond goal attainment, control can be leveraged to counteract estimation errors and steer the system into regimes where representation costs are lower. We empirically demonstrate that these principles generalize beyond the local linear-Gaussian approximation, enabling the solution of nonlinear control problems such as pole balancing and drone stabilization. Together, these results establish a framework for rational computation that extends existing approaches to information-constrained decision-making and offers normative insight into how brains and machines can achieve effective behavior under tight computational constraints.2024-06-20T15:50:38ZItzel Olivos-CastilloPaul SchraterXaq Pitkowhttp://arxiv.org/abs/2605.16938v1Effort as Ceiling, Not Dial: Reasoning Budget Does Not Modulate Cognitive Cost Alignment Between Humans and Large Reasoning Models2026-05-16T11:20:01ZLarge Reasoning Models (LRMs) generate chain-of-thought traces whose length tracks human reaction times across cognitive tasks, but recent debate questions whether this alignment reflects genuine computational structure or surface verbosity. We test whether the alignment varies with inference-time reasoning effort. Across GPT-OSS-20B and GPT-OSS-120B, three effort levels, and six reasoning tasks, within-task and cross-task alignment remain invariant: Bayes Factors lean toward the null, and mean alignment is numerically near-identical across conditions. A manipulation check reveals that the effort parameter sets an upper budget on generation rather than driving real-time allocation, suggesting that the allocation policy is crystallized at training time. Arithmetic complexity contrasts further show that token allocation tracks fine-grained, format-dependent human difficulty patterns, with model scale improving the match. Cognitive cost alignment between LRMs and humans appears to be a training-time achievement, robust to inference-time perturbations, supporting a compiled rather than online account of LRM problem-solving.2026-05-16T11:20:01Z8 pages, 6 figuresYueqing HuTianhong Wanghttp://arxiv.org/abs/2605.16761v1A Mathematical Characterization of Neural Activation Induced by Temporal Interference Stimulation2026-05-16T02:18:51ZTemporal Interference Stimulation (TIS) is a non-invasive neuromodulation technique in which two high-frequency sinusoidal currents with slightly different frequencies generate a low-frequency envelope that can activate deep neural structures. This study investigates the conditions under which TIS elicits action potentials in a single neuron modeled by the FitzHugh-Nagumo system. This research integrates phase-plane analysis and geometric singular perturbation to develop a mathematical framework for analyzing TIS. By combining a mathematical analysis of differential equations with computer simulations, the study elucidates how the amplitudes and beat frequency jointly determine whether the neuron remains quiescent, exhibits only transient responses, or undergoes persistent (tonic) firing.2026-05-16T02:18:51Z24 pages, 9 figuresEsteban PaduroAntoine ChailletMario Sigalottihttp://arxiv.org/abs/2508.21177v2Coherent dynamics in soft-threshold integrate-and-fire networks2026-05-15T20:45:39ZWe study bifurcations in networks of integrate-and-fire neurons with stochastic spike emission, focusing on the effects of the spatial and temporal structure of the synaptic interactions. Using a deterministic mean-field approximation of the population dynamics, we characterize spatial, temporal, and spatiotemporal patterns of macroscopic activity. In the mean-field theory, synaptic delays give rise to uniform oscillations across the population through a subcritical Hopf bifurcation of the stationary uniform equilibrium. With local excitation and long-range inhibition the network undergoes a Turing bifurcation, resulting in a localized area of sustained activity, or stationary bump. When the coupling has both delays, local inhibition, and long range excitation, the network undergoes a Turing-Hopf bifurcation leading to spatiotemporal dynamics, such as standing and traveling waves. When multiple instabilities are excited, we observe other complex spatiotemporal dynamics. We confirm all these predictions of the mean-field theory in simulations of the underlying stochastic model.2025-08-28T19:31:57Z29 pages, 8 figuresLauren ForbesJared GrossmanMontie AveryRyan GohGabriel Koch Ockerhttp://arxiv.org/abs/2605.16146v1The Complex Brain Hypothesis: Resolving the Entropy-Content Conundrum in Minimal Phenomenal Experience2026-05-15T16:26:46ZMinimal Phenomenal Experiences (MPEs) are states of consciousness in which wakefulness is preserved but phenomenal content is low or absent. The Entropic Brain Hypothesis (EBH) is a model of conscious processes that regards the entropy of spontaneous brain activity as a marker of 'phenomenal richness', exemplified by high-content psychedelic experiences (HCPEs). Yet recent human neuroimaging studies of MPEs induced by meditation -- and possibly 5-MeO-DMT -- suggest that these states, defined by their phenomenological simplicity, also show signs of increased neurophysiological entropy. This presents a conundrum for the EBH: brain entropy is elevated with increased and decreased richness of the phenomenal experience. Here, we put forward the Complex Brain Hypothesis (CBH), which proposes that the richness of experience differentiating MPEs from HCPEs is better indexed by complexity than by entropy. We argue that brain complexity is modulated by the grain of inference through which the brain resolves uncertainty: some HCPEs exemplify a fine-grained regime, in which loosened constraints amplify fluctuations into proliferating content, whereas some MPEs exemplify a coarse-grained regime, in which a simpler model dissolves variety into an experience of 'contentless' awareness. Both regimes can be associated with elevated brain entropy, but they diverge in phenomenology and perturbational signatures. By resolving the entropy-content conundrum, the CBH refines the EBH and highlights MPEs as an important test case for computational theories of consciousness.2026-05-15T16:26:46ZJonas MagoEdmundo Lopez-SolaJakub VohryzekMichael LifshitzRobin Carhart-HarrisKarl FristonShamil Chandariahttp://arxiv.org/abs/2412.12106v3The Syncytial Mesh Model: A Mesoscale Control-Field Framework for Scale-Dependent Coherence in the Brain2026-05-15T15:38:43ZThe Syncytial Mesh Model introduces a three-layered framework for large-scale brain dynamics integrating local neural circuitry, macrostructural connectivity, and a slow mesoscale control-field substrate associated with astrocytic syncytial organization. Rather than directly generating electrophysiological activity, the proposed syncytial layer modulates neuronal excitability, coherence structure, and metastable coordination across spatial scales.
The framework is formulated as a phenomenological effective theory combining neural-mass dynamics, connectome-scale coupling, and continuous-field interactions. Within this architecture, the model provides a candidate explanation for large-scale traveling-wave organization, low-frequency coherence structure, and distributed plasticity phenomena that are not straightforwardly reducible to direct local synaptic connectivity alone.
Numerical simulations of the effective field dynamics generate stable traveling-wave propagation, smooth phase-gradient organization, and low-frequency modal structure qualitatively resembling experimentally reported infra-slow and delta/theta coordination patterns. An analytic mesoscale coherence model further illustrates how scale-dependent synchronization probabilities may emerge from slow-field modulation and damping dynamics without requiring globally phase-locked neuronal oscillations.2024-11-29T10:52:48ZThis revised version clarifies the Syncytial Mesh Model as a phenomenological mesoscale control-field framework associated with astrocytic syncytial organization rather than a direct generator of electrophysiological activity. Empirical claims, references, and mathematical interpretations have been substantially refined. AI tools were used for language refinement and drafting supportAndreu Ballus Santacana10.1101/2024.11.22.624908http://arxiv.org/abs/2602.23410v3Brain-OF: An Omnifunctional Foundation Model for fMRI, EEG and MEG2026-05-15T11:39:26ZBrain foundation models have achieved remarkable advances across a wide range of neuroscience tasks. However, most existing models are limited to a single functional modality, restricting their ability to exploit complementary spatiotemporal dynamics and the collective data scale across different neuroimaging techniques. This limitation largely arises from severe semantic heterogeneity and resolution discrepancies among modalities. To address these challenges, we propose Brain-OF, an omnifunctional brain foundation model jointly pretrained on fMRI, EEG and MEG, capable of handling both unimodal and multimodal inputs within a unified framework. To reconcile heterogeneous spatiotemporal resolutions, we introduce the Any-Resolution Neural Signal Sampler, which projects diverse brain signals into a shared semantic space. To further manage semantic shifts, the Brain-OF backbone integrates DINT attention with a Sparse Mixture of Experts, where shared experts capture modality-invariant representations and routed experts specialize in modality-specific semantics. Furthermore, to explicitly internalize the characteristics of neural activity through self-supervised learning, we propose Masked Temporal-Frequency Modeling, a dual-domain pretraining objective that jointly reconstructs brain signals in both the time and frequency domains. Brain-OF is pretrained on a large-scale corpus comprising around 40 datasets and demonstrates superior performance across diverse downstream tasks, highlighting the benefits of joint multimodal integration and dual-domain pretraining.2026-02-26T15:47:13ZHanning GuoHanwen BiFarah AbdellatifAndrei GalbenusJon. N. ShahAbigail MorrisonJürgen Dammershttp://arxiv.org/abs/2605.16468v1Mechanistically Interpretable Neural Encoding Reveals Fine-Grained Functional Selectivity in Human Visual Cortex2026-05-15T11:28:10ZA central goal in understanding human vision is to uncover the visual features that drive neuronal activity. A growing body of work has used artificial neural networks as encoding models to predict cortical responses to natural images, revealing the visual content that activates category-selective regions. However, existing approaches are largely correlational and treat the encoder as a black box, leaving open which image features drive each voxel's response. We introduce Mechanistically Interpretable Neural Encoding (MINE), a framework that opens this black box by applying mechanistic-interpretability tools to localize the features within natural images that drive millimeter-scale (voxel-level) activity. MINE predicts each voxel's response using language-aligned image representations, and produces semantically interpretable descriptions of the features critical for the voxel's activation. We further generalize these per-image features into per-voxel functional profiles. To validate the per-image descriptions, we show they are sufficient to generate images that elicit voxel responses matching the responses to the original images, more accurately than images generated from random or low-attribution controls. Moreover, counterfactually inserting or removing the predicted features from images shifts activation in the expected direction, providing causal evidence. Counterfactual editing guided by the per-voxel activation profiles produces even stronger activation shifts, indicating that the profiles faithfully capture each voxel's selectivity. Finally, we apply MINE to well-studied category-selective brain regions, showing it recovers their known categorical preferences while revealing fine-grained unique voxel structure within each region. Overall, our results establish mechanistic interpretability as a path to discover and causally validate fine-grained hypotheses about neural function.2026-05-15T11:28:10Z40 pages, 28 figuresIdan Daniel GrosbardMor GevaGalit Yovelhttp://arxiv.org/abs/2605.15862v1From Observed Viability to Internal Predictive Approximation: A Single-Subject Latent-Space Analysis of Gait Dynamics Under Occlusal Constraint2026-05-15T11:23:44ZAdaptive biomechanical systems may show similar observable gait performance while differing in latent organization and longitudinal behavior. This study examines whether an observed longitudinal transformation of gait organization can be approximated within a predictive latent-space framework, without claiming clinical prediction or causal occlusal effects.
Using an exploratory single-subject design in a Parkinsonian participant, gait was recorded with instrumented insoles during two sessions separated by eleven weeks. Six occlusal observational probes were tested: natural occlusion, open-mouth disengagement, strong clenching, two vertical-dimension increases in centric relation, and one vertical-dimension increase with mandibular protrusion. Principal Component Analysis was used to construct a PC1--PC2 latent representation. A simplified supervised machine-learning model, implemented as a feed-forward neural network, was trained to approximate the observed M1--M2 transformation.
The primary analysis focused on the three centric-relation conditions and tested whether the displacement hierarchy could be reproduced. The model preserved the ordering OC3 < ONL < OC2.5. The extended six-probe analysis also preserved the global structure of the exploratory displacement pattern, with OC3 and OC3P closely grouped and the highest displacements associated with OC2.5 and open-mouth disengagement. Held-out M2 and leave-condition-out analyses showed condition-dependent approximation variability.
These findings do not establish generalizable prediction, therapeutic superiority, causal occlusal effects, or clinical viability forecasting. They support only the restricted conclusion that observed longitudinal latent transformations can be internally approximated within this single-subject dataset, providing a methodological bridge toward future multi-subject predictive viability models.2026-05-15T11:23:44Z31 pages, 1 figure, 9 tables. Exploratory single-subject study combining gait analysis, occlusal observational probes, PCA-based latent-space modeling, and supervised predictive approximationJacques RaynalPierre SlangenElsa RaynalJacques Margerithttp://arxiv.org/abs/2605.15801v1Beyond Flickering: Introducing Code-Modulated Motion Visual Evoked Potentials for Brain-Computer Interfacing2026-05-15T09:56:16ZA code-modulated motion visual evoked potential (c-MVEP) for brain-computer interfacing (BCI) is presented in this study. This paradigm uses pseudo-random sequences to visually stimulate objects using motion as an alternative to flickering. In an offline experiment of this study, EEG data were recorded and compared during sequential stimulation of a single object under four conditions: c-MVEP, code-modulated visual evoked potential (c-VEP), steady-state motion visual evoked potential (SSMVEP), and steady-state visual evoked potential (SSVEP). c-MVEP showed similar time-domain characteristics as c-VEP, and also in the frequency domain c-MVEP evoked a broadband response similar to c-VEP, with a comparable signal-to-noise ratio (SNR), albeit more focused in the lower frequency range. Both SSMVEP and SSVEP showed clear oscillatory responses at the stimulation frequency and harmonics, with a higher SNR for SSVEP than SSMVEP. The spatial distribution of c-MVEP showed the main activation at Oz and spread across multiple electrodes, whereas c-VEP showed less spreading and was more focused at Oz. Similar observations were made for SSMVEP and SSVEP. From subjective ratings, there was no clear preference for the motion-based stimulation of SSMVEP or c-MVEP over flicker-based stimulation of SSVEP or c-VEP. The online experiment of this study, evaluated a 4-class BCI with the same four conditions, testing the practical feasibility of the c-MVEP paradigm. The c-MVEP BCI reached a mean accuracy of 85.67% with an average selection time of 2.61s, which was significantly lower than c-VEP (97.81%; 1.15s) and SSVEP (93.42%; 1.94s), but significantly higher than SSMVEP (64.91%; 4.18s). Overall, this study shows the great potential of the newly proposed c-MVEP paradigm using motion stimulation for BCI applications, providing a valuable alternative to the c-VEP paradigm using flickering stimulation.2026-05-15T09:56:16ZHanneke ScheppinkRainer HerpersJordy ThielenIvan Volosyakhttp://arxiv.org/abs/2605.00026v3The $γ_c$-Peak: Covariant Recovery on Four Organic Qubit Platforms2026-05-15T09:55:11ZThe Petz recovery map (1986) provably reverses a noisy quantum channel on a reference state, but its algorithmic relevance to real, dissipation-dominated platforms has remained unclear. Using the open-source \texttt{organic-qc-bench} simulation package, we benchmark a Petz-style covariant-purification quantum error correction (CQEC) protocol across four engineered organic qubit platforms operated \emph{without any magnetic field}: a flavin-nitroxide radical-pair reservoir (P1); perchlorotriphenylmethyl radicals in a covalent organic framework (P2); the SVILC qubit [Wakaura2017] on $κ$-(BEDT-TTF)$_2$Cu[N(CN)$_2$]Br (P3, conditional on SVILC confirmation); and a Su-Schrieffer-Heeger soliton on \emph{trans}-polyacetylene (P4).
Across five quantum algorithms (QKAN, qDRIFT, control-free QPE, Shor-Regev, Bernstein-Vazirani) and two ML tasks, CQEC gains are significant ($p\!<\!10^{-5}$; Wilcoxon, Bonferroni $α\!=\!0.05/44$) for all sixteen path$\times$algorithm pairs. The central finding is the \emph{$γ_c$-peak}: the fidelity gain $ΔF$ is maximised \emph{at} the entanglement-breaking threshold $γ_c$, with $ΔF_{\rm max}\!=\!+0.303$ at $d\!=\!64$ and a linear $\log_2 d$ scaling over $d=2$-$64$ -- algorithmically confirming the prediction [Wakaura2026LQBH] that Petz recovery preserves coherence beyond this threshold. Bernstein-Vazirani also yields a $7.6$-$31\times$ provable quantum advantage at $n\!=\!3$-$5$, diarylethene-photoswitch CZ fidelities reach $F_{CZ}\!\ge\!0.987$ for P2-P4, and projected manufacturing costs are 10-40$\times$ lower with 10-200$\times$ less operating power than superconducting platforms. The $γ_c$-peak establishes Petz-style recovery as a practically relevant primitive at the dissipation-coherence boundary and identifies PTM-COF (P2) as the highest-priority experimental target.2026-04-22T09:12:12ZHikaru WakauraTaiki Tanimaehttp://arxiv.org/abs/2603.29617v2Convergent Representations of Linguistic Constructions in Human and Artificial Neural Systems2026-05-15T09:42:03ZUnderstanding how the brain processes linguistic constructions is a central challenge in cognitive neuroscience and linguistics. Recent computational studies show that artificial neural language models spontaneously develop differentiated representations of Argument Structure Constructions (ASCs), generating predictions about when and how construction-level information emerges during processing. The present study tests these predictions in human neural activity using electroencephalography (EEG). Ten native English speakers listened to 200 synthetically generated sentences across four construction types (transitive, ditransitive, caused-motion, resultative) while neural responses were recorded. Analyses using time-frequency methods, feature extraction, and machine learning classification revealed construction-specific neural signatures emerging primarily at sentence-final positions, where argument structure becomes fully disambiguated, and most prominently in the alpha band. Pairwise classification showed reliable differentiation, especially between ditransitive and resultative constructions, while other pairs overlapped. Crucially, the temporal emergence and similarity structure of these effects mirror patterns in recurrent and transformer-based language models, where constructional representations arise during integrative processing stages. These findings support the view that linguistic constructions are neurally encoded as distinct form-meaning mappings, in line with Construction Grammar, and suggest convergence between biological and artificial systems on similar representational solutions. More broadly, this convergence is consistent with the idea that learning systems discover stable regions within an underlying representational landscape - recently termed a Platonic representational space - that constrains the emergence of efficient linguistic abstractions.2026-03-31T11:37:50ZPegah RamezaniThomas KinfeAndreas MaierAchim SchillingPatrick Krausshttp://arxiv.org/abs/2512.09502v2Scalable Construction of Spiking Neural Networks using up to thousands of GPUs2026-05-15T08:11:02ZDiverse scientific and engineering research areas deal with discrete, time-stamped changes in large systems of interacting delay differential equations. Simulating such complex systems at scale on high-performance computing clusters demands efficient management of communication and memory. Inspired by the human cerebral cortex -- a sparsely connected network of $\mathcal{O}(10^{10})$ neurons, each forming $\mathcal{O}(10^{3})$--$\mathcal{O}(10^{4})$ synapses and communicating via short electrical pulses called spikes -- we study the simulation of large-scale spiking neural networks for computational neuroscience research. This work presents a novel network construction method for multi-GPU clusters and upcoming exascale supercomputers using the Message Passing Interface (MPI), where each process builds its local connectivity and prepares the data structures for efficient spike exchange across the cluster during state propagation. We demonstrate scaling performance of two cortical models using point-to-point and collective communication, respectively.2025-12-10T10:27:31ZNeuromorphic Computing and Engineering, Volume 6, Number 2, 024012, 2026Bruno GolosioGianmarco TiddiaJosé VillamarLuca PontissoLuca SergiFrancesco SimulaPooja BabuElena PastorelliAbigail MorrisonMarkus DiesmannAlessandro LonardoPier Stanislao PaolucciJohanna Senk10.1088/2634-4386/ae65d2http://arxiv.org/abs/2509.00555v3Integrated information and predictive processing theories of consciousness: An adversarial collaborative review2026-05-15T07:00:09ZAs neuroscientific theories of consciousness continue to proliferate, the need to assess their similarities and differences - as well as their predictive and explanatory power - becomes ever more pressing. Recently, a number of structured adversarial collaborations have been devised to test the competing predictions of several candidate theories of consciousness. In this review, we compare and contrast three theories being investigated in one such adversarial collaboration: Integrated Information Theory, Neurorepresentationalism, and Active Inference. We begin by presenting the core claims of each theory, before comparing them in terms of the phenomena they seek to explain, the sorts of explanations they avail, and the methodological strategies they endorse. We then consider some of the inherent challenges of theory-testing, and how adversarial collaboration addresses some of these difficulties. The stage is then set for the empirical work to come: first, we outline the key hypotheses to be tested across a series of multi-site experiments; second, we discuss the kinds of observations that would support or challenge each theory; third, we consider how these theories might assimilate or accommodate such observations. Finally, we show how data harvested across disparate experiments (and their replicates) may be formally integrated to provide a quantitative measure of the evidential support accrued under each theory. Besides orienting the reader to the theoretical foundations of our collaboration, this review aims to provide valuable meta-scientific insights into the mechanics of adversarial collaboration and theory-testing in general - including the way theories may be evaluated in terms of the scientific progress they deliver.2025-08-30T16:41:13ZNeuroscience & Biobehavioral Reviews, 187, 106742 (2026)Andrew W. CorcoranAndrew M. HaunReinder DormanGiulio TononiKarl J. FristonCyriel M. A. Pennartz TWCF :INTREPID Consortium10.1016/j.neubiorev.2026.106742