https://arxiv.org/api/ATAEFMp/U5LITkwz3tARfkhPhoU 2026-06-22T13:28:55Z 12181 405 15 http://arxiv.org/abs/2604.13574v1 From Brain Models to Executable Digital Twins: Execution Semantics and Neuro-Neuromorphic Systems 2026-04-15T07:34:46Z

Brain digital twins aim to provide faithful, individualized computational representations of brains as dynamical systems, enabling mechanistic understanding and supporting prediction of clinical interventions. Yet current approaches remain fragmented across data pipelines, model classes, temporal scales, and computing platforms, which prevents the preservation of execution semantics across the end-toend workflow. This survey introduces physically constrained executability as a unifying perspective for comparing approaches at the level of execution: whether an execution state is persistent, which events are permitted to update it (simulation, measurement, actuation), and how strongly execution is temporally and causally coupled to neurobiological dynamics. Building on modeling and simulation theory, I propose a taxonomy of execution regimes ranging from isolated offline models to coordinated co-simulation, to continuously executing digital twins sustained by online data assimilation, and ultimately to neuro-neuromorphic physical systems in which biological and computational dynamics are co-executed under shared physical constraints. The executability concept clarifies why accuracy alone is insufficient, and motivates an agenda centered on semantic interoperability, hybrid-time correctness, evaluation protocols, scalable reproducible workflows, and safe closed-loop validation. This survey adopts a systems and runtime-oriented perspective, enabling comparison of heterogeneous approaches based on their execution semantics rather than on model form or application domain alone.

2026-04-15T07:34:46Z Alexandre Muzy ILLS http://arxiv.org/abs/2211.11346v2 Hierarchically Modular Dynamical Neural Network Relaxing in a Warped Space: Basic Model and its Characteristics 2026-04-15T07:17:28Z

We propose a hierarchically modular, dynamical neural network model whose architecture minimizes a specifically designed energy function and defines its temporal characteristics. The model has an internal and an external space that are connected with a layered internetwork that consists of a pair of forward and backward subnets composed of static neurons (with an instantaneous time-course). Dynamical neurons with large time constants in the internal space determine the overall time-course. The model offers a framework in which state variables in the network relax in a warped space, due to the cooperation between dynamic and static neurons. We assume that the system operates in either a learning or an association mode, depending on the presence or absence of feedback paths and input ports. In the learning mode, synaptic weights in the internetwork are modified by strong inputs corresponding to repetitive neuronal bursting, which represents sinusoidal or quasi-sinusoidal waves in the short-term average density of nerve impulses or in the membrane potential. A two-dimensional mapping relationship can be formed by employing signals with different frequencies based on the same mechanism as Lissajous curves. In the association mode, the speed of convergence to a goal point greatly varies with the mapping relationship of the previously trained internetwork, and owing to this property, the convergence trajectory in the two-dimensional model with the non-linear mapping internetwork cannot go straight but instead must curve. We further introduce a constrained association mode with a given target trajectory and elucidate that in the internal space, an output trajectory is generated, which is mapped from the external space according to the inverse of the mapping relationship of the forward subnet.

2022-11-21T10:53:46Z 44 pages, 22 figures. v2: fixed typos and clarified phrasing Kazuyoshi Tsutsumi Ernst Niebur http://arxiv.org/abs/2501.02378v2 A ghost mechanism: An analytical model of abrupt learning in recurrent networks 2026-04-15T05:10:03Z

Abrupt learning is a common phenomenon in recurrent neural networks (RNNs) trained on working memory tasks. In such cases, the networks develop transient slow regions in state space that extend the effective timescales of computation. However, the mechanisms driving sudden performance improvements and their causal role remain unclear. To address this gap, we introduce the ghost mechanism, a process by which dynamical systems exhibit transient slowdown near the remnant of a saddle-node bifurcation. By reducing the high-dimensional dynamics near ghost points, we derive a one-dimensional canonical form that analytically captures learning as a process controlled by a single scale parameter. Using this model, we study a form of abrupt learning emerging from ghost points and identify a critical learning rate that scales as an inverse power law with the timescale of the learned computation. Beyond this rate, learning collapses through two interacting modes: (i) vanishing gradients and (ii) oscillatory gradients near minima. These features can lock the system into high-confidence but incorrect predictions when parameter updates trigger a no-learning zone, a region of parameter space where gradients vanish. We validate these predictions in low-rank RNNs, where ghost points precede abrupt transitions, and further demonstrate their generality in full-rank RNNs trained on canonical working memory tasks. Our theory offers two approaches to address these learning difficulties: increasing trainable ranks stabilizes learning trajectories, while reducing output confidence mitigates entrapment in no-learning zones. Overall, the ghost mechanism reveals how the computational demands of a task constrain the optimization landscape, demonstrating that well-known learning difficulties in RNNs partly arise from the dynamical systems they must learn to implement.

2025-01-04T20:49:20Z to appear in Physical Review X Fatih Dinc Ege Cirakman Bariscan Kurtkaya Mert Yuksekgonul Yiqi Jiang Mark J. Schnitzer Hidenori Tanaka http://arxiv.org/abs/2603.12416v2 Formation of Artificial Neural Assemblies by Biologically Plausible Inhibition Mechanisms 2026-04-14T22:11:55Z

As proposed by Hebb's theory, neural assemblies are groups of excitatory neurons that fire synchronously and exhibit high synaptic density, representing external stimuli and supporting cognitive functions such as language and decision-making. Recently, a model called Assembly Calculus (AC) was proposed, enabling the formation of artificial neural assemblies through the $k$-winners-take-all selection process and Hebbian learning. Although the model is capable of forming assemblies according to Hebb's theory, the adopted selection process does not incorporate essential aspects of biological neural computation, as neural activity, which is often governed by statistical distributions consistent with power-law scaling. Given this limitation, the present work aimed to bring the model's dynamics closer to that observed in real cortical networks. To achieve this, a new selection mechanism inspired by the dynamics of gamma oscillation cycles, called E%-winners-take-all, was implemented, combined with an inhibition process based on the ratio between excitatory and inhibitory neurons observed in various regions of the cerebral cortex. The results obtained from our model (called E%-WTA model) were compared with those of the original model, and the analyses demonstrated that the introduced modifications allowed the network's own dynamics to determine the size of the formed assemblies. Furthermore, the recovery rate of these groups, through the evocation of the stimuli that generated them, became superior to that obtained in the original model.

2026-03-12T19:57:19Z 9 pages, 4 figures Lucas Hoff Gustavo Soroka Matheus Guimarães Aline Villavicencio Marco Idiart http://arxiv.org/abs/2601.07215v3 Neuronal Spike Trains as Functional-Analytic Distributions: Representation, Analysis, and Significance 2026-04-14T20:45:23Z

The action potential constitutes the digital component of the signaling dynamics of neurons. But the biophysical nature of the full-time course of the action potential associated with changes in membrane potential is mathematically distinct from its representation as a discrete set of events that encode when action potentials are triggered in a collection of spike trains. In this paper, we develop from first principles a unified functional-analytic framework for neuronal spike trains, grounded in Schwartz distribution theory. We show how this representation provides an exact operational calculus for convolution, distributional differentiation, and distributional support, which enables closed-form analysis of spike train dynamics without discretization, rate approximation, or smoothing. We then analyze the framework in the context of a two-neuron reciprocal circuit with propagation latencies and refractoriness, deriving exact results for synaptic drive, spike timing sensitivity, and causal admissibility of inputs, quantities that are either ill-defined or require approximation in conventional treatments.

2026-01-12T05:14:51Z Peer-reviewed accepted version in press to be published in Neural Computation. 27 pages, 1 figure Gabriel A. Silva http://arxiv.org/abs/2604.13281v1 Attention to task structure for cognitive flexibility 2026-04-14T20:17:39Z

Humans and artificial agents must often learn and switch between multiple tasks in dynamic environments. Success in such settings requires cognitive flexibility: the ability to retain prior knowledge (cognitive stability) while also transferring it to novel tasks (cognitive generalization). Cognitive flexibility research has largely focused on the role of model architecture to achieve these complementary goals. However, it is less well understood how the structure of the environment itself influences cognitive flexibility, and how it interacts with model architecture. To address this gap, we design a multi-task learning environment in which tasks are defined by a combination of two cue dimensions, allowing us to characterize the environment with graph-theory methods. We also introduce gating-based (multiplicative) and concatenation-based attention models that can decompose tasks into components and can sequentially allocate attention to them. We compare the attention-based models' performance in the multi-task learning environment to multilayer perceptrons. Generalization and stability are systematically evaluated across environments that vary in richness and task connectivity. We observe that richer environments improve both generalization and stability. In addition, a critical novel observation is that (graph theory based) connectivity between the tasks in the environment strongly modulates both stability and generalization, with especially pronounced benefits for attention-based models. These findings underscore the importance of considering not only cognitive architectures but also environmental structure and their interaction in shaping multi-task learning, generalization, and stability.

2026-04-14T20:17:39Z Xiaoyu K. Zhang Mehdi Senoussi Tom Verguts http://arxiv.org/abs/2412.07238v3 Speaker effects in language comprehension: An integrative model of language and speaker processing 2026-04-14T16:21:58Z

The identity of a speaker influences language comprehension through modulating perception and expectation. This review explores speaker effects and proposes an integrative model of language and speaker processing that integrates distinct mechanistic perspectives. We argue that speaker effects arise from the interplay between bottom-up perception-based processes, driven by acoustic-episodic memory, and top-down expectation-based processes, driven by a speaker model. We show that language and speaker processing are functionally integrated through multi-level probabilistic processing: prior beliefs about a speaker modulate language processing at the phonetic, lexical, and semantic levels, while the unfolding speech and message continuously update the speaker model, refining broad demographic priors into precise individualized representations. Within this framework, we distinguish between speaker-idiosyncrasy effects arising from familiarity with an individual and speaker-demographics effects arising from social group expectations. We discuss how speaker effects serve as indices for assessing language development and social cognition, and we encourage future research to extend these findings to the emerging domain of artificial intelligence (AI) speakers, as AI agents represent a new class of social interlocutors that are transforming the way we engage in communication.

2024-12-10T07:03:06Z Psychon Bull Rev 33, 138 (2026) Hanlin Wu Zhenguang G. Cai 10.3758/s13423-026-02896-6 http://arxiv.org/abs/2604.12825v1 The illusory simplicity of the feedforward pass: evidence for the dynamical nature of stimulus encoding along the primate ventral stream 2026-04-14T14:48:25Z

In studying primate vision, a large body of work focuses on the first feedforward sweep. During this initial time window, information is thought to pass through ventral stream regions in a stage-like fashion in an effort to extract high-level information from the retinal input. Consequently, electrophysiological analyses commonly focus on spatial response patterns, either by averaging data in time, or by applying decoders in a temporally local fashion. By analysing data recorded simultaneously across multiple arrays placed along the macaque ventral stream, we here show that this prior approach may be missing key aspects of information encoding. First, time-resolved, multivariate analyses of information transfer between V4 and IT reveal temporally and semantically varied information content as being exchanged within the first 100ms of processing. Second, by employing recurrent neural network (RNN) decoding techniques that extend across the temporal domain, we demonstrate that the neural pattern dynamics themselves carry categorical information far beyond the spatially encoded information available at any given time point. These findings challenge the prevailing view of a single, stage-like feedforward process and suggest that even the earliest parts of visual processing are better characterised as a spatiotemporally evolving process that encodes information in its dynamics rather than purely spatial response patterns.

2026-04-14T14:48:25Z Daniel Anthes Sushrut Thorat Anna Mitola Paolo Papale Peter König Tim C Kietzmann http://arxiv.org/abs/2604.12683v1 Brain-DiT: A Universal Multi-state fMRI Foundation Model with Metadata-Conditioned Pretraining 2026-04-14T12:52:42Z

Current fMRI foundation models primarily rely on a limited range of brain states and mismatched pretraining tasks, restricting their ability to learn generalized representations across diverse brain states. We present \textit{Brain-DiT}, a universal multi-state fMRI foundation model pretrained on 349,898 sessions from 24 datasets spanning resting, task, naturalistic, disease, and sleep states. Unlike prior fMRI foundation models that rely on masked reconstruction in the raw-signal space or a latent space, \textit{Brain-DiT} adopts metadata-conditioned diffusion pretraining with a Diffusion Transformer (DiT), enabling the model to learn multi-scale representations that capture both fine-grained functional structure and global semantics. Across extensive evaluations and ablations on 7 downstream tasks, we find consistent evidence that diffusion-based generative pretraining is a stronger proxy than reconstruction or alignment, with metadata-conditioned pretraining further improving downstream performance by disentangling intrinsic neural dynamics from population-level variability. We also observe that downstream tasks exhibit distinct preferences for representational scale: ADNI classification benefits more from global semantic representations, whereas age/sex prediction comparatively relies more on fine-grained local structure. Code and parameters of Brain-DiT are available at \href{https://github.com/REDMAO4869/Brain-DiT}{Link}.

2026-04-14T12:52:42Z Junfeng Xia Wenhao Ye Xuanye Pan Xinke Shen Mo Wang Quanying Liu http://arxiv.org/abs/2503.14333v4 Characterizing higher-order representations through generative diffusion models explains human decoded neurofeedback performance 2026-04-14T05:13:11Z

Brains construct not only "first-order" representations of the environment but also "higher-order" representations about those representations -- including higher-order uncertainty estimates that guide learning and adaptive behavior. Higher-order expectations about representational uncertainty -- i.e., learned through experience -- may play a key role in guiding behavior and learning, but their characterization remains empirically and theoretically challenging. Here, we introduce the Noise Estimation through Reinforcement-based Diffusion (NERD) model, a novel computational framework that trains denoising diffusion models via reinforcement learning to infer distributions of noise in functional MRI data from a decoded neurofeedback task, where healthy human participants learn to achieve target neural states. We hypothesize that participants accomplish this task by learning about and then minimizing their own representational uncertainty. We test this hypothesis with NERD, which mirrors brain-like unsupervised learning. Our results show that NERD outperforms backpropagation-trained control models in capturing human performance with explanatory power enhanced by clustering learned noise distributions. Importantly, our results also reveal individual differences in expected-uncertainty representations that predict task success, demonstrating NERD's utility as a powerful tool for probing higher-order neural representations.

2025-03-18T15:08:19Z 25 pages, 7 figures Hojjat Azimi Asrari Megan A. K. Peters http://arxiv.org/abs/2602.05971v2 Characterizing Human Semantic Navigation in Concept Production as Trajectories in Embedding Space 2026-04-14T00:49:27Z

Semantic representations can be framed as a structured, dynamic knowledge space through which humans navigate to retrieve and manipulate meaning. To investigate how humans traverse this geometry, we introduce a framework that represents concept production as navigation through embedding space. Using different transformer text embedding models, we construct participant-specific semantic trajectories based on cumulative embeddings and extract geometric and dynamical metrics, including distance to next, distance to centroid, entropy, velocity, and acceleration. These measures capture both scalar and directional aspects of semantic navigation, providing a computationally grounded view of semantic representation search as movement in a geometric space. We evaluate the framework on four datasets across different languages, spanning different property generation tasks: Neurodegenerative, Swear verbal fluency, Property listing task in Italian, and in German. Across these contexts, our approach distinguishes between clinical groups and concept types, offering a mathematical framework that requires minimal human intervention compared to typical labor-intensive linguistic pre-processing methods. Comparison with a non-cumulative approach reveals that cumulative embeddings work best for longer trajectories, whereas shorter ones may provide too little context, favoring the non-cumulative alternative. Critically, different embedding models yielded similar results, highlighting similarities between different learned representations despite different training pipelines. By framing semantic navigation as a structured trajectory through embedding space, bridging cognitive modeling with learned representation, thereby establishing a pipeline for quantifying semantic representation dynamics with applications in clinical research, cross-linguistic analysis, and the assessment of artificial cognition.

2026-02-05T18:23:04Z 10 pages, 6 figures (excluding refs/appendix). Accepted to ICLR 2026 International Conference on Learning Representations (ICLR) 2026 Felipe D. Toro-Hernández Jesuino Vieira Filho Rodrigo M. Cabral-Carvalho http://arxiv.org/abs/2604.15363v1 Machine learning approaches to uncover the neural mechanisms of motivated behaviour: from ADHD to individual differences in effort and reward sensitivity 2026-04-13T23:47:46Z

Motivated behaviour relies on the brain's capacity to evaluate effort and reward. Dysregulation within these processes contributes to a spectrum of conditions, from hyperactivity in attention-deficit/hyperactivity disorder (ADHD) to diminished goal-directed behaviour in apathy. This thesis investigates the neural mechanisms underlying ADHD using electroencephalography (EEG) and examines individual differences in effort and reward sensitivity using neuroimaging, applying machine learning approaches through three main studies. In Study 1, task-based and resting-state EEG were employed with machine learning models to classify adult individuals with ADHD and healthy controls. Machine learning classifiers trained on task-based EEG during a stop signal task outperformed those trained on resting-state EEG, with the strongest predictive features arising from gamma-band spectral power over fronto-central and parietal regions. In Study 2, diffusion MRI and whole-brain permutation-based analyses identified associations between white matter integrity and computationally modelled parameters reflecting effort and reward sensitivity, with SMA-connected tracts emerging as a central hub. In Study 3, grey matter volumes from structural T1-weighted MRI were used to examine correlates of effort sensitivity, reward sensitivity, and subclinical apathy, with machine learning confirming robust decoding of reward sensitivity and apathy levels. Across studies, fronto-parietal circuits emerged as central to effort valuation and reward processing. These findings may serve as neural biomarkers for improving diagnostic accuracy in ADHD and motivational impairments, and for guiding personalised neurotechnological interventions.

2026-04-13T23:47:46Z PhD thesis, Dublin City University, December 2025. 194 pages Nam Trinh http://arxiv.org/abs/2604.11482v1 Integrated information theory: the good, the bad and the misunderstood 2026-04-13T13:49:21Z

The integrated information theory of consciousness (IIT) is uniquely ambitious in proposing a mathematical formula, derived from apparently fundamental properties of conscious experience, to describe the quantity and quality of consciousness for any physical system that possesses it. IIT has generated considerable debate, which has engendered some misunderstandings and misrepresentations. Here we address and hope to remedy this. We begin by concisely summarising the essentials of IIT. Given IIT is supposed to apply universally, we do this with reference to an arbitrary patch of matter, as opposed to the usual system of discrete computational units. Then, after briefly summarising IIT's theoretical and empirical achievements, we focus on five points which we consider especially important for driving forward new theory and increasing understanding. First, a high value of the measure $Φ$ is not synonymous with `more consciousness'. We describe how $Φ$ might be replaced with a suite of quantities to obtain a multi-dimensional characterisation of states of consciousness. Second, we describe with nuance the distinct flavour of panpsychism implied by IIT -- whereby space (and time) are tiled with substrates of (proto-) consciousness -- and find this is not problematic for the theory. Third, $Φ$ is not well-defined for real physical systems, and has not been computed on any real physical system. Fourth, so far only proxies for IIT measures have been computed, and not approximations. Fifth, for IIT to fit with current successful theories in fundamental physics, a reformulation in terms of continuous fields would be needed.

2026-04-13T13:49:21Z 19 pages, 3 figures Adam B. Barrett Borjan Milinkovic Pedro A. M. Mediano Fernando E. Rosas Daniel Bor Lionel Barnett Anil K. Seth http://arxiv.org/abs/2604.22796v1 Relationship between the level of mental fatigue induced by a prolonged cognitive task and the degree of balance disturbance 2026-04-13T12:55:20Z

This study investigated the effects of mental fatigue (MF) induced by a 90-min AX-continuous performance test (AX-CPT) on balance control by addressing the issue of the heterogeneity of individuals' responses. Twenty healthy young active participants were recruited. They had to carry out two balance tasks (sway as little as possible on a stable support with the eyes open and closed) when standing on a force platform before and after performing a 90-min AX-CPT. The NASA-TLX test was used to assess the subjective manifestations of MF. Objective cognitive performance was measured using results from the AX-CPT. Inter-individual differences in behavioral deterioration due to MF were analyzed with a hierarchical cluster analysis, which categorizes participants' behaviors into subgroups with similar characteristics. The cluster analysis revealed that the achievement of the AX-CPT induced various levels of MF and balance impairments within the whole sample. A significant relationship between the level of MF and the degree of balance disturbance was observed only when participants stood with the eyes open, thus suggesting that inter-individual differences in vulnerability to MF could stem from differences between subjects in the level of engagement of visual attention and/or from differences in field dependency for balance control. These findings show that the completion of the same prolonged demanding cognitive task induces a strong heterogeneity in subjects' responses, with marked individual differences in MF vulnerability that affect balance control differently according to the sensory context.

2026-04-13T12:55:20Z Experimental Brain Research, 2021, 239 (7), pp.2273-2283 Frédéric Noé MEPS Betty Hachard MEPS Hadrien Ceyte DevAH, ISM Noëlle Bru LMAP Thierry Paillard MEPS http://arxiv.org/abs/2604.08587v2 Covariant quantum error correction in a three-layer quantum brain model: computational analysis of layer-specific coherence dynamics 2026-04-13T11:27:25Z

Quantum brain proposals require coherence on behaviorally relevant timescales, yet the gap between spin coherence times and neural decision windows has remained a quantitative obstacle. We evaluate approximate covariant quantum error correction (CQEC) -- a purification protocol constrained by the Eastin-Knill theorem -- across two radical-pair proteins parameterized by \textit{ab initio} spin Hamiltonians: monoamine oxidase~A (MAO-A) and cryptochrome (CRY, PDB~4I6G). Both share a three-layer architecture (${}^{31}$P nuclear spin memory, electron spin interface, classical electrochemistry) and identical hyperfine coupling ($A = 200$~MHz), but differ 16-fold in nuclear $T_2$: 3.2~ms (MAO-A) versus 52~ms (CRY). We test whether CQEC preserves coherence over the 200~ms Schultze-Kraft veto window by mapping each protein's $T_2$ gap onto a simulation decoherence rate ($γ_\mathrm{veto} = T_2~\text{gap}/2T_\mathrm{sim}$): 3.08 for MAO-A, 0.19 for CRY. At $γ_\mathrm{veto} = 0.19$, CQEC maintains tunneling coherence of 0.83 (95\% CI [0.76, 0.79]; versus 0.12 without correction, $\times$6.9 improvement). At $γ_\mathrm{veto} = 3.08$, coherence collapses to 0.012 even with CQEC. A $T_2$ sensitivity analysis confirms robustness: at $T_2 = 26$~ms (half the CRY estimate), CQEC-protected coherence remains 0.69. A classical Markov baseline produces only monotonic relaxation, confirming that CQEC-maintained oscillatory dynamics are genuinely quantum. However, no single protein optimizes both layers: CRY's shorter $T_2^e$ (0.53~ns versus 1.1~ns) worsens Layer~2 fidelity. This layer-protein tradeoff, together with unresolved challenges in state preparation and entanglement distribution, defines the next targets for quantum brain research.

2026-03-31T11:47:00Z Hikaru Wakaura