https://arxiv.org/api/kn41r8BkJkkoEDJzuuCNy+k8uF0 2026-03-24T21:42:00Z 11816 255 15 http://arxiv.org/abs/2601.02885v2 A Mathematical Formalization of Self-Determining Agency 2026-02-06T08:45:56Z Defining agency is an extremely important challenge for cognitive science and artificial intelligence. Physics generally describes mechanical happenings, but there remains an unbridgeable gap between these and the acts of agents. To discuss the morality and responsibility of agents, it is necessary to model acts; whether such responsible acts can be fully explained by physical determinism remains an ongoing debate. Although we have already proposed a physical agent determinism model that appears to go beyond mere mechanical happenings, we have not yet established a strict mathematical formalism to eliminate ambiguity. Here, we explain why a physical system can follow coarse-graining agent-level determination without violating physical laws by formulating supervenient causation. Generally, supervenience including coarse graining does not change without a change in its lower base; therefore, a single supervenience alone cannot define supervenient causation. We define supervenient causation as the causal efficacy from the supervenience level to its lower base level. Although an algebraic expression composed of the multiple supervenient functions does supervenes on the base, an index sequence that determines the algebraic expression does not supervene on the base. Therefore, the sequence can possess unique dynamical laws that are independent of the lower base level. This independent dynamics creates the possibility for temporally preceding changes at the supervenience level to cause changes at the lower base level. Such a dual-laws system is considered useful for modeling self-determining agents such as humans. 2026-01-06T10:13:02Z Yoshiyuki Ohmura Earnest Kota Carr Yasuo Kuniyoshi http://arxiv.org/abs/2602.05971v1 Characterizing Human Semantic Navigation in Concept Production as Trajectories in Embedding Space 2026-02-05T18:23:04Z Semantic representations can be framed as a structured, dynamic knowledge space through which humans navigate to retrieve and manipulate meaning. To investigate how humans traverse this geometry, we introduce a framework that represents concept production as navigation through embedding space. Using different transformer text embedding models, we construct participant-specific semantic trajectories based on cumulative embeddings and extract geometric and dynamical metrics, including distance to next, distance to centroid, entropy, velocity, and acceleration. These measures capture both scalar and directional aspects of semantic navigation, providing a computationally grounded view of semantic representation search as movement in a geometric space. We evaluate the framework on four datasets across different languages, spanning different property generation tasks: Neurodegenerative, Swear verbal fluency, Property listing task in Italian, and in German. Across these contexts, our approach distinguishes between clinical groups and concept types, offering a mathematical framework that requires minimal human intervention compared to typical labor-intensive linguistic pre-processing methods. Comparison with a non-cumulative approach reveals that cumulative embeddings work best for longer trajectories, whereas shorter ones may provide too little context, favoring the non-cumulative alternative. Critically, different embedding models yielded similar results, highlighting similarities between different learned representations despite different training pipelines. By framing semantic navigation as a structured trajectory through embedding space, bridging cognitive modeling with learned representation, thereby establishing a pipeline for quantifying semantic representation dynamics with applications in clinical research, cross-linguistic analysis, and the assessment of artificial cognition. 2026-02-05T18:23:04Z 10 pages, 6 figures (excluding refs/appendix). Accepted to ICLR 2026 Felipe D. Toro-Hernández Jesuino Vieira Filho Rodrigo M. Cabral-Carvalho http://arxiv.org/abs/2505.17329v3 Transformer brain encoders explain human high-level visual responses 2026-02-05T04:56:32Z A major goal of neuroscience is to understand brain computations during visual processing in naturalistic settings. A dominant approach is to use image-computable deep neural networks trained with different task objectives as a basis for linear encoding models. However, in addition to requiring estimation of a large number of linear encoding parameters, this approach ignores the structure of the feature maps both in the brain and the models. Recently proposed alternatives factor the linear mapping into separate sets of spatial and feature weights, thus finding static receptive fields for units, which is appropriate only for early visual areas. In this work, we employ the attention mechanism used in the transformer architecture to study how retinotopic visual features can be dynamically routed to category-selective areas in high-level visual processing. We show that this computational motif is significantly more powerful than alternative methods in predicting brain activity during natural scene viewing, across different feature basis models and modalities. We also show that this approach is inherently more interpretable as the attention-routing signals for different high-level categorical areas can be easily visualized for any input image. Given its high performance at predicting brain responses to novel images, the model deserves consideration as a candidate mechanistic model of how visual information from retinotopic maps is routed in the human brain based on the relevance of the input content to different category-selective regions. 2025-05-22T22:48:15Z Hossein Adeli Sun Minni Nikolaus Kriegeskorte http://arxiv.org/abs/2602.23382v1 Audited calibration under regime shift as a computational test of support-structured broadcast 2026-02-04T14:55:13Z A central prediction of the accompanying theoretical framework is that metacognitive calibration can vary even when content-level performance is held approximately fixed, depending on whether support structure is preserved in a globally reusable broadcast state. We provide a minimal computational test of this claim using a two-channel probabilistic cue-integration task with regime shifts that induce systematic miscalibration in one channel. We compare content-dominated architectures, in which confidence is calibrated by a single global mapping from evidence strength to probability, to an auditor architecture that learns a regime-conditioned calibration mapping from an audit trail of outcomes. We then couple confidence to control by implementing a policy that either acts immediately or requests one additional sample when confidence falls below a threshold. Across matched evidence streams, the auditor substantially improves calibration, particularly in the degraded regime, and produces qualitatively different control behavior by selectively requesting additional evidence under low-support conditions. These results demonstrate a concrete, testable dissociation between content performance and system-level confidence and policy that arises from globally reusable support summaries. 2026-02-04T14:55:13Z 12 pages, 1 table, 5 figures Mark Walsh http://arxiv.org/abs/2511.16465v5 Mesoscale tissue properties and electric fields in brain stimulation: Bridging the macroscopic and microscopic scales using layer-specific cortical conductivity 2026-02-04T13:31:47Z Accurate simulations of electric fields (E-fields) in neural stimulation depend on tissue conductivity representations that link underlying microscopic tissue structure with macroscopic assumptions. Mesoscale conductivity variations can produce meaningful changes in E-fields and neural activation thresholds but remain largely absent from standard macroscopic models. Conductivity variations within the cortex are expected given the differences in cell density and volume fraction across layers. We review recent efforts modeling microscopic and mesoscopic E-fields and outline approaches that bridge micro- and macroscales to derive consistent mesoscale conductivity distributions. Using simplified microscopic models, effective tissue conductivity was estimated as a function of volume fraction of extracellular space, and the conductivities of different cortical layers were interpolated based on experimental volume fraction. The effective tissue conductivities were monotonically decreasing convex functions of the cell volume fraction. With decreasing cell volume fraction, the conductivity of cortical layers increased with depth from layer 2 to 6. Although the variation of conductivity within the cortex was small when compared to the conductivity of extracellular fluid (9% to 15%), the conductivity difference was considerably larger when compared between layers, e.g., with layer 3 and 6 being 20% and 50% more conductive than layer 2, respectively. The review and analysis provide a foundation for accurate multiscale models of E-fields and neural stimulation. Using layer-specific conductivity values within the cortex could improve the accuracy of estimations of thresholds and distributions of neural activation in E-field models of brain stimulation. 2025-11-20T15:29:43Z 20 pages, 6 figures, 4 tables Boshuo Wang Torge H. Worbs Minhaj A. Hussain Aman S. Aberra Axel Thielscher Warren M. Grill Angel V. Peterchev http://arxiv.org/abs/2602.04512v1 BrainVista: Modeling Naturalistic Brain Dynamics as Multimodal Next-Token Prediction 2026-02-04T13:00:06Z Naturalistic fMRI characterizes the brain as a dynamic predictive engine driven by continuous sensory streams. However, modeling the causal forward evolution in realistic neural simulation is impeded by the timescale mismatch between multimodal inputs and the complex topology of cortical networks. To address these challenges, we introduce BrainVista, a multimodal autoregressive framework designed to model the causal evolution of brain states. BrainVista incorporates Network-wise Tokenizers to disentangle system-specific dynamics and a Spatial Mixer Head that captures inter-network information flow without compromising functional boundaries. Furthermore, we propose a novel Stimulus-to-Brain (S2B) masking mechanism to synchronize high-frequency sensory stimuli with hemodynamically filtered signals, enabling strict, history-only causal conditioning. We validate our framework on Algonauts 2025, CineBrain, and HAD, achieving state-of-the-art fMRI encoding performance. In long-horizon rollout settings, our model yields substantial improvements over baselines, increasing pattern correlation by 36.0\% and 33.3\% on relative to the strongest baseline Algonauts 2025 and CineBrain, respectively. 2026-02-04T13:00:06Z 17 pages, 7 figures, 11 tables Xuanhua Yin Runkai Zhao Lina Yao Weidong Cai http://arxiv.org/abs/2602.04492v1 Discovering Mechanistic Models of Neural Activity: System Identification in an in Silico Zebrafish 2026-02-04T12:33:29Z Constructing mechanistic models of neural circuits is a fundamental goal of neuroscience, yet verifying such models is limited by the lack of ground truth. To rigorously test model discovery, we establish an in silico testbed using neuromechanical simulations of a larval zebrafish as a transparent ground truth. We find that LLM-based tree search autonomously discovers predictive models that significantly outperform established forecasting baselines. Conditioning on sensory drive is necessary but not sufficient for faithful system identification, as models exploit statistical shortcuts. Structural priors prove essential for enabling robust out-of-distribution generalization and recovery of interpretable mechanistic models. Our insights provide guidance for modeling real-world neural recordings and offer a broader template for AI-driven scientific discovery. 2026-02-04T12:33:29Z Jan-Matthis Lueckmann Viren Jain Michał Januszewski http://arxiv.org/abs/2601.15313v2 Attention Is Not Retention: The Orthogonality Constraint in Infinite-Context Architectures 2026-02-04T07:21:37Z Biological memory solves a problem that eludes current AI: storing specific episodic facts without corrupting general semantic knowledge. Complementary Learning Systems theory explains this through two subsystems - a fast hippocampal system using sparse, pattern-separated representations for episodes, and a slow neocortical system using distributed representations for statistical regularities. Current AI systems lack this separation, attempting to serve both functions through neural weights alone. We identify the Orthogonality Constraint: reliable memory requires orthogonal keys, but semantic embeddings cannot be orthogonal because training clusters similar concepts together. The result is Semantic Interference (connecting to what cognitive psychologists have long observed in human memory), where neural systems writing facts into shared continuous parameters collapse to near-random accuracy within tens of semantically related facts. Through semantic density (rho), the mean pairwise cosine similarity, we show collapse occurs at N=5 facts (rho > 0.6) or N ~ 20-75 (moderate rho). We validate across modalities: 16,309 Wikipedia facts, scientific measurements (rho = 0.96, 0.02% accuracy at N=10,000), and image embeddings (rho = 0.82, 0.05% at N=2,000). This failure is geometric - no increase in model capacity can overcome interference when keys share semantic overlap. We propose Knowledge Objects (KOs): structured facts with hash-based identity, controlled vocabularies, and explicit version chains. On Wikipedia facts, KO retrieval achieves 45.7% where Modern Hopfield Networks collapse to near-zero; hash-based retrieval maintains 100%. Production systems (Claude Memory, ChatGPT Memory) store unstructured text, causing schema drift (40-70% consistency) and version ambiguity. Knowledge Objects provide the discrete hippocampal component that enables reliable bicameral memory. 2026-01-14T18:55:23Z 32 Pages, 7 Figures Oliver Zahn Matt Beton Simran Chana http://arxiv.org/abs/2602.04270v1 Multi-Integration of Labels across Categories for Component Identification (MILCCI) 2026-02-04T07:00:33Z Many fields collect large-scale temporal data through repeated measurements (trials), where each trial is labeled with a set of metadata variables spanning several categories. For example, a trial in a neuroscience study may be linked to a value from category (a): task difficulty, and category (b): animal choice. A critical challenge in time-series analysis is to understand how these labels are encoded within the multi-trial observations, and disentangle the distinct effect of each label entry across categories. Here, we present MILCCI, a novel data-driven method that i) identifies the interpretable components underlying the data, ii) captures cross-trial variability, and iii) integrates label information to understand each category's representation within the data. MILCCI extends a sparse per-trial decomposition that leverages label similarities within each category to enable subtle, label-driven cross-trial adjustments in component compositions and to distinguish the contribution of each category. MILCCI also learns each component's corresponding temporal trace, which evolves over time within each trial and varies flexibly across trials. We demonstrate MILCCI's performance through both synthetic and real-world examples, including voting patterns, online page view trends, and neuronal recordings. 2026-02-04T07:00:33Z Noga Mudrik Yuxi Chen Gal Mishne Adam S. Charles http://arxiv.org/abs/2602.04095v1 A computational account of dreaming: learning and memory consolidation 2026-02-04T00:09:26Z A number of studies have concluded that dreaming is mostly caused by randomly arriving internal signals because "dream contents are random impulses", and argued that dream sleep is unlikely to play an important part in our intellectual capacity. On the contrary, numerous functional studies have revealed that dream sleep does play an important role in our learning and other intellectual functions. Specifically, recent studies have suggested the importance of dream sleep in memory consolidation, following the findings of neural replaying of recent waking patterns in the hippocampus. The randomness has been the hurdle that divides dream theories into either functional or functionless. This study presents a cognitive and computational model of dream process. This model is simulated to perform the functions of learning and memory consolidation, which are two most popular dream functions that have been proposed. The simulations demonstrate that random signals may result in learning and memory consolidation. Thus, dreaming is proposed as a continuation of brain's waking activities that processes signals activated spontaneously and randomly from the hippocampus. The characteristics of the model are discussed and found in agreement with many characteristics concluded from various empirical studies. 2026-02-04T00:09:26Z 30 pages, 4 tables, 2 figures Cognitive System Research, 2009 Qi Zhang 10.1016/j.cogsys.2008.06.002 http://arxiv.org/abs/2602.03766v1 FOVI: A biologically-inspired foveated interface for deep vision models 2026-02-03T17:26:54Z Human vision is foveated, with variable resolution peaking at the center of a large field of view; this reflects an efficient trade-off for active sensing, allowing eye-movements to bring different parts of the world into focus with other parts of the world in context. In contrast, most computer vision systems encode the visual world at a uniform resolution, raising challenges for processing full-field high-resolution images efficiently. We propose a foveated vision interface (FOVI) based on the human retina and primary visual cortex, that reformats a variable-resolution retina-like sensor array into a uniformly dense, V1-like sensor manifold. Receptive fields are defined as k-nearest-neighborhoods (kNNs) on the sensor manifold, enabling kNN-convolution via a novel kernel mapping technique. We demonstrate two use cases: (1) an end-to-end kNN-convolutional architecture, and (2) a foveated adaptation of the foundational DINOv3 ViT model, leveraging low-rank adaptation (LoRA). These models provide competitive performance at a fraction of the computational cost of non-foveated baselines, opening pathways for efficient and scalable active sensing for high-resolution egocentric vision. Code and pre-trained models are available at https://github.com/nblauch/fovi and https://huggingface.co/fovi-pytorch. 2026-02-03T17:26:54Z Nicholas M. Blauch George A. Alvarez Talia Konkle http://arxiv.org/abs/2506.04536v4 NOBLE -- Neural Operator with Biologically-informed Latent Embeddings to Capture Experimental Variability in Biological Neuron Models 2026-02-03T15:24:18Z Characterizing the cellular properties of neurons is fundamental to understanding their function in the brain. In this quest, the generation of bio-realistic models is central towards integrating multimodal cellular data sets and establishing causal relationships. However, current modeling approaches remain constrained by the limited availability and intrinsic variability of experimental neuronal data. The deterministic formalism of bio-realistic models currently precludes accounting for the natural variability observed experimentally. While deep learning is becoming increasingly relevant in this space, it fails to capture the full biophysical complexity of neurons, their nonlinear voltage dynamics, and variability. To address these shortcomings, we introduce NOBLE, a neural operator framework that learns a mapping from a continuous frequency-modulated embedding of interpretable neuron features to the somatic voltage response induced by current injection. Trained on synthetic data generated from bio-realistic neuron models, NOBLE predicts distributions of neural dynamics accounting for the intrinsic experimental variability. Unlike conventional bio-realistic neuron models, interpolating within the embedding space offers models whose dynamics are consistent with experimentally observed responses. NOBLE enables the efficient generation of synthetic neurons that closely resemble experimental data and exhibit trial-to-trial variability, offering a $4200\times$ speedup over the numerical solver. NOBLE is the first scaled-up deep learning framework that validates its generalization with real experimental data. To this end, NOBLE captures fundamental neural properties in a unique and emergent manner that opens the door to a better understanding of cellular composition and computations, neuromorphic architectures, large-scale brain circuits, and general neuroAI applications. 2025-06-05T01:01:18Z Luca Ghafourpour Valentin Duruisseaux Bahareh Tolooshams Philip H. Wong Costas A. Anastassiou Anima Anandkumar http://arxiv.org/abs/2506.04289v2 Relational reasoning and inductive bias in transformers and large language models 2026-02-03T13:28:47Z Transformer-based models have demonstrated remarkable reasoning abilities, but the mechanisms underlying relational reasoning remain poorly understood. We investigate how transformers perform \textit{transitive inference}, a classic relational reasoning task which requires inference indirectly related items (e.g., if $A>B$ and $B>C$, then $A>C$), comparing in-weights learning (IWL) and in-context learning (ICL) strategies. We find that IWL naturally induces a generalization bias towards transitive inference despite training only on adjacent items, whereas ICL models develop induction circuits implementing match-and-copy strategies that fail to encode hierarchical relationships. However, when pre-trained on in-context linear regression tasks, transformers successfully exhibit in-context generalizable transitive inference, displaying both \textit{symbolic distance} and \textit{terminal item effects} characteristic of human and animal performance, without forming induction circuits. We extend these findings to large language models, demonstrating that prompting with linear geometric scaffolds improves transitive inference, while circular geometries (which violate transitivity by allowing wraparound) impair performance, particularly when models cannot rely on stored knowledge. Together, these results reveal that both the training regime and the geometric structure of induced representations critically determine transformers' capacity for transitive inference. 2025-06-04T10:15:05Z 15 pages, 10 figures Jesse Geerts Andrew Liu Stephanie Chan Claudia Clopath Kimberly Stachenfeld http://arxiv.org/abs/2602.03490v1 A Minimal Task Reveals Emergent Path Integration and Object-Location Binding in a Predictive Sequence Model 2026-02-03T13:08:27Z Adaptive cognition requires structured internal models representing objects and their relations. Predictive neural networks are often proposed to form such "world models", yet their underlying mechanisms remain unclear. One hypothesis is that action-conditioned sequential prediction suffices for learning such world models. In this work, we investigate this possibility in a minimal in-silico setting. Sequentially sampling tokens from 2D continuous token scenes, a recurrent neural network is trained to predict the upcoming token from current input and a saccade-like displacement. On novel scenes, prediction accuracy improves across the sequence, indicating in-context learning. Decoding analyses reveal path integration and dynamic binding of token identity to position. Interventional analyses show that new bindings can be learned late in sequence and that out-of-distribution bindings can be learned. Together, these results demonstrate how structured representations that rely on flexible binding emerge to support prediction, offering a mechanistic account of sequential world modeling relevant to cognitive science. 2026-02-03T13:08:27Z 7 pages, 4 figures Linda Ariel Ventura Victoria Bosch Tim C Kietzmann Sushrut Thorat http://arxiv.org/abs/2508.15784v3 Emergent time-keeping mechanisms in a deep reinforcement learning agent performing an interval timing task 2026-02-03T09:46:37Z Drawing parallels between Deep Artificial Neural Networks (DNNs) and biological systems can aid in understanding complex biological mechanisms that are difficult to disentangle. Temporal processing, an extensively researched topic, is one such example that lacks a coherent understanding of its underlying mechanisms. In this study, we investigate temporal processing in a Deep Reinforcement Learning (DRL) agent performing an interval timing task and explore potential biological counterparts to its emergent behavior. The agent was successfully trained to perform a duration production task, which involved marking successive occurrences of a target interval while viewing a video sequence. Analysis of the agent's internal states revealed oscillatory neural activations, a ubiquitous pattern in biological systems. Interestingly, the agent's actions were predominantly influenced by neurons exhibiting these oscillations with high amplitudes and frequencies corresponding to the target interval. Parallels are drawn between the agent's time-keeping strategy and the Striatal Beat Frequency (SBF) model, a biologically plausible model of interval timing. Furthermore, the agent maintained its oscillatory representations and task performance when tested on different video sequences (including a blank video). Thus, once learned, the agent internalized its time-keeping mechanism and showed minimal reliance on its environment to perform the timing task. A hypothesis about the resemblance between this emergent behavior and certain aspects of the evolution of biological processes like circadian rhythms, has been discussed. This study aims to contribute to recent research efforts of utilizing DNNs to understand biological systems, with a particular emphasis on temporal processing. 2025-08-06T13:56:41Z Accepted at 2025 Artificial Life Conference Amrapali Pednekar Alvaro Garrido Pieter Simoens Yara Khaluf