https://arxiv.org/api/nQDMSiyLxF01MAYykKLtMaQ3BnE2026-06-21T15:20:37Z1218112015http://arxiv.org/abs/2606.01661v1Feature leakage and the identifiability of direct-dependency entropy models of neural activity2026-06-01T04:15:49ZBiological neurons receive thousands of synaptic inputs on branching, electrically excitable dendrites, yet population activity is often modeled with direct input-output rules in which each input contributes independently to a scalar drive. We study what successful prediction by such models does, and does not, reveal about neural computation. For conditional maximum-entropy models that match output rates and pairwise output-input coactivities, the entropy explained by a direct model is a prediction measure under the sampled input distribution, not a mechanism-identification test. A restricted MaxEnt fit is an information projection: omitted interaction, temporal, or hidden-state terms can be absorbed into fitted first-order parameters whenever they are correlated with the included sufficient statistics. For sparse correlated binary inputs, this absorption has an explicit coskewness form. We introduce diagnostics that separate in-distribution prediction from recovery of the response rule: state reweighting that holds P(y|x) fixed while changing P(x), conditional log-odds contrasts for local additivity, and temporal leakage controls. In ground-truth simulations, purely higher-order responses can pass first-order entropy and raw coactivity tests under leakage-prone sampling, but are correctly classified after reweighting. Applied to selected, leakage-enriched local tables from CA1 hippocampal recordings, approximately half of tables that appear first-order under empirical weights become distribution-sensitive under balanced reweighting, far above a matched additive-surrogate null. Thus direct entropy-explained fractions and raw coactivity predictions should be interpreted as predictions under the observed state distribution, not as evidence that mechanisms outside the direct model are absent or small.2026-06-01T04:15:49ZHouman SafaaiBernardo L. Sabatinihttp://arxiv.org/abs/2606.01357v1Hypergraphs from multivariate connectivity: caCoh-based EEG/MEG representation2026-05-31T17:29:31ZHypergraphs provide a natural framework for representing neurophysiological interactions distributed across sets of sensors. A key methodological question is how hyperedges should be defined from frequency-resolved electroencephalography/magnetoencephalography (EEG/MEG) data. We demonstrate a construction strategy in which hyperedges are obtained from canonical coherence (caCoh), an extension of coherence that estimates coupling between multidimensional signal spaces. To our knowledge, this is the first work to construct hypergraphs directly from a multivariate connectivity measure specifically designed for frequency-resolved neurophysiological analysis. We propose two caCoh-based representations: a one-to-space hypergraph, where each external signal defines a hyperedge over the EEG/MEG sensor space, and a space-to-space hypergraph, where two multidimensional signal spaces are represented by a single hyperedge. We evaluate the approach in controlled simulations with known coupling frequencies and varying signal-to-noise ratio (SNR). Compared with graphs based on magnitude-squared coherence (MSC), caCoh-based hypergraphs showed statistically higher target-baseline contrasts at almost all SNR levels, indicating stronger recovery of coupling frequencies. They also recovered sensor-level spatial patterns associated with the simulated sources. In addition, one-to-space and space-to-space representations reduced 610 MSC edges per frequency to 10 and 1 hyperedges, respectively. These results establish multivariate spectral connectivity as a natural methodological basis for EEG/MEG hypergraphs.2026-05-31T17:29:31ZDaniil VlasenkoIrina SaranskaiaDenis Zakharovhttp://arxiv.org/abs/2606.01264v1A 1000-hour EEG-EMG-audio dataset of Japanese speech production2026-05-31T14:30:46ZWe present a multimodal dataset of 1020 hours of simultaneously recorded scalp electroencephalography (EEG), facial electromyography (EMG), and speech audio from three healthy native Japanese speakers during open-vocabulary overt speech. Recordings were acquired with three EEG systems-an ultra-high-density system (g.Pangolin) and two cap-type systems (g.SCARABEO and eegosports), spanning 62-128 channels-across many sessions over several months. Each session provides time-synchronized EEG, facial EMG, and audio, together with speech-event annotations and transcriptions. Although collected with speech decoding as a primary motivation, the dataset also supports work on multimodal signal processing, artifact modeling, longitudinal and cross-device adaptation, and EEG representation learning. Technical validation included power spectral density and event-related potential analyses across participants, devices, and tasks, which showed the expected 1/f spectral profile, task-related alpha-band attenuation, and time-locked evoked responses. The dataset is released in Brain Imaging Data Structure (BIDS) format via OpenNeuro under a CC0 waiver to support both speech-related and broader EEG research.2026-05-31T14:30:46ZMotoshige SatoIlya HoriguchiMasakazu InoueKenichi TomeokaEri HatakeyamaYuya KitaAtsushi YamamotoIppei FujisawaShuntaro Sasaihttp://arxiv.org/abs/2606.01227v1DAGGER: Gradient-Free Construction of Transiently Amplifying Networks under Hard Connectivity Constraints2026-05-31T13:20:26ZMany networks not only support but also rely on transient non-normal amplification, an orders-of-magnitude increase in the activity of an otherwise stable system. Constructing such networks under hard sign/sparsity/diagonal constraints -- the regime relevant for biological connectomes and structured RNN initializations -- has so far required either gradient-based local search with thousands of inner-loop eigendecompositions or Schur-form direct construction in an abstract basis that breaks the constraints under projection.
Here we introduce DAGGER (Directed Acyclic Graph Guided Edge Reweighting), a gradient-free single-pass algorithm. Given a stable signed sparse matrix, DAGGER produces an output with the same sign, sparsity, and diagonal. A single scalar $β$ controls a Wasserstein-2 budget that smoothly trades exact multiset preservation ($β= 0$) for amplification; peak amplification grows essentially without bound with $β$, empirically reaching $10^{10}$ before numerical overflow.
DAGGER matches or exceeds gradient-based methods at multiset preservation in a single forward pass -- 30-100$\times$ fewer eigendecompositions than a typical gradient inner loop -- and at moderate $β$ beats them by orders of magnitude with connectivity exactly preserved. We develop the algorithm, compare it to the existing methods and on a downstream signal-detection task, and examine the diagnostics that show why DAGGER is structurally different from other amplifying networks.2026-05-31T13:20:26Z12 pages, 7 figuresJames C. Fergusonhttp://arxiv.org/abs/2606.12449v1A quantum-like benchmark for context-sensitive associative memory with adaptive plasticity2026-05-30T19:25:20ZLearning and memory require a balance between plasticity and stability: synaptic connections must encode new information without collapsing, saturating, or erasing previously useful structure. Associative-memory models can appear to learn successfully when fixed background connectivity already carries part of the task, making it difficult to distinguish genuine recall dynamics from structural assistance. We test this issue using an order-sensitive adaptive-plasticity benchmark for staged associative recall. The benchmark compares a quantum-like associative-memory model with matched real-valued no-phase and Markov-rate controls under the same task schedule, perturbation profiles, weak-support conditions, and plasticity settings. Here, "quantum-like" refers to the modeling formalism, not to a biological claim about quantum computation. We first screen weak structural support and then fix a conservative operating point for factorial comparisons across model families and plasticity mechanisms. The useful weak-support regime is narrow and non-monotonic. Weak structure alone does not rescue recall in the no-plasticity ablation, whereas most useful recall gains arise from adaptive plasticity, especially homeostatic stabilization. The Markov-rate control often achieves stronger raw recall, but the quantum-like model more consistently preserves order sensitivity and stage-dependent organization. These results do not support a universal quantum-like advantage. Instead, they show that model classes are better distinguished by a multi-objective profile combining recall, temporal organization, and context sensitivity than by any single recall score. The benchmark therefore provides a controlled framework for studying context-sensitive memory dynamics under weak support, regulated plasticity, and matched classical comparison.2026-05-30T19:25:20ZYashine H. Goolam HossenLea GassabTravis J. A. Craddockhttp://arxiv.org/abs/2605.01430v2Measuring Understanding Through Discrete Compositional Knowledge Structures in Hierarchical Automata2026-05-30T19:21:50ZHow do we measure genuine understanding in artificial cognitive systems? Current approaches face a measurement gap: probabilistic systems refine confidence gradually, practice-based systems compile knowledge through repeated execution, and neural systems distribute understanding across opaque embedding spaces. We propose that making understanding measurable requires architectures where understanding formation produces discrete, inspectable structural signatures. This paper presents hierarchical automata built from finite state machines representing patterns and higher-order automata representing compositions. Constrained inference constructs automata from single observations. Similarity detection clusters related automata, making concept robustness quantifiable. Graph memory makes compositional knowledge directly inspectable. Metacognitive mechanisms enable observable reconfiguration. We demonstrate understanding measurement in a simple geometric domain. Graph evolution tracking reveals five measurable signatures: immediate representation formation, structural knowledge, generalization capacity, compositional awareness, and metacognitive access. These measurements distinguish structural understanding from statistical correlation. Our contribution is a framework for making understanding measurable through discrete compositional knowledge structures. This measurement capability complements perceptual learning in neural systems and task execution in neurosymbolic architectures.2026-05-02T13:02:34ZAGI 2026 ConferenceIgor Balazhttp://arxiv.org/abs/2606.04011v1Towards an Ideometrics-Based Understanding of Consciousness, Time, Space and Dreams2026-05-30T15:27:04ZFrom an ideometrics-based perspective, consciousness may reduce the informational entropy of many randomly possible future outcomes through ideometric processes. Consciousness enables a system to internally simulate alternative futures and then voluntarily act, based on ideometric processes, towards realising preferred states in external reality. This may explain why most humans gravitate towards futures that minimise threat and maximise survival, reproduction, safety and well-being. Ideometrics typically uses three fundamental criteria: attractiveness, feasibility and potential impact of many competing ideas. Feasibility and potential impact can, in principle, be computed by non-conscious systems, including artificial intelligence (AI). However, attractiveness may represent the consciously and emotionally experienced valuation of possible futures. Feasibility may have appeared first during evolution, while potential impact required predictive processing, and consciousness added subjective attractiveness to many alternative futures. Within this framework, subjective sense of time may be intertwined with consciousness, providing causal relating and internal ordering to external changes perceived by the senses. Time may require conscious beings to have a meaning, while consciousness may require the subjective sense of time to have a meaning. Space, in turn, provides the structured field in which ideas can acquire causal impact across nested scales. Dreaming may represent remnants of earlier evolutionary stages of internal modelling.2026-05-30T15:27:04Z38 pages, 2 tables, 95 referencesIgor Rudanhttp://arxiv.org/abs/2512.07842v2State and Parameter Estimation for a Neural Model of Local Field Potentials2026-05-30T12:29:00ZThe study of cortical dynamics during different states such as decision making, sleep and movement, is an important topic in Neuroscience. Modelling efforts aim to relate the neural rhythms present in cortical recordings to the underlying dynamics responsible for their emergence. We present an effort to characterize the neural activity from the cortex of a mouse during natural sleep, captured through local field potential measurements. Our approach relies on using a discretized Wilson--Cowan Amari neural field model for neural activity, along with a data assimilation method that allows the Bayesian joint estimation of the state and parameters. We demonstrate the feasibility of our approach on synthetic measurements before applying it to a dataset available in literature. Our findings suggest the potential of our approach to characterize the stimulus received by the cortex from other brain regions, while simultaneously inferring a state that aligns with the observed signal.2025-11-24T15:18:05ZDaniele AvitabileGabriel J. LordKhadija Meddounihttp://arxiv.org/abs/2606.00667v1Cortex and subcortex play distinct roles over learning when cortical memory is limited2026-05-30T10:48:32ZIt has been proposed that the brain integrates flexible, computationally expensive cortical processing with simpler, lower-cost subcortical mechanisms to achieve resource-efficient performance greater than that of either system alone. Despite the allure of this perspective, satisfying theoretical frameworks that explore this hypothesis are still limited. We extend existing frameworks in which a model-based module and model-free module learn in tandem by explicitly constraining the memory resources of the model-based module, and investigate the impact of this constraint in a simple decision-making setting. Memory constraints naturally give rise to strategies for allocating memory resources. We evaluate the performance of different strategies in different situations and demonstrate that when the rewarded states change often, it can be advantageous for the model-based module to focus its memory resources not on exploiting the current reward, but on capturing general structure of the environment. This work provides a theoretical foundation for a functional dissociation between cortical and subcortical systems during learning: the cortex supports general structure learning, while subcortical circuits specialize in reward-based learning. We further detail how these hypotheses can be tested on experimental data.2026-05-30T10:48:32ZPreprint. 19 pages, 4 figuresMatthew FarrellTaro Toyoizumihttp://arxiv.org/abs/2606.04010v1The Variance Brain Foundation Models Forgot: Third-Order Statistics Predict Cognition Where Billion-Parameter Models Fail2026-05-29T23:57:21ZBrain foundation models (BFMs) are self-supervised Transformers pretrained on fMRI data. We posit that these models should capture each subject's cognitive performance from their fMRI signal. Yet across three state-of-the-art BFMs and every readout we test, they predict cognition worse than a linear regression from the $\sim$80K parameters of the functional connectivity matrix (FC). The gap widens with scale: BrainLM's 650M model predicts cognition worse than its 111M. We attribute this to a \textbf{variance allocation problem}: BFM pretraining captures the variance components that dominate fMRI but not the higher-order structure that predicts cognition. Our per-cumulant analysis of the reconstructed signal shows that the second-order covariance is partially preserved, while the third-order co-skewness tensor is largely destroyed. To recover what BFMs lose, we design a linear pipeline that projects the fMRI signal into the subspace that best preserves its co-skewness and computes FC there. This \textbf{exceeds raw FC and every pretrained BFM} on every dataset and parcellation we test, outperforming prior state-of-the-art under controlled evaluation \textbf{with no pretraining and no GPU}. We \textbf{recover the raw-FC ceiling on BrainLM's forward pass} by finetuning with a loss targeted at this same subspace. This shows that the bottleneck is the pretraining objective, not the architecture or the model size.2026-05-29T23:57:21Z37 pages, 16 figures, 23 tablesGiovanni MarraffiniGabriel MahuasTrinidad BorrellVictoria ShevchenkoDemian Wassermannhttp://arxiv.org/abs/2606.00373v1Sequential chaotic oscillations in excitatory-inhibitory threshold-linear networks2026-05-29T21:32:06ZMetastable states, a phenomenon observed in brain dynamics and many other systems, have been proposed as a key feature of healthy brain function, reflecting a balance between integration and segregation. However, it remains unclear how to capture this behavior within a dynamical-systems framework. In this paper, we propose sequential chaotic oscillations (SCOs), arising in excitatory-inhibitory threshold-linear networks (E-I TLNs), as a candidate dynamical mechanism for sequential metastability. As a simple form of chaotic itinerancy, SCOs occur under constant input and consist of a sequence of metastable states whose transition order can be predicted by the underlying graph. To identify the parameter regime for SCOs, we develop new graph rules for E-I TLNs and use them to characterize the fixed point structure of E-I TLNs on paths and cycles. Our results show that the emergence of SCOs requires unstable singleton fixed points and sufficiently strong inhibition. In addition to SCOs, we find that E-I oscillations need not be synchronized. Motivated by this, we introduce a decomposition into the z-mode and the mean mode, which capture excitatory differences and overall network activity, respectively. These modes are then used to distinguish attractors associated with the full-support fixed point of E-I TLNs on cycles.2026-05-29T21:32:06Z37 pages, 12 figuresJie ZangCarina Curtohttp://arxiv.org/abs/2602.03766v2FOVI: A biologically-inspired foveated interface for deep vision models2026-05-29T21:29:57ZHuman vision is foveated, with variable resolution peaking at the center of a large field of view; this reflects an efficient trade-off for active sensing, allowing eye-movements to bring different parts of the world into focus with other parts of the world in context. In contrast, most computer vision systems encode the visual world at a uniform resolution, raising challenges for processing full-field high-resolution images efficiently. We propose a foveated vision interface (FOVI) based on the human retina and primary visual cortex (V1), that reformats a variable-resolution retina-like sensor array into a uniformly dense, V1-like sensor manifold. Receptive fields are defined as k-nearest-neighborhoods (kNNs) on the sensor manifold, enabling kNN-convolution via a novel kernel mapping technique. We demonstrate two use cases: (1) an end-to-end kNN-convolutional architecture, and (2) a foveated adaptation of the DINOv3 ViT foundation model, leveraging low-rank adaptation (LoRA). These models provide competitive performance with a fraction of the pixels and computational cost of full resolution non-foveated baselines, opening pathways for efficient and scalable active sensing for high-resolution egocentric vision. Code (https://github.com/nblauch/fovi) and pre-trained models (https://huggingface.co/fovi-pytorch) are available.2026-02-03T17:26:54ZICML 2026Nicholas M. BlauchGeorge A. AlvarezTalia Konklehttp://arxiv.org/abs/2606.00326v1On the synaptic matrix eigenvalues of sparsely connected neural networks2026-05-29T20:09:09ZThe spectral behaviour of the synaptic matrix, representing the neuronal connection strengths, is an important tool to analyze the stability and transient dynamics of a typical brain as well as its learning process and memory capacity. The complexity of the brain due to large number of neurons as well as underlying transient mechanisms e.g. homeostasis, seizure or synaptic plasticity can lead to networks with time-varying degree and type of sparsity. This renders an exact determination of the synaptic matrix not only technically difficult but also meaningless, leaving its statistical analysis as the best available theoretical approach. This motivates us to pursue a spectral analysis of the synaptic matrix models with different type of sparsity and thereby analyze latter's role on various aspects of network dynamics and stability. Our results have potential relevance for detemining the type of synaptic sparsity required to induce a specific brain function or desired transient mechanism e.g for pharmacological effects or physiological modulators.2026-05-29T20:09:09Z46 pages, 12 figuresMohd. Gayas AnsariPragya Shuklahttp://arxiv.org/abs/2603.03312v3Escaping the BLEU Trap: A Signal-Grounded Framework with Decoupled Semantic Guidance for EEG-to-Text Decoding2026-05-29T19:20:43ZDecoding natural language from non-invasive EEG signals is a promising yet challenging task. However, current state-of-the-art models remain constrained by three fundamental issues: Semantic Bias, where outputs collapse into generic linguistic templates; Signal Neglect, where models rely heavily on LLM priors to hallucinate fluent text even in the absence of meaningful signals; and the "BLEU Trap", where high-frequency stopwords inflate n-gram metrics, masking a lack of true semantic fidelity. To resolve these challenges, we move beyond conventional end-to-end pipelines and propose SemKey, a novel multi-stage framework that enforces signal-grounded generation through four decoupled semantic objectives: sentiment, topic, length, and surprisal. We extract these semantic anchors from EEG embeddings directly, then unify them with an Active Retrieval Decoding mechanism, compelling the LLM to ground its token generation in the neural signals rather than defaulting to linguistic priors. Furthermore, we break the BLEU Trap by establishing a comprehensive evaluation protocol using rigorous retrieval and distribution-based metrics such as Fréchet Distance. Extensive experiments demonstrate that SemKey effectively mitigates hallucinations on noise inputs and achieves SOTA performance on these robust protocols. Code will be released upon acceptance at https://github.com/xmed-lab/SemKey.2026-02-09T02:47:07ZYuchen WangHaonan WangYu GuoHonglong YangXiaomeng Lihttp://arxiv.org/abs/2606.00243v1Dynamics and Representation Structure of Local Approximations to Gradient-Based Learning in Linear Recurrent Neural Networks2026-05-29T18:19:45ZBiological and neuromorphic recurrent neural networks (RNNs) are subject to spatial and temporal locality constraints on the information that can plausibly be used during learning. A common strategy to satisfy these constraints is to modify gradient descent by neglecting non-local terms to varying degrees, as in random feedback local online (RFLO) learning and truncated backpropagation through time (tBPTT). However, the learning dynamics of these algorithms, and how they compare with BPTT, remain poorly understood. We apply dynamical systems theory to data-aligned linear RNNs -- whose dynamics can be separated into orthogonal modes -- to compare stationary solutions, stability properties, and convergence rates, finding qualitatively distinct behaviour for RFLO versus BPTT and one-step tBPTT. We further observe that the solutions learned by RFLO are restricted to low-rank perturbations of initial parameters, a result which holds beyond the data-aligned setting. Our work provides analytical insight into how locality constraints shape learning dynamics, with implications for neuroscientific models of learning and alternative optimization approaches for RNNs.2026-05-29T18:19:45Zaccepted to ICML 2026 as poster. Current version is camera-ready submissionEzekiel WilliamsAlexandre PayeurGuillaume Lajoie