https://arxiv.org/api/Zd182JHqSHOTabAAlNnl54Mhn0g 2026-06-22T17:21:56Z 12181 450 15 http://arxiv.org/abs/2604.05215v1 Hierarchical Mesh Transformers with Topology-Guided Pretraining for Morphometric Analysis of Brain Structures 2026-04-06T22:27:36Z

Representation learning on large-scale unstructured volumetric and surface meshes poses significant challenges in neuroimaging, especially when models must incorporate diverse vertex-level morphometric descriptors, such as cortical thickness, curvature, sulcal depth, and myelin content, which carry subtle disease-related signals. Current approaches either ignore these clinically informative features or support only a single mesh topology, restricting their use across imaging pipelines. We introduce a hierarchical transformer framework designed for heterogeneous mesh analysis that operates on spatially adaptive tree partitions constructed from simplicial complexes of arbitrary order. This design accommodates both volumetric and surface discretizations within a single architecture, enabling efficient multi-scale attention without topology-specific modifications. A feature projection module maps variable-length per-vertex clinical descriptors into the spatial hierarchy, separating geometric structure from feature dimensionality and allowing seamless integration of different neuroimaging feature sets. Self-supervised pretraining via masked reconstruction of both coordinates and morphometric channels on large unlabeled cohorts yields a transferable encoder backbone applicable to diverse downstream tasks and mesh modalities. We validate our approach on Alzheimer's disease classification and amyloid burden prediction using volumetric brain meshes from ADNI, as well as focal cortical dysplasia detection on cortical surface meshes from the MELD dataset, achieving state-of-the-art results across all benchmarks.

2026-04-06T22:27:36Z Yujian Xiong Mohammad Farazi Yanxi Chen Wenhui Zhu Xuanzhao Dong Natasha Lepore Yi Su Raza Mushtaq Stephen Foldes Andrew Yang Yalin Wang http://arxiv.org/abs/2509.10547v2 Pursuit of biomarkers of brain diseases: Beyond cohort comparisons 2026-04-06T20:28:02Z

Despite the diversity and volume of brain data acquired and advanced AI-based algorithms to analyze them, brain features are rarely used in clinics for diagnosis and prognosis. Here we argue that the field continues to rely on cohort comparisons to seek biomarkers, despite the well-established degeneracy of brain features. Using a thought experiment (Brain Swap), we show that more data and more powerful algorithms will not be sufficient to identify biomarkers of brain diseases. We argue that instead of comparing patient versus healthy controls using single data type, we should use multimodal (e.g. brain activity, neurotransmitters, neuromodulators, brain imaging) and longitudinal brain data to guide the grouping before defining multidimensional biomarkers for brain diseases.

2025-09-08T11:58:09Z Pascal Helson Arvind Kumar 10.1038/s41746-026-02614-5 http://arxiv.org/abs/2604.16434v1 Support Sufficiency as Consequence-Sensitive Compression in Belief Arbitration 2026-04-06T18:28:45Z

When a system commits to a hypothesis, much of the evidential structure behind that commitment is lost to compression. Standard accounts assume that selected content and scalar confidence suffice for downstream control. This paper argues that they do not, and that determining what must survive compression is itself a consequence-sensitive problem. We develop a recurrent arbitration architecture in which active constraint fields jointly determine a hypothesis geometry over candidates. Rather than carrying that geometry forward in full, the system compresses it into a support-aware control state whose resolution is regulated by current consequence geometry, arbitration memory, and resource constraints. A bounded objective formalizes the tradeoff. Too little retained support collapses policy-relevant distinctions, producing controllers that select content adequately while misrouting verification, abstention, and recovery. Too much retained support fragments learning across overly fine contexts, degrading adaptation even as discrimination improves. These failure modes yield ordered controller predictions confirmed by a minimal repeated-interaction simulation. Adaptive controllers that regulate support resolution outperform all fixed-resolution controllers in cumulative utility. Agile adaptive control outperforms sluggish adaptive control. Fixed high-resolution control achieves the best commitment accuracy but still trails adaptive controllers because resource cost and learning fragmentation offset the gains from richer retention. Support sufficiency should be understood not as a static representational threshold, but as a dynamic compression criterion. Robust arbitration depends on preserving the smallest support structure adequate for policy under the current consequence landscape, and on regulating that structure as conditions change across repeated cycles of inference and action.

2026-04-06T18:28:45Z 27 pages, 3 figures, 1 table Mark Walsh http://arxiv.org/abs/2604.05042v1 Energy-Based Dynamical Models for Neurocomputation, Learning, and Optimization 2026-04-06T18:00:17Z

Recent advances at the intersection of control theory, neuroscience, and machine learning have revealed novel mechanisms by which dynamical systems perform computation. These advances encompass a wide range of conceptual, mathematical, and computational ideas, with applications for model learning and training, memory retrieval, data-driven control, and optimization. This tutorial focuses on neuro-inspired approaches to computation that aim to improve scalability, robustness, and energy efficiency across such tasks, bridging the gap between artificial and biological systems. Particular emphasis is placed on energy-based dynamical models that encode information through gradient flows and energy landscapes. We begin by reviewing classical formulations, such as continuous-time Hopfield networks and Boltzmann machines, and then extend the framework to modern developments. These include dense associative memory models for high-capacity storage, oscillator-based networks for large-scale optimization, and proximal-descent dynamics for composite and constrained reconstruction. The tutorial demonstrates how control-theoretic principles can guide the design of next-generation neurocomputing systems, steering the discussion beyond conventional feedforward and backpropagation-based approaches to artificial intelligence.

2026-04-06T18:00:17Z Arthur N. Montanari Francesco Bullo Dmitry Krotov Adilson E. Motter http://arxiv.org/abs/2604.04770v1 Regime Mapping of Oscillatory States in Balanced Spiking Networks with Multiple Time Scales 2026-04-06T15:43:16Z

Balanced spiking networks can transition between silent, asynchronous-irregular, and oscillatory states depending on interacting synaptic and temporal time scales, while their joint parameter structure remains incompletely characterized. In this work, we systematically map how postsynaptic decay (τs), conduction delay (d), and plasticity rate (λp) jointly shape oscillatory regimes in recurrent leaky integrate-and-fire networks. By combining Brian2 simulations across the (τs, d, λp) space with a coarse Hopf-reference boundary, we construct regime maps that directly visualize SIL-AI-OSC transitions and corresponding spectral prominence landscapes. The mapped results show that increasing λp expands oscillatory regions toward shorter τs and moderate-to-long delays, while prominence maps identify parameter regions with the strongest rhythmic coherence. Representative control experiments further connect this global landscape to local rhythm-forming mechanisms, showing that STDP freezing weakens rhythmic coherence whereas delay jitter enhances it with minimal change in mean firing rate. As a result, these findings provide a useful reference for operating-point selection, synchrony modulation studies, and future biologically grounded spiking-network modeling within similar balanced-network settings.

2026-04-06T15:43:16Z Tsung-Han Kuo Tzu-Chia Tung http://arxiv.org/abs/2410.08823v3 Gray Anchoring: a New Computational Theory for Biological Color Constancy 2026-04-06T06:45:43Z

It is still challenging for computer vision to imitate human color perception, e.g., color constancy, which is a fundamental perceptual ability in humans to perceive, interpret and interact with their surroundings. Among others, the anchoring theory provides impressive insights for human lightness perception, yet the specific anchoring rules underlying color constancy have remained contentious for decades. In this work, we introduced a novel computational theory - gray-anchoring (GA) theory - to explain how the early stage of visual system contributes to color constancy and demonstrate how our GA rule applies to the chromatic domain by identifying gray surfaces within complex scenes. Furthermore, we also demonstrate the potential neural implementation of gray-anchoring by quantitatively analyzing the computational flows of concentric double-opponent (DO) cells in V1. The simulational results show that the concentric DO cells have the ability to identify gray surfaces within color-biased scenes and these gray surfaces can then be used by the higher-level cortices to easily estimate the illuminant. This finding offers not only a clear functional explanation of the concentric DO receptive fields of this cell type in the visual system but also an effective and efficient solution to computational color constancy for computer vision.

2024-10-11T14:04:31Z 22 pages, 6 figures Kai-Fu Yang Dajun Xing Yong-Jie Li http://arxiv.org/abs/2604.04154v1 Non-Equilibrium Stochastic Dynamics as a Unified Framework for Insight and Repetitive Learning: A Kramers Escape Approach to Continual Learning 2026-04-05T15:42:23Z

Continual learning in artificial neural networks is fundamentally limited by the stability--plasticity dilemma: systems that retain prior knowledge tend to resist acquiring new knowledge, and vice versa. Existing approaches, most notably elastic weight consolidation~(EWC), address this empirically without a physical account of why plasticity eventually collapses as tasks accumulate. Separately, the distinction between sudden insight and gradual skill acquisition through repetitive practice has lacked a unified theoretical description. Here, we show that both problems admit a common resolution within non-equilibrium statistical physics. We model the state of a learning system as a particle evolving under Langevin dynamics on a double-well energy landscape, with the noise amplitude governed by a time-dependent effective temperature $T(t)$. The probability density obeys a Fokker--Planck equation, and transitions between metastable states are governed by the Kramers escape rate $k = (ω_0ω_b/2π)\,e^{-ΔE/T}$. We make two contributions. First, we identify the EWC penalty term as an energy barrier whose height grows linearly with the number of accumulated tasks, yielding an exponential collapse of the transition rate predicted analytically and confirmed numerically. Second, we show that insight and repetitive learning correspond to two qualitatively distinct temperature protocols within the same Fokker--Planck equation: insight events produce transient spikes in $T(t)$ that drive rapid barrier crossing, whereas repetitive practice operates at a modestly elevated but fixed temperature, achieving transitions through sustained stochastic diffusion. These results establish a physically grounded framework for understanding plasticity and its failure in continual learning systems, and suggest principled design criteria for adaptive noise schedules in artificial intelligence.

2026-04-05T15:42:23Z 12 pages, 4 figures Gunn Kim http://arxiv.org/abs/2604.04033v1 Topological Sensitivity in Connectome-Constrained Neural Networks 2026-04-05T09:23:01Z

Connectome-constrained neural networks are often evaluated against sparse random controls and then interpreted as evidence that biological graph topology improves learning efficiency. We revisit that claim in a controlled flyvis-based study using a Drosophila connectome, a naive self-loop-matched random graph, and a degree-preserving rewired null. Under weak controls, in which both models were recovered from a connectome-trained checkpoint and the null matched only global graph counts, the connectome appeared substantially better in early loss, mean activity, and runtime. That picture changed under stricter controls. Training both graphs from a shared random initialization removed the early loss advantage, and replacing the naive null by a degree-preserving null removed the apparent activity advantage. A five-sample degree-preserving ensemble and a pre-training activity-scale diagnostic further strengthened this revised interpretation. We also report a descriptive mechanism analysis of the earlier weak-control comparison, but we treat it as behavioral characterization rather than proof of causal superiority. We show that previously reported topology advantages in connectome-constrained neural networks can arise from initialization and null-model confounds, and largely disappear under fair from-scratch initialization and degree-preserving controls.

2026-04-05T09:23:01Z 17 pages, 5 fig Nalin Dhiman http://arxiv.org/abs/2604.04025v1 Neurological Plausibility of AI-Generated Music for Commercial Environments: An In-Silico Cortical Investigation Using Wubble and TRIBE v2 2026-04-05T08:53:32Z

Background music shapes attention, affect, and approach behavior in commercial environments, yet the neural plausibility of AI-generated music for such settings remains poorly characterized. We present an in-silico pilot study that combines Wubble, a generative music system, with TRIBE v2, a publicly released whole-brain encoding model, to estimate cortical response profiles for prompt-conditioned retail music. Five fully instrumental tracks were generated to span low-to-high arousal, sparse-to-dense arrangement, and neutral-to-positive valence prompts, then analyzed with audio-only TRIBE v2 inference on loudness-normalized waveforms. Analysis focused on fsaverage5 cortical predictions summarized over auditory, superior temporal, temporo-parietal, and inferior frontal HCP parcels. The fast bright major-pop condition produced the largest whole-cortex mean activation (0.0402), the strongest prefrontal ROI composite response (0.0704), and the highest parcel means in IFJa (0.1102), IFJp (0.0995), A5 (0.0188), and area 45 (0.0015). Pairwise spatial correlations ranged from 0.787 to 0.974, indicating that prompt variation modulated predicted cortical states rather than yielding a single undifferentiated response profile. Predicted cortical surface maps further revealed visually distinct spatial organization between low-arousal and high-arousal conditions. These results support a cautious claim of cortical neurological plausibility: prompt-conditioned AI music can systematically shift predicted auditory-temporal-prefrontal patterns relevant to salience and valuation. Although the study does not establish subcortical reward engagement or consumer behavior, it provides a reproducible framework for neural pre-screening and pre-optimization of commercial music generation against biologically informed cortical proxies.

2026-04-05T08:53:32Z IEEE-style preprint; 4 figures; 4 tables Shaad Sufi http://arxiv.org/abs/2505.22680v3 Evidence for Bures--Wasserstein Boundary Dynamics in the Living Human Brain 2026-04-04T15:10:40Z

When substrate-constrained covariance flow on the Bures--Wasserstein manifold reaches the Williamson boundary, single-mode compression saturates and further admissible covariance evolution is forced into the cross-mode complement. This paper derives how that substrate boundary transition becomes experimentally visible in an embedded spin probe in the living human brain. We formulate a boundary-conditioned transfer theorem: when the substrate enters the deep boundary regime in a coupled mode, the boundary-selected cross-mode continuation of substrate covariance flow enters the reduced spin dynamics as a nonzero inter-spin correlation block. The spin probe does not inherit the substrate boundary as a state; it detects the boundary indirectly through the transferred cross-mode sector of the reduced dynamics. To leading order, this transfer is selective: it acts through an additive cross-diffusion channel while leaving conventional single-mode NMR observables such as $T_1$, $T_2$, linewidths, and the ordinary single-quantum response dominated by the thermal background. Projecting the induced spin cross-mode structure into the two-spin algebra, we argue that the experimentally relevant dominant recipient is the double-quantum SU(1,1) pair sector rather than the compact zero-quantum SU(2) exchange sector. We then derive the coherence-transfer pathway through which this double-quantum pair coherence is converted into a detectable signal by the $45^\circ$--gradient--$45^\circ$ readout block.

2025-05-19T09:43:25Z Christian Kerskens http://arxiv.org/abs/2604.03480v1 Large Language Models Align with the Human Brain during Creative Thinking 2026-04-03T22:02:15Z

Creative thinking is a fundamental aspect of human cognition, and divergent thinking-the capacity to generate novel and varied ideas-is widely regarded as its core generative engine. Large language models (LLMs) have recently demonstrated impressive performance on divergent thinking tests and prior work has shown that models with higher task performance tend to be more aligned to human brain activity. However, existing brain-LLM alignment studies have focused on passive, non-creative tasks. Here, we explore brain alignment during creative thinking using fMRI data from 170 participants performing the Alternate Uses Task (AUT). We extract representations from LLMs varying in size (270M-72B) and measure alignment to brain responses via Representational Similarity Analysis (RSA), targeting the creativity-related default mode and frontoparietal networks. We find that brain-LLM alignment scales with model size (default mode network only) and idea originality (both networks), with effects strongest early in the creative process. We further show that post-training objectives shape alignment in functionally selective ways: a creativity-optimized \texttt{Llama-3.1-8B-Instruct} preserves alignment with high-creativity neural responses while reducing alignment with low-creativity ones; a human behavior fine-tuned model elevates alignment with both; and a reasoning-trained variant shows the opposite pattern, suggesting chain-of-thought training steers representations away from creative neural geometry toward analytical processing. These results demonstrate that post-training objectives selectively reshape LLM representations relative to the neural geometry of human creative thought.

2026-04-03T22:02:15Z Under review Mete Ismayilzada Simone A. Luchini Abdulkadir Gokce Badr AlKhamissi Antoine Bosselut Antonio Laverghetta Lonneke van der Plas Roger E. Beaty http://arxiv.org/abs/2604.03021v1 Temporal structure of the language hierarchy within small cortical patches 2026-04-03T13:13:22Z

Speech production requires the rapid coordination of a complex hierarchy of linguistic units, transforming a semantic representation into a precise sequence of articulatory movements. To unravel the neural mechanisms underlying this feat, we leverage recordings from eight 3.2 x 3.2 mm 64-microelectrode arrays implanted in the motor cortex and inferior frontal gyrus of two patients tasked to produce twenty thousand sentences. We show that a hierarchy of linguistic features are robustly encoded in most of these small cortical patches. Contrary to our expectations, instead of a clear macroscopic organization between patches, we observe a multiplexing of phonetic, syllabic and lexical representations within each cortical patch. Critically, this coding scheme dynamically changes over time to allow successive phonemes, syllables and words to be simultaneously represented without interference. Overall, these results, reminiscent of position encoding in transformers, show how small cortical patches organize the unfolding of the speech hierarchy during language production.

2026-04-03T13:13:22Z Julien Gadonneix Mingfang Zhang Jérémy Rapin Linnea Evanson Pierre Bourdillon Jean-Rémi King http://arxiv.org/abs/2604.14202v1 Bridging scalp and intracranial EEG in BCI via pretrained neural representations and geometric constraint embedding 2026-04-03T12:54:50Z

Electroencephalography (EEG) has become one of the key modalities underpinning brain-computer interfaces (BCIs) due to its high temporal resolution, rapid responsiveness, non-invasiveness, low cost, and portability. However, EEG signals are substantially inferior to intracranial EEG (iEEG) in signal-to-noise ratio and local spatial resolution, whereas iEEG suffers from extremely limited clinical accessibility owing to its invasive nature, hindering widespread application. To address this challenge, this study proposes a unified data-and prior knowledge-driven framework for EEG-iEEG representational enhancement. Guided by the principle that "geometric structure dictates function", the framework maps static cortical anatomy onto dynamic constraints governing neural signal propagation and integrates general-purpose neural representations extracted by a pre-trained large EEG model to explicitly model signal transmission through the brain. Enhanced EEG signals are then synthesized via a multidimensional representation diffusion process. Numerous experimental results demonstrate that the generated enhanced EEG signals effectively recover the neural activity patterns lost during propagation through the brain. This finding indicates that the performance ceiling of BCIs is constrained not only by acquisition hardware but also by the depth to which the generative model resolves the mechanisms of neural signal propagation. Collectively, the proposed framework provides a viable pathway toward acquiring high-fidelity neural signals at low cost.

2026-04-03T12:54:50Z Yihang Dong Changhong Jing Shuqiang Wang http://arxiv.org/abs/2604.14200v1 Retina gap junctions support the robust perception by warping neural representational geometries along the visual hierarchy 2026-04-03T11:23:26Z

Deep Neural Networks (DNNs) are vulnerable to elaborately designed adversarial noise, although they have achieved extraordinary success in many tasks. Compared with DNNs, the human visual system is highly robust. However, it is unclear how the human visual system defends against adversarial attacks, especially the role of the early visual system and its influence on the brain manifold. Due to retina gap junctions being crucial for the denoising function in the early visual system, we combine a retina gap junction-based filter, G-filter, with DNN as an abstract human visual system model called the biological hybrid model. We adopt this model to study the defense performance of retina gap junctions and their impact on the brain manifold. Compared with other defense methods, the biological hybrid model is more robust and can be further improved by introducing noise during training. Next, we analyze the manifold and its decision boundary of the biological hybrid model from a geometry perspective. The results show that the biological hybrid model has a unique 2D decision boundary with high nonlinearity and a lower curvature of the decision boundary of the manifold compared to other defense methods. The transforming manifold may account for the high robustness of the biological hybrid model. Finally, to dissect G-filter and clarify its internal mechanism, we borrow the Neural Ordinary Differential Equation (ODE) concept and rewrite G-filter into an equivalent recurrent neural network. The results show that the decision boundary of the model's manifold will gradually change with time and eventually reach a steady state, which is modulated by gap junction conductance, revealing the influence of retina gap junctions on the brain manifold is a gradually evolving process.

2026-04-03T11:23:26Z 32 pages, 6 figures Yang Yue Shenjian Zhang Yonghong Tian Kai Du Tiejun Huang http://arxiv.org/abs/2510.20847v2 Integrated representational signatures strengthen specificity in brains and models 2026-04-03T08:07:12Z

The extent to which different neural or artificial neural networks (models) rely on equivalent representations to support similar tasks remains a central question in neuroscience and machine learning. Prior work has typically compared systems using a single representational similarity metric, yet each captures only one facet of representational structure. To address this, we leverage a suite of representational similarity metrics-each capturing a distinct facet of representational correspondence, such as geometry, unit-level tuning, or linear decodability-and assess brain region or model separability using multiple complementary measures. Metrics that preserve geometric or tuning structure (e.g., RSA, Soft Matching) yield stronger region-based discrimination, whereas more flexible mappings such as Linear Predictivity show weaker separation. These findings suggest that geometry and tuning encode brain-region- or model-family-specific signatures, while linearly decodable information tends to be more globally shared across regions or models. To integrate these complementary representational facets, we adapt Similarity Network Fusion (SNF), a framework originally developed for multi-omics data integration. SNF produces substantially sharper regional and model family-level separation than any single metric and yields robust composite similarity profiles. Moreover, clustering cortical regions using SNF-derived similarity scores reveals a clearer hierarchical organization that aligns closely with established anatomical and functional hierarchies of the visual cortex-surpassing the correspondence achieved by individual metrics.

2025-10-21T04:37:27Z Jialin Wu Shreya Saha Yiqing Bo Meenakshi Khosla