https://arxiv.org/api/csiiMo2QeeerqY7Bm53mv+US7FI 2026-06-21T09:10:40Z 12181 45 15 http://arxiv.org/abs/2605.16739v2 EmoMind: Decoding Affective Captions from Human Brain fMRI 2026-06-11T20:35:44Z Decoding visual experience from brain activity has advanced substantially, but current brain-to-text systems largely recover semantic content while discarding affect. Additionally, language models can generate emotional text when prompted with categorical labels, but such labels collapse rich inter-subject variability into coarse discrete bins. We present EmoMind, the first end-to-end pipeline for decoding affective captions directly from fMRI signals. EmoMind first retrieves a semantically grounded neutral scene description from brain-decoded visual features, then rewrites it using a continuous 34-dimensional emotion vector decoded from the same fMRI recording. To control the balance between content preservation and affective expression, we train the rewriter with classifier-free guidance against an identity-preserving null branch, enabling smooth interpolation between semantic fidelity and affective expressivity. We evaluate affective caption generation with a three-axis validation framework spanning subject-specificity, structural geometry, and causal control. We further augment this framework with a synthetic-brain substitution test that probes robustness to the measurement apparatus, and we benchmark each axis against GPT-4 prompted with brain-decoded top-5 emotion labels as a strong discrete baseline. Across two independent emotion fMRI datasets, EmoMind significantly outperforms label-prompted GPT-4 on all three axes, with the largest gains on metrics that require person-specific affective structure rather than population-level emotion aggregation. These results establish continuous brain-decoded affect as a viable control signal for individualized affective caption generation and open new directions for studying individual affective brain organisation. 2026-05-16T01:32:45Z Bilal A. Mohammed Lin Gu Ruogu Fang http://arxiv.org/abs/2606.13801v1 Neural Variability Enhances Artificial Network Robustness 2026-06-11T18:15:39Z Neural responses in cortex exhibit substantial trial-to-trial variability in response to repeated stimuli, while peripheral sensory neurons respond far more consistently, leading many to wonder whether stochasticity may carry meaning. Existing work has argued that noise and signal correlations may be optimized for discrimination in animals, whereas artificial neural network (ANN) studies have shown similar benefits of noise in machine learning tasks, although most ANN work has neglected the effects of correlations. Here we investigate whether correlated noise improves the robustness of artificial neural networks to adversarial attacks and naturalistic image modifications. Using the covariance of activations under modified versus clean inputs, we find that structured noise may significantly improve network robustness. Robustness to naturalistic image modifications benefits most from structure, but this structure transfers poorly across modification types. In contrast, noise structure from adversarial attacks can generalize to other kinds of attacks. These results suggest that structured noise in ANN activations generally improves robustness, establishing a biologically plausible strategy for creating robust artificial neural networks that only relies on local information. 2026-06-11T18:15:39Z Robin Preble Praveen Venkatesh Stefan Mihalas Kameron Decker Harris http://arxiv.org/abs/2510.00011v2 Robust State-space Reconstruction of Brain Dynamics via Bootstrap Monte Carlo SSA 2026-06-11T15:04:43Z Reconstructing latent state-space geometry from time series provides a powerful route to studying nonlinear dynamics across complex systems. Delay-coordinate embedding provides the theoretical basis but assumes long, noise-free recordings, which many domains violate. In many real-world domains, recordings are short, noisy, and coarsely sampled; in neuroimaging, for example, fMRI additionally contains autocorrelated background structure that can obscure oscillatory components and destabilize embeddings. We propose bootstrap Monte Carlo singular spectrum analysis (BMC-SSA), which combines Monte Carlo SSA with bootstrap stability to retain oscillatory modes that are statistically supported and reproducible across resampled data. This produces reconstructions that emphasize reliable oscillatory structure, enhancing determinism and stabilizing subsequent embeddings. Our results show that BMC-SSA improves the reliability of functional measures and uncovers differences in state-space dynamics in fMRI, offering a general framework for robust embedding of noisy, finite signals. 2025-09-16T20:47:14Z 6 pages, 2 figures, conference Sir-Lord Wiafe Carter Hinsley Vince D. Calhoun http://arxiv.org/abs/2606.13260v1 Extracting Governing Equations from Latent Dynamics via Multi-View Contrastive Learning 2026-06-11T12:16:35Z Identifying latent dynamical systems from noisy, high-dimensional measurements is a central problem at the intersection of representation learning, system identification, and scientific discovery. We present DYSCO, a multi-view temporal contrastive learning algorithm that jointly recovers latent trajectories and the governing dynamics from such observations, by leveraging multiple independent noisy views of the same underlying process to disentangle signal from noise. By parameterizing the dynamics in a structured functional basis, our framework further enables symbolic recovery of the governing equations within an affine gauge. We offer theoretical guarantees for strong identification up to an affine indeterminacy, extending prior identifiability results to the realistic setting of noisy nonlinear observations. Empirically, we demonstrate accurate recovery of both latent trajectories and flow fields across a diverse set of dynamical regimes (e.g., chaotic, oscillatory, and metastable) under both Gaussian and Poisson observation noise, the latter being particularly relevant for neural recordings. 2026-06-11T12:16:35Z Paolo Muratore Mackenzie Weygandt Mathis http://arxiv.org/abs/2508.14143v2 The Urysohn Machine: A Metric-Topological Model of Computation 2026-06-11T11:28:34Z We introduce the Urysohn Machine, an effective model of classification-oriented computation in which metric separation, frontier structure, and contraction are explicit parts of the computational state. Its basic object is a \emph{Urysohn Triple}: a support region, a target partition, and a separating classifier stored in a reusable Metric Library. The topological foundation is a constructive Urysohn Realization theorem for finite simplicial settings. It builds separators from dyadic ladders of nested polyhedral regions and equips their frontiers with a chain-level calculus: frontiers are cycles, and shells between levels have boundaries given by differences of frontiers. This construction yields two related complexity measures: decision-boundary width, the geometric measure of a single classifier's boundary, and Urysohn width, the total frontier mass represented by a library or realization. We prove an Amortized Separation Theorem showing that approximating a boundary of width to accuracy requires a number of simple basis triples proportional to boundary width and inversely proportional to resolution, under explicit boundary-footprint assumptions. We also introduce a contrastive separation operator whose graph-cut functional consistently estimates decision-boundary width from sampled metric data, while its Laplacian spectrum certifies class-component structure and conductance. Finally, we analyze the dynamic Urysohn ladder and prove four guarantees: separability under quotient collapse, stability of committed frontiers, bounded capacity under contraction, and scalability with quotient distance. Together, these results give a metric-topological account of classification complexity, amortized inference, and compositional reuse that preserves classical computability while exposing geometric structure hidden by purely symbolic descriptions. 2025-08-19T15:10:26Z Xin Li http://arxiv.org/abs/2606.13132v1 Including the Cost of Irreducible Uncertainty in the Policy Compression Framework 2026-06-11T09:55:02Z AI decision-support systems can benefit from anticipating biases in human decision-making. Many such biases may arise from human cognitive limitations. The policy compression framework models decision-making as a trade-off between reward maximization and the cognitive cost of encoding state-dependent action policies, formalized as the mutual information between states and actions (policy complexity). We argue that this account is incomplete because it treats conditional entropy--the irreducible uncertainty about which action should be selected given a state--as costless, even though empirical evidence suggests that it modulates reaction times. We therefore extend the framework by defining cognitive cost as the sum of policy complexity and a weighted conditional-entropy term, governed by a new parameter, $η$. The resulting optimal policy retains the standard exponential form but becomes sharper as $η$ increases, allowing policy precision to vary more independently of reward sensitivity. This modification implies that the standard policy compression framework may underestimate the cognitive cost of action selection, and it has the potential to better account for biases in human decision-making. At the same time, it introduces additional complexity for fitting the model to human data, which future work will need to address. 2026-06-11T09:55:02Z Accepted at the 5th International Conference on Hybrid Human-Artificial Intelligence, 2026 Álvaro Garrido-Pérez Pieter Simoens Amrapali Pednekar Yara Khaluf http://arxiv.org/abs/2603.24603v2 Fusion Learning from Dynamic Functional Connectivity: Combining the Amplitude and Phase of fMRI Signals to Identify Brain Disorders 2026-06-11T09:47:52Z Dynamic functional connectivity (dFC) derived from resting-state functional magnetic resonance imaging (fMRI) has been extensively utilized in brain science research. The sliding window correlation (SWC) method is a widely used approach for constructing dFC by computing correlation coefficients between amplitude time series of signals from pairs of brain regions. In this study, we propose an integrated approach that incorporates both amplitude and phase information of fMRI signals to improve the detection of brain disorders. Specifically, we introduce a multi-scale fusion learning framework, namely MSFL, which leverages two complementary dFC features derived from SWC and phase synchronization (PS). Here, SWC captures amplitude correlations, while PS measures phase coherence within dFC. We evaluated the efficacy of MSFL in classifying autism spectrum disorder and major depressive disorder using two publicly available datasets: ABIDE I and REST-meta-MDD, respectively. The results indicate that MSFL significantly outperforms existing comparative models. Moreover, we performed model explanation analysis using the SHAP framework, which showed that both types of dFC features from SWC and PS contribute to detecting brain disorders. 2026-03-14T04:57:17Z Jinlong Hu Jiatong Huang Zijian Cai http://arxiv.org/abs/2606.13017v1 Deep Sleep Classification via EEG Signal Criticality: A Passive BCI Approach for Sleep-Improvement Neurofeedback 2026-06-11T07:53:29Z Automated sleep staging is a fundamental application of passive Brain-Computer Interfaces (pBCI), decoding spontaneous neural states to enable closed-loop interventions independent of user intent. This study evaluates criticality features derived from Detrended Fluctuation Analysis (DFA) for the specific identification of deep sleep (N3). We analyzed $347,232$ EEG epochs from $290$ older women using UMAP manifold learning to visualize state transitions. Subsequently, six classifiers were benchmarked via 10-fold cross-validation, using balanced accuracy to determine the optimal "state-sensing" engine for neurofeedback.Naive Bayes achieved the highest mean balanced accuracy ($87.17\% \pm 0.24\%$), significantly outperforming a fully connected deep neural network (FNN: $81.58\%$) and Random Forest ($80.97\%$). Linear models (LDA: $57.21\%$; SVM: $51.01\%$) performed poorly, indicating that DFA-derived criticality features reside on a distinct, non-linear manifold. Probabilistic decoding of EEG criticality provides a high-accuracy sensing mechanism for pBCIs. This robust classification pipeline supports the development of state-dependent neurofeedback, such as targeted auditory stimulation, to enhance cognitive recovery. 2026-06-11T07:53:29Z 7 pages, 3 figures, accepted for publication in the Proceedings of the 10th Graz Brain-Computer Interface Conference 2026, Graz, Austria, September 14-17, 2026 Stanisław Narębski Tomasz Komendziński Tomasz M. Rutkowski http://arxiv.org/abs/2606.12684v1 Phase model analysis of the effect of M-current on neural synchrony in hippocampal networks 2026-06-10T21:14:54Z Neural assemblies, transiently coordinated groups of neurons, observed in the hippocampus are thought to underlie the formation of episodic memories. Acetylcholine (ACh), a neuromodulator, that is received by the hippocampus, plays a critical role in memory and learning. A well supported hypothesis suggests that high levels of ACh during active exploration and rapid eye movement (REM) sleep promote memory encoding, while low levels during quiet waking and slow-wave sleep (SWS) support memory consolidation. We study this bidirectional role of ACh in neural assembly formation through its effect on the synchrony among neurons. We consider a network model of pyramidal neurons, each equipped with a slow, voltage-dependent, non-inactivating potassium current (M-current), which is downregulated in the presence of ACh. Neural assemblies are represented as cluster solutions to this system. Using a one-dimensional phase model reduction of a pair of weakly coupled pyramidal neurons under different levels of the M-current, we predict the symmetric cluster solutions that may emerge in larger networks equipped with all-to-all globally homogeneous, symmetric distance-dependent and nearest-neighbours coupling architectures. We find that under low ACh conditions, the network can fully synchronize, whereas high levels can desynchronize the network into multiple stable symmetric cluster solutions representing distinct neural assemblies. 2026-06-10T21:14:54Z 39 pages, 14 figures Megha Manoj Sue Ann Campbell http://arxiv.org/abs/2606.12600v1 Multifractal human signals at the edge of life reveal a heart-brain anti-correlation 2026-06-10T18:56:09Z This study investigates the terminal breakdown of human neurophysiological function through the lens of non-linear dynamics by analyzing the multifractal spectrum. Using Multifractal Detrended Fluctuation Analysis (MF-DFA), we quantify the temporal evolution of complexity in synchronized electroencephalogram (EEG) and electrocardiogram (ECG) time series from patients in the terminal stage. Our results reveal a marked divergence in multifractal spectrum width: while neural activity exhibits a collapse of multifractality toward a more constrained state, cardiac signals undergo anomalous spectral broadening, indicating increased non-linear fluctuations and dynamical instability. A negative correlation between these spectral widths suggests effective functional decoupling and the emergence of anti-correlated dynamics between neural and cardiac systems. Rather than reflecting a uniform physiological decline, this divergence is consistent with a body-to-brain breakdown in which peripheral dysfunction progressively overwhelms central regulatory processes. In a broader context, the observed opposing trends resemble patterns reported in other body-driven adaptive processes, suggesting that inverse dynamics across coupled systems may emerge when constraints originate from peripheral rather than central mechanisms. Ultimately, the dying process appears to represent an extreme form of cross-system disintegration, marked by the collapse of the hierarchical coordination that normally sustains integrated physiological function. 2026-06-10T18:56:09Z Yago Emanoel Ramos Maria Eloá do Ó Henrique Ferraz de Arruda Mauro Copelli G. Camelo-Neto Pedro V. Carelli http://arxiv.org/abs/2605.23111v4 Contextual Role Modulates Object Representational Geometry in the Human Brain 2026-06-10T18:25:19Z The human brain represents objects in a way that is both invariant across instances and flexible enough to support different contexts and tasks. Yet it remains unknown how object representations are dynamically remapped as the same object shifts across contextual roles. Using fMRI during naturalistic movie viewing we investigated how the same objects are represented when they are passive scene elements versus targets of goal-directed actions. Action targets engaged a parietal action network centered in the supramarginal and postcentral gyri, while passive objects recruited a distributed occipito-temporal network involved in visual object recognition. Within context-selective networks, representational geometry showed a double dissociation: target objects were organized by action affordance and hand posture affordance dimensions, while passive objects aligned with semantic dimensions. Visual representational structure was invariant to context. Outside these networks, representational content retained invariance, indicating that flexibility and invariance operate at different levels of the same representational system. These findings demonstrate neural remapping of object representations depending on moment-to-moment changes in contextual roles during a naturalistic scene. 2026-05-22T00:13:27Z Julien Dirani Shankar Chawla Leila Wehbe Bradford Z. Mahon http://arxiv.org/abs/2606.11893v1 Beyond representational alignment with brain-guided language models for robust reasoning 2026-06-10T10:22:49Z The correspondence between large language models (LLMs) and the neural mechanisms underlying human higher-order cognition remains insufficiently characterized. Given that language and reasoning in the human brain appear dissociable, an open question is whether LLMs align with neural signals from reasoning-related regions and whether such signals can improve them. Here, focusing on deductive reasoning, we show that LLM internal representations are not only partially aligned with task-fMRI activity but can also be directly enhanced by these signals. Using a neural-predictivity metric, we find that LLMs explain a substantial fraction of the explainable variance in reasoning-related regions at the aggregate level, whereas predictivity within specific reasoning types is lower, indicating both alignment and divergence. Building on this, we propose a brain-guided framework: we steer model representations along directions induced by the joint structure of model and brain representations, applying intervention at inference and fine-tuning during training. We demonstrate that task-evoked brain signals can directly enhance LLM reasoning, yielding gains orthogonal to language-only supervision across 10 LLMs (1.5B-72B), with transfer across reasoning types and up to 13\% absolute accuracy gain. Our results advance LLM-brain correspondences from correlation to guidance, establishing a brain-signal-driven pathway toward more robust and cognitively aligned AI. 2026-06-10T10:22:49Z Mingqing Xiao Kai Du Zhouchen Lin http://arxiv.org/abs/2605.29588v2 Brain-IT-VQA: From Brain Signals to Answers 2026-06-10T10:19:45Z Decoding visual content from fMRI signals recorded while a person views images, and specifically answering questions about the seen images, is a long-standing challenge. While significant progress has been made in recent years in visual question answering (VQA) from fMRI, performance remains limited. Moreover, although recent models can make increasingly accurate predictions, they have rarely been used as tools for understanding the structure of visual representations in the brain. We present Brain-IT-VQA, a framework for visual question answering from fMRI. Building on the Brain Interaction Transformer (Brain-IT), our method decodes language tokens from brain activity and integrates them with a language model to answer visual questions. Our model substantially outperforms previous fMRI-based captioning and VQA approaches. We further introduce NSD-VQA, a new dataset and benchmark for visual question answering from fMRI. Unlike existing image-fMRI VQA datasets, which typically provide only a few broad and weakly controlled questions per image, NSD-VQA provides on average 20 question-answer pairs per image across 20 controlled question categories that disentangle multiple levels of visual understanding. This enables more reliable and interpretable evaluation despite limited fMRI test data. Together, Brain-IT-VQA and NSD-VQA provide both a strong predictive framework and a tool for studying brain representations. Using this benchmark, we quantify which forms of visual and semantic information can be reliably decoded from fMRI responses to natural images. We further analyze the contributions of different brain regions across question types. 2026-05-28T08:33:23Z Roman Beliy Matias Cosarinsky Oliver Heinimann Navve Wasserman Michal Irani http://arxiv.org/abs/2606.11833v1 Flow Matching with In-Context Priors for Out-of-Distribution Brain Dynamics 2026-06-10T09:15:33Z Flow matching and diffusion models enable conditional generation across domains ranging from images to proteins, with recent extensions to out-of-distribution contexts. Yet generative models of neural time series have largely remained restricted to categorical conditioning, precluding compositional and zero-shot generalization. In this work, we propose a per-timestep conditioned diffusion transformer for generating realistic fMRI brain dynamics during unseen cognitive tasks by injecting both compositional language and optional spatial priors in-context. Such zero-shot generation could enable counterfactual neuroscience by supporting in-silico design and evaluation of novel cognitive experiments before empirical validation. Leveraging this model, we evaluate across hundreds of held-out task conditions and characterize predictive performance in relation to the training manifold. From language alone, the model recovers region-specific recruitment across tasks and held-out spatial activation patterns. Spatial priors, when available, complement the text pathway by anchoring generation in regions of task space where language alone degrades, while retaining the compositional structure needed for counterfactual task specification. To our knowledge this is the first generative model of whole-cortex fMRI dynamics for unseen cognitive tasks, advancing counterfactual neuroscience and data-driven experimental design. 2026-06-10T09:15:33Z Code and pretrained models available at https://github.com/SamGijsen/pinc-flows Sam Gijsen Michał Łukomski Marc-André Schulz Kerstin Ritter http://arxiv.org/abs/2606.11598v1 Large language models selectively converge with human-shared neural semantic representations 2026-06-10T02:54:51Z Interpersonal communication requires building shared semantics that enable listeners to understand speakers' meanings from their unfolding language, but the dimensional structure of this shared neural representation remains unclear. LLMs increasingly approximate human language capability and neural responses, raising the question of whether they capture the same semantic structure shared between human brains. Here, we combined storytelling-listening pseudo-hyperscanning MEG with dimension-resolved interbrain encoding modeling to compare human- and LLM-derived accounts of shared neural semantic representations. Content words from the speaker's narratives were rated by humans and five recent LLMs along ten semantic dimensions (i.e., perception, motor, space, time, socialness, animacy, emotion, attention, causality, and drive). We tested whether these dimensions explained speaker-listener neural synchronization (NS) beyond acoustic and phonological features. Both human- and LLM-derived semantic spaces explained NS, but these shared semantics are better characterized as a multidimensional neural structure rather than a single global signal. These patterns also predicted individual differences in listeners' story comprehension, linking neural alignment to cognition. However, comparable overall prediction concealed systematic differences in representational geometry. Larger LLMs aligned more closely and showed greater overlap with humans in semantic structure and NS, but this was incomplete and dimension-dependent. The largest divergences emerged for dimensions closely tied to agency, affect, and social experience. These findings show that LLMs capture substantial components of human shared neural semantics, but their alignment is selective. Larger or more capable models improve the approximation, whereas socially and affectively grounded dimensions are captured only partially. 2026-06-10T02:54:51Z Chen Hong Ximing Shao Gangyi Feng