https://arxiv.org/api/7DAp/jcplQZpBrXWkrQVkANLhcY 2026-06-22T01:16:24Z 12181 240 15 http://arxiv.org/abs/2604.21909v2 Directional Confusions Reveal Divergent Inductive Biases Through Rate-Distortion Geometry in Human and Machine Vision 2026-05-14T17:52:52Z To humans, a robin seems more like a bird than a bird seems like a robin, but does this asymmetry also hold for machine vision? Humans and modern vision models can match each other in accuracy while making systematically different kinds of errors, differing not in how often they fail, but in who gets mistaken for whom. We show these directional confusions reveal distinct inductive biases invisible to accuracy alone. Using matched human and deep neural network responses on a natural-image categorization task under 12 perturbation types, we quantify asymmetry in confusion matrices and link its organization to the geometry of the information--error trade-off - how efficiently, and how gracefully, a system generalizes under distortion. We find that humans exhibit broad but weak asymmetries across many class pairs, whereas deep vision models show sparser, stronger directional collapses into a few dominant categories. Robustness training reduces overall asymmetry magnitude but fails to recover this human-like distributed structure. Generative simulations further show that these two asymmetry organizations shift the trade-off geometry in opposite directions even at matched accuracy, explaining why the same scalar asymmetry score can reflect fundamentally different generalization strategies. Together, these results establish directional confusion structure as a sensitive, interpretable signature of inductive bias that accuracy-based evaluation cannot recover. 2026-04-23T17:52:16Z Leyla Roksan Caglar Pedro A. M. Mediano Baihan Lin http://arxiv.org/abs/2605.14867v1 REALM: Retrospective Encoder Alignment for LFP Modeling 2026-05-14T14:16:22Z Spike activity has been the dominant neural signal for behavior decoding due to its high spatial and temporal resolution. However, as brain-computer interfaces (BCIs) move toward high channel counts and wireless operation, the high sampling frequency of spike signals becomes a bottleneck due to high power and bandwidth requirements. Local field potentials (LFPs) represent a different spatial-temporal scale of brain activity compared to spikes, offering key advantages including improved long-term stability, reduced energy consumption, and lower bandwidth requirement. Despite these benefits, LFP-based decoding models typically show reduced accuracy and often rely on non-causal architectures that are unsuitable for real-time deployment. To address these challenges, we propose REALM: a retrospective distillation framework that enables causal LFP decoding. Inspired by offline-to-online distillation strategies in speech recognition, REALM transfers representational knowledge from a pretrained multi-session bidirectional LFP model to a causal version for real-time deployment. We first pretrain a bidirectional Mamba-2 teacher model using a masked autoencoding objective. We then distill this teacher model into a compact student model via a combined objective of representation alignment and task supervision. REALM consistently outperforms both causal and non-causal LFP-based SOTA methods for behavior decoding. Notably, our REALM improves decoding performance while achieving a $2\times$ reduction in parameter count and a $10\times$ reduction in training time. These results demonstrate that retrospective distillation effectively bridges the gap between offline and real-time neural decoding. REALM shows that LFP-only models can achieve competitive decoding performance without reliance on spike signals, offering a practical and scalable alternative for next-generation wireless implantable BCIs. 2026-05-14T14:16:22Z Peicheng Wu Zhenyu Bu Runze Ma Lin Du http://arxiv.org/abs/2506.21000v4 Modulating task outcome value to mitigate real-world procrastination via noninvasive brain stimulation 2026-05-14T13:16:42Z Procrastination represents one of the most prevalent behavioral problems associated with individual health and societal productivity. Despite its high prevalence and substantial impact on daily functioning, its underlying neurocognitive mechanisms remain poorly understood. A leading model posits that procrastination arises from imbalanced competing motivations: the avoidance of negative task aversiveness and the pursuit of positive task outcomes, yet this framework has not been fully validated in real-world settings and not applied effectively to guide interventions. Here, we addressed this gap with a double-blind, randomized controlled trial. We applied seven sessions of high-definition transcranial direct current stimulation (HD-tDCS) to the left dorsolateral prefrontal cortex (DLPFC) in chronic procrastinators. Using the intensive experience sampling method (iESM), we assessed the effect of anodal HD-tDCS on real-world procrastination at offline after-effect (2-day interval) and long-term after-effect (6-month follow-up). We found that this neuromodulation produced a lasting reduction in real-world procrastination, with effects sustained at a 6-month follow-up. While the intervention is significantly associated with both decreased task aversiveness and increased perceived task outcome value, a mediation analysis indicated a disassociable mechanism: the increase in task outcome value (but not task aversiveness) showed a statistical pattern consistent with accounting for the observed behavioral improvement. In conclusion, the findings are consistent with the hypothesis that enhancing DLPFC function may reduce procrastination by selectively amplifying the valuation of future rewards, not by simply reducing negative feelings about the task. These results align with established decision-theoretic frameworks and suggest a targeted, theory-informed avenue for future behavioral interventions. 2025-06-26T04:30:51Z Zhiyi Chen Zhilin Ren Wei Li ZhenZhen Huo ZhuangZheng Wang Ye Liu Bowen Hu Wanting Chen Ting Xu Artemiy Leonov Chenyan Zhang Bernhard Hommel Tingyong Feng 10.7554/eLife.108241.2.sa3 http://arxiv.org/abs/2605.14680v1 Are cortical microcircuits optimized for information flux? -- A simulation-based reverse engineering study 2026-05-14T10:48:53Z A sufficiently large information flux in recurrent neural networks, quantified by the mutual information between successive network states, is considered a prerequisite for rich information processing capabilities. This raises the question of whether biological neural networks, such as cortical microcolumns, may be structurally organized to enhance information flux. To investigate this possibility, we study a simplified model of the cortical layer 5 architecture, in which a densely and strongly interconnected core population is embedded within a larger supporting network. Surprisingly, we find that the embedding network exerts a pronounced flux-enhancing effect on the core dynamics. Systematic reverse-engineering analyses reveal that the embedding network provides two key contributions: first, it generates effective biases that shift core neurons into a higher-entropy operating regime; second, it supplies stochastic fluctuations that prevent the network from becoming trapped in simple fixed-point or oscillatory attractors through the mechanism of Recurrence Resonance. We further show that the information flux can be increased even beyond the biologically embedded case by applying individually optimized biases to the core neurons, and that these biases can emerge from a simple self-organization principle. Our findings are relevant both for the functional interpretation of biological neural circuits and for the design of artificial recurrent systems such as reservoir computers. 2026-05-14T10:48:53Z Claus Metzner Ali Ghebleh Karin Prebeck Achim Schilling Andreas Maier Thomas Kinfe Patrick Krauss http://arxiv.org/abs/2605.10310v2 Positive Alignment: Artificial Intelligence for Human Flourishing 2026-05-14T08:50:45Z Existing alignment research is dominated by concerns about safety and preventing harm: safeguards, controllability, and compliance. This paradigm of alignment parallels early psychology's focus on mental illness: necessary but incomplete. What we call Positive Alignment is the development of AI systems that (i) actively support human and ecological flourishing in a pluralistic, polycentric, context-sensitive, and user-authored way while (ii) remaining safe and cooperative. It is a distinct and necessary agenda within AI alignment research. We argue that several existing failures of alignment (e.g., engagement hacking, loss of human autonomy, failures in truth-seeking, low epistemic humility, error correction, lack of diverse viewpoints, and being primarily reactive rather than proactive) may be better addressed through positive alignment, including cultivating virtues and maximizing human flourishing. We highlight a range of challenges, open questions, and technical directions (e.g., data filtering and upsampling, pre- and post-training, evaluations, collaborative value collection) for different phases of the LLM and agents lifecycle. We end with design principles for promoting disagreement and decentralization through contextual grounding, community customization, continual adaptation, and polycentric governance; that is, many legitimate centers of oversight rather than one institutional or moral chokepoint. 2026-05-11T10:11:08Z Ruben Laukkonen Seb Krier Chloé Bakalar Shamil Chandaria Morten Kringelbach Adam Elwood Daniel Ford Fernando Rosas Maty Bohacek Matija Franklin Nenad Tomašev Stephanie Chan Verena Rieser Roma Patel Michael Levin Arun Rao http://arxiv.org/abs/2605.12534v2 BioSEN: A Bio-acoustic Signal Enhancement Network for Animal Vocalizations 2026-05-14T08:05:44Z Most work in audio enhancement targets human speech, while bioacoustics is less studied due to noisy recordings and the distinct traits of animal sounds. To fill this gap, we adapt speech enhancement methods and build BioSEN, a model made for bioacoustic signals. BioSEN has three modules: a multi-scale dual-axis attention unit for time-frequency feature extraction, a bio-harmonic multi-scale enhancement unit for capturing harmonic structures, and an energy-adaptive gating connection unit that uses frequency weights to keep vocalizations from being removed as noise. Tests on three bioacoustic datasets show that BioSEN matches or exceeds state-of-the-art speech enhancement models while using far less computation. These results show BioSEN's strength for bioacoustic audio enhancement and its promise for biodiversity monitoring and conservation. 2026-05-02T00:19:24Z ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Tianyu Song Ton Viet Ta Ngamta Thamwattana Hisako Nomura Linh Thi Hoai Nguyen 10.1109/ICASSP55912.2026.11463818 http://arxiv.org/abs/2605.14388v1 Multiple mechanisms of rhythm switching in recurrent neural networks with adaptive time constants 2026-05-14T05:11:54Z Although recurrent neural networks (RNNs) trained on cognitive tasks have become a widely used framework for studying neural computation, the internal mechanisms by which RNNs switch between rhythms across multiple frequency bands, and how these mechanisms relate to neuronal time constants, have not been systematically analyzed. We trained leaky integrator RNNs with neuron-specific learnable time constants on a four-band (theta, alpha, beta, gamma) rhythm-switching task and analyzed 20 independently trained networks. Whereas low-frequency rhythms were produced by distributed participation of many neurons, high-frequency rhythms were dominated by a small subpopulation of short-time-constant neurons, and the negative correlation between time constant and matched-mode amplitude strengthened monotonically with frequency. Rhythm switching was supported by multiple coexisting mechanisms: turnover of the active subpopulation, network-wide baseline shifts that reposition the operating point near distinct unstable fixed points, and inter-neuronal phase reorganization that selectively cancels or supports band components in the population output. The mechanism deployed for each mode pair varied across training runs, exposing a degeneracy of learned solutions. These findings parallel the coexistence of rhythm-specific and multi-rhythm interneurons reported in biological circuits and provide a candidate framework for interpreting frequency-band-specific functional differentiation in neural systems. 2026-05-14T05:11:54Z 19 pages, 8 figures Yutaka Yamaguti Shota Nakamura http://arxiv.org/abs/2605.14319v1 Approximate Macroscopic Dynamics of Spiking Neural Networks Based on Solutions to the Transport Equation 2026-05-14T03:30:32Z Firing rate fluctuations in neural populations are observed experimentally over multiple time scales, in single neurons, across trials when elicited by stimuli, and across populations. In this work, we examine how firing rate fluctuations emerge in networks of coupled integrate-and-fire neurons as a function of the initial distribution of voltages in networks with time-varying inputs. We analytically derive an approximation for the evolution of the instantaneous population rate or flux as a function of the initial voltage distribution through a Fokker-Planck system. Unlike earlier mean field approaches based on asynchronous or constant flux steady state solutions to the Fokker-Planck system, the approach considered here is based on the transport solution to the advection equation and assumes that the time-varying inputs are slow, and the neurons are in the excitation-driven regime. The transport mean field system predicts how firing rate fluctuations emerge from a dynamic interaction between time-varying inputs, initial densities, and coupling in populations of neurons. 2026-05-14T03:30:32Z 20 pages, 5 figures Wilten Nicola Sue Ann Campbell http://arxiv.org/abs/2605.23981v1 Metacognition Should Be the Scientific Framework for Bounded and Effective Self-Governance in Generative AI 2026-05-13T23:40:56Z Generative AI research increasingly confronts a shared problem: systems must sustain yet govern their own generative activity when uncertainty is high, evidence is missing, or context is insufficient. This position paper argues that metacognition should become the scientific framework for bounded and effective self governance in generative AI, where output generation is properly evaluated together with the capacities through which generative systems navigate and regulate their own activity. We advance this position by showing that bounded and effective AI self-governance requires metacognitive alignment across computational, algorithmic, and ecological levels. At the computational level, metacognition specifies the meta-level functions a system is meant to serve, such as monitoring, evaluation, control, and adaptation. At the algorithmic level, these functions are realized through procedures such as elicitation, iteration, and modularization. At the ecological level, metacognitive signals become meaningful, actionable, and accountable within the interface, workflow, and accountability arrangements. Metacognition thus makes it possible to conceive generative AI as both capable and well-governed, rather than treating capability and governance as competing aims. 2026-05-13T23:40:56Z 16 pages, 1 figure, 1 table Eugene Yu Ji Igor Grossmann Amir-Hossein Karimi http://arxiv.org/abs/2605.10947v2 Interpretable EEG Microstate Discovery via Variational Deep Embedding: A Systematic Architecture Search with Multi-Quadrant Evaluation 2026-05-13T22:39:22Z EEG microstate analysis segments continuous brain electrical activity into brief, quasi-stable topographic configurations that reflect discrete functional brain states. Conventional approaches such as Modified K-Means operate directly in electrode space with hard assignment, offering no learned latent representation, no generative decoder, and no mechanism to decode latent configurations into verifiable scalp topographies, limiting both model transparency and interpretability. To address this, we present a Convolutional Variational Deep Embedding (Conv-VaDE) model that jointly learns topographic reconstruction and probabilistic soft clustering in a shared latent space. Conv-VaDE enables generative decoding of cluster prototypes into verifiable scalp topographies, replacing opaque hard partitioning with probabilistic soft assignment. A polarity invariance scheme and a four-dimensional grid search over cluster count (K from 3 to 20), latent dimensionality, network depth, and channel width are conducted to systematically reveal how each architectural design choice shapes the quality, stability, and interpretability of learned EEG microstate representations. The model is evaluated on the LEMON resting-state eyes-closed EEG dataset with ten participants using topographic template formation, clustering stability, and global explained variance (GEV). The architecture search reveals that depth L = 4 appears consistently across all 18 best-performing configurations, yielding a best-case GEV of 0.730 and a silhouette of 0.229 at K = 4 across the model sweeps, where moderately deep networks with compact channel widths and small latent dimensionality dominate across the full K range. These results establish that principled architecture search, rather than model scale, is the key to interpretable and stable EEG microstate discovery via variational deep embedding. 2026-04-29T12:07:01Z Saheed Faremi Andrea Visentin Luca Longo http://arxiv.org/abs/2605.14025v1 Do Language Models Align with Brains? Prediction Scores Are Not Enough 2026-05-13T18:37:17Z Brain-language model comparisons often interpret neural prediction scores as evidence that model representations capture brain-relevant language computation. We asked whether language models align with brains, and whether prediction scores are enough to support that claim, using L-PACT, a source-audited framework that evaluates predictive, relational, mechanism-stripping, and reliability-bounded evidence. Across primary naturalistic language neural datasets and derived language-model representations, L-PACT compared real model features with nuisance baselines and severe controls, tested whether model-to-brain profiles reproduced brain-to-brain patterns, recomputed held-out scores after mechanism stripping, and normalized evidence against brain-brain ceilings. The locked analysis set contains 414 predictive-control rows, 2304 relational profile rows, 4320 mechanism-stripping rows, 420 brain-brain ceiling rows, and 146 integrated decision rows. Assay-sensitivity checks showed that brain-brain reliability, brain-as-model run-to-run relational profiles, independent low-level neural and WAV-derived acoustic-envelope gates, and a deterministic implanted-signal simulation can produce positive evidence when expected. Nevertheless, no real model row passed the predictive, relational, mechanism-stripping, or operational Turing-bounded reliability gates; all 146 integrated rows were control-explained. Less stringent single-criterion rules would have counted raw positive predictive, relational, stripping-delta, and ceiling-normalized effects, but L-PACT downgraded them because controls explained the apparent evidence. In the analyzed derived artifact set, the tested language-model representations do not satisfy L-PACT alignment gates; apparent positives are converted into an auditable control-explained taxonomy rather than treated as structural alignment. 2026-05-13T18:37:17Z 39 pages, 4 main figures, 6 supplementary figures Xiao Jia http://arxiv.org/abs/2603.03337v2 Does the motor cortex draw on a wire plane? 2026-05-13T15:44:54Z The two-thirds power law of human motor control ($v \propto κ^{-1/3}$) is geometrically equivalent to constant equi-affine speed. In classical differential geometry, however, the equi-affine metric is not a tensor: it depends on acceleration, which does not transform covariantly under arbitrary coordinate changes. To recover tensorial behavior, one must either restrict the symmetry group to the affine group or introduce an affine connection -- sacrificing full diffeomorphism covariance. This article proposes a different geometric setting. We equip the Euclidean plane with the "wire diffeology', the smooth structure generated by all smooth curves. In this diffeological space, the equi-affine metric becomes a true covariant $3$-tensor under the **full** diffeomorphism group -- no restriction of symmetries, no additional structure required. The construction is motivated by a simple fact: the motor cortex traces curves, not two-dimensional patches. Accordingly, curves are taken as primitive, echoing the motor control literature in which movements are built from a repertoire of elementary building blocks -- motor primitives. The wire plane offers a geometric formalization of this idea in which the two-thirds power law emerges as a fully covariant invariant. 2026-02-12T22:01:31Z 7.33 pages. This note applies the framework of Diffeology (specifically the Wire Plane) to resolve the non-tensorial nature of the equi-affine metric in motor control Patrick Iglesias-Zemmour http://arxiv.org/abs/2605.13675v1 Characterizing Universal Object Representations Across Vision Models 2026-05-13T15:34:41Z Deep neural networks trained with different architectures, objectives, and datasets have been reported to converge on similar visual representations. However, what remains unknown is which visual properties models actually converge on and which factors may underlie this convergence. To address this, we decompose the object similarity structure of 162 diverse vision models into a small set of non-negative dimensions. To determine universal versus model-specific dimensions, we then estimate how often each dimension reappears across models. In contrast to model-specific dimensions, universal dimensions are more interpretable and more strongly driven by conceptual image properties, indicating the relevance of interpretability and semantic content as implicit factors driving universality across models. Differences in architecture, objective function, training data, model size, and model performance do not explain the emergence of universal dimensions. However, models with more universal dimensions also better predict macaque IT activity and human similarity judgments, suggesting that universality reflects representations relevant to biological vision. These findings have important implications for understanding the emergent representations underlying deep neural network models and their alignment with biological vision. 2026-05-13T15:34:41Z Florian P. Mahner Johannes Roth Ka Chun Lam Michael F. Bonner Francisco Pereira Martin N. Hebart http://arxiv.org/abs/2510.04698v3 The Bayesian Origin of the Probability Weighting Function in Human Representation of Probabilities 2026-05-13T13:27:40Z Humans systematically misrepresent probability in a stereotyped inverse-S pattern. It has been documented for decades, but its origin remains unexplained. We propose a Bayesian encoding-decoding account in which probabilities are represented by noisy internal signals and decoded by Bayes-risk minimization. For bounded probability stimuli, we show that distortion decomposes into boundary regression, likelihood repulsion, and prior attraction, yielding a key prediction: the classic inverse-S-shaped weighting pattern implies a U-shaped allocation of encoding precision with greater sensitivity near 0 and 1. Across judgment of relative frequency, lottery pricing, and risky choice, this U-shape is recovered from data without imposing any functional form on the encoding, and our framework outperforms deterministic weighting functions, bounded log-odds models, uniform-encoding Bayesian accounts, and matched efficient-coding models on held-out data. In a new dot probability estimation experiment with bimodal stimulus statistics, the recovered prior tracks the new distribution while the recovered encoding remains U-shaped. Together, these results identify the inverse-S-shaped probability weighting function as the joint product of a stable U-shaped encoding and a flexible prior, integrated by optimal Bayesian decoding. 2025-10-06T11:10:55Z Xin Tong Thi Thu Uyen Hoang Xue-Xin Wei Michael Hahn http://arxiv.org/abs/2605.13315v1 Embodied Neurocomputation: A Framework for Interfacing Biological Neural Cultures with Scaled Task-Driven Validation 2026-05-13T10:27:05Z Biological neural networks (BNNs) have been established as a powerful and adaptive substrate that offer the potential for incredibly energy and data efficient information processing with distinct learning mechanisms. Yet a core challenge to utilizing BNN for neurocomputation is determining the optimal encoding and decoding mechanisms between the traditional silicon computing interface and the living biology. Here, we propose an Embodied Neurocomputation framework as a systems-level approach to this multi-variable optimization encoding/decoding problem. We operationalize this approach through the first large-scale parameter optimization of encoding configurations for a BNN agent performing closed-loop navigation along an odor-style gradient in a simulated grid-world. Despite the relative simplicity of the task, the biological interactions gave rise to a massive multi-combinatorial search space for optimal parameters. By considering how the components of the system are interconnected and parameterized, we evaluated approximately 1,300 parameter combinations, over 4,000 hours of real-time agent-environment interactions, to identify 12 configurations that consistently demonstrated learning across multiple episodes. These configurations achieved significantly higher task performances than optimized silicon-based DQN agents under the same interaction budget. These findings represent an initial step toward robust and scalable goal-oriented learning using BNNs. Our framework establishes a foundation for applying task-driven neurocomputing and supports the development of field-wide benchmarks. In the long term, this work supports the development of hybrid bio-silicon architectures capable of efficient, adaptive and real-time computation, including the potential for robotic control applications. 2026-05-13T10:27:05Z Johnson Zhou Daniel Tanneberg Forough Habibollahi Alon Loeffler Kiaran Lawson Valentina Baccetti Kwaku Dad Abu-Bonsrah Candice Desouza Finn Doensen Bradley Watmuff Daria Kornienko Azin Azadi Justin Leigh Bourke Bernhard Sendhoff Brett J. Kagan