https://arxiv.org/api/3dc0lULI8YQs0kF8sVLPZnJ4viA2026-06-21T11:40:34Z121817515http://arxiv.org/abs/2606.08720v1This is how the Neocortex Learns2026-06-07T16:31:28ZA sufficient account of how the neocortex learns must meet three criteria: 1. Computationally, it must approximate a powerful, general-purpose learning algorithm known to scale to human-level intelligence; 2. Algorithmically, it must be implementable using known, well-established neural circuits within the neocortex and associated brain structures; 3. Implementationally, there must be a detailed account for how all of the algorithmic mechanisms actually function at a neurochemical level. At present, there is only one framework that meets all of these criteria: error-driven predictive learning via temporal derivatives, driven by corticothalamic circuits, based on competitive kinase synaptic plasticity induction mechanisms. This has been implemented in the Axon neural simulation framework using spiking neurons, and demonstrated to learn across a wide range of challenging cognitively motivated tasks.2026-06-07T16:31:28Z9 pages, 4 figuresRandall C. O'Reillyhttp://arxiv.org/abs/2512.11000v2Unambiguous Representations in Neural Networks: An Information-Theoretic Approach to Intentionality2026-06-06T16:37:20ZRepresentations pervade our daily experience, from letters representing sounds to bit strings encoding digital files. While such representations require externally defined decoders to convey meaning, conscious experience is fundamentally different: a neural state corresponding to perceiving a red square cannot alternatively encode the experience of a green triangle. This intrinsic property of consciousness suggests that conscious representations must be unambiguous in a way that conventional representations are not. We formalize this intuition using information theory, defining representational ambiguity as the conditional entropy H(I|R) over possible interpretations I given a representation R. Through experiments on neural networks trained to classify MNIST digits, we demonstrate that relational structures in network connectivity can unambiguously encode representational content. From relational structure alone, we achieve perfect (100%) accuracy for dropout-trained networks and 38% for standard backpropagation (chance: 10%) in identifying output neuron class identity, despite identical task performance, demonstrating that representational ambiguity can arise orthogonally to behavioral accuracy. We further show that spatial position of input neurons, relevant to phenomenal properties like visual field location, can be decoded from network connectivity with R^2 up to 0.844. These results provide a quantitative method for measuring representational ambiguity in neural systems and demonstrate that neural networks can exhibit the low-ambiguity representations posited as necessary (though not sufficient) by theoretical accounts such as narrow representationalism and IIT.2025-12-10T19:00:34ZPresented at the Models of Consciousness 6 (MoC6) conference (https://amcs-community.org/moc6-schedule-information/#abstract-36)Francesco Lässighttp://arxiv.org/abs/2606.08202v1Vector Space of Cycles2026-06-06T14:40:32ZMost statistical and machine learning methods for directed interactions focus on pairwise effects among variables. Even existing cyclic models represent feedback primarily through node-level dependencies, making large-scale recurrent organization difficult to estimate and compare. This limitation is particularly acute in biological and neural systems, where interactions are highly recurrent and involve many overlapping cycles. We introduce a variational framework for statistical inference on cyclic interactions. Directed interactions are represented as edge flows on a simplicial complex and evolved under an energy-minimizing dynamical system. The resulting dynamics separate transient interaction components from persistent harmonic flows, yielding a low-dimensional cycle space that captures stable recurrent organization. Rather than enumerating individual cycles, the proposed framework represents cyclic interactions as elements of a Hilbert space, enabling projection, averaging, comparison, and population-level statistical inference. We establish theoretical properties of the harmonic projection, including characterization of the cycle space, variance reduction, and population inference. Simulations demonstrate substantially improved recovery of cyclic structure in dense recurrent systems compared with existing directed-interaction methods. Applied to resting-state fMRI from 400 human subjects, the framework reveals reproducible large-scale cyclic organization that is not detectable through edgewise averaging. These results provide a scalable statistical framework for studying recurrent interactions in high-dimensional dynamical systems.2026-06-06T14:40:32ZMoo K. ChungAnass B. El-YaagoubiHernando Ombaohttp://arxiv.org/abs/2504.12310v2Reflective Empiricism: Bias Reflection and Introspection as a Scientific Method2026-06-06T12:05:55ZThis paper introduces Reflective Empiricism, an extension of empirical science that incorporates subjective perception and consciousness processes as equally valid sources of knowledge. It views reality as an interplay of subjective experience and objective laws, comprehensible only through systematic introspection, bias reflection, and premise-based logical-explorative modeling. This approach overcomes paradigmatic blindness arising from unreflected subjective filters in established paradigms, promoting an adaptable science. Innovations include a method for bias recognition, premise-based models grounded in observed phenomena to unlock new conceptual spaces, and Eureka moments - intuitive insights - as starting points for hypotheses, subsequently tested empirically. The author's self-observation, such as analyzing belief formation, demonstrates its application and transformative power. Rooted in philosophical and scientific-historical references (e.g., Archimedes' intuition, quantum observer effect), Reflective Empiricism connects physics, psychology, and philosophy, enhancing interdisciplinary synthesis and accelerating knowledge creation by leveraging anomalies and subjective depth. It does not seek to replace empirical research but to enrich it, enabling a more holistic approach to phenomena that have not yet been fully grasped. A subsequent body of the author's work is presented as case studies demonstrating the application of the method introduced here.2025-04-07T08:36:26Z21 pages, 0 figures. Version 2 added Section 10 with case studies illustrating the application of the reflective method to subsequent works in epistemology, cognitive psychology, emergent ethics, and foundational physicsOliver Marc Wittwerhttp://arxiv.org/abs/2506.19094v5Accurate identification of communication between multiple interacting neural populations2026-06-06T00:11:40ZNeural recording technologies now enable simultaneous recording of population activity across many brain regions, motivating the development of data-driven models of inter-regional communication. However, existing models can struggle to disentangle the influences that drive recorded population activity, leading to inaccurate portraits of communication. Here, we introduce Multi-Region Latent Factor Analysis via Dynamical Systems (MR-LFADS), a sequential variational autoencoder designed to disentangle inter-regional communication, inputs from unobserved regions, and local neural population dynamics. We show that MR-LFADS outperforms existing approaches at identifying communication across dozens of simulations of task-trained multi-region networks. When applied to large-scale electrophysiology, MR-LFADS predicts brain-wide effects of circuit perturbations that were held out during model fitting. These validations on synthetic and real neural data position MR-LFADS as a promising tool for discovering principles of brain-wide information processing.2025-06-23T20:15:29ZForty-second International Conference on Machine Learning (2025)Belle LiuJacob SacksMatthew D. Golubhttp://arxiv.org/abs/2603.23082v2Spatial navigation in preclinical Alzheimer's disease: A review2026-06-05T20:06:49ZAlzheimer's disease (AD) develops over a prolonged preclinical phase, during which neuropathological changes accumulate long before cognitive symptoms appear. Identifying cognitive functions affected at early stages is critical for the preclinical detection of asymptomatic individuals at-risk of AD. Early risk identification could enable timely interventions aimed at mitigating the development of significant future cognitive impairment. While episodic memory decline typically appears after substantial medial temporal lobe damage, spatial navigation has emerged as a particularly sensitive cognitive function in preclinical AD. In this review, we provide an overview of spatial navigation computations and the tasks used to assess them, highlighting how spatial navigation relies on neural circuits corresponding to the earliest sites of AD pathology. We synthesize evidence from cognitively unimpaired individuals with AD biomarkers, i.e. individuals at-risk of AD, and discuss future research directions. Overall, performance on spatial navigation tasks, particularly path integration and wayfinding, correlates with plasma and CSF biomarkers of AD pathology, notably p-tau. Spatial navigation assessment can represent a sensitive and scalable approach for early detection of individuals at-risk of AD in preclinical stages, and will inform future interventions to mitigate the progression toward clinically significant cognitive impairment.2026-03-24T11:25:52ZSyrine SalouhouVictor GillesRemi ValléeGillian T. CoughlanRomain BacheletMichael HornbergerHugo SpiersAntoine CoutrotAntoine Garnier-Crussardhttp://arxiv.org/abs/2606.07798v1Reconstructing and forecasting disease trajectories of patients with Alzheimer's disease using routine data in resource-constrained settings2026-06-05T19:23:56ZAlzheimer's disease is a progressive neurodegenerative disorder, and its progression varies substantially across patients. Existing work aims to forecast patients' future cognitive state, with minimal focus on reconstructing the state from past visits. Furthermore, in current research, quantifying predictive uncertainty remains underexplored and relies on costly modalities such as MRI, PET, and CSF, limiting their deployment in resource-limited settings. In this research, our primary objectives are: First, bidirectional prediction of cognitive scores from irregular visits to present the complete disease trajectory. Second, to enable interpolation and extrapolation capabilities to assist clinicians in informed prognostic decision making, and third, to provide a well-calibrated uncertainty estimate for all predictions, and finally, to achieve the objectives using the modalities available during routine visits. We propose a unified framework, GNOVA: A GRU-Neural ODE Variational Autoencoder. The architecture combines a Gated Recurrent Unit encoder and a Neural ODE decoder within a variational autoencoder framework. In our work, we forecast the CDR-SB and MMSE Scores. The GRU encoder allows for any number of inputs at any time point. The Neural-ODE decoder performs continuous estimation, allowing interpolation and extrapolation at any desired time point. The Variational autoencoder allows for uncertainty estimation in predictions. We worked with 1,727 patients from the ADNI dataset over 10 years; the model achieved mean absolute errors of 1.35 and 2.28 for CDR-SB and MMSE scores, respectively, without requiring any neuroimaging or biomarker data. Feature-ablation studies revealed that age, BMI, and APOE4 status were strong predictors. The proposed framework enables the reconstruction of incomplete patient histories and the anticipation of future cognitive states.2026-06-05T19:23:56ZRatnadeep DasAtri ChatterjeeSitikantha Royhttp://arxiv.org/abs/2606.11245v1Position: Hippocampal Explicit Memory Is the Cornerstone for AGI2026-06-05T17:40:34ZLarge Language Models (LLMs) have demonstrated remarkable capabilities across various tasks, raising expectations for Artificial General Intelligence (AGI). This position paper argues that integrating explicit memory is the cornerstone for advancing LLMs toward AGI. The key reason is that the underlying learning mechanism of LLMs is highly analogous to human implicit memory. However, higher-order cognitive functions necessary for AGI, such as long-term strategic planning, metacognition, and symbolic reasoning, heavily rely on hippocampal explicit memory and cannot arise solely from implicit statistical learning. Drawing on findings from neuroscience, I advance this perspective and complement it with computational requirements for artificial explicit memory systems, hoping to foster further research and lay the groundwork for explicit memory integration.2026-06-05T17:40:34ZAccepted to ICML 2026 (Position Paper Track)Sangjun Parkhttp://arxiv.org/abs/2606.07336v1Fixed point compositionality via low-rank gluing rules in inhibition-dominated threshold-linear networks2026-06-05T14:49:30ZBrains routinely generate highly flexible and complex behaviors on a relatively stable structure and limited resources. A key mechanism underlying this ability is compositionality, which allows the brain to efficiently decompose complex tasks into simpler, reusable primitives. While network modularity has often been linked to compositionality in biological and artificial networks, a rigorous mathematical characterization of this relationship in nonlinear networks is still lacking. In this work, we formally investigate how structural modularity supports functional compositionality in inhibition-dominated threshold-linear networks (TLNs). We introduce a novel class of modular network assembly called low-rank gluings, where component subnetworks with arbitrary internal connectivity are connected via specific low-rank couplings. We prove that the global fixed points of these networks are constrained to be combinations of the local fixed points of their constituent modules. For a more structured subclass, called rank-1 gluings, we provide a complete characterization that determines which combinations of local fixed points yield global ones. We apply these results to graph-based networks, extending fixed point decomposition rules from combinatorial threshold-linear networks (CTLNs) to the more flexible family of generalized CTLNs (gCTLNs), thereby proving that these structural rules are more robust than initially posited. Finally, we demonstrate that these gluing rules provide a mathematically tractable recipe for engineering compositional dynamics, enabling the construction of networks with a combinatorially large repertoire of predictable attractors that can be understood from simpler component motifs, ranging from compositions of fixed points to compositional limit cycles.2026-06-05T14:49:30Z39 pages, 18 figuresJuliana Londono Alvarezhttp://arxiv.org/abs/2602.09997v2Popularity Feedback Constrains Innovation in Cultural Markets2026-06-05T14:05:49ZReal-world creative processes ranging from art to science rely on social feedback-loops between selection and creation. Yet, the effects of popularity feedback on collective creativity remain poorly understood. We investigate how popularity ratings influence cultural dynamics in a large-scale online experiment where participants ($N = 1\,008$) iteratively \textit{select} images from evolving markets and \textit{produce} their own modifications. Results show that exposing the popularity of images reduces cultural diversity and slows innovation, delaying aesthetic improvements. Popularity feedback is associated with changes to both selection and creative stages. During selection, popularity information triggers cumulative advantage, with participants preferentially building upon popular images, reducing diversity. During creation, participants make less disruptive changes, and are more likely to expand existing visual patterns. Feedback loops in cultural markets thus not only shape selection, but also, directly or indirectly, the form and direction of cultural innovation.2026-02-10T17:20:40ZLucas GautheronRaja MarjiehDalton C. ConleySeth FreyHannah RubinMike D. SchneiderOfer TchernichovskiNori Jacobyhttp://arxiv.org/abs/2606.06647v1The Identity Trap in EEG Foundation Models: A Diagnostic Audit2026-06-04T18:54:34ZObjective. EEG foundation models (FMs) report strong accuracy on clinical resting-state EEG. However, high accuracy under subject-disjoint cross-validation remains ambiguous: it can reflect a genuine clinical biomarker, or subject-identity features that correlate with the label. We name this the Identity Trap and ask whether it can be diagnosed at the representation level before fine-tuning.
Approach. We propose FMScope, a frozen-representation protocol packaging five diagnostics: variance decomposition, subject-axis erasure, aperiodic 1/f ablation, layer-wise label probing, and within-subject direction consistency. We apply it to three pretrained FMs (LaBraM, CBraMod, REVE) across four datasets in a 2x2 layout: subject relation of label x presence of a consensus cross-subject EEG marker.
Main results. (i) The Identity Trap is universal: frozen subject-variance is 13-89x a random null in 12/12 pairs, rising in all 12 under fine-tuning (+10 to +63 pp). This dominance is a removable linear axis: erasing it improves label decoding where the label varies within subject (+6 to +12 pp in primary cells; +4 to +27 pp across external cohorts). (ii) Aperiodic 1/f is one subject carrier: removing it drops the subject probe by 9-19 pp on LaBraM and CBraMod. REVE saturates subject identity without measurable aperiodic dependence. (iii) Fine-tuning amplifies label-variance only in cells with a literature-established cross-subject marker.
Significance. The Identity Trap is a physically-grounded instance of shortcut learning: the preferred cue has a measurable physiological component, and subject-disjoint splitting alone cannot rule it out. FMScope separates gains reflecting a biological marker from those reflecting subject identity.2026-06-04T18:54:34Z28 pages, 6 figures, 8 tables. Code available at https://github.com/Jimmy110101013/fmscopeJun-You LinYing Choon WuTzyy-Ping Junghttp://arxiv.org/abs/2602.00163v2Deep Learning Pose Estimation for Multi-Label Recognition of Combined Hyperkinetic Movement Disorders2026-06-04T18:17:28ZHyperkinetic movement disorders (HMDs) such as dystonia, tremor, chorea, myoclonus, and tics are disabling motor manifestations across childhood and adulthood. Their fluctuating, intermittent, and frequently co-occurring expressions hinder clinical recognition and longitudinal monitoring, which remain largely subjective and vulnerable to inter-rater variability. Objective and scalable methods to distinguish overlapping HMD phenotypes from routine clinical videos are still lacking. Here, we developed a pose-based machine-learning framework that converts standard outpatient videos into anatomically meaningful keypoint time series and computes kinematic descriptors spanning statistical, temporal, spectral, and higher-order irregularity-complexity features.2026-01-29T21:55:48ZLaura CifDiane DemaillyGabriella A. HorvàthJuan Dario Ortigoza EscobarNathalie DorisonMayté Castro JiménezCécile A. HubschThomas WirthGun-Marie HarizSophie HubyMorgan DornadicZohra SoueiMuhammad Mushhood Ur RehmanSimone HemmMehdi BoulaymeEduardo M. MoraudJocelyne BlochXavier Vasqueshttp://arxiv.org/abs/2606.07674v1Simultaneous hyperkinetic movement disorders phenotyping: a cross-cohort pediatric transfer study using routine videos, markerless pose estimation and a tabular foundation model2026-06-04T18:13:13ZObjective: To develop and externally test a video-based framework for simultaneous detection of hyperkinetic MDs phenomenologies: dystonia, tremor, myoclonus, chorea, athetosis, ballismus, stereotypies, and tics using routine clinical recordings, with explicit testing of external, cross-cohort transfer from adult to pediatric populations. Methods: In this proof-of-concept study, the framework combines markerless pose estimation, kinematic descriptors, and a pretrained fondation model. A shared predictive backbone was developed on 21 adults with confirmed hyperkinetic MDs and 4 healthy controls assessed under a standardized protocol. External validation was performed on an independent external cohort: a real-world pediatric sample (n=12, monogenic combined MDs). For the external dataset, the backbone was deployed without retraining; lightweight calibration adjusted only the final subject-level decision step using a small labeled subset of patients selected by clinicians as representative of the cohort's phenotypic range. Results: After local calibration of the decision layer on the clinician-selected subset, performance improved consistently on the held-out pediatric patients (n=7): Hamming accuracy rose from 0.804 to 0.839 and the Jaccard index from 0.548 to 0.633. This calibrated performance was preserved, and the Jaccard index further improved, when the evaluation was restricted to the phenomenologies with more definite clinician agreement (Hamming accuracy 0.9, Jaccard index 0.786), indicating that the gains did not rest on the least-reliable labels.2026-06-04T18:13:13ZLaura CifDiane DemaillyZohra SoueiMuhammad Mushhood Ur RehmanJuan Dario Ortigoza EscobarMayté Castro JiménezCécile A. HubschSophie HubyMorgan DornadicGun-Marie HarizEduardo M. MoraudJocelyne BlochGabriella A. HorvathXavier Vasqueshttp://arxiv.org/abs/2606.06424v1Intrinsic Computational Functionalism: From Observer-Relative Maps to Observer-Independent Structures2026-06-04T17:28:43ZAnti-computational arguments show that externally imposed computational interpretations cannot ground consciousness, but they do not establish that all computational organisations are observer-relative. We develop intrinsic computational functionalism: the view that, if consciousness is computationally constituted, it depends on physically realised computational structures the system has in virtue of itself rather than on labels imposed by an external interpreter. Two criteria operationalise this view. (C1) System-intrinsic instantiation: the relevant property must be specifiable without an observer's labelling, and invariant under structure-preserving relabellings of the system's variables. (C2) Causal-dynamical organisation under intervention: the property must be grounded in a state-space structure whose variables mutually constrain one another, and whose organisation is exhibited in counterfactual response under intervention. Together these criteria specify what any candidate computational account must satisfy to remain observer-independent, without selecting which intrinsic structures bear on experience. The argumentative core is a three-tier decomposition of identification work: interpreter-relative label selection (tier i), theoretically constrained partition selection (tier ii), and dynamics-internal grain selection (tier iii). We argue that any computational property capable of avoiding the observer-relativity objection must be identified, if at all, through tier (iii) dynamics-internal grain selection, conditional on empirically disciplined tier (ii) choices. Syntax-is-not-semantics arguments, mapmaker arguments, and the observer-relativity component of biological-naturalist objections succeed against views that locate the consciousness-relevant property at tier (i); once the tiers are distinguished, intrinsic computational functionalism survives.2026-06-04T17:28:43Z23 pages, no figures. Shuqin Ma and Ryota Kanai contributed equally (joint first authors)Shuqin MaSchool of Philosophy, Fudan University, Shanghai, ChinaSussex Centre for Consciousness Science, University of Sussex, United KingdomRyota KanaiAraya Inc., Tokyo, Japanhttp://arxiv.org/abs/2606.06345v1Boosting Brain-to-Image Decoding with TRIBE v2 Data Augmentation2026-06-04T16:18:08ZBrain decoding is limited by the availability of labeled neural data, and remains challenging in low-data regimes. To address this issue, we investigate whether and when brain decoding can be boosted by augmenting small fMRI datasets with synthetic data generated by a pretrained model of fMRI responses to stimuli. We use TRIBE v2, a large encoding model pretrained on more than 1000 hours of fMRI responses to video, audio and language. For each dataset, we evaluate systematic grids that show how the performance of image decoders varies with the amount of synthetic data used for training. Our results, based on two datasets (the 7T fMRI Natural Scenes Dataset and 3T fMRI BOLD5000), show up to 68% improvement in Top-10 image-retrieval accuracy compared to decoders trained only on real data. Importantly, the proportion of augmented data required to reach a given image decoding performance needs to be adjusted depending on the data source. Surprisingly, image decoders trained exclusively on synthetic fMRI can perform above chance in some settings, suggesting that TRIBE v2 can support zero-shot brain-to-image decoding. Together, these results show how large-scale models of the fMRI responses to sight, sound and language may provide a foundation to improve the data efficiency for image decoding.2026-06-04T16:18:08ZYohann BenchetritMarlène CareilSimon DahanHubert BanvilleStéphane d'AscoliJean-Rémi King