https://arxiv.org/api/uxKfXehGiagy5IKJwIyGWQp+XkI2026-06-18T07:33:09Z2898325515http://arxiv.org/abs/2407.10883v2Data Want to be Free: An Innovation Resistance Theory Model for Identifying Barriers to Government Data Sharing2026-06-06T10:05:07ZData sharing is increasingly essential for digital government and data-driven innovation, yet many public organizations remain reluctant to make their data openly available. While prior research has examined factors influencing open data adoption, little theoretical work explores why resistance persists within public agencies. This study develops an Innovation Resistance Theory (IRT) model tailored to government data sharing to identify predictors of organizational resistance. An initial model was derived from literature and refined through interviews with 21 public organizations across six European countries. The resulting IRT4DS model identifies 39 barriers spanning usage, value, risk, tradition, and image dimensions, and 23 countermeasures mapped to the most critical barriers and the actors responsible for addressing them. By extending IRT into the context of governmental data sharing, the study advances theoretical understanding of why public data often remains closed and provides actionable guidance for policymakers seeking to design enabling data ecosystems and reduce structural and cultural barriers to OGD adoption.2024-07-15T16:35:38ZAnastasija NikiforovaAntoine ClarinvalAnneke ZuiderwijkDaniel RudmarkPetar MilicCharalampos AlexopoulosKatrin Rajamäe-Soosaarhttp://arxiv.org/abs/2606.08076v1"I understand your perspective": LLM Persuasion and Sycophancy through the Lens of Communicative Action Theory2026-06-06T09:54:31ZLarge Language Models (LLMs) can generate high-quality arguments, yet their ability to engage in nuanced and persuasive communicative actions remains largely unexplored. This work explores the persuasive potential of LLMs through the framework of Jürgen Habermas' Theory of Communicative Action. It examines whether LLMs express illocutionary intent (i.e., pragmatic functions of language such as conveying knowledge, building trust, or signaling similarity) in ways that are comparable to human communication. We simulate online discussions between opinion holders and LLMs using conversations from the persuasive subreddit ChangeMyView. We then compare the likelihood of illocutionary intents in human-written and LLM-generated counter-arguments, specifically those that successfully changed the original poster's view. We find that all three LLMs effectively convey illocutionary intent -- often more so than humans -- potentially increasing their anthropomorphism. Further, LLMs craft sycophantic responses that closely align with the opinion holder's intent, a strategy strongly associated with opinion change. Finally, crowd-sourced workers find LLM-generated counter-arguments more agreeable and consistently prefer them over human-written ones. These findings suggest that LLMs' persuasive power extends beyond merely generating high-quality arguments. On the contrary, training LLMs with human preferences effectively tunes them to mirror human communication patterns, particularly nuanced communicative actions, potentially increasing individuals' susceptibility to their influence.2026-06-06T09:54:31ZFindings of the Association for Computational Linguistics: ACL 2025Esra DönmezAgnieszka Falenska10.18653/v1/2025.findings-acl.793http://arxiv.org/abs/2604.06278v4Predictive Volatility of Machine Learning in Micro-Samples: A Regularised Assessment of Regional Poverty2026-06-06T07:22:29ZSmall regional datasets pose a dual statistical problem: correlated predictors inflate estimation variance, while flexible learners can become unstable because the available information per adaptive degree of freedom is limited. We examine this issue through predictive volatility, defined as the cross-sample dispersion and upper-tail behaviour of out-of-sample loss. Using simulation evidence reported for sparse linear, near-linear and heavy-tailed settings, we compare ordinary least squares, frequentist penalties, Bayesian shrinkage models, bounded-response and spatial specifications, and flexible machine-learning procedures. In the reported simulation results, regularised linear estimators generally dominate in the linear high-collinearity micro-sample settings and remain the most reliable overall, whereas tree-based methods become more competitive only when the signal is weakly nonlinear and the sample size is larger. In the empirical application to 34 Indonesian provinces, ridge yields the best leave-one-out performance, followed by elastic net and lasso. Across the Bayesian shrinkage specifications, ICT skills show the most consistent negative association with poverty, with the strongest support under horseshoe and spike-and-slab formulations. These results suggest that, in micro-sample regional modelling, the main constraint is limited information per effective degree of freedom rather than insufficient algorithmic flexibility.2026-04-07T09:41:12ZCorrections are neededA. H. JamaluddinA. T. R. DaniN. I. MahatV. RatnasariS. S. M. Fauzihttp://arxiv.org/abs/2604.02720v3Cognitive Comparability and the Limits of Governance: Evaluating Authority Under Radical Capability Asymmetry2026-06-06T04:29:46ZGovernance theory presupposes a rough cognitive comparability between governors and governed. This paper makes that assumption explicit and testable through a six-dimension evaluation framework covering legitimacy, accountability, corrigibility, non-domination, subsidiarity, and institutional resilience, drawn from political legitimacy theory, principal-agent models, republican theory, and the AI alignment literature. The framework is first demonstrated on existing non-majoritarian institutions, where capability asymmetry is real but bounded, and then applied to a prospective case of bounded superintelligent authority, where the asymmetry is radical. Four of six dimensions show structural failures. Two of the four appear tractable to institutional design (subsidiarity scope limitation and institutional resilience). The other two, the public reason problem under cognitive incomprehensibility and the non-domination problem under permanent capability asymmetry, call for new normative theory rather than better institutional design. The analysis also finds that dimensions which operate as independent checks under bounded asymmetry begin to degrade together under radical asymmetry, because each depends on the same oversight capacity. The assumptions that allowed these checks to remain independent have gone unexamined so far because they have always held.2026-04-03T04:26:18Z20 pages, 2 tables. Interdisciplinary paper on AI governance and political theoryTony Rosthttp://arxiv.org/abs/2606.07948v1EduMirror: Modeling Educational Social Dynamics with Value-driven Multi-agent Simulation2026-06-06T02:38:30ZUnderstanding how educational social dynamics evolve is critical for informing effective educational policies and counterfactual interventions. However, traditional methods face a fundamental dilemma: observational studies often lack causal power, while controlled experiments are frequently constrained by ethical concerns. Although LLM-based multi-agent simulations offer a scalable in silico alternative, existing approaches remain limited by weak psychological grounding and insufficient measurement of latent psychological states. To address this, we introduce EduMirror, a multi-agent simulator for the scientific study of educational social dynamics. We provide configurable education-oriented agent forms, including value-driven agents grounded in psychological needs and social value orientation, together with a dual-track measurement protocol for quantifying observable behaviors and latent psychological states. We validate the realism and usability of EduMirror through case studies on school bullying and group cooperation, as well as broader evaluations across diverse educational scenarios. The results show that EduMirror generates educational social dynamics that are realistic, theory-consistent, and measurable by empirical criteria. These properties enable structured in silico educational research, providing a computational tool for hypothesis testing and counterfactual intervention analysis in educational science. Project page: https://edumirror.net.2026-06-06T02:38:30ZICML 2026Jingzhe LinHengbin YuYongdan ZengFangwei Zhonghttp://arxiv.org/abs/2606.07939v1Stable Geometry, Reversing Poles: The Bipolar Structure of AI Occupational Substitutability and Its Decade-Scale Inversion2026-06-06T02:00:32ZEmpirical research on the labor-market impact of artificial intelligence has converged, since Frey and Osborne (2017), on a continuous-gradient representation in which each occupation is assigned a real-valued exposure score on [0,1] obtained by linear aggregation across capability dimensions. This continuity is rarely articulated as an assumption and has not been tested at the micro-action level where substitution actually occurs. We decompose 1,961 O*NET Detailed Work Activities into 15,817 micro-actions using a multi-agent LLM pipeline with 31-expert HITL calibration, then project the DWA-level Occupational Automation Index from our prior work onto a 7-macro semantic typology. The result is a bipolar structure. Tool-Mediated Physical (M2, mean OAI = 0.054) and Planning & Design (M7, mean OAI = 0.499) form two extremes separated by Cohen's d = 2.41 (H = 172.88, p = 6.21e-34). The geometry is robust under three independent stress tests: resolution (K=7 to K=15, polar gap widens from 0.45 to 0.57), encoder swap to BGE (LLM-class OAI lead replicates at 3.37x), and Eloundou's GPT-4 task ratings (DWA-level rho = 0.635). The six middle macros form a low-contrast band between the poles (TOST at d=0.2 admits only 1/15 pairs as equivalent), not a flat plain. The geometry's stability does not, however, extend to its content. Across a decade, the polarity has inverted. Frey-Osborne (2013) placed Tool-Mediated Physical near the highest computerisation risk and Planning & Design near the lowest; our LLM-era OAI reverses that order, with macro-level FO-Eloundou Spearman rho = -0.750, p = 0.020, against the original Oxford Martin appendix. Which pole is high is therefore contingent on the era's dominant capability frontier, while the stable geometry itself is the structurally robust object.2026-06-06T02:00:32Z57 pages, 13 figures, 10 tables. Companion paper to arXiv:2604.04464 (Gao & Huang 2026). Code and data: https://github.com/ShuyaoGao/bipolar-action-substrateShuyao GaoaSSIST University, Seoul, South KoreaMinghao HuangaSSIST University, Seoul, South Koreahttp://arxiv.org/abs/2605.01616v2Learning Behavioral Signals from Encrypted Smartphone Network Traffic2026-06-05T22:55:35ZHuman behavior is challenging to measure continuously at scale, yet traces of daily routines and well-being may be reflected in interactions with personal devices. We investigate whether encrypted smartphone network traffic can serve as a passive sensing signal for behavioral states related to sleep disturbance, stress, and loneliness. To capture both population-level patterns and individual-specific behavior, we employ a transformer-based model with user-specific adapters that learns representations of network activity while accounting for personal baselines and deviations from them. To improve interpretability, we further analyze these representations using sparse representation learning to identify latent behavioral features associated with distinct activity patterns. We relate the resulting features to sleep disturbance, stress, and loneliness using generalized estimating equations with Mundlak decomposition, enabling separation of stable between-person differences from within-person changes over time. Our analysis reveals that the three outcomes are characterized by different temporal dynamics: stress is predominantly associated with persistent between-person variation, loneliness is more strongly linked to within-person fluctuations, and sleep disturbance reflects a combination of both. Importantly, these within-person behavioral signals are not recovered by conventional handcrafted network-traffic features, highlighting the advantages of learned representations for longitudinal behavioral modeling. Overall, our findings demonstrate that encrypted network traffic contains interpretable behavioral information and can support passive, scalable monitoring of behavioral dynamics, particularly changes relative to an individual's typical pattern of activity.2026-05-02T21:40:07Z19 pages, 6 figuresRameen MahmoodOmar El ShahawySouptik BaruaZachary BeattieJeffrey KayeXuhai "Orson'' XuChao-Yi WuDanny Yuxing Huanghttp://arxiv.org/abs/2606.04819v2The Usefulness Gap in Proof-of-Useful-Work: An Empirical Study of Pearl's cuPOW Protocol2026-06-05T21:17:13ZPearl, a Layer-1 blockchain with high-profile AI industry endorsements, markets its Proof-of-Useful-Work (PoUW) protocol as simultaneously securing the network and performing AI inference. We present the first systematic empirical measurement of a deployed PoUW system, finding that Pearl's 24 EH/s network -- representing approximately 320,000 GPU-equivalents consuming an estimated 112 MW -- produces zero useful AI computation. Budget GPU rental prices rose 38% and utilization surged from 57% to 94% following the mining software's public release, displacing legitimate research workloads.
Our measurements span five dimensions: (1) network composition analysis of 8,012 workers shows all have inference-capable hardware, yet the dominant mining software contains no inference code; (2) the verification protocol accepts random matrices by design, confirmed by 44 pool-accepted shares from our open-source miner across NVIDIA, AMD, CPU, and Apple Silicon hardware; (3) statistical distribution checks are trivially defeated by adversarial Gaussian sampling; (4) mining economics are marginal at current PRL prices ($0.76), with ROI ranging from -1% to +67% depending on GPU tier -- near breakeven for most hardware; and (5) the mining computation is commodity integer arithmetic portable to any hardware platform, offering no vendor lock-in. These findings quantify the verifiability-usefulness tension identified theoretically by Leinweber et al., providing concrete measurements of its magnitude and economic consequences in a deployed system.2026-06-03T12:42:29ZAbhinaba Basuhttp://arxiv.org/abs/2606.07802v1Memetic Capture: A Pluralistic Policy Framework for Governing AI-Driven Cultural Disempowerment2026-06-05T19:32:36ZCulture is the most insidious vector of gradual human disempowerment by AI: unlike economic or political displacement, cultural displacement attacks the very preferences and values through which humans recognise and resist disempowerment itself. We argue that existing AI governance frameworks suffer from a critical blind spot by treating cultural impact as secondary to economic and safety concerns. This paper develops \emph{memetic capture} as a unifying concept for AI-driven cultural disempowerment, and proposes the \textbf{Cultural Pluralistic Governance Framework (CPGF)}, a four-tier policy architecture combining quantitative cultural influence metrics, democratic value assemblies, pluralistic deployment standards, and transnational coordination mechanisms. We argue that pluralism is not merely an ethical requirement for such governance but a structural necessity: monocultural AI governance accelerates the very disempowerment it claims to prevent. We identify concrete policy levers, discuss implementation tensions, and outline a research agenda at the intersection of pluralistic alignment and cultural AI governance.2026-06-05T19:32:36ZPaper accepted in Pluralistic Alignment Workshop at ICML 2026Subramanyam Sahoohttp://arxiv.org/abs/2604.04464v2Bounded by Risk, Not Capability: Quantifying AI Occupational Substitution Rates via a Tech-Risk Dual-Factor Model2026-06-05T19:13:05ZThe deployment of Large Language Models (LLMs) has ignited concerns about technological unemployment. Existing task-based evaluations predominantly measure theoretical "exposure" to AI capabilities, ignoring critical frictions of real-world commercial adoption: liability, compliance, and physical safety. We argue occupations are not eradicated instantaneously, but gradually encroached upon via atomic actions. We introduce a Tech-Risk Dual-Factor Model to re-evaluate this. By deconstructing 923 occupations into 2,087 Detailed Work Activities (DWAs), we utilize a multi-agent LLM ensemble to score both technical feasibility and business risk. Through variance-based Human-in-the-Loop (HITL) validation with an expert panel, we demonstrate a profound cognitive gap: isolated algorithmic probabilities fail to encapsulate the "institutional premium" imposed by experts bounded by professional liability. Applying a strictly algorithmic baseline via mathematical bottleneck aggregation, we calculate Relative Occupational Automation Indices ($OAI$) for the U.S. labor market. Our findings challenge the traditional Routine-Biased Technological Change (RBTC) hypothesis. Non-routine cognitive roles highly dependent on symbolic manipulation (e.g., Data Scientists) face unprecedented exposure ($OAI \approx 0.70$). Conversely, unstructured physical trades and high-stakes caretaking roles exhibit absolute resilience, quantifying a profound "Cognitive Risk Asymmetry." We hypothesize the emergent necessity of a "Compliance Premium," indicating wage resilience increasingly tied to risk-absorption capacity. We frame these findings as a cross-sectional diagnostic of systemic vulnerability, establishing a foundation for subsequent Computable General Equilibrium (CGE) econometric modeling involving dynamic wage elasticity and structural labor reallocation.2026-04-06T06:21:08Z32 pages, 4 figures. v2: added link to the reproducibility repository (https://github.com/ShuyaoGao/bounded-risk-oai), updated author email, and updated several referencesShuyao GaoaSSIST University, Seoul, South KoreaMinghao HuangaSSIST University, Seoul, South Koreahttp://arxiv.org/abs/2606.07764v1Reimagining Open Source and Openness in AI: Co-Creating Responsible Technological Futures2026-06-05T18:23:39ZDebates over open source and openness in artificial intelligence have intensified as policymakers, researchers, and practitioners grapple with how foundation models should be developed and governed to balance innovation, accountability, and public interest. However, there has been limited empirical work examining how diverse stakeholders collectively understand and negotiate responsible openness in AI, particularly through participatory processes that extend beyond industry-led definitions and frameworks. This paper presents findings from a multi-sectoral workshop grounded in futures thinking and participatory design methods. The workshop generated co-created visions of desirable futures and the role of AI, alongside a set of action pathways and a research roadmap focused on responsible open source and openness in AI. This paper makes three key contributions. First, it empirically documents the co-created visions, actions, and research priorities. Second, it identifies four core tensions that emerged as participants translated high-level aspirations into concrete actions, revealing conflicting interpretations of openness regarding its purpose (as an end or a means), its scope (expansion versus meaningful access), and its operation (mandatory versus conditional, sufficient versus dependent on governance and use). These tensions illustrate that responsible openness is not a singular technical solution, but a negotiated sociotechnical project shaped by values, positionalities, and priorities. Third, the paper advances methodological approaches in AI governance by demonstrating how participatory futures methods can surface plural visions, actions, and research priorities that extend beyond dominant, largely corporate, narratives, offering empirical insight into how openness, power, and accountability are negotiated in practice.2026-06-05T18:23:39ZTo appear in the 2026 ACM Conference on Fairness, Accountability, and Transparency (FAccT '26)Genevieve SmithHiral PatelSteven LuoMonica G. BobraJudy BrewerCathryn CarsonIsadora CruxenShachee DoshiMaximilian GahntzNicholas GarciaNatalia LukaMeredith M. LeeMin Kyung LeeWoohyeuk LeeJarrod MillmanRicardo Miron TorresChinasa T. OkoloCailean OsborneDerek SlaterKatie Steen-JamesNikko StevensJennifer TridgellDavid Gray Widder10.1145/3805689.3806719http://arxiv.org/abs/2606.07279v1Detective scaffolding for within-session reasoning development: a three-phase framework evaluated in polymer engineering and pre-university outreach2026-06-05T13:55:54ZThis paper presents a detective scaffolding framework -- a three-phase instructional sequence (Hypothesis Activation -> Evidence Structuring -> Causal Integration) in which engineering students investigate a realistic industrial defect scenario using staged in-class polls as designed evidence probes. Unlike conventional uses of student response systems for engagement, the framework positions each poll as an Evidence-Centred Design instrument targeting a specific reasoning capability. In the primary implementation, 80 Year~3 polymer engineering students progressed from prior-knowledge-driven misconception (71% attributing defects to temperature) to complete root-cause convergence (100\% identifying humidity; Fisher's exact test, $p < .001$) across four sequenced prompts within a single 90-minute lecture slot. A dual-accuracy analysis revealed that at one intermediate stage, textbook-correct and analytically valid responses diverged, illustrating why conventional scoring can misrepresent reasoning quality. In a transferability study, 26 Year~12 students with no engineering background achieved identical root-cause identification rates across two adapted scenarios, with significant gains in data-analysis confidence and AI explanation ability. The results suggest that the pedagogical structure, rather than disciplinary content, drives the convergence effect, implying portability across disciplines and educational levels.2026-06-05T13:55:54Z24 pages, 3 figuresHaolin FengHolly BarrettXinru DengDimitrios G PapageorgiouYiwei Sunhttp://arxiv.org/abs/2606.07270v1Two-Phase Simulated Annealing for Equitable Team Formation: Eliminating Complaints in Large Engineering Cohorts2026-06-05T13:44:35ZContribution: This paper presents a novel two-phase algorithmic approach that decouples preference satisfaction from fairness optimization in student team formation, achieving both objectives without compromise. The method applies simulated annealing -- a core materials science technique -- to an educational challenge, demonstrating pedagogical integration of administrative processes. Background: Forming effective teams in large engineering cohorts (100+ students) requires balancing student preferences, academic fairness, and demographic diversity. Existing tools either optimize for fairness while ignoring preferences (CATME, Team-Anneal) or accommodate preferences while compromising balance (self-selection), leaving complaint rates at 5--35%. Intended Outcomes: Eliminate formal complaints, achieve near-zero GPA variance between teams, prevent gender isolation, and maintain high preference satisfaction while creating a scalable, reproducible solution applicable across engineering programs. Application Design: Phase 1 forms fixed triads through graph-theoretic clustering that maximizes mutual preferences, preserving social bonds. Phase 2 employs simulated annealing to pair triads into teams of six while optimizing GPA variance, gender balance, and size constraints. This decomposition mirrors hierarchical optimization in materials processing. Findings: Deployed across 238 students, the algorithm eliminated formal complaints entirely (vs >30% baseline), achieved GPA variance of 0.005 (vs. historical mean 9.74), eliminated gender-isolated individuals, and maintained 94.3% preference satisfaction. Validation against 82 historical grouping instances (1,538 teams, 6 academic years) confirmed significant improvement over conventional methods.2026-06-05T13:44:35Z9 pages, 3 figuresYiwei SunXinru DengDimitrios G Papageorgiouhttp://arxiv.org/abs/2511.06080v4AIDEN: Design and Pilot Study of an AI Assistant for the Visually Impaired2026-06-05T13:26:09ZThis paper presents AIDEN, an artificial intelligence-based assistant designed to enhance the autonomy and daily quality of life of visually impaired individuals, who often struggle with object identification, text reading, and navigation in unfamiliar environments. Existing solutions such as screen readers or audio-based assistants facilitate access to information but frequently lead to auditory overload and raise privacy concerns in open environments. AIDEN addresses these limitations with a hybrid architecture that integrates You Only Look Once (YOLO) for real-time object detection and a Large Language and Vision Assistant (LLaVA) for scene description and Optical Character Recognition (OCR). A key novelty of the system is a continuous haptic guidance mechanism based on a Geiger-counter metaphor, which supports object centering without occupying the auditory channel, while privacy is preserved by ensuring that no personal data are stored. Empirical evaluations with visually impaired participants assessed perceived ease of use and acceptance using the Technology Acceptance Model (TAM). Results indicate high user satisfaction, particularly regarding intuitiveness and perceived autonomy. Moreover, the ``Find an Object'' achieved effective real-time performance. These findings provide promising evidence that multimodal haptic-visual feedback can improve daily usability and independence compared to traditional audio-centric methods, motivating larger-scale clinical validations.2025-11-08T17:23:51ZLuis Marquez-CarpinteroFrancisco Gomez-DonosoZuria BauerBessie Dominguez-DagerAlvaro Belmonte-BaezaMónica Pina-NavarroFrancisco Morillas-EspejoFelix EscalonaMiguel Cazorlahttp://arxiv.org/abs/2606.07245v1AI Sovereignty: A Qualitative Model of Strategic Competition as AI Becomes an Instrument of National Power2026-06-05T13:11:39ZAI sovereignty is the extent to which a nation independently controls its artificial intelligence (AI) technologies. The race toward ever-more-sophisticated frontier AI models is of increasing strategic importance, with nations considering how AI might improve their economic situations, competitive advantage, and overall national power. However, the costs of AI sovereignty are enormous, and we lack definitions and conceptual models to navigate evolving AI sovereignty dynamics. We address this gap with definitions relevant to AI sovereignty, along with a first-of-its-kind qualitative model that incorporates micro, meso, and macro contributors. Model-based qualitative forecasts highlight competitive dynamics and evolving potential for AI-driven national power. The model identifies key leverage points that nations can use to enhance their own growth or degrade an adversary's, including consideration of accelerators, electricity, water, data sets and skilled workforce. These leverage points can be activated at strategic and operational levels through both direct kinetic actions, such as Iran's targeting of data centers with drones, and indirect non-kinetic effects including cyber, space, information, economic coercion and diplomacy. If our assumptions and hypotheses are valid, this strategic competition may come to define how nations improve their economic situations, competitive advantage, and overall national power in the 21st Century.2026-06-05T13:11:39ZMain article: 19 pages, 10 figures. Supplementary: 19 pages, 7 figures, 7 tables. To be presented at the 2026 International System Dynamics Conference (ISDC), July 20-24, TU Delft, Delft, NetherlandsTimothy ClancyAsmeret Naugle