https://arxiv.org/api/Ek/untGAwSE5nM74iMqXQ8SDBWA 2026-06-14T00:09:29Z 28886 195 15 http://arxiv.org/abs/2605.16301v2 Do LLMs Hold Their Values? MANTA: A Multi-Turn Adversarial Benchmark for Animal Welfare Reasoning 2026-06-03T11:55:54Z

Evaluating animal welfare reasoning in LLMs remains an open challenge despite rapid deployment in consumer and professional contexts where welfare considerations appear implicitly in everyday queries. Existing benchmarks such as AnimalHarmBench evaluate this through single-turn, explicitly framed questions, measuring whether models avoid harmful content when directly asked. This approach overlooks two failure modes: alignment degradation under sustained adversarial pressure, and moral sensitivity (whether a model spontaneously surfaces welfare stakes in everyday queries). To fill this gap, we construct MANTA, a benchmark of 1,088 five-turn conversations progressing from an implicit Turn-1 scenario through an explicit welfare prompt to three adversarial pressure rounds drawn from a five-type taxonomy: Social, Cultural, Economic, Pragmatic, and Epistemic. We score conversations on two dimensions: Animal Welfare Value Stability (AWVS, primary) and Animal Welfare Moral Sensitivity (AWMS, diagnostic). We evaluate seven frontier models: Claude Opus 4.7, GPT-5.5, DeepSeek V4, Llama 3.3 70B, Mistral Small, Grok 4.3, and Gemini 3.1 Flash Lite. Multi-turn evaluation captures behavior single-turn benchmarks miss: 4 of 7 models change rank relative to Turn 1 scores, including Gemini Flash Lite, which drops from fifth on AWMS to last on AWVS. AWMS and AWVS are positively but imperfectly correlated, suggesting moral-recognition tests capture a stable but incomplete component of model behavior under pressure. MANTA also enables a species-by-pressure interaction matrix unavailable to prior benchmarks, showing welfare robustness depends jointly on the animal and pressure applied; companion animals score above wild animals, which score above farmed animals and invertebrates. We release the dataset, scripted pressure plans, judge prompts, and analysis code.

2026-04-18T19:51:32Z Isabella Luong Joyee Chen Arturs Kanepajs Jasmine Brazilek Sankalpa Ghose David Williams-King Linh Le Allen Lu http://arxiv.org/abs/2503.06525v2 From Motion Signals to Insights: A Unified Framework for Student Behavior Analysis and Feedback in Physical Education Classes 2026-06-03T11:47:00Z

Analyzing student behavior in educational scenarios is crucial for enhancing teaching quality and student engagement. Existing AI-based models often rely on classroom video footage to identify and analyze student behavior. While these video-based methods can partially capture and analyze student actions, they struggle to accurately track each student's actions in physical education classes, which take place in outdoor, open spaces with diverse activities, and are challenging to generalize to the specialized technical movements involved in these settings. Furthermore, current methods typically lack the ability to integrate specialized pedagogical knowledge, limiting their ability to provide in-depth insights into student behavior and offer feedback for optimizing instructional design. To address these limitations, we propose a unified end-to-end framework that leverages human activity recognition technologies based on motion signals, combined with advanced large language models, to conduct more detailed analyses and feedback of student behavior in physical education classes. Our framework begins with the teacher's instructional designs and the motion signals from students during physical education sessions, ultimately generating automated reports with teaching insights and suggestions for improving both learning and class instructions. This solution provides a motion signal-based approach for analyzing student behavior and optimizing instructional design tailored to physical education classes. Experimental results demonstrate that our framework can accurately identify student behaviors and produce meaningful pedagogical insights.

2025-03-09T09:04:36Z Work in progress Xian Gao Jiacheng Ruan Jingsheng Gao Mingye Xie Zongyun Zhang Ting Liu Yuzhuo Fu http://arxiv.org/abs/2606.04750v1 Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment 2026-06-03T11:31:37Z

Instilling virtuous behavior in artificial intelligence has seen increasing interest. One of the techniques proposed is known as affinity-based reinforcement learning, which uses policy regularization on the objective function to incentivize virtuous actions without being fully dependent on the reward function design. Thus far, this technique has been demonstrated to be effective in grid worlds and toy-problem environments with minimal state and action spaces. To expand this research to more sophisticated environments, we introduce a two-player multi-agent environment based on the role-playing board game known as Fog of Love. In this environment, two agents compete to fulfill their individual virtues, while also cooperating to satisfy their relationship. Given the multi-agent nature, this is a complex problem where multi-agent deep deterministic policy gradient agents neither compete nor cooperate successfully. We present evidence that localized affinities enhance agent performance in achieving both competitive and cooperative objectives, resulting from superior overall scores in both domains. This not only results in virtuous choices but also clarifies an agent's teleology and makes its behavior human-level interpretable.

2026-06-03T11:31:37Z Ajay Vishwanath Christian Omlin http://arxiv.org/abs/2505.08385v2 TikTok Search Recommendations: Governance and Research Challenges 2026-06-03T09:50:57Z

Like other social media, TikTok is embracing its use as a search engine, developing search products to steer users to produce searchable content and engage in content discovery. Their recently developed product search recommendations are preformulated search queries recommended to users on videos. However, TikTok provides limited transparency about how search recommendations are generated and moderated, despite requirements under regulatory frameworks like the European Union's Digital Services Act. By suggesting that the platform simply aggregates comments and common searches linked to videos, it sidesteps responsibility and issues that arise from contextually problematic recommendations, reigniting long-standing concerns about platform liability and moderation. This position paper addresses the novelty of search recommendations on TikTok by highlighting the challenges that this feature poses for platform governance and offering a computational research agenda, drawing on preliminary qualitative analysis. It sets out the need for transparency in platform documentation, data access and research to study search recommendations.

2025-05-13T09:32:09Z Published at The 1st International Workshop on Computational Approaches to Content Moderation and Platform Governance (COMPASS), held at ICWSM 2025. Please cite accordingly. This research has been supported by funding from the ERC Starting Grant HUMANads (ERC-2021-StG No 101041824) Taylor Annabell Robert Gorwa Rebecca Scharlach Jacob van de Kerkhof Thales Bertaglia http://arxiv.org/abs/2605.30995v2 Traceable by Design: An LLM Pipeline and Dashboard for EU Regulatory Consultation Analysis 2026-06-03T09:48:21Z

Public consultations generate large volumes of data in the form of stakeholder submissions that are practically unfeasible to analyse manually. We present an end-to-end LLM-based pipeline and interactive dashboard for structured topic extraction from regulatory consultation submissions, demonstrated on the European Commission's Digital Fairness Act (DFA) public call for evidence as a case study. The system processes raw PDF attachments and web-form responses, extracts topic annotations, and grounds every extraction in a verbatim quote from the source text. Applied to 4,322 DFA submissions, the pipeline produced 15,368 topic annotations supported by 20,951 verbatim evidence quotes. Three principles govern the proposed design: verbatim grounding, full traceability, and transparency by design. The dashboard exposes the full extraction dataset through five analytical views, from dataset-level topic overviews to individual paragraph drill-downs, with every result traceable to its source. Beyond the predefined DFA topic categories, the pipeline generated certain stakeholder concerns, such as Age Verification, Payment Processor Censorship, and Digital Ownership, that a fixed-taxonomy approach would have missed. The pipeline is domain-generic; adapting it to a new consultation requires only a prompt update and a new dataset. A live demo is available at https://dfa-dashboard.thalesbertaglia.com/. The code and processed data are publicly available at https://github.com/thalesbertaglia/dfa-dashboard.

2026-05-29T08:29:00Z This research has been supported by funding from the ERC Starting Grant HUMANads (ERC-2021-StG No 101041824) Thales Bertaglia Haoyang Gui Catalina Goanta Gerasimos Spanakis http://arxiv.org/abs/2505.01122v3 The Great Data Standoff: Researchers vs. Platforms Under the Digital Services Act 2026-06-03T09:42:24Z

To facilitate accountability and transparency, the Digital Services Act (DSA) sets up a process through which Very Large Online Platforms (VLOPs) need to grant vetted researchers access to their internal data (Article 40(4)). Operationalising such access is challenging for at least two reasons. First, data access is only available for research on systemic risks affecting European citizens, a concept with high levels of legal uncertainty. Second, data access suffers from an inherent standoff problem. Researchers need to request specific data but are not in a position to know all internal data processed by VLOPs, who, in turn, expect data specificity for potential access. In light of these limitations, data access under the DSA remains a mystery. To contribute to the discussion of how Article 40 can be interpreted and applied, we provide a concrete illustration of what data access can look like in a real-world systemic risk case study. We focus on the 2024 Romanian presidential election interference incident, the first event of its kind to trigger systemic risk investigations by the European Commission. During the elections, one candidate is said to have benefited from TikTok algorithmic amplification through a complex dis- and misinformation campaign. By analysing this incident, we can comprehend election-related systemic risk to explore practical research tasks and compare necessary data with available TikTok data. In particular, we make two contributions: (i) we combine insights from law, computer science and platform governance to shed light on the complexities of studying systemic risks in the context of election interference, focusing on two relevant factors: platform manipulation and hidden advertising; and (ii) we provide practical insights into various categories of available data for the study of TikTok, based on platform documentation, data donations and the Research API.

2025-05-02T09:00:19Z Published at the 20th International AAAI Conference on Web and Social Media (ICWSM 2026). Please cite accordingly. This research has been supported by funding from the ERC Starting Grant HUMANads (ERC-2021-StG No 101041824) Catalina Goanta Savvas Zannettou Rishabh Kaushal Jacob van de Kerkhof Thales Bertaglia Taylor Annabell Haoyang Gui Gerasimos Spanakis Adriana Iamnitchi http://arxiv.org/abs/2603.10829v2 Spatially conditioned dynamics between population and built form 2026-06-03T09:00:40Z

Understanding the relationship between population and the built environment is essential for addressing socio-spatial inequalities. While researchers have long theorized these dynamics, empirical analyses remain limited. This study proposes a spatially explicit framework to quantify the relationship between population and the built environment at the scale of local census tracts in Czechia. The approach integrates a fine-grained classification of built form with a comprehensive set of socio-demographic indicators. The method compares global and geographically weighted classification models to assess the overall strength and spatial variability of the associations between population structure and built form. The results of the study show that population characteristics exhibit linear, spatially conditioned relationships with built form, emphasizing that spatial heterogeneity must be accounted for when assessing these relationships. The analysis also reveals that some built form types are more socially selective than others, underscoring the importance of built form in reproducing social-spatial inequalities.

2026-03-11T14:36:37Z Anna Brazdova Martin Fleischmann http://arxiv.org/abs/2606.04617v1 When Firms Learn to Game the Rules 2026-06-03T08:53:36Z

Rules-as-Code promises more testable legal obligations, but it also changes what regulated firms can learn. Existing work mostly emphasizes implementation gains; the strategic gap is whether machine-readable rules make boundary search cheaper. I study that gap with a synthetic agent-based reinforcement-learning simulation that separates actual conduct near a legal threshold from proximity in the computable enforcement signal. Across 150 seed-level scenario runs, 378 common-random-number computability-sweep runs, 288 Latin-hypercube global-design runs, and a 2,880,000-row firm-period panel, computable static rules raise conduct boundary mass relative to ambiguous static rules (0.411 versus 0.367) and raise signal boundary mass more sharply (0.403 versus 0.281). Ordinary adaptive updates lower consumer harm (0.202 to 0.194) but do not reliably reduce boundary search. A budget-neutral anti-gaming design reduces conduct boundary mass by 0.032 and consumer harm by 0.025 relative to computable static rules. These are mechanism-oriented synthetic results, not estimates of real firm behavior in a jurisdiction or industry. The contribution is an estimand distinction, an inspectable ABM/RL mechanism, and a reproducible artifact showing that transparent behavioral assumptions are sufficient to generate gaming-like boundary dynamics without implying that computable regulation is inherently undesirable.

2026-06-03T08:53:36Z Includes synthetic simulation data, source code, figures, and reproducibility materials Xufeng He http://arxiv.org/abs/2606.04592v1 Synthetic Personalities: How Well Can LLMs Mimic Individual Respondents Using Socio-Economic Microdata? 2026-06-03T08:30:03Z

LLM-based digital twins promise to scale and accelerate market research, but most published twins are either coarse persona bots conditioned on a few demographic questions or detailed individual-level twins built on purpose-collected surveys and interview transcripts. Neither setup speaks to the operationally most relevant case for marketing practice: building detailed individual twins from the pre-existing heterogeneous panel data that firms already accumulate through CRM systems, loyalty programs, and repeat surveys. We construct detailed individual-level twins from the German Socio-Economic Panel (SOEP) and evaluate them across a $3 \times 5 \times 2 \times 2$ construction-method grid that covers three open-weights LLMs, five cumulative information depths ranked by normalized Shannon entropy, two embedding methods, and two reasoning modes, scoring over 2.1 million twin responses on 500 participants and 183 held-out questions. Twin quality rises with information depth but with diminishing returns past the 75 percent entropy quartile, which acts as a cost-efficient Pareto point relative to the best-performing 100 percent cells. Switching the embedding from a narrative persona summary to a raw dialog history of past responses raises hold-out accuracy in every model-by-reasoning cell at the 100 percent depth, while an explicit thinking mode raises rank-order correlation without moving accuracy. Best-cell accuracy reaches 78.8 percent and Fisher-$z$ correlation reaches $r = 0.590$ on the SOEP held-out evaluation set. The findings suggest that twin-based market research is no longer gated by data design, but by item volume, model selection, and a small set of construction-level decisions that this paper now maps.

2026-06-03T08:30:03Z Leonard Kinzinger Jochen Hartmann http://arxiv.org/abs/2606.04563v1 Addressing Negative Commons Governance with Positive Commons Principles 2026-06-03T07:52:38Z

Computing is accompanied by both positive and negative commons throughout its lifecycle of creation, execution, and disposal. We examine two governance systems situated within this lifecycle -- global e-waste trade and the Linux kernel community -- to evaluate whether Elinor Ostrom's eight design principles for common-pool resource (CPR) governance extend to the management of negative common-pool resources (NCPRs). Unlike traditional CPRs where communities work to preserve a finite resource (i.e. clean water), NCPR governance seeks to collectively reduce a negative shared stock. In our two cases, e-waste governance aims to reduce the volume of mismanaged waste and illicit trade, while the Linux community aims to reduce the number of error-prone or malicious contributions that reach the main branch and, in turn, extend the life of existing hardware. Through qualitative analysis of primary sources from each domain, we find that the same eight principles by Ostrom that aid positive commons governance tend to appear in successful negative commons governance systems. We argue that future NCPR governance design should prioritize Ostrom's principles, particularly clearly defined boundaries and well-functioning nested structures.

2026-06-03T07:52:38Z Paper in Proceedings of LIMITS 2026: 12th Workshop on Computing within Limits, 2026-06-23-25, Online Boyang Zhou Oleg Ianchenko http://arxiv.org/abs/2606.04543v1 Agentic AI and Pedagogical Best Practice: The Tension Between Automation and Learning 2026-06-03T07:26:23Z

Artificial intelligence in education is evolving from passive chatbots to proactive AI agents capable of initiation and goal-directed interactions. While offering opportunities for personalised learning, this shift risks undermining learner agency and cognitive effort. This paper reviews six pedagogical principles-prior knowledge activation, collaborative learning, problem-based learning, formative assessment, scaffolding, and metacognition-through the lens of agentic AI. We discuss the tension between automation and learning, proposing design recommendations that prioritise intentional friction, dynamic scaffolding, human-in-the-loop oversight, and considered AI utilisation to ensure AI supports rather than supplants human learning.

2026-06-03T07:26:23Z Accepted for publication at AIED 2026 - Festival of Learning HAI-Agency Workshop on Orchestrating Human and AI Agency for Proactive and Reflective Learning Steve Woollaston Brendan Flanagan Isanka Wijerathne Hiroaki Ogata http://arxiv.org/abs/2605.28829v2 Aryabhata 2: Scaling Reinforcement Learning for Advanced STEM Reasoning 2026-06-03T07:25:11Z

Competitive STEM examinations such as JEE and NEET require multi-step symbolic reasoning, precise numerical computation, and deep conceptual understanding across physics, chemistry, and mathematics. Recent large language models perform strongly on common reasoning benchmarks, yet they remain difficult to deploy at scale, where millions of student doubts demand domain-specific, consistently structured problem solving. We introduce Aryabhata 2, a reasoning-focused language model for competitive STEM examinations, trained via reinforcement-learning post-training. Using PhysicsWallah's internal question banks, we construct a high-quality training curriculum and post-train GPT-OSS-20B through reinforcement learning with verifiable rewards. Training combines prolonged reinforcement learning with broadened exploration via progressively larger rollout group sizes. We evaluate Aryabhata 2 on competitive examination benchmarks, including JEE Main, JEE Advanced, and NEET, as well as out-of-distribution reasoning datasets such as AIME, HMMT, MMLU-Pro, MMLU-Redux 2.0, and GPQA. Results show that Aryabhata 2 outperforms its base model GPT-OSS-20B on competitive STEM reasoning while requiring substantially fewer output tokens (up to 64\% fewer).

2026-04-10T06:53:27Z Ritvik Rastogi Vishal Singh Tejas Chaudhari Sandeep Varma http://arxiv.org/abs/2512.03296v2 Associating Healthcare Teamwork with Patient Outcomes for Predictive Analysis 2026-06-03T06:14:44Z

Cancer treatment outcomes are influenced not only by clinical and demographic factors but also by the collaboration of healthcare teams. However, prior work has largely overlooked the potential role of human collaboration in shaping patient survival. This paper presents an applied AI approach to uncovering the impact of healthcare professionals' (HCPs) collaboration, captured through electronic health record (EHR) systems, on cancer patient outcomes. We model EHR-mediated HCP interactions as networks and apply machine learning techniques to detect predictive signals of patient survival embedded in these collaborations. Our models are cross validated to ensure generalizability, and we explain the predictions by identifying key network traits associated with improved outcomes. Importantly, clinical experts and literature validate the relevance of the identified crucial collaboration traits, reinforcing their potential for real-world applications. This work contributes to a practical workflow for leveraging digital traces of collaboration and AI to assess and improve team-based healthcare. The approach is potentially transferable to other domains involving complex collaboration and offers actionable insights to support data-informed interventions in healthcare delivery.

2025-12-02T23:16:03Z Hsiao-Ying Lu Kwan-Liu Ma http://arxiv.org/abs/2606.04490v1 Prioritization of Risks from Artificial Intelligence: A Delphi Study of 272 International Experts 2026-06-03T06:14:41Z

Artificial intelligence poses many risks, ranging from familiar present-day harms to unprecedented and potentially catastrophic ones. Effective risk management requires prioritization: we must understand which risks are most severe, who is most vulnerable, and who is most responsible for addressing them. We report results from a three-round Delphi study conducted late 2025 with 272 international AI experts. Experts rated 24 AI risks on harm probability and severity, sector and actor vulnerability, actor responsibility, and overall concern. Experts estimated the five most severe harms in the next 5 years were likely to come from dangerous capabilities, competitive dynamics, weapons & cyberattacks (including CBRNE), power centralization, and false information. In a business-as-usual scenario, experts judged 18 of 24 risks as having a more than 10% probability of catastrophic outcomes (e.g., more than 1 million deaths or more than USD 100B in financial loss) in the next 5 years (2025-2030). In a scenario where pragmatic mitigations are implemented, experts still judged five risks as having a more than 10% probability of catastrophic outcomes: dangerous capabilities, weapons & cyberattacks, environmental harm, inequality & unemployment, and power centralization. All 24 risks were judged as being more than 5% likely to cause catastrophic outcomes. AI users and the general public were judged the most vulnerable to these risks, but experts assigned the highest responsibility for addressing them to general-purpose AI developers and governance actors (including governments, regulators, and standards bodies). Across most risks, experts identified information, finance, and national security as the most vulnerable sectors. These findings can guide AI risk prioritization and clarify expert expectations about who should bear responsibility for mitigation.

2026-06-03T06:14:41Z Access data at https://osf.io/pj2qr Alexander K. Saeri Jess Graham Michael Noetel Peter Slattery Dennis Ah-king Edla Aittokallio Ibitola Akindehin Abbas Al Mahdi Elie Alhajjar Rafael Andersson Lipcsey Gary Ang Catherine M. Azam Amos Azaria Rishal Balkissoon Isabel Barberá Claudio Bareato Jonathan Barry Michael Basehart Andrew M. Bean Danny Belitz Samantha Augusta Bennett Kayla Blomquist Damian Borstel Ben Bucknall Tomas Bueno Momcilovic Aurelie Bugeau Nicholas Caputo Stephen Casper Gulam Chagani Ze Shen Chin Jiyeon Cho Jay Chooi Joel N. Christoph Dmytro Chumachenko Kieran Conboy Elizabeth M. Daly Tom David Paul de Font-Reaulx Antonio De Santis Fabrizio Degni Christopher W. DiCarlo Yawen Duan Janet Egan Ian W. Eisenberg Sherif M. Elsafty Adam Ennamli Mark Esposito Nicola Fabiano Gallo Fall Neil R. Fernandes Pip Foweraker Chiara Gallese Sandra Galletti Andrew Gamino-Cheong Rokas Gipiškis Gwyn Glasser Delaram Golpayegani Jeff Grayson Hans Gundlach Josiah Hagen Alexander Hagenah Amelia S. Haines The Anh Han Yixiong Hao Kasii Harris Tianxing He Koen Holtman Giorgos Iacovides Kenneth L. Ingham Krystal Jackson Adam Jones Himanshu Joshi Brian Judge Arturs Kanepajs Shreya Kapoor Win Myat Nwe Khine Aidan Kierans Aleksandra Korolova Markus Krebsz Nicholas Kruus Joe Kwon Valeria Lazzaroli Ray X. Lee Evelina Leivada Stephan Lewandowsky Michael B. Li Xiaojian Li Geunsik Lim Henrique Lisakowski Fabio Lonardoni Todd C. Lowe Jackson G. Lu Alexander Lyzhov Nada Madkour Parv Mahajan David Manheim Kareem Mathias Claudio Mayrink Verdun Sean McGregor Scott McLean Matthew J. McMahon Minas Megalokonomos Nicolas Moës Fernando Mourao Yaroslav Mukhin Malcolm Murray Simon Mylius Neeraj Nagpal Koichi Nakada Anna Neumann Jessica Newman Kwan Yee Ng Minh N. Nguyen Quynh Phuong Nguyen Seán S. Ó hÉigeartaigh Daria Onitiu Kelly Onu Oscar Oviedo-Trespalacios Ugur Ozer Chanwoo Park M. Alejandra Parra-Orlandoni Patricia Paskov Anna M. Pastwa Burak Piskin Jacob Pratt Claudiu A. Predincea Marjana Prifti Skenduli Kenneth Priore Mukunda Madhab Pujari Zhenting Qi Preethi Raghunathan Robi Rahman Deepika Raman Max Reddel Jyoti Ruparel Emma B. Ruttkamp-Bloem Tiffany Saade Greg Sadler Said Saillant Paul M. Salmon Ayrton San Joaquin Lama Saouma Maziya Sarangpurwala Supheakmungkol Sarin Daniel S. Schiff Anna D. Schilling Chris Schmitz Reva Schwartz Abeer Sharma Tianhao Shen Kehan Sheng Maury D. Shenk Eli Sherman Chandler Smith Julie M. Smith Estevenson Solano Oliver Sourbut Madhulika Srikumar Ryan Stendall Jakob Stenseke Michael Stern Joshua Sternfeld Nikko Stevens Ilia Sucholutsky Yuanyuan Sun Mariami Tkeshelashvili Cristian Trout Brian Tse Nikolaos Tsinganos Michelle Vaccaro Anthony R. Valiaveedu Ramakrishnan Veeramony Jeremy Verdo Pulkit Verma Andrea Luigi Vitali Jinge Wang JR Washebek Yonah Welker George F. Westerman James Williams Tristan Williams Rongwu Xu Mick Yang Xuemeng Yang Sander Zeijlemaker Jingyu Zhang Marta Ziosi Neil Thompson http://arxiv.org/abs/2606.04450v1 Listening to the Workforce: Measuring Construction Worker Safety Attitudes from Social Media Discourse Using LLMs 2026-06-03T04:54:40Z

Worker safety attitudes are key determinants of whether protective practices are applied or bypassed on construction sites. Yet measuring them at scale has remained out of reach. Safety attitudes are multidimensional, vary across topics, and surface most candidly in workers' own conversations. This study created and validated the Construction Safety Attitude Framework (CSAF), which integrates two components: a theory-grounded structure that characterizes safety attitudes along eight dimensions, and an operational codebook for measuring them in worker naturalistic discourse. Applying CSAF to 250 posts and comments from the r/Construction community on Reddit, trained coders reached strong agreement (Krippendorff's α = 0.85). Pairwise lift and conditional probability confirmed that the eight dimensions are related yet distinct. To apply the framework across large volumes of discourse, CSAF was operationalized through a large language model (LLM) classifier. On 450 r/Construction contributions, the classifier reproduced expert human coding (Cohen's \k{appa} = 0.90, precision = 0.98, recall = 0.98), and on 400 contributions from r/Roofing it retained that accuracy after transfer to a different trade community (\k{appa} = 0.89, precision = 0.98, recall = 0.97). A proof-of-value case study then applied the validated classifier to 10,346 contributions from r/Roofing, demonstrating that CSAF can distinguish multidimensional attitudes by safety topic, track how they shift over time, and trace the reasoning behind unfavorable ones. The study therefore provides a theoretically grounded, empirically vetted instrument for examining safety attitudes, offering a basis for targeted interventions that address the attitudes underlying unsafe practices.

2026-06-03T04:54:40Z Farouq Sammour Yuxin Zhang Zhenyu Zhang