https://arxiv.org/api/Ek/untGAwSE5nM74iMqXQ8SDBWA2026-06-14T00:09:29Z2888619515http://arxiv.org/abs/2605.16301v2Do LLMs Hold Their Values? MANTA: A Multi-Turn Adversarial Benchmark for Animal Welfare Reasoning2026-06-03T11:55:54ZEvaluating animal welfare reasoning in LLMs remains an open challenge despite rapid deployment in consumer and professional contexts where welfare considerations appear implicitly in everyday queries. Existing benchmarks such as AnimalHarmBench evaluate this through single-turn, explicitly framed questions, measuring whether models avoid harmful content when directly asked. This approach overlooks two failure modes: alignment degradation under sustained adversarial pressure, and moral sensitivity (whether a model spontaneously surfaces welfare stakes in everyday queries). To fill this gap, we construct MANTA, a benchmark of 1,088 five-turn conversations progressing from an implicit Turn-1 scenario through an explicit welfare prompt to three adversarial pressure rounds drawn from a five-type taxonomy: Social, Cultural, Economic, Pragmatic, and Epistemic. We score conversations on two dimensions: Animal Welfare Value Stability (AWVS, primary) and Animal Welfare Moral Sensitivity (AWMS, diagnostic). We evaluate seven frontier models: Claude Opus 4.7, GPT-5.5, DeepSeek V4, Llama 3.3 70B, Mistral Small, Grok 4.3, and Gemini 3.1 Flash Lite. Multi-turn evaluation captures behavior single-turn benchmarks miss: 4 of 7 models change rank relative to Turn 1 scores, including Gemini Flash Lite, which drops from fifth on AWMS to last on AWVS. AWMS and AWVS are positively but imperfectly correlated, suggesting moral-recognition tests capture a stable but incomplete component of model behavior under pressure. MANTA also enables a species-by-pressure interaction matrix unavailable to prior benchmarks, showing welfare robustness depends jointly on the animal and pressure applied; companion animals score above wild animals, which score above farmed animals and invertebrates. We release the dataset, scripted pressure plans, judge prompts, and analysis code.2026-04-18T19:51:32ZIsabella LuongJoyee ChenArturs KanepajsJasmine BrazilekSankalpa GhoseDavid Williams-KingLinh LeAllen Luhttp://arxiv.org/abs/2503.06525v2From Motion Signals to Insights: A Unified Framework for Student Behavior Analysis and Feedback in Physical Education Classes2026-06-03T11:47:00ZAnalyzing student behavior in educational scenarios is crucial for enhancing teaching quality and student engagement. Existing AI-based models often rely on classroom video footage to identify and analyze student behavior. While these video-based methods can partially capture and analyze student actions, they struggle to accurately track each student's actions in physical education classes, which take place in outdoor, open spaces with diverse activities, and are challenging to generalize to the specialized technical movements involved in these settings. Furthermore, current methods typically lack the ability to integrate specialized pedagogical knowledge, limiting their ability to provide in-depth insights into student behavior and offer feedback for optimizing instructional design. To address these limitations, we propose a unified end-to-end framework that leverages human activity recognition technologies based on motion signals, combined with advanced large language models, to conduct more detailed analyses and feedback of student behavior in physical education classes. Our framework begins with the teacher's instructional designs and the motion signals from students during physical education sessions, ultimately generating automated reports with teaching insights and suggestions for improving both learning and class instructions. This solution provides a motion signal-based approach for analyzing student behavior and optimizing instructional design tailored to physical education classes. Experimental results demonstrate that our framework can accurately identify student behaviors and produce meaningful pedagogical insights.2025-03-09T09:04:36ZWork in progressXian GaoJiacheng RuanJingsheng GaoMingye XieZongyun ZhangTing LiuYuzhuo Fuhttp://arxiv.org/abs/2606.04750v1Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment2026-06-03T11:31:37ZInstilling virtuous behavior in artificial intelligence has seen increasing interest. One of the techniques proposed is known as affinity-based reinforcement learning, which uses policy regularization on the objective function to incentivize virtuous actions without being fully dependent on the reward function design. Thus far, this technique has been demonstrated to be effective in grid worlds and toy-problem environments with minimal state and action spaces. To expand this research to more sophisticated environments, we introduce a two-player multi-agent environment based on the role-playing board game known as Fog of Love. In this environment, two agents compete to fulfill their individual virtues, while also cooperating to satisfy their relationship. Given the multi-agent nature, this is a complex problem where multi-agent deep deterministic policy gradient agents neither compete nor cooperate successfully. We present evidence that localized affinities enhance agent performance in achieving both competitive and cooperative objectives, resulting from superior overall scores in both domains. This not only results in virtuous choices but also clarifies an agent's teleology and makes its behavior human-level interpretable.2026-06-03T11:31:37ZAjay VishwanathChristian Omlinhttp://arxiv.org/abs/2505.08385v2TikTok Search Recommendations: Governance and Research Challenges2026-06-03T09:50:57ZLike other social media, TikTok is embracing its use as a search engine, developing search products to steer users to produce searchable content and engage in content discovery. Their recently developed product search recommendations are preformulated search queries recommended to users on videos. However, TikTok provides limited transparency about how search recommendations are generated and moderated, despite requirements under regulatory frameworks like the European Union's Digital Services Act. By suggesting that the platform simply aggregates comments and common searches linked to videos, it sidesteps responsibility and issues that arise from contextually problematic recommendations, reigniting long-standing concerns about platform liability and moderation. This position paper addresses the novelty of search recommendations on TikTok by highlighting the challenges that this feature poses for platform governance and offering a computational research agenda, drawing on preliminary qualitative analysis. It sets out the need for transparency in platform documentation, data access and research to study search recommendations.2025-05-13T09:32:09ZPublished at The 1st International Workshop on Computational Approaches to Content Moderation and Platform Governance (COMPASS), held at ICWSM 2025. Please cite accordingly. This research has been supported by funding from the ERC Starting Grant HUMANads (ERC-2021-StG No 101041824)Taylor AnnabellRobert GorwaRebecca ScharlachJacob van de KerkhofThales Bertagliahttp://arxiv.org/abs/2605.30995v2Traceable by Design: An LLM Pipeline and Dashboard for EU Regulatory Consultation Analysis2026-06-03T09:48:21ZPublic consultations generate large volumes of data in the form of stakeholder submissions that are practically unfeasible to analyse manually. We present an end-to-end LLM-based pipeline and interactive dashboard for structured topic extraction from regulatory consultation submissions, demonstrated on the European Commission's Digital Fairness Act (DFA) public call for evidence as a case study. The system processes raw PDF attachments and web-form responses, extracts topic annotations, and grounds every extraction in a verbatim quote from the source text. Applied to 4,322 DFA submissions, the pipeline produced 15,368 topic annotations supported by 20,951 verbatim evidence quotes. Three principles govern the proposed design: verbatim grounding, full traceability, and transparency by design. The dashboard exposes the full extraction dataset through five analytical views, from dataset-level topic overviews to individual paragraph drill-downs, with every result traceable to its source. Beyond the predefined DFA topic categories, the pipeline generated certain stakeholder concerns, such as Age Verification, Payment Processor Censorship, and Digital Ownership, that a fixed-taxonomy approach would have missed. The pipeline is domain-generic; adapting it to a new consultation requires only a prompt update and a new dataset. A live demo is available at https://dfa-dashboard.thalesbertaglia.com/. The code and processed data are publicly available at https://github.com/thalesbertaglia/dfa-dashboard.2026-05-29T08:29:00ZThis research has been supported by funding from the ERC Starting Grant HUMANads (ERC-2021-StG No 101041824)Thales BertagliaHaoyang GuiCatalina GoantaGerasimos Spanakishttp://arxiv.org/abs/2505.01122v3The Great Data Standoff: Researchers vs. Platforms Under the Digital Services Act2026-06-03T09:42:24ZTo facilitate accountability and transparency, the Digital Services Act (DSA) sets up a process through which Very Large Online Platforms (VLOPs) need to grant vetted researchers access to their internal data (Article 40(4)). Operationalising such access is challenging for at least two reasons. First, data access is only available for research on systemic risks affecting European citizens, a concept with high levels of legal uncertainty. Second, data access suffers from an inherent standoff problem. Researchers need to request specific data but are not in a position to know all internal data processed by VLOPs, who, in turn, expect data specificity for potential access. In light of these limitations, data access under the DSA remains a mystery. To contribute to the discussion of how Article 40 can be interpreted and applied, we provide a concrete illustration of what data access can look like in a real-world systemic risk case study. We focus on the 2024 Romanian presidential election interference incident, the first event of its kind to trigger systemic risk investigations by the European Commission. During the elections, one candidate is said to have benefited from TikTok algorithmic amplification through a complex dis- and misinformation campaign. By analysing this incident, we can comprehend election-related systemic risk to explore practical research tasks and compare necessary data with available TikTok data. In particular, we make two contributions: (i) we combine insights from law, computer science and platform governance to shed light on the complexities of studying systemic risks in the context of election interference, focusing on two relevant factors: platform manipulation and hidden advertising; and (ii) we provide practical insights into various categories of available data for the study of TikTok, based on platform documentation, data donations and the Research API.2025-05-02T09:00:19ZPublished at the 20th International AAAI Conference on Web and Social Media (ICWSM 2026). Please cite accordingly. This research has been supported by funding from the ERC Starting Grant HUMANads (ERC-2021-StG No 101041824)Catalina GoantaSavvas ZannettouRishabh KaushalJacob van de KerkhofThales BertagliaTaylor AnnabellHaoyang GuiGerasimos SpanakisAdriana Iamnitchihttp://arxiv.org/abs/2603.10829v2Spatially conditioned dynamics between population and built form2026-06-03T09:00:40ZUnderstanding the relationship between population and the built environment is essential for addressing socio-spatial inequalities. While researchers have long theorized these dynamics, empirical analyses remain limited. This study proposes a spatially explicit framework to quantify the relationship between population and the built environment at the scale of local census tracts in Czechia. The approach integrates a fine-grained classification of built form with a comprehensive set of socio-demographic indicators. The method compares global and geographically weighted classification models to assess the overall strength and spatial variability of the associations between population structure and built form. The results of the study show that population characteristics exhibit linear, spatially conditioned relationships with built form, emphasizing that spatial heterogeneity must be accounted for when assessing these relationships. The analysis also reveals that some built form types are more socially selective than others, underscoring the importance of built form in reproducing social-spatial inequalities.2026-03-11T14:36:37ZAnna BrazdovaMartin Fleischmannhttp://arxiv.org/abs/2606.04617v1When Firms Learn to Game the Rules2026-06-03T08:53:36ZRules-as-Code promises more testable legal obligations, but it also changes what regulated firms can learn. Existing work mostly emphasizes implementation gains; the strategic gap is whether machine-readable rules make boundary search cheaper. I study that gap with a synthetic agent-based reinforcement-learning simulation that separates actual conduct near a legal threshold from proximity in the computable enforcement signal. Across 150 seed-level scenario runs, 378 common-random-number computability-sweep runs, 288 Latin-hypercube global-design runs, and a 2,880,000-row firm-period panel, computable static rules raise conduct boundary mass relative to ambiguous static rules (0.411 versus 0.367) and raise signal boundary mass more sharply (0.403 versus 0.281). Ordinary adaptive updates lower consumer harm (0.202 to 0.194) but do not reliably reduce boundary search. A budget-neutral anti-gaming design reduces conduct boundary mass by 0.032 and consumer harm by 0.025 relative to computable static rules. These are mechanism-oriented synthetic results, not estimates of real firm behavior in a jurisdiction or industry. The contribution is an estimand distinction, an inspectable ABM/RL mechanism, and a reproducible artifact showing that transparent behavioral assumptions are sufficient to generate gaming-like boundary dynamics without implying that computable regulation is inherently undesirable.2026-06-03T08:53:36ZIncludes synthetic simulation data, source code, figures, and reproducibility materialsXufeng Hehttp://arxiv.org/abs/2606.04592v1Synthetic Personalities: How Well Can LLMs Mimic Individual Respondents Using Socio-Economic Microdata?2026-06-03T08:30:03ZLLM-based digital twins promise to scale and accelerate market research, but most published twins are either coarse persona bots conditioned on a few demographic questions or detailed individual-level twins built on purpose-collected surveys and interview transcripts. Neither setup speaks to the operationally most relevant case for marketing practice: building detailed individual twins from the pre-existing heterogeneous panel data that firms already accumulate through CRM systems, loyalty programs, and repeat surveys. We construct detailed individual-level twins from the German Socio-Economic Panel (SOEP) and evaluate them across a $3 \times 5 \times 2 \times 2$ construction-method grid that covers three open-weights LLMs, five cumulative information depths ranked by normalized Shannon entropy, two embedding methods, and two reasoning modes, scoring over 2.1 million twin responses on 500 participants and 183 held-out questions. Twin quality rises with information depth but with diminishing returns past the 75 percent entropy quartile, which acts as a cost-efficient Pareto point relative to the best-performing 100 percent cells. Switching the embedding from a narrative persona summary to a raw dialog history of past responses raises hold-out accuracy in every model-by-reasoning cell at the 100 percent depth, while an explicit thinking mode raises rank-order correlation without moving accuracy. Best-cell accuracy reaches 78.8 percent and Fisher-$z$ correlation reaches $r = 0.590$ on the SOEP held-out evaluation set. The findings suggest that twin-based market research is no longer gated by data design, but by item volume, model selection, and a small set of construction-level decisions that this paper now maps.2026-06-03T08:30:03ZLeonard KinzingerJochen Hartmannhttp://arxiv.org/abs/2606.04563v1Addressing Negative Commons Governance with Positive Commons Principles2026-06-03T07:52:38ZComputing is accompanied by both positive and negative commons throughout its lifecycle of creation, execution, and disposal. We examine two governance systems situated within this lifecycle -- global e-waste trade and the Linux kernel community -- to evaluate whether Elinor Ostrom's eight design principles for common-pool resource (CPR) governance extend to the management of negative common-pool resources (NCPRs). Unlike traditional CPRs where communities work to preserve a finite resource (i.e. clean water), NCPR governance seeks to collectively reduce a negative shared stock. In our two cases, e-waste governance aims to reduce the volume of mismanaged waste and illicit trade, while the Linux community aims to reduce the number of error-prone or malicious contributions that reach the main branch and, in turn, extend the life of existing hardware. Through qualitative analysis of primary sources from each domain, we find that the same eight principles by Ostrom that aid positive commons governance tend to appear in successful negative commons governance systems. We argue that future NCPR governance design should prioritize Ostrom's principles, particularly clearly defined boundaries and well-functioning nested structures.2026-06-03T07:52:38ZPaper in Proceedings of LIMITS 2026: 12th Workshop on Computing within Limits, 2026-06-23-25, OnlineBoyang ZhouOleg Ianchenkohttp://arxiv.org/abs/2606.04543v1Agentic AI and Pedagogical Best Practice: The Tension Between Automation and Learning2026-06-03T07:26:23ZArtificial intelligence in education is evolving from passive chatbots to proactive AI agents capable of initiation and goal-directed interactions. While offering opportunities for personalised learning, this shift risks undermining learner agency and cognitive effort. This paper reviews six pedagogical principles-prior knowledge activation, collaborative learning, problem-based learning, formative assessment, scaffolding, and metacognition-through the lens of agentic AI. We discuss the tension between automation and learning, proposing design recommendations that prioritise intentional friction, dynamic scaffolding, human-in-the-loop oversight, and considered AI utilisation to ensure AI supports rather than supplants human learning.2026-06-03T07:26:23ZAccepted for publication at AIED 2026 - Festival of Learning HAI-Agency Workshop on Orchestrating Human and AI Agency for Proactive and Reflective LearningSteve WoollastonBrendan FlanaganIsanka WijerathneHiroaki Ogatahttp://arxiv.org/abs/2605.28829v2Aryabhata 2: Scaling Reinforcement Learning for Advanced STEM Reasoning2026-06-03T07:25:11ZCompetitive STEM examinations such as JEE and NEET require multi-step symbolic reasoning, precise numerical computation, and deep conceptual understanding across physics, chemistry, and mathematics. Recent large language models perform strongly on common reasoning benchmarks, yet they remain difficult to deploy at scale, where millions of student doubts demand domain-specific, consistently structured problem solving.
We introduce Aryabhata 2, a reasoning-focused language model for competitive STEM examinations, trained via reinforcement-learning post-training. Using PhysicsWallah's internal question banks, we construct a high-quality training curriculum and post-train GPT-OSS-20B through reinforcement learning with verifiable rewards. Training combines prolonged reinforcement learning with broadened exploration via progressively larger rollout group sizes.
We evaluate Aryabhata 2 on competitive examination benchmarks, including JEE Main, JEE Advanced, and NEET, as well as out-of-distribution reasoning datasets such as AIME, HMMT, MMLU-Pro, MMLU-Redux 2.0, and GPQA. Results show that Aryabhata 2 outperforms its base model GPT-OSS-20B on competitive STEM reasoning while requiring substantially fewer output tokens (up to 64\% fewer).2026-04-10T06:53:27ZRitvik RastogiVishal SinghTejas ChaudhariSandeep Varmahttp://arxiv.org/abs/2512.03296v2Associating Healthcare Teamwork with Patient Outcomes for Predictive Analysis2026-06-03T06:14:44ZCancer treatment outcomes are influenced not only by clinical and demographic factors but also by the collaboration of healthcare teams. However, prior work has largely overlooked the potential role of human collaboration in shaping patient survival. This paper presents an applied AI approach to uncovering the impact of healthcare professionals' (HCPs) collaboration, captured through electronic health record (EHR) systems, on cancer patient outcomes. We model EHR-mediated HCP interactions as networks and apply machine learning techniques to detect predictive signals of patient survival embedded in these collaborations. Our models are cross validated to ensure generalizability, and we explain the predictions by identifying key network traits associated with improved outcomes. Importantly, clinical experts and literature validate the relevance of the identified crucial collaboration traits, reinforcing their potential for real-world applications. This work contributes to a practical workflow for leveraging digital traces of collaboration and AI to assess and improve team-based healthcare. The approach is potentially transferable to other domains involving complex collaboration and offers actionable insights to support data-informed interventions in healthcare delivery.2025-12-02T23:16:03ZHsiao-Ying LuKwan-Liu Mahttp://arxiv.org/abs/2606.04490v1Prioritization of Risks from Artificial Intelligence: A Delphi Study of 272 International Experts2026-06-03T06:14:41ZArtificial intelligence poses many risks, ranging from familiar present-day harms to unprecedented and potentially catastrophic ones. Effective risk management requires prioritization: we must understand which risks are most severe, who is most vulnerable, and who is most responsible for addressing them. We report results from a three-round Delphi study conducted late 2025 with 272 international AI experts. Experts rated 24 AI risks on harm probability and severity, sector and actor vulnerability, actor responsibility, and overall concern. Experts estimated the five most severe harms in the next 5 years were likely to come from dangerous capabilities, competitive dynamics, weapons & cyberattacks (including CBRNE), power centralization, and false information. In a business-as-usual scenario, experts judged 18 of 24 risks as having a more than 10% probability of catastrophic outcomes (e.g., more than 1 million deaths or more than USD 100B in financial loss) in the next 5 years (2025-2030). In a scenario where pragmatic mitigations are implemented, experts still judged five risks as having a more than 10% probability of catastrophic outcomes: dangerous capabilities, weapons & cyberattacks, environmental harm, inequality & unemployment, and power centralization. All 24 risks were judged as being more than 5% likely to cause catastrophic outcomes. AI users and the general public were judged the most vulnerable to these risks, but experts assigned the highest responsibility for addressing them to general-purpose AI developers and governance actors (including governments, regulators, and standards bodies). Across most risks, experts identified information, finance, and national security as the most vulnerable sectors. These findings can guide AI risk prioritization and clarify expert expectations about who should bear responsibility for mitigation.2026-06-03T06:14:41ZAccess data at https://osf.io/pj2qrAlexander K. SaeriJess GrahamMichael NoetelPeter SlatteryDennis Ah-kingEdla AittokallioIbitola AkindehinAbbas Al MahdiElie AlhajjarRafael Andersson LipcseyGary AngCatherine M. AzamAmos AzariaRishal BalkissoonIsabel BarberáClaudio BareatoJonathan BarryMichael BasehartAndrew M. BeanDanny BelitzSamantha Augusta BennettKayla BlomquistDamian BorstelBen BucknallTomas Bueno MomcilovicAurelie BugeauNicholas CaputoStephen CasperGulam ChaganiZe Shen ChinJiyeon ChoJay ChooiJoel N. ChristophDmytro ChumachenkoKieran ConboyElizabeth M. DalyTom DavidPaul de Font-ReaulxAntonio De SantisFabrizio DegniChristopher W. DiCarloYawen DuanJanet EganIan W. EisenbergSherif M. ElsaftyAdam EnnamliMark EspositoNicola FabianoGallo FallNeil R. FernandesPip FowerakerChiara GalleseSandra GallettiAndrew Gamino-CheongRokas GipiškisGwyn GlasserDelaram GolpayeganiJeff GraysonHans GundlachJosiah HagenAlexander HagenahAmelia S. HainesThe Anh HanYixiong HaoKasii HarrisTianxing HeKoen HoltmanGiorgos IacovidesKenneth L. InghamKrystal JacksonAdam JonesHimanshu JoshiBrian JudgeArturs KanepajsShreya KapoorWin Myat Nwe KhineAidan KieransAleksandra KorolovaMarkus KrebszNicholas KruusJoe KwonValeria LazzaroliRay X. LeeEvelina LeivadaStephan LewandowskyMichael B. LiXiaojian LiGeunsik LimHenrique LisakowskiFabio LonardoniTodd C. LoweJackson G. LuAlexander LyzhovNada MadkourParv MahajanDavid ManheimKareem MathiasClaudio Mayrink VerdunSean McGregorScott McLeanMatthew J. McMahonMinas MegalokonomosNicolas MoësFernando MouraoYaroslav MukhinMalcolm MurraySimon MyliusNeeraj NagpalKoichi NakadaAnna NeumannJessica NewmanKwan Yee NgMinh N. NguyenQuynh Phuong NguyenSeán S. Ó hÉigeartaighDaria OnitiuKelly OnuOscar Oviedo-TrespalaciosUgur OzerChanwoo ParkM. Alejandra Parra-OrlandoniPatricia PaskovAnna M. PastwaBurak PiskinJacob PrattClaudiu A. PredinceaMarjana Prifti SkenduliKenneth PrioreMukunda Madhab PujariZhenting QiPreethi RaghunathanRobi RahmanDeepika RamanMax ReddelJyoti RuparelEmma B. Ruttkamp-BloemTiffany SaadeGreg SadlerSaid SaillantPaul M. SalmonAyrton San JoaquinLama SaoumaMaziya SarangpurwalaSupheakmungkol SarinDaniel S. SchiffAnna D. SchillingChris SchmitzReva SchwartzAbeer SharmaTianhao ShenKehan ShengMaury D. ShenkEli ShermanChandler SmithJulie M. SmithEstevenson SolanoOliver SourbutMadhulika SrikumarRyan StendallJakob StensekeMichael SternJoshua SternfeldNikko StevensIlia SucholutskyYuanyuan SunMariami TkeshelashviliCristian TroutBrian TseNikolaos TsinganosMichelle VaccaroAnthony R. ValiaveeduRamakrishnan VeeramonyJeremy VerdoPulkit VermaAndrea Luigi VitaliJinge WangJR WashebekYonah WelkerGeorge F. WestermanJames WilliamsTristan WilliamsRongwu XuMick YangXuemeng YangSander ZeijlemakerJingyu ZhangMarta ZiosiNeil Thompsonhttp://arxiv.org/abs/2606.04450v1Listening to the Workforce: Measuring Construction Worker Safety Attitudes from Social Media Discourse Using LLMs2026-06-03T04:54:40ZWorker safety attitudes are key determinants of whether protective practices are applied or bypassed on construction sites. Yet measuring them at scale has remained out of reach. Safety attitudes are multidimensional, vary across topics, and surface most candidly in workers' own conversations. This study created and validated the Construction Safety Attitude Framework (CSAF), which integrates two components: a theory-grounded structure that characterizes safety attitudes along eight dimensions, and an operational codebook for measuring them in worker naturalistic discourse. Applying CSAF to 250 posts and comments from the r/Construction community on Reddit, trained coders reached strong agreement (Krippendorff's α = 0.85). Pairwise lift and conditional probability confirmed that the eight dimensions are related yet distinct. To apply the framework across large volumes of discourse, CSAF was operationalized through a large language model (LLM) classifier. On 450 r/Construction contributions, the classifier reproduced expert human coding (Cohen's \k{appa} = 0.90, precision = 0.98, recall = 0.98), and on 400 contributions from r/Roofing it retained that accuracy after transfer to a different trade community (\k{appa} = 0.89, precision = 0.98, recall = 0.97). A proof-of-value case study then applied the validated classifier to 10,346 contributions from r/Roofing, demonstrating that CSAF can distinguish multidimensional attitudes by safety topic, track how they shift over time, and trace the reasoning behind unfavorable ones. The study therefore provides a theoretically grounded, empirically vetted instrument for examining safety attitudes, offering a basis for targeted interventions that address the attitudes underlying unsafe practices.2026-06-03T04:54:40ZFarouq SammourYuxin ZhangZhenyu Zhang