https://arxiv.org/api/Ts6crr6+axV4c6HA1nlcie8Dhj4 2026-07-17T19:55:57Z 23656 0 15 http://arxiv.org/abs/2607.15114v1 CoSimRec: Measuring Coordinated-Content Penetration in Recommender Feedback Loops 2026-07-16T15:24:09Z

Recommender systems increasingly shape which content reaches users, making it important to understand whether coordinated activity is amplified beyond the accounts that initiate it. Existing robustness evaluations largely focus on static target-rank changes and do not capture how coordinated interactions, recommendation, and user response evolve within a feedback loop. To address this gap, we propose CoSimRec, an offline agent-based evaluation framework that models coordinated accounts, dynamic ranking, non-bot responses, and ranking interventions in a shared closed-loop process. CoSimRec introduces the Algorithmic Penetration Rate (APR) metric family to measure target content's share of non-bot exposure and engagement, lift against matched no-attack baselines, and exposure gained per coordinated interaction. We evaluate CoSimRec on MIND, MovieLens, and LastFM using random, popularity-based, feedback-sensitive, MF, and BPR-MF recommenders, with ten-seed inference for the primary APR analysis and population-scale experiments of up to 1000 users. Random controls show no statistically supported positive penetration, whereas popularity-based and feedback-sensitive ranking produce significant positive APR-Lift in all six master-worker dataset--recommender settings, reaching 0.4505 on LastFM; synchronization-aware ranking reduces APR in every corresponding defense setting.

2026-07-16T15:24:09Z 9 pages, 4 figures, 4 tables Nan Li Jiahong Shao Jiuyang Lyu http://arxiv.org/abs/2601.11338v2 Walk based Laplacians for Modeling Diffusion on Complex Networks 2026-07-16T14:53:46Z

We develop a novel framework for modeling diffusion on complex networks by constructing Laplacian-like operators based on walks around a graph. Our approach introduces a parametric family of walk-based Laplacians that naturally incorporate memory effects by excluding or downweighting backtracking trajectories, where walkers immediately revisit nodes. The framework includes: (i) walk-based Laplacians that count all traversals in the network; (ii) nonbacktracking variants that eliminate immediate reversals; and (iii) backtrack-downweighted variants that provide a continuous interpolation between these two regimes. We establish that these operators extend the definition of the standard Laplacian and also preserve some of its properties. We present efficient algorithms using Krylov subspace methods for computing them, ensuring applicability of our proposed framework to large networks. Extensive numerical experiments on real-world networks validate the modeling flexibility of our approach and demonstrate the computational efficiency of the proposed algorithms, including GPU acceleration.

2026-01-16T14:36:08Z Francesca Arrigo Fabio Durastante http://arxiv.org/abs/2607.13713v2 Comprehensive, Efficient Large-Scale Community Detection via Structural Entropy Game 2026-07-16T07:33:20Z

Community detection is a critical task in graph theory, social network analysis, and bioinformatics, where communities are defined as clusters of densely interconnected nodes. However, detecting communities in large-scale networks with millions of nodes and billions of edges remains challenging due to the inefficiency and unreliability of existing methods. Moreover, many existing methods are limited to specific types of graph structures (such as unweighted or undirected graphs) or are designed solely for detecting static communities, reducing their broader applicability. To address these issues, we propose a novel heuristic community detection algorithm, termed CoDeSEG, which identifies communities by minimizing the network's two-dimensional (2D) structural entropy within a potential game framework. In the game, nodes decide to stay in the current community or move to another based on a strategy that maximizes the 2D structural entropy utility function. Additionally, we introduce a structural entropy-based node overlapping heuristic for detecting overlapping communities, with a near-linear time complexity. Furthermore, we design a cascading influence propagation-based adaptive community update strategy, which dynamically identifies and processes nodes whose community affiliations may change during graph evolution, thereby effectively extending CoDeSEG to dynamic community detection scenarios. Experimental results on fourteen large-scale networks demonstrate that CoDeSEG achieves state-of-the-art performance across three community detection tasks (overlapping, non-overlapping, dynamic), while also delivering substantial improvements in detection efficiency.

2026-07-15T11:29:56Z arXiv admin note: text overlap with arXiv:2501.15130 Pu Li Yantuan Xian Hao Peng Huafeng Li Zhengtao Yu Yan Xiang Philip S. Yu http://arxiv.org/abs/2607.14574v1 Collaborative Spatial Learning with Multi-LLM Agents in Networked Social Experiments 2026-07-16T05:11:42Z

Collective problem solving often requires that group members consider the tradeoff between exploitation of known solutions and exploration for new ones, where information of known solutions can be disseminated among individual members through communication networks. The Mason--Watts experiment (PNAS 2012) showed that human groups in shorter-path networks outperform those in longer-path networks on a two-dimensional search task. In this work, we focus on the investigation of such network-efficiency effects in the setting of a group of large language model (LLM) agents. Specifically, we consider groups of sixteen LLM agents playing the Mason--Watts experiment on the eight Mason--Watts network topologies. Moreover, we develop mechanistic Bayesian optimization agents such that the performance of LLM agents can be compared with both the mechanistic agents and the human experimental data. Our computational experiments indicate that the LLM agents show a significant network-efficiency effect when instructed to randomize their first-round choices, but not under the default initialization. In this experiment, adding a one-sentence first-round randomization instruction improves collective payoff by more than three times the estimated payoff difference across the eight network topologies. Also, the Bayesian optimization agents obtain higher payoffs than the evaluated LLM agents on this spatial search task. We further compare the agents' exploration--exploitation behavior, copying, and spatial diversity.

2026-07-16T05:11:42Z Accepted at ASONAM 2026; to appear in the Springer proceedings Hao He Chris J. Kuhlman Xinwei Deng http://arxiv.org/abs/2606.25286v3 Structuring International Governance through the Space of Concerns 2026-07-16T04:39:42Z

When institutions decide by consensus, the official record shows agreement but hides who shaped what was decided. We introduce a way to recover that hidden structure from the one trace consensus cannot suppress: the documentary record of what actors choose to work on. Adapting tools from economic complexity, we map a ``space of concerns'' in which issues lie close when the same actors repeatedly specialize in both -- turning a flat agenda into a measurable topology of attention. Across six decades of the Antarctic Treaty (6,591 documents, 66 actors), engagement is structured, local, and persistent, and the most specialized actors produce binding law at roughly five times the baseline rate. The approach generalizes to any document-rich consensus forum, showing that unanimity does not erase political structure -- it relocates it upstream, into the organization of attention.

2026-06-24T01:39:00Z Casper van Elteren Fiona Lippert Zachary Carter Maria Kleshnina Michael Bode Vítor V. Vasconcelos http://arxiv.org/abs/2607.14531v1 Overlapping Network Community Detection Using Sparse Backbones 2026-07-16T03:38:41Z

Community structures are common in real networks, and extracting them provides valuable insight in applications ranging from drug discovery to market segmentation. Overlapping community detection (OCD) is the task of clustering networked data in which nodes may belong to multiple clusters. Existing OCD algorithms often struggle to achieve a suitable balance between detection quality and scalability. We, therefore, propose Highway, a scalable OCD algorithm that exploits the sparse backbone of the input network to perform efficient community inference. We used 728 Lancichinetti-Fortunato-Radicchi benchmark networks to compare Highway and its ablated version against 10 existing OCD algorithms. Our results, based on five performance measures, demonstrate a competitive performance for Highway. It ranks first in overlapping normalized mutual information with a 6.9% improvement over the strongest baseline. It also ranks second in all the other four performance measures. These comparative results suggest that Highway coupled with its backbone procedure offers a suitable accuracy-efficiency trade-off. The Highway algorithm is open-source and available as part of the CDlib library.

2026-07-16T03:38:41Z Peer-reviewed and accepted author copy Zihe Zhou Samin Aref http://arxiv.org/abs/2607.14491v1 Manufactured Divisiveness: Decomposing the Hostile Content of Seven Social Media Influence Operations 2026-07-16T02:13:53Z

State-backed influence operations are routinely measured as high-prevalence sources of ``hate'' and ``toxicity.'' We argue those rates rest on a measurement error: the detectors behind them are validated to catch a broader definition inclusive of hostility or divisiveness aimed at an out-group, and so over-attribute hate to content better described as partisan or geopolitical invective. Across 25.08M tweets from seven government-attributed campaigns in the Twitter Information Operations archive (8,275 accounts), we separate hate from the other forms of divisiveness. We first validate a two-prompt LLM-based detector, matching human labels at Cohen's $κ=0.82$, to identify the broader hostility; we then develop an auditable rule, agreeing with an expert at $κ=0.52$, to further classify this content (5,457 posts) into three sub-categories. About 50.1% are identity-based attacks on people, whereas 30.4% are partisan attacks and 19.5% invective against states and their foreign policy. Reporting all of it as hate therefore overstates hate roughly twofold; only 18.7% is both identity-based and dehumanizing or inciting. Six of seven campaigns sort into three regimes that a single ``hate'' rate flattens, namely identity hate (RU-op and IRA, both Russia-attributed), geopolitical invective (both Iran operations), and partisan divisiveness (both Venezuela operations). We call the shared product $manufactured divisiveness$. The line to separate these constructs itself remains unsettled: on the hardest cases three independent human experts agree only moderately (pairwise $κ=0.37$--$0.50$), and the best of nineteen LLM models tops out at $κ=0.601$ against the experts' majority. Our findings can help redefine the study of hate in the context of influence campaigns and broader online discourse.

2026-07-16T02:13:53Z Emilio Ferrara http://arxiv.org/abs/2607.14430v1 Prices, Probabilities, and Parlays: Systematic Bias in Sports Prediction Markets 2026-07-15T23:53:56Z

Prediction market prices are routinely interpreted as probabilities, both in academic work and in derivative products built atop these markets. We document two systematic ways this interpretation fails on Kalshi, a major U.S. event-contract exchange, using 23 million moneyline trades across major sport leagues. First, calibration, the agreement between quoted prices and realized event frequencies, is not a static property of a contract: fitting calibration models within time-to-expiry buckets, we find that parameters sit at their perfect-calibration reference values in the middle of a contract's life but depart sharply as expiry approaches. In the final ten minutes before settlement the empirical calibration curve becomes step-like, fitting a Prelec form with curvature parameter well above one, the opposite sign of the canonical lottery-choice fit, consistent with insurance-demand behavior by traders holding losing positions. Second, cross-game parlays on Kalshi are systematically overpriced relative to the product of their contemporaneous leg prices, with overpricing growing in leg count. This holds when the parlay legs are drawn from the TTE regime in which leg-level calibration is essentially perfect, indicating a separate, market-level markup at the parlay-pricing stage. Both deviations are systematic and therefore admit computational correction. Practical use of prediction-market prices as probabilities requires conditioning on time to expiry and on product type, not on price alone.

2026-07-15T23:53:56Z Niusha Moshrefi http://arxiv.org/abs/2607.14288v1 Majority Correctness in Social Networks: From Well-Mixed Electorates to Complex Networks 2026-07-15T18:49:12Z

We study majority correctness when voting is preceded by sustained social interaction on a social network. Motivated by the Condorcet Jury Theorem, we consider a binary choice with an objectively correct alternative, where uninformed voters revise their vote intentions through repeated interaction in the presence of competing committed leaders (zealots). In this zealot--contrarian voter model, voters may either imitate or oppose the views they encounter. For fully mixed electorates, we characterize the long-run distribution of votes and the correlation structure induced among voters, and we show that Erdős--Rényi networks exhibit the same majority-correctness behavior after an appropriate rescaling of leader influence. Building on these results, we establish a finite-electorate Condorcet-type guarantee: when post-deliberation individual correctness exceeds random choice, a strict majority is more likely to select the correct alternative than a randomly chosen voter. At the same time, we identify an aggregation failure: social interaction can reduce majority accuracy relative to a no-deliberation benchmark in which voters respond only to zealots. As the electorate size tends to infinity, this finite-electorate advantage disappears unless social updating is purely conformist, revealing a tipping point at full conformity: any persistent contrarian updating drives both individual and majority correctness to the random choice level of one half. Simulations on scale-free, ring, and small-world networks further show that topology matters because it shapes the vote correlations generated by social influence: hub-dominated structures generate stronger positive correlations and lower majority accuracy, whereas spatially structured networks generate weaker correlations, preserve a larger effective number of independent judgments, and improve majority accuracy.

2026-07-15T18:49:12Z 48 pages, 8 figures Dan Braha Marcus A. M. de Aguiar http://arxiv.org/abs/2602.10125v7 How segmented is my network? 2026-07-15T18:37:52Z

Network segmentation is a popular security practice for limiting lateral movement, yet practitioners lack a metric to measure how segmented a network actually is. We define segmentedness as the fraction of potential node-pair communications disallowed by policy -- equivalently, the complement of graph edge density -- and show it to be the first statistically principled scalar metric for this purpose. Then, we derive a normalized estimator for segmentedness and evaluate its uncertainty using confidence intervals. For a 95\% confidence interval with a margin-of-error of $\pm 0.1$, we show that a minimum of $M=97$ sampled node pairs is sufficient. This result is independent of the total number of nodes in the network, provided that node pairs are sampled uniformly at random. We evaluate the estimator through Monte Carlo simulations on Erdős--Rényi, stochastic block models, and real-world enterprise network datasets, demonstrating accurate estimation. Finally, we discuss applications of the estimator, such as baseline tracking, zero trust assessment, and merger integration.

2026-01-31T15:28:09Z 5 Tables, 5 Figures Rohit Dube http://arxiv.org/abs/2607.14048v1 An Epidemic Threshold Set for Networks 2026-07-15T17:16:27Z

In this paper, we investigate a discrete-time SIS epidemic model and the epidemic thresholds on complex networks. We focus on proposing a community-level epidemic threshold set and establishing a comparative result between the local epidemic thresholds and the global epidemic threshold. To verify our theoretical findings and structural properties, we conduct numerical experiments on one synthetic network (Network1) and one real-world network (the Haslemere contact network). Our numerical simulations, along with the computation and statistical ranking of the epidemic threshold sets, align accurately with our theoretical results.

2026-07-15T17:16:27Z Hoang Phi Dung Nguyen Hong Phuc http://arxiv.org/abs/2607.14045v1 LLMs for Qualitative and Mixed-Methods Social Network Analysis 2026-07-15T17:14:12Z

This manuscript explores the integration of Large Language Models (LLMs) into the field of qualitative and mixed-methods social network analysis (SNA). We argue that the primary focus of this integration should be on enhancing the depth and rigor of qualitative SNA, rather than on replacing human researchers with automated systems. We begin by outlining the core principles of qualitative and mixed-methods SNA, emphasizing the importance of understanding the meaning of ties, the role of narratives, and the significance of relational identities. We then discuss how LLMs can be used as powerful tools to augment this work, from assisting with data collection and coding to supporting theory-building and abductive reasoning. We also address the limitations and ethical challenges of using LLMs in this context, including issues of bias, hallucination, and the need for reflexivity. We conclude with a series of research designs and practical recommendations for researchers who want to integrate LLMs into their work in a thoughtful and responsible way.

2026-07-15T17:14:12Z Moses Boudourides http://arxiv.org/abs/2607.13998v1 The Dynamic Verifiable Multi-Agent Human Agentic Loyalty Loop (DVM-HALL) Model and the Net Human-Agent Score (NHAS) in Autonomous Commerce 2026-07-15T16:27:48Z

The rapid proliferation of Agentic Artificial Intelligence fundamentally disrupts traditional customer loyalty paradigms. As AI evolves from passive recommendation algorithms to autonomous, goal-directed agents capable of executing purchasing decisions, the conventional understanding of consumer-brand relationships requires a structural reevaluation. By synthesizing extant literature across human-machine teaming, consumer decision-making, and algorithmic trust dynamics, we demonstrate that traditional loyalty models fail to account for algorithmic bounded rationality and constructed autonomy. To address this, we introduce the Dynamic Verifiable Multi-Agent Human Agentic Loyalty Loop (DVM-HALL) model. We formalize brand choice via a softmax probability formulation where human emotional equity, agentic machine-experience utility, calibrated trust, delegated authority, and verifiable execution jointly determine selection. The model features recursive updating mechanisms to dynamically calibrate trust and delegation after each interaction. Crucially, the framework integrates a verifiable execution layer for Decentralized Finance (DeFi) and tokenized loyalty settings, incorporating execution risks -- such as gas costs, slippage, MEV exposure, and smart-contract vulnerabilities -- as core predictors of agentic brand preference. Furthermore, we introduce the Net Human-Agent Score (NHAS), an auditable, risk-weighted metric designed to measure human-agent alignment using human feedback, execution logs, benchmark comparisons, and verifiable receipts. Finally, we propose a comprehensive three-stage empirical validation plan spanning controlled shopping experiments, multi-agent market simulations, and DeFi testbeds. This framework provides the foundational theory required for brands to navigate the impending transition toward machine customers.

2026-07-15T16:27:48Z Sai Srikanth Madugula Peplluis Esteva de la Rosa Daya Shankar http://arxiv.org/abs/2603.06502v2 Mapping the long-term trajectories of political violence in Africa 2026-07-15T15:36:09Z

Existing models of political violence often emphasize discrete transitions, when conflicts emerge, escalate, or subside, without considering the longer trajectories of violence that accumulate across time and space. This paper introduces a spatially explicit longitudinal sequence analysis to address this gap. Using event-level data from the Armed Conflict Location and Event Dataset covering Africa from 1997 to 2024, we classify locations according to the intensity and spatial concentration of violence, tracing how these states evolve into distinct conflict trajectories. Applying optimal matching and clustering techniques, we identify six recurrent patterns ranging from short-lived, localized outbreaks to protracted high-intensity conflicts. We further assess how these trajectories align across neighboring areas, revealing evidence of spatial interdependence, particularly in border regions. By highlighting the temporal rhythms and geographic linkages of political violence, the study advances conflict research beyond isolated transitions and provides a framework for understanding the life cycles of violence.

2026-03-06T17:36:13Z Applied Geography, 2026, 194, 104089 Steven M. Radil Nick Dorward Olivier Walther Levi John Wolf 10.1016/j.apgeog.2026.104089 http://arxiv.org/abs/2510.09416v4 What Do Temporal Graph Learning Models Learn? 2026-07-15T15:14:01Z

Learning on temporal graphs has become a central topic in graph representation learning, with numerous benchmarks indicating the strong performance of state-of-the-art models. However, recent work has raised concerns about the reliability of benchmark results, noting issues with commonly used evaluation protocols and the surprising competitiveness of simple heuristics. This contrast raises the question of which characteristics of the underlying graphs temporal graph learning models actually use to form their predictions. We address this by systematically evaluating eight models on their ability to capture eight fundamental characteristics related to the link structure of temporal graphs. These include structural characteristics such as density, temporal patterns such as recency, and edge formation mechanisms such as homophily. Using both synthetic and real-world datasets, we analyze how well models learn these characteristics. Our findings reveal a mixed picture: models capture some characteristics well but fail to reproduce others. With this, we expose important limitations. Overall, we believe that our results provide practical insights for the application of temporal graph learning models and motivate more interpretability-driven evaluations in graph learning research.

2025-10-10T14:18:37Z Abigail J. Hayes Tobias Schumacher Markus Strohmaier