https://arxiv.org/api/E2UUPfqNWWmZIUbS67ONHyJywTM2026-03-18T10:11:23Z14312015http://arxiv.org/abs/2509.02380v3Faster Algorithms for the Least-Core value and the Nucleolus in Convex Games2026-03-17T17:05:03ZThe nucleolus is a central solution concept in cooperative game theory. While its computation is NP-hard in general, it can be computed in polynomial time for convex games; however, the only published polynomial-time algorithm relies on the ellipsoid method. We develop a combinatorial alternative based on reduced games and iterative least-core value computations. Leveraging submodular function minimization and polyhedral structure in a novel way, we obtain a faster combinatorial algorithm for computing the least-core value, improving the oracle complexity by a factor $n^3$ over previous approaches. As a consequence, we obtain a new strongly polynomial-time and combinatorial algorithm for computing the nucleolus in convex games. Preliminary analysis indicates an improved oracle complexity compared to the ellipsoid-based algorithm.2025-09-02T14:46:24ZGiacomo MaggioranoAlessandro SossoGautier Staufferhttp://arxiv.org/abs/2603.16751v1Finding Common Ground in a Sea of Alternatives2026-03-17T16:28:37ZWe study the problem of selecting a statement that finds common ground across diverse population preferences. Generative AI is uniquely suited for this task because it can access a practically infinite set of statements, but AI systems like the Habermas machine leave the choice of generated statement to a voting rule. What it means for this rule to find common ground, however, is not well-defined. In this work, we propose a formal model for finding common ground in the infinite alternative setting based on the proportional veto core from social choice. To provide guarantees relative to these infinitely many alternatives and a large population, we wish to satisfy a notion of proportional veto core using only query access to the unknown distribution of alternatives and voters. We design an efficient sampling-based algorithm that returns an alternative in the (approximate) proportional veto core with high probability and prove matching lower bounds, which show that no algorithm can do the same using fewer queries. On a synthetic dataset of preferences over text, we confirm the effectiveness of our sampling-based algorithm and compare other social choice methods as well as LLM-based methods in terms of how reliably they produce statements in the proportional veto core.2026-03-17T16:28:37ZJay ChooiPaul GölzAriel D. ProcacciaBenjamin SchifferShirley Zhanghttp://arxiv.org/abs/2510.20606v2Strategic Costs of Perceived Bias in Fair Selection2026-03-17T14:17:24ZMeritocratic systems, from admissions to hiring, aim to impartially reward skill and effort. Yet persistent disparities across race, gender, and class challenge this ideal. Some attribute these gaps to structural inequality; others to individual choice. We develop a game-theoretic model in which candidates from different socioeconomic groups differ in their perceived post-selection value--shaped by social context and, increasingly, by AI-powered tools offering personalized career or salary guidance. Each candidate strategically chooses effort, balancing its cost against expected reward; effort translates into observable merit, and selection is based solely on merit. We characterize the unique Nash equilibrium in the large-agent limit and derive explicit formulas showing how valuation disparities and institutional selectivity jointly determine effort, representation, social welfare, and utility. We further propose a cost-sensitive optimization framework that quantifies how modifying selectivity or perceived value can reduce disparities without compromising institutional goals. Our analysis reveals a perception-driven bias: when perceptions of post-selection value differ across groups, these differences translate into rational differences in effort, propagating disparities backward through otherwise "fair" selection processes. While the model is static, it captures one stage of a broader feedback cycle linking perceptions, incentives, and outcome--bridging rational-choice and structural explanations of inequality by showing how techno-social environments shape individual incentives in meritocratic systems.2025-10-23T14:38:05ZThe paper has been accepted by NeurIPS 2025L. Elisa CelisLingxiao HuangMilind SohoniNisheeth K. Vishnoihttp://arxiv.org/abs/2306.05221v5Steering No-Regret Learners to a Desired Equilibrium2026-03-17T13:42:53ZA mediator observes no-regret learners playing an extensive-form game repeatedly across $T$ rounds. The mediator attempts to steer players toward some desirable predetermined equilibrium by giving (nonnegative) payments to players. We call this the steering problem. The steering problem captures problems several problems of interest, among them equilibrium selection and information design (persuasion). If the mediator's budget is unbounded, steering is trivial because the mediator can simply pay the players to play desirable actions. We study two bounds on the mediator's payments: a total budget and a per-round budget. If the mediator's total budget does not grow with $T$, we show that steering is impossible. However, we show that it is enough for the total budget to grow sublinearly with $T$, that is, for the average payment to vanish. When players' full strategies are observed at each round, we show that constant per-round budgets permit steering. In the more challenging setting where only trajectories through the game tree are observable, we show that steering is impossible with constant per-round budgets in general extensive-form games, but possible in normal-form games or if the per-round budget may itself depend on $T$. We also show how our results can be generalized to the case when the equilibrium is being computed online while steering is happening. We supplement our theoretical positive results with experiments highlighting the efficacy of steering in large games.2023-06-08T14:18:46ZBrian Hu ZhangGabriele FarinaIoannis AnagnostidesFederico CacciamaniStephen Marcus McAleerAndreas Alexander HauptAndrea CelliNicola GattiVincent ConitzerTuomas Sandholmhttp://arxiv.org/abs/2402.09994v3Approximating Competitive Equilibrium by Nash Welfare2026-03-17T12:45:39ZWe study the relationship between two central concepts in the allocation of divisible goods: competitive equilibrium (CE) and allocations that maximize Nash welfare, i.e., allocations where the weighted geometric mean of the utilities is maximal. When agents have homogeneous concave utility functions, these concepts coincide: the classical Eisenberg-Gale convex program that maximizes Nash welfare over feasible allocations yields a competitive equilibrium. However, they diverge for non-homogeneous utilities. From a computational perspective, maximizing Nash welfare amounts to solving a convex program for any concave utility functions, whereas computing CE becomes PPAD-hard already for separable piecewise linear concave (SPLC) utilities.
We introduce the concept of Gale-substitute utility functions, an analogue of the weak gross substitutes (WGS) property for the so-called Gale demand system. For Gale-substitutes utilities, we show that any allocation maximizing Nash welfare provides an approximate-CE with surprisingly strong guarantees, where every agent gets at least half the maximum utility they can get at any CE, and is approximately envy-free. Gale-substitutes include utility functions where computing CE is PPAD hard, such as all separable concave utilities and the previously studied non-separable class of Leontief-free utilities. We introduce a broad new class of utility functions called generalized network utilities based on the generalized flow model. This class includes SPLC and Leontief-free utilities, and we show that all such utilities are Gale-substitutes.
Conversely, although some agents may get much higher utility at a Nash welfare maximizing allocation than at a CE, we show a `price of anarchy' type result: for general concave utilities, every CE achieves at least $(1/e)^{1/e} > 0.69$ fraction of the maximum Nash welfare, and this factor is tight.2024-02-15T14:58:00ZJugal GargYixin TaoLászló A. Véghhttp://arxiv.org/abs/2509.04143v4Disentangling trust from cooperation: Evolution of trust as reduced monitoring in social dilemmas2026-03-17T11:40:15ZIt is commonly assumed that trust increases cooperation. However, game-theoretic models often fail to distinguish between cooperative actions and trust, making it difficult to independently measure trust and determine how its effects vary in different social dilemmas. To address this, we build on influential theories that equate trust with reduced monitoring of an agent's actions. We implement this as a heuristic that cognitively bounded agents can use in repeated games to avoid spending time and effort always monitoring their partner. Agents using this heuristic reduce monitoring of a partner's actions once a threshold level of cooperativeness has been observed -- providing a measurable and architecture-agnostic definition of trust. Using evolutionary game theory, we systematically analyse the success of strategies that use this trust heuristic across the entire space of two-player symmetric social dilemma games. We demonstrate that trust-as-reduced-monitoring facilitates cooperation in two different ways. First, when monitoring is costly, trust heuristics allow for higher levels of cooperation in social dilemmas where the temptation to defect is high. Second, when agents can make action errors, trust heuristics promote cooperation even in coordination problems. Our results disentangle trust from cooperation, and provide a behavioural measure of trust that applies across interaction types.2025-09-04T12:13:38ZChaos, Solitons & Fractals 208, 118130 (2026)Cedric PerretThe Anh HanElias Fernández DomingosTheodor CimpeanuSimon T. Powers10.1016/j.chaos.2026.118130http://arxiv.org/abs/2512.22552v3Computing Pure-Strategy Nash Equilibria in a Two-Party Policy Competition: Existence and Algorithmic Approaches2026-03-17T09:02:34ZWe formulate two-party policy competition as a two-player non-cooperative game, generalizing Lin et al.'s work (2021). Each party selects a real-valued policy vector as its strategy from a compact subset of Euclidean space, and a voter's utility for a policy is given by the inner product with their preference vector. To capture the uncertainty in the competition, we assume that a policy's winning probability increases monotonically with its total utility across all voters, and we formalize this via an affine isotonic function. A player's payoff is defined as the expected utility received by its supporters. In this work, we first test and validate the isotonicity hypothesis through voting simulations. Next, we prove the existence of a pure-strategy Nash equilibrium (PSNE) in both one- and multi-dimensional settings. Although we construct a counterexample demonstrating the game's non-monotonicity, our experiments show that a decentralized gradient-based algorithm typically converges rapidly to an approximate PSNE. Finally, we present a grid-based search algorithm that finds an $ε$-approximate PSNE of the game in time polynomial in the input size and $1/ε$.2025-12-27T10:44:32ZA full version of the extended abstract in AAMAS 2026Chuang-Chieh LinChi-Jen LuPo-An ChenChih-Chieh Hunghttp://arxiv.org/abs/2504.10910v2Fostering Sustainable Cooperation through Strategic Resource Allocation and Utilization on Social Networks2026-03-17T05:46:50ZEfficient allocation and use of limited resources are fundamental to advancing collective welfare and achieving long-term societal sustainability. This challenge involves not only how policymakers distribute scarce resources among individuals, but also how individuals strategically utilize them. The complexity deepens when individuals are embedded in networks of social interactions, where outcomes are interdependent and future decisions are shaped by a dynamic tension between cooperation driven by collective long-term benefit and self-interest motivated by short-term personal gain. Here, we introduce a novel framework of generalized public goods games on hypergraphs to capture the multifaceted nature of real-world social interactions. Using Nash equilibrium analysis, we reveal how full cooperation (all individuals contribute all their resources to maximize collective benefit) emerges from the interplay between resource allocation strategies, individual usage behaviors, and the structure of interactions. We find that equal resource distribution enhances cooperation in homogeneous networks but may suppress it in heterogeneous ones, indicating that equity in allocation does not universally lead to optimal collective outcomes. To address this, we propose two complementary optimization strategies: one to guide policymakers in designing effective resource allocation schemes, and the other to support individuals in making sustainable use decisions. We validate the effectiveness of both approaches across a range of synthetic and empirical cases. Our findings provide actionable insights for designing governance frameworks and resource management policies that promote sustainable cooperation in complex socio-environmental systems.2025-04-15T06:45:15ZJuyi LiXiaoqun WuQi Suhttp://arxiv.org/abs/2602.02487v3Carry-Over Lottery Allocation: Practical Incentive-Compatible Drafts2026-03-16T21:25:28ZThe NBA draft can incentivize teams to deliberately lose. We propose a draft mechanism that is practical, incentive-compatible, and favors weaker teams. The Carry-Over Lottery Allocation (COLA) framework represents a paradigm shift in evaluating team quality, replacing single season standings with multi-year playoff outcomes. In our proposed mechanism, every non-playoff team receives the same number of lottery tickets, removing incentives to lose. Lottery tickets carry over to future lotteries, but playoff success or winning a top pick diminishes a team's accumulated tickets. The lottery is familiar and preserves fan engagement.
Implementation challenges are addressed to demonstrate feasibility, including transitioning to COLA, handling trades, and accommodating draft classes of varying strength. For exceptionally strong classes, teams may prefer the lottery to the playoffs. We provide a solution, employing a truth-elicitation mechanism to identify such years and expanding lottery eligibility to include as many playoff teams as necessary to preserve incentive compatibility.2026-02-02T18:58:35Z35 pages, 4 figuresTimothy HighleyTannah DuncanIlia Volkovhttp://arxiv.org/abs/2603.15338v1Strategic Partitioning and Manipulability in Two-Round Elections2026-03-16T14:27:23ZWe consider a two-round election model involving $m$ voters and $n$ candidates. Each voter is endowed with a strict preference list ranking the candidates. In the first round, the candidates are partitioned into two subsets, $A$ and $B$, and voters select their preferred candidate from each. Provided there are no ties, the two respective winners advance to a second round, where voters choose between them according to their initial preference lists. We analyze this scenario using a probabilistic framework based on a spatial voting model with cyclically constructed preference lists and uniformly distributed ideal points. Our objective is to determine the optimal initial partition of $A$ and $B$ that maximizes a target candidate's probability of winning. We analytically evaluate this success probability and derive its asymptotic behavior as the number of candidates $n \to \infty$. A key finding is that the asymptotically optimal relative width of the main discrete cluster converges precisely to one-fifth of the total number of candidates. Finally, we provide computational results and confidence intervals derived from simulation algorithms that validate the analytical framework. Specifically, we demonstrate that the probability of the universal victory event rapidly approaches $1$ as the electorate size increases.2026-03-16T14:27:23Z37 pagesEmilio De SantisAntonio Di CrescenzoVerdiana Mustarohttp://arxiv.org/abs/2502.20031v2On the Welfare of EIP-1559 with Patient Bidders2026-03-16T09:39:21ZThe ``EIP-1599 algorithm'' is used by the Ethereum blockchain to assemble transactions into blocks. While prior work has studied it under the assumption that bidders are ``impatient'', we analyze it under the assumption that bidders are ``patient'', which better corresponds to the fact that unscheduled transactions remain in the mempool and can be scheduled at a later time. We show that with ``patient'' bidders, this algorithm produces schedules of near-optimal welfare, provided it is given a mild resource augmentation (that does not increase with the time horizon). We prove some generalizations of the basic theorem, establish lower bounds that rule out several candidate improvements and extensions, and propose several questions for future work.2025-02-27T12:16:02ZMoshe BabaioffNoam Nisanhttp://arxiv.org/abs/2603.14867v1Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning2026-03-16T06:11:00ZMany strategic decision-making problems, such as environment design for warehouse robots, can be naturally formulated as bi-level reinforcement learning (RL), where a leader agent optimizes its objective while a follower solves a Markov decision process (MDP) conditioned on the leader's decisions. In many situations, a fundamental challenge arises when the leader cannot intervene in the follower's optimization process; it can only observe the optimization outcome. We address this decentralized setting by deriving the hypergradient of the leader's objective, i.e., the gradient of the leader's strategy that accounts for changes in the follower's optimal policy. Unlike prior hypergradient-based methods that require extensive data for repeated state visits or rely on gradient estimators whose complexity can increase substantially with the high-dimensional leader's decision space, we leverage the Boltzmann covariance trick to derive an alternative hypergradient formulation. This enables efficient hypergradient estimation solely from interaction samples, even when the leader's decision space is high-dimensional. Additionally, to our knowledge, this is the first method that enables hypergradient-based optimization for 2-player Markov games in decentralized settings. Experiments highlight the impact of hypergradient updates and demonstrate our method's effectiveness in both discrete and continuous state tasks.2026-03-16T06:11:00Z26 pages. Accepted at ICAPS 2026Mikoto KudoTakumi TanabeAkifumi WachiYouhei Akimotohttp://arxiv.org/abs/2405.06689v2Policy Iteration for Two-Player General-Sum Stochastic Stackelberg Games2026-03-16T06:00:46ZWe address two-player general-sum stochastic Stackelberg games (SSGs), where the leader's policy is optimized considering the best-response follower whose policy is optimal for its reward under the leader. Existing policy gradient and value iteration approaches for SSGs do not guarantee monotone improvement in the leader's policy under the best-response follower. Consequently, their performance is not guaranteed when their limits are not stationary Stackelberg equilibria (SSEs), which do not necessarily exist. In this paper, we derive a policy improvement theorem for SSGs under the best-response follower and propose a novel policy iteration algorithm that guarantees monotone improvement in the leader's performance. Additionally, we introduce Pareto-optimality as an extended optimality of the SSE and prove that our method converges to the Pareto front when the leader is myopic.2024-05-07T07:40:42Z29 pages. Accepted at ACML 2025. To appear in PMLR 304Mikoto KudoYouhei Akimotohttp://arxiv.org/abs/2510.11255v4Sequential Solution Concepts in Cooperative Games with Generalized Characteristic Functions2026-03-16T03:44:07ZMotivated by the fact that the worth of a coalition may depend on the order in which agents arrive, Nowak and Radzik (1994) (NR) introduced cooperative games with generalized characteristic functions. We study such temporal cooperative games (TCGs), where the worth function v is defined on sequences of agents π rather than sets S. This order sensitivity necessitates a re-examination of axioms for reward sharing. NR and subsequent work proposed several axioms; the resulting solution concepts are still inherently order-oblivious and closely tied to the Shapley value. In contrast, we focus on sequential solution concepts that explicitly depend on the realized order π. We study reward-sharing mechanisms satisfying incentive for optimal arrival (I4OA), which promotes orders maximizing total worth; online individual rationality (OIR), which ensures agents are not harmed by later arrivals; and sequential efficiency (SE), which requires that the worth of any sequence is fully distributed among its agents. These axioms are intrinsic to TCGs, and we characterize a class of reward-sharing mechanisms uniquely determined by them. The classical Shapley value does not directly extend to this setting. We therefore construct natural Shapley analogs in two worlds: a sequential world, where rewards are defined for each sequence agent pair, and an extended world, where rewards are defined per agent, consistent with the NR framework. In both cases, the axioms of efficiency, additivity, and null player uniquely characterize the corresponding Shapley analogs. But, these Shapley analogs are disjoint from the class of solutions satisfying the sequential axioms, even for convex and simple TCGs.2025-10-13T10:44:10Z22 pages, under reviewAshwin GoyalDrashthi DoshiSwaprava Nathhttp://arxiv.org/abs/2505.12010v4Incentivize Contribution and Learn Parameters Too: Federated Learning with Strategic Data Owners2026-03-16T01:56:51ZClassical federated learning (FL) assumes that the clients have a limited amount of noisy data with which they voluntarily participate and contribute towards learning a global, more accurate model in a principled manner. The learning happens in a distributed fashion without sharing the data with the center. However, these methods do not consider the incentive of an agent for participating and contributing to the process, given that data collection and running a distributed algorithm is costly for the clients. The question of rationality of contribution has been asked recently in the literature and some results exist that consider this problem. This paper addresses the question of simultaneous parameter learning and incentivizing contribution in a truthful manner, which distinguishes it from the extant literature. Our first mechanism incentivizes each client to contribute to the FL process at a Nash equilibrium and simultaneously learn the model parameters. We also ensure that agents are incentivized to truthfully reveal information in the intermediate stages of the algorithm. However, this equilibrium outcome can be away from the optimal, where clients contribute with their full data and the algorithm learns the optimal parameters. We propose a second mechanism that enables the full data contribution along with optimal parameter learning. Large scale experiments with real (federated) datasets (CIFAR-10, FEMNIST, and Twitter) show that these algorithms converge quite fast in practice, yield good welfare guarantees and better model performance for all agents.2025-05-17T14:04:20Z27 pages, under reviewDrashthi DoshiAditya Vema Reddy KesariAvishek GhoshSwaprava NathSuhas S Kowshik