https://arxiv.org/api/zY/e4u6fRUCdBqSfKRovnJYpnGQ2026-06-21T18:12:32Z405924015http://arxiv.org/abs/2412.09321v6Coarse Q-learning: Indifference, Indeterminacy, and Instability2026-05-12T12:52:59ZWe introduce Coarse Q-learning (CQL), a reinforcement-learning model for bandit problems with stochastically varying menus. Alternatives are exogenously partitioned into similarity classes, and feedback from sampled alternatives is pooled within classes into class-level valuations. Choices follow multinomial logit over class valuations, and valuations update toward realized payoffs as in Q-learning. Using stochastic approximation, we derive the mean-field dynamics and characterize the steady states as smooth analogues of Valuation Equilibria. The model yields novel long-run phenomena in the high payoff-sensitivity limit: depending on the environment, CQL may exhibit multiple stable strict equilibria, a unique globally stable mixed equilibrium with indifference across classes, or no stable equilibrium at all, with valuations and choice probabilities converging instead to a stable limit cycle. These outcomes are driven by coarse aggregation and do not arise in the standard alternative-level benchmark.2024-12-12T14:47:12Z45 Main pages + 26 Supplemental Appendix pagesPhilippe JehielAviman Satpathyhttp://arxiv.org/abs/2407.21198v3Lattice operations for the pairwise stable set in many-to-many markets via re-equilibration dynamics2026-05-12T09:09:44ZWe compute the lattice operations for the (pairwise) stable set in many-to-many matching markets when only path-independence on agents' choice functions is imposed. To do this, we first show that the sets of firm-quasi-stable and worker-quasi-stable many-to-many matchings form lattices. Then, we construct Tarski operators on these lattices whose fixed points coincide with the set of stable matchings, and show that iterating these operators from suitable quasi-stable matchings yields the lattice operations in the stable set. These operators resemble lay-off and vacancy chain dynamics, respectively.2024-07-30T21:20:18ZAgustin G. BonifacioNoelia JuarezPaola B. Manaserohttp://arxiv.org/abs/2605.11736v1Approximate Strategyproofness in Approval-based Budget Division2026-05-12T08:16:40ZIn approval-based budget division, the task is to allocate a divisible resource to the candidates based on the voters' approval preferences over the candidates. For this setting, Brandl et al. [2021] have shown that no distribution rule can be strategyproof, efficient, and fair at the same time. In this paper, we aim to circumvent this impossibility theorem by focusing on approximate strategyproofness. To this end, we analyze the incentive ratio of distribution rules, which quantifies the maximum multiplicative utility gain of a voter by manipulating. While it turns out that several classical rules have a large incentive ratio, we prove that the Nash product rule ($\mathsf{NASH}$) has an incentive ratio of $2$, thereby demonstrating that we can bypass the impossibility of Brandl et al. by relaxing strategyproofness. Moreover, we show that an incentive ratio of $2$ is optimal subject to some of the fairness and efficiency properties of $\mathsf{NASH}$, and that the positive result for the Nash product rule even holds when voters may report arbitrary concave utility functions. Finally, we complement our results with an experimental analysis.2026-05-12T08:16:40ZForthcoming at IJCAI'26Haris AzizPatrick LedererJeremy Vollenhttp://arxiv.org/abs/2504.01829v4Revealed Bayesian Persuasion2026-05-12T07:29:47ZHow does one test empirically the hypothesis that a decision maker (DM) is being influenced by information via Bayesian persuasion? In this paper, I consider a DM whose state-dependent preferences are known to an analyst, who sees the conditional distribution of choices given the state. I provide necessary and sufficient conditions for the dataset to be consistent with the DM being Bayesian persuaded by an unobserved sender who generates a distribution of signals to ex-ante optimize the sender's expected payoff. I thereby provide a tool for empirical work on information design.2025-04-02T15:38:24ZJeffrey Menschhttp://arxiv.org/abs/2605.11350v1Human-AI Productivity Paradoxes: Modeling the Interplay of Skill, Effort, and AI Assistance2026-05-12T00:12:27ZGenerative Artificial Intelligence (AI) tools are rapidly adopted in the workplace and in education, yet the empirical evidence on AI's impact remains mixed. We propose a model of human-AI interaction to better understand and analyze several mechanisms by which AI affects productivity. In our setup, human agents with varying skill levels exert utility-maximizing effort to produce certain task outcomes with AI assistance. We find that incorporating either endogeneity in skill development or in AI unreliability can induce a productivity paradox: increased levels of AI assistance may degrade productivity, leading to potentially significant shortfalls. Moreover, we examine the long-term distributional effect of AI on skill, and demonstrate that skill polarization can emerge in steady state when accounting for heterogeneity in AI literacy -- the agent's capability to identify and adapt to inaccurate AI outputs. Our results elucidate several mechanisms that may explain the emergence of human-AI productivity paradoxes and skill polarization, and identify simple measures that characterize when they arise.2026-05-12T00:12:27ZAli AouadThodoris LykourisHuiying Zhonghttp://arxiv.org/abs/2605.12559v1Coordination Failures and Stackelberg Leadership in Housing Development with Network Effects2026-05-11T21:03:38ZI study coordination failures in housing development markets with network effects, where the value of building depends on aggregate supply. When network effects are sufficiently strong and convex, multiple equilibria arise: a low-supply coordination failure and a high-supply outcome. Without a coordination mechanism, equilibrium is indeterminate. I introduce a large developer who moves first in a Stackelberg game, committing to housing supply before atomistic developers make entry decisions. The main result is that the large developer always commits at least to the high-supply equilibrium, eliminating the coordination failure by pushing past the unstable threshold that separates the low and high outcomes. The result is unconditional; it holds for general demand functions and cost distributions, and does not depend on which stable continuation equilibrium materializes. The leader's commitment inverts standard monopoly intuition: first-mover commitment can improve welfare by resolving a coordination problem that atomistic markets cannot solve on their own. I also characterize when the developer builds beyond the high equilibrium into a monopoly region, and show that the market underprovides housing relative to the social optimum.2026-05-11T21:03:38Z43 pages, 7 figuresVaibhav Ranganhttp://arxiv.org/abs/2605.11157v1The Price of Proportional Representation in Temporal Voting2026-05-11T19:05:30ZWe study proportional representation in the temporal voting model, where collective decisions are made repeatedly over time over a fixed horizon. Prior work has extensively investigated how proportional representation axioms from multiwinner voting (e.g., justified representation (JR) and its variants) can be adapted, satisfied, and verified in this setting. However, much less is understood about their interaction with social welfare. In this work, we quantify the efficiency cost of enforcing proportionality. We formalize the welfare-proportionality tension via the worst-case ratio between the maximum achievable utilitarian welfare and the maximum welfare attainable subject to a proportionality axiom. We show that imposing proportional representation in the temporal setting can incur a growing, yet sublinear, welfare loss as the number of voters or rounds increases. We further identify a clean separation among axioms: for JR, the welfare loss diminishes as the time horizon grows and vanishes asymptotically, whereas for stronger axioms this conflict persists even with many rounds. Moreover, we prove that welfare maximization under each axiom is NP-complete and APX-hard, even under static preferences and bounded-degree approvals, and provide fixed-parameter algorithms under several natural structural parameters.2026-05-11T19:05:30ZAppears in the 35th International Joint Conference on Artificial Intelligence (IJCAI), 2026Nicholas Tehhttp://arxiv.org/abs/2605.10505v1A Theory of Multilevel Interactive Equilibrium in NeuroAI2026-05-11T13:01:54ZWe propose a game-theoretic framework for adaptive multi-agent intelligent systems. Unlike classical game theory, which often treats strategies as primitive objects chosen by perfectly rational agents, the proposed framework provides a mathematical foundation for studying equilibrium in NeuroAI and can be viewed as an extension of game theory under relaxed assumptions, including partial observability, bounded computation, and uncertainty. At its core, Multilevel Interactive Equilibrium (MIE) generalizes the classical Nash equilibrium to intelligent systems with internal computation. Rather than being defined solely at the level of observable behavior, equilibrium emerges when neural learning dynamics, cognitive representations, and behavioral strategies mutually stabilize between interacting agents. This framework applies uniformly to interactions between two biological brains, two artificial agents, or hybrid human-AI systems. We discuss applications of multilevel game theory to human-autonomous vehicle driving, human-machine interaction, human-large language model (LLM) interaction, and computational psychiatry. We also outline experimental strategies and computational methods for estimating MIE and discuss challenges and prospects for future research.2026-05-11T13:01:54ZZhe Sage ChenQuanyan Zhuhttp://arxiv.org/abs/2605.10495v1Robust Bayes Acts under Prior Perturbations: Contamination, Stability, and Selection Paths2026-05-11T12:52:47ZThis paper develops a quantitative framework to assess the robustness of Bayes-optimal decisions in finite decision problems under model uncertainty. We introduce two complementary stability notions for acts: the robustness radius, measuring the largest perturbation of a reference prior under which an act remains Bayes-optimal, and the contamination need, quantifying the minimal perturbation required for an act to become Bayes-optimal under some nearby prior. Both concepts are characterized via linear programming formulations and computed efficiently using bisection methods exploiting monotonicity properties. Building on these stability measures, we propose a cost-adjusted stability criterion that integrates robustness considerations with act-specific selection costs, yielding a parametric family of decision rules indexed by a regularization parameter. We analyze how optimal act selection evolves along this parameter and derive selection paths that reveal structural transitions between stability-driven and cost-driven regimes. The framework is applied to a portfolio choice problem under uncertainty between different economic regimes. Concretely, using data on historical ETF returns, we compute robustness and contamination profiles for six portfolio strategies and analyze their behavior under heterogeneous belief specifications. The results illustrate that robustness-based selection refines classical expected utility by accounting for prior misspecification.2026-05-11T12:52:47ZChristoph JansenLancaster University Leipzig, GermanyGeorg SchollmeyerLudwig-Maximilians-Universität München, Germanyhttp://arxiv.org/abs/2505.14639v5Communication as Voting2026-05-11T09:32:46ZThis paper analyzes a cheap-talk model with multiple senders and one receiver. Each sender observes a noisy signal about an unknown state and sends a message; the receiver observes the message tally and chooses a policy. This setting shares certain features with voting models (e.g., Feddersen and Pesendorfer, 1997, 1998). The existing literature (e.g., Levit and Malenko, 2011; Battaglini, 2017) focuses on scenarios in which the receiver and the senders agree on the preferred policy in each state. In contrast, we explore environments in which the receiver and the senders disagree over the preferred policy in some states. We establish an equilibrium no-conflict result: in any non-babbling equilibrium, the senders and the receiver agree on the preferred policy at every realized message tally. We show that information aggregation fails, and the receiver cannot fully learn the state even as the number of senders grows large. We also identify a discontinuity in information transmission relative to the implications of the existing literature. Finally, introducing a mediator can improve information transmission and restore efficiency.2025-05-20T17:26:15ZKailin Chenhttp://arxiv.org/abs/2605.09747v1The Matching Function: A Unified Look into the Black Box2026-05-10T20:50:26ZIn this paper, we use tools from network theory to trace the properties of the matching function to the structure of granular connections between applicants and vacancies. We unify seemingly disparate parts of the literature by recovering multiple functional forms as special cases including the CES. We derive a testable condition under which matching in any network from the broad class we analyze can be thought "as if" it comes from a CES matching function, up to a first-order approximation. We provide a theory of match efficacy in which inequality in search intensities is the key determinant of how well the matching process works. A robust finding of our analysis is that dispersion of search intensities on either side of the market is bad for the matching process. We also show that a rise in the market's mean search intensity can reduce match efficacy when it is associated with a higher Gini coefficient of search intensities.2026-05-10T20:50:26ZGeorgios AngelisYann Bramoulléhttp://arxiv.org/abs/2505.03232v3Collective decisions under uncertainty: efficiency, ex-ante fairness, and normalization2026-05-10T04:41:09ZThis paper studies preference aggregation under uncertainty in the multi-profile framework and characterizes a new class of aggregation rules that address classical concerns about Harsanyi's (1955) utilitarian rules. Our aggregation rules, which we call relative fair aggregation rules, are grounded in three key ideas: utilitarianism, egalitarianism, and the 0--1 normalization of individual utilities. These rules are parameterized by a set of weight vectors over individuals and evaluate each ambiguous alternative by taking the minimum weighted sum of 0--1 normalized utility levels over the weight set. For the characterization, we propose two novel axioms -- weak preference for mixing and restricted certainty independence -- developed by using a new method of objectively randomizing outcomes within the Savagean setting. Additional results clarify how these axioms capture the utilitarian and egalitarian attitudes of the rules.2025-05-06T06:53:21ZThe file comprises the main body (22 pages), the Appendix (13 pages), and referencesLeo KurataKensei Nakamurahttp://arxiv.org/abs/2605.09136v1On the Possibility of Informationally Inefficient Markets Without Noise2026-05-09T19:42:45ZNoise traders can be dispensed with entirely. Partial revelation of information through prices arises under any non-exponential expected utility preference, including CRRA, without noise traders, random endowments, supply shocks, hedging motives, or behavioral biases. The model contains zero exogenous noise.
The mechanism is a mismatch between the space in which market clearing aggregates signals and the Bayesian sufficient statistic. CARA demand is linear in log-odds, so prices aggregate in log-odds space and reveal the statistic exactly. Every other preference aggregates differently; the resulting Jensen gap makes revelation partial. I prove that CARA is the unique fully revealing preference class, characterize the rational expectations equilibrium via a contour integration fixed point, and verify that partial revelation survives learning from prices. The Grossman-Stiglitz paradox is resolved: information acquisition has positive value within the rational class. Numerical solution of the rational expectations fixed point at K = 3 confirms partial revelation, positive trade volume, and positive value of information across the full range of CRRA risk aversion, vanishing only in the CARA limit.2026-05-09T19:42:45ZMattthijs Breugemhttp://arxiv.org/abs/2605.09029v1Secret Communication with Plausible Deniability2026-05-09T16:10:39ZCommunication is secret if a message is independent of the state; however, the receiver's subsequent action may still reveal that she has acted on hidden information. This paper studies when secret communication can also provide plausible deniability: under single-crossing preferences, every action induced by the sender's message must be rationalizable using the receiver's baseline information alone. We characterize joint information structures that satisfy both secrecy and plausible deniability. We show that plausible deniability restricts communication exactly when the baseline message is directional -- meaning its likelihood is monotone in the state. Combining this restriction with secrecy, we show that, for directional messages, frontier communication reveals at most whether the state lies above or below a cutoff. Finally, we identify conditions under which a greatest feasible communication structure exists and can be constructed explicitly in a simple way.2026-05-09T16:10:39ZXiaoyu ChengYonggyun KimMichael P. H. Tamhttp://arxiv.org/abs/2605.08989v1Aggregating Elo Ratings: An Axiomatization2026-05-09T15:15:54ZMany environments assign several Elo ratings to the same agent: a chess player has classical, rapid, and blitz ratings; an online platform may rate by time control, mode, or format; an evaluator may rate performance across tasks or roles. This paper axiomatizes when such a vector of ratings can be reduced to a single scalar rating that is itself on the Elo scale. We impose three substantive conditions: same-scale normalization (a uniform profile keeps its rating), recursive consistency (aggregating in blocks gives the same answer as aggregating directly, provided each block carries the total weight of its members), and marginal Elo-strength consistency (for two equally weighted coordinates, the ratio of marginal contributions to the combined rating equals the ordinary Elo odds). The unique rating rule satisfying these conditions converts each component to its Elo strength, takes a weighted arithmetic mean of strengths, and converts back. We show how this rule differs from a random-format lottery and from rating-scale averaging, prove the axioms are independent, and illustrate the rule on combining classical, rapid, and blitz ratings.2026-05-09T15:15:54ZMehmet Mars Seven