https://arxiv.org/api/MktY55DujR7kNgIKHJvkHAaGIBQ 2026-06-21T10:25:36Z 4059 135 15 http://arxiv.org/abs/2605.30916v1 Welfare, Improvability, and Variance: A Principal-Agent Approach to Optimal Benchmark Item Aggregation 2026-05-29T07:01:38Z

AI benchmarks have well-documented limitations, with prior work examining contamination, saturation, and construct underspecification. Aggregation has received far less attention: benchmarks are typically summarized by uniformly averaging item-level scores, implicitly treating every test item as equally valuable. We model benchmarking as a multitask principal-agent game and show that the welfare loss from a benchmark is determined jointly by three item-level primitives: alignment with normative welfare priorities, marginal improvability, and performance variance. We translate the theory into an audit framework that ranks items along each of these three axes, and apply it to OLMES items using WORKBank for welfare, the EvoLM 4B suite for improvability, and the PolyPythias 410M panel for variance. The framework surfaces items that are Pareto-inferior within OLMES subject to a pro-worker welfare operationalization. All code is available at https://github.com/stair-lab/principal-agent-benchmarks.

2026-05-29T07:01:38Z Andreas Haupt Justin Hartenstein Anka Reuel Mykel Kochenderfer Sanmi Koyejo http://arxiv.org/abs/2605.30890v1 A Geometric Approach to the Transformation Problem of Values 2026-05-29T06:26:01Z

The reduction of complex labour to simple labour is an unresolved difficulty in Marx's labour theory of value, and a key obstacle that has prevented the transformation problem from being settled definitively. This paper proposes a two-step solution framework. First, we prove that as long as the macroeconomy generates a physical surplus, the reduction coefficients that respect the floor of labour-power reproduction form a bounded ``value feasible region''; within this region the two macro aggregate equalities can hold simultaneously for a reasonable range of the profit rate. Second, we propose a linear mapping method that exploits the observable structure of nominal wages and the reproduction floor constraint to systematically construct the implicit reduction coefficients from the value feasible region. We show that this mapping is a homeomorphism between the price feasible region and the value feasible region, and that it preserves the boundary structure. An empirical calibration based on China's 2017 inter-provincial input--output table with 1272 sectors shows that the reduction coefficients obtained by the mapping method substantially outperform the homogeneous labour method and the wage-proxy method in matching the macro profit share.

2026-05-29T06:26:01Z Jiyuan Lyu http://arxiv.org/abs/2605.30879v1 Competitive Many-to-One Matching: Sorting vs. Equality 2026-05-29T06:08:17Z

We study many-to-one matching with transfers and peer effects, such as matching workers to firms, students to schools, residents to neighborhoods, or consumers to status goods. With flexible prices (as in the labor market), competitive equilibrium exists and is efficient under general conditions. We characterize when workforces are segregated by skill and matched to firms in a positively assortative manner. In general, equilibrium features alternating intervals of workforce segregation and compression (mixing). Comparative statics characterize when workforces are more segregated or more compressed, and when profits and wages are more or less unequal. With uniform prices (as in school or neighborhood choice), the value generated by peer effects accrues to schools rather than students, and equilibrium can be excessively segregated. Our model generalizes both assignment models (optimal transport) and Bayesian persuasion.

2026-05-29T06:08:17Z Anton Kolotilin Alexander Wolitzky http://arxiv.org/abs/2508.17671v7 Consistent Opponent Modeling in Imperfect-Information Games 2026-05-29T00:10:05Z

The goal of agents in multi-agent environments is to maximize total reward against the opposing agents that are encountered. Following a game-theoretic solution concept, such as Nash equilibrium, may obtain a strong performance in some settings; however, such approaches fail to capitalize on historical and observed data from repeated interactions against our opponents. Opponent modeling algorithms integrate machine learning techniques to exploit suboptimal opponents utilizing available data; however, the effectiveness of such approaches in imperfect-information games to date is quite limited. We show that existing opponent modeling approaches fail to satisfy a simple desirable property even against static opponents drawn from a known prior distribution; namely, they do not guarantee that the model approaches the opponent's true strategy even in the limit as the number of game iterations approaches infinity. We develop a new algorithm that is able to achieve this property and runs efficiently by solving a convex minimization problem based on the sequence-form game representation using projected gradient descent. The algorithm is guaranteed to efficiently converge to the opponent's true strategy under standard Bayesian identifiability and visitation assumptions, given observations from gameplay and possibly additional historical data if it is available.

2025-08-25T05:08:49Z Sam Ganzfried http://arxiv.org/abs/2605.30566v1 Participation Costs Narrow Democratic Cooperation 2026-05-28T20:59:21Z

Collective action often requires institutions that make cooperation individually worthwhile. We ask whether democratic allocation of public-good return can transform a repeated public good into a self-sustaining cooperative institution, and how participation costs reshape that process. A simple evolutionary model shows that voted redistribution can support a prosocial allocation order, but can also sustain an antisocial allocation order or democratic free riding, in which individuals benefit from an institution maintained by others while avoiding the cost of participation. The model predicts competing effects of voting cost. Cost can suppress use of the institution to reward low contributors under strong selection, but can also thin the active electorate and erode contributor-rewarding support. We test these predictions in a preregistered online experiment with \NIncludedGroupsVone{} five-person groups. Endogenous democratic redistribution increased contributions relative to an equal-share public-goods control, with zero-cost voting producing the strongest temporal improvement. Voting costs did not mainly turn active voters toward low-contributor-rewarding allocation. Instead, they shifted behavior toward abstention and democratic free riding, made abstention locally rewarding, and widened the gap between post-task perceptions of democratic participation and the behavioral record. Democratic allocation can therefore stabilize cooperation, but participation costs can reduce the number of people actively sustaining the institution and can make that erosion less visible to participants themselves.

2026-05-28T20:59:21Z 32 page, 6 figures Mohammad Salahshour Fjolle Shabani Urs Fischbacher Iain D. Couzin http://arxiv.org/abs/2403.11240v4 Speed, Accuracy, and Complexity 2026-05-28T20:22:57Z

This paper studies when response time is informative about problem complexity. It revisits a canonical sequential-sampling model in which a decision-maker chooses when to stop acquiring costly information. Problem complexity is measured by the noise-to-signal ratio of the evidence process. Under exogenous stopping rules -- as when the decision-maker does not optimally adjust to problem complexity -- response time increases with complexity. By contrast, this monotonicity breaks down when the decision-maker observes problem complexity ex ante and optimally adjusts to it. Expected stopping time is then inverse-U-shaped in complexity, so choices are fast in both very simple and very complex problems. Ability and response time are similarly ambiguously related: more able decision-makers are faster on simple problems but slower on complex ones. Finally, this paper shows that complexity and ability can be inferred from the sensitivity of choices to subsidies, which is greater in more complex problems and for less able decision-makers.

2024-03-17T15:06:39Z Duarte Gonçalves http://arxiv.org/abs/2605.30515v1 Obviously Strategy-proof Choice of Social Acts 2026-05-28T19:53:28Z

We study obviously strategy-proof implementation in the framework of social choice over acts introduced by Bahel and Sprumont (2020). We characterize the class of unanimous social choice functions that are implementable via obviously strategy-proof mechanisms. Our main result shows that a unanimous social choice function is obviously strategy-proof implementable if and only if it is dictatorial.

2026-05-28T19:53:28Z Abinash Panda Anup Pramanik http://arxiv.org/abs/2406.14198v4 Tight Guarantees in the Commons 2026-05-28T15:58:08Z

In our context-free model of a commons, the function$\mathcal{W}$ transforms the profile of the agents' types $(x_{1},..,x_{n})$ to a freely transferable output $\mathcal{W}(x_{1},..,x_{n})$ that they must share fairly. We expand the ubiquitous concept of \textit{endogenous fair shares} to include both a lower and an upper bound on agent $i$'s share at the interim stage where $i$ only knows its own type $x_{i}$. Two functions $(g^{-},g^{+})$ form a pair of tight guarantees if 1) they satisfy the system of inequalities $% \sum_{1}^{n}g^{-}(x_{i})\leq \mathcal{W}(x)\leq \sum_{1}^{n}g^{+}(x_{i})$ for all profiles, and 2) the interval $[g^{-}(x_{i}),g^{+}(x_{i})]$ is inclusion minimal across all types. For super (resp sub) modular functions 1) the \textit{Unanimity }share% \textit{\ }$\frac{1}{n}\mathcal{W}(x_{i},x_{i},..,x_{i})$ is the unique tight upper (resp lower) guarantee, 2) two \textit{Stand Alone} shares $% g(x_{i})=\mathcal{W}(x_{i},\overbrace{x_{0},..,x_{0}})-\frac{n-1}{n}\mathcal{% W}(\overbrace{x_{0},..,x_{0}})$ (where $x_{0}$ is the smallest or largest type) bracket all tight guarantees on the other side of Unanimity, 3) serial cost sharing implements the Unanimity and Stand Alone guarantees. In applications to specific microeconomic models, tight guarantees vindicate or dismiss familiar deterministic sharing rules and suggest new ones with a clear normative interpretation. Our examples include joint production with substitute or complementary inputs, allocating an indivisible good and cash transfers, sharing the cost (or benefit) of the variance or the spread of types, the waiting cost in a queue, and more.

2024-06-20T11:03:51Z Anna Bogomolnaia Hervé Moulin http://arxiv.org/abs/2605.30081v1 Tax Salience: How Requiring Transparency Affects the Price of Equality 2026-05-28T15:28:53Z

Less-salient taxes can ease the classic equality-efficiency trade-off by making people respond less to taxation. But deliberately obscuring taxes may be viewed as dishonest. This creates a three-way trade-off between equality, efficiency, and honesty. We analyze this trade-off in a simple setting with a linear income tax. We define and characterize the morally efficient frontier, trading off utilitarian welfare against honesty or transparency. Complete honesty is Pareto inefficient but not morally inefficient. More generally, any increase in honesty reduces utilitarian welfare. When utilitarian welfare is decomposed into equality and efficiency, the cost of honesty falls most robustly on equality: higher salience always reduces equality, while the effect on efficiency is ambiguous. This asymmetry is explained by the fact that salience increases the price of equality, which is the efficiency cost of a marginal increase in equality. Our approach could be applied to other settings in which utilitarian and procedural or deontological values conflict.

2026-05-28T15:28:53Z Ashley Craig Itai Sher http://arxiv.org/abs/2604.19044v2 Fair Commodity Taxation 2026-05-28T13:25:34Z

We study economies where consumers interact independently with many monopolists. When consumer valuations over goods are correlated, correlation can distort the induced distribution of consumer surplus (information rents). We identify which shifts in the correlation structure over values make the induced distribution more or less fair, in the sense of second order stochastic dominance. We then investigate the role taxation can have on information rents, and show the tax authority never benefits from randomizing the allocation of goods. We characterize the set of mechanisms that are on the fairness-efficiency frontier under regularity conditions on the distribution of types. Furthermore, under these conditions all allocations on the fairness-efficiency frontier ration the good more than an unregulated monopolist. Finally, we discuss implications of our model for luxury commodity taxation.

2026-04-21T03:50:42Z Eric Gao Daniel Luo http://arxiv.org/abs/2605.29361v1 The Empirical Content of Revealed Preference in High Dimensions 2026-05-28T05:00:52Z

We examine how the empirical content of revealed preference theory depends on the dimensionality of the choice environment. While higher-dimensional choice problems may appear more demanding, we show that revealed preference restrictions become less informative. Using Selten's Area measure, we establish that for any fixed number of observations, the empirical content of GARP converges to zero exponentially fast in the number of goods. We provide complementary proofs based on revealed preference graphs and the Afriat inequalities, and show in simulations calibrated to scanner data that the effect is quantitatively large. We also evaluate potential responses in observational and experimental settings and find that, while these can slow the rate, they do not eliminate this loss of empirical content.

2026-05-28T05:00:52Z Ian Crawford Longye Tian http://arxiv.org/abs/2602.23098v2 Purification and Perturbations of Communication and Repeated Games 2026-05-28T03:59:35Z

I prove that it is irrational for agents with even slightly private preferences to condition their strategy on private information that is payoff-irrelevant to them, contrary to powerful techniques for analyzing communication and repeated games. In repeated games with public+private monitoring, this means all pure equilibria are perfect public equilibria, and non-trivial belief free equilibria do not exist. In a wide class of communication games (up to allowing receiver commitment), this means persuasion is impossible with state-independent preferences. Nevertheless, these analytic techniques may be made compatible with private preferences through perturbation approaches: considering either payoff-relevance of the private information or correlation between parties' information. An example of the latter occurs by introducing `atonement' to repeated games equilibria.

2026-02-26T15:14:41Z Alistair Barton http://arxiv.org/abs/2304.01385v6 Should the Timing of Inspections be Predictable? 2026-05-27T19:19:49Z

A principal hires an agent to work on a long-term project that culminates in a breakthrough or a breakdown. At each time, the agent privately chooses to work or shirk. Working increases the arrival rate of breakthroughs and decreases the arrival rate of breakdowns. To motivate the agent to work, the principal conducts costly inspections. She fires the agent if shirking is detected. We characterize the principal's optimal inspection policy. Predictable inspections are optimal if work primarily generates breakthroughs. Random inspections are optimal if work primarily prevents breakdowns. Crucially, the agent's actions affect the survival rate of the project, which determines his risk attitude over the timing of planned inspections.

2023-04-03T21:20:25Z Ian Ball Jan Knoepfle http://arxiv.org/abs/2505.06846v3 Utility Maximization Under Endogenous Uncertainty 2026-05-27T19:10:04Z

This paper studies decision problems where the decision maker's choice of action affects the probability distribution of a payoff relevant random variable. We establish sufficient conditions for the existence of an expected utility maximizing action in such settings. The main requirement is a mild continuity condition on the family of possible distributions. We also show that this condition is a minimal requirement. Our result does not require common assumptions such as the monotone likelihood ratio property (MLRP) or the convexity of distribution functions condition (CDFC). It can therefore be used to prove the existence of an optimal action in many settings where existing results do not apply, including an important class of problems where the support of the random variable depends on the decision maker's choice and the density functions are not pointwise continuous.

2025-05-11T05:11:22Z Ayush Gupta http://arxiv.org/abs/2605.28985v1 Subsidizing Sequential Search 2026-05-27T18:40:46Z

We study markets where firms compete for consumer attention by subsidizing costly product inspection. These subsidies do not change product quality, but they alter the order in which consumers search by lowering inspection costs. We establish a subsidy-sorting principle: in any equilibrium, higher-quality firms provide weakly larger subsidies, leading consumers to search in descending subsidy order. A unique equilibrium survives forward-induction reasoning in the spirit of the Intuitive Criterion: low-quality firms are never inspected, intermediate-quality firms separate with strictly increasing subsidies, and high-quality firms pool at the full subsidy. This equilibrium maximizes information revelation among all possible outcomes and ensures efficient inspection. We then extend the analysis to AI-mediated platforms that can create and price inspection tokens. The platform's optimal linear pricing leads to excessive inspection relative to the social optimum. While this distortion does not reduce consumer welfare, it reallocates surplus from sellers to the platform and consumers.

2026-05-27T18:40:46Z Salvador Candelas Nicole Immorlica Brendan Lucier