https://arxiv.org/api/iskVSPSKJUivYep020v7V/P5J4E2026-06-13T18:15:16Z22599015http://arxiv.org/abs/2604.25954v1Fast Core Identification2026-04-25T14:19:29ZThis paper examines the computational complexity of the \emph{Core Identification Problem} (CIP) in one-sided matching markets governed by the Top Trading Cycles (TTC) algorithm. The central contribution is a formal complexity separation: this paper proves that identifying which agents receive a core allocation is strictly easier than computing the full TTC allocation. Specifically, we show that CIP can be solved in $\bigO{Ln}$ time, where $L$ is the maximum number of preferences reported per agent, by computing the leading eigenvector of a preference-derived Markov transition matrix via randomized SVD\@. For sparse preference profiles ($L = \bigO{1}$, as in the NYC school choice where $L = 12$), this yields an algorithm $\bigO{n}$. This result strictly improves on the $\bigO{n \log n}$ complexity of the full TTC allocation (\cite{SabanSethuraman2013}) and matches the $\Omg{n}$ information-theoretic lower bound, establishing asymptotic optimality. The method inherits all properties of TTC: Pareto efficiency, individual rationality, and strategy-proofness, and is robust to preference noise for sufficiently large~$n$.2026-04-25T14:19:29Z23 pagesIrene Aldridgehttp://arxiv.org/abs/2604.22069v1Liquidity provision in CLMMs: evidence from transactions data2026-04-23T20:48:47ZThe emergence of Concentrated Liquidity Market Makers (CLMMs) has made liquidity provision on decentralized exchanges an active and risk-sensitive task. However, the standalone profitability of liquidity provision remains unclear for liquidity providers (LPs) who neither hedge their inventory risk nor receive off-pool profits. This paper studies the actual outcomes of LP activity using historical transaction-level data from WETH/USD liquidity pools on the Base chain across the Uniswap, Aerodrome, PancakeSwap and SushiSwap protocols. We propose a methodology for reconstructing LP PnL dynamics from on-chain events and introduce an original metric that captures both the terminal state of LP capital and its path over time. Based on this framework, we estimate the share of successful LPs, classify their behavior and develop a taxonomy of 15 position types as structural components of PnL. We further identify a distinct class of multi-LPs operating across several pools and show that the dominant profitable position configurations are concentrated around the current pool price. The results show that only about one out of six LPs avoids losses in the selected market segment, raising an open question about the true economic motives of LP participation. Evidence also suggests that successful LPs often close positions before the full range is traversed, making observed behavior closer to profit-target-based strategies.2026-04-23T20:48:47ZAndrey UrusovRostislav BerezovskiyAnatoly KrestenkoAndrei Kornilovhttp://arxiv.org/abs/2604.21581v1Pricing and Hedging Financial Derivatives in Merger\&Acquisition Deals with Price Impact2026-04-23T12:03:56ZWe investigate the optimal execution of contracts that are used in merger\&acquisition deals. We consider cash-settled and physically delivered contracts between a broker and a counterpart. Contracts are linear (total returns swaps), nonlinear (collar contracts) or Asian type (TWAP based contracts). We derive the optimal execution strategy and the optimal fee through indifference utility arguments allowing for linear market effects of trades. We show that linear cash-settled contracts are more expensive and more exposed to manipulation/statistical arbitrages by the broker. Also nonlinear and Asian type contracts are exposed to these phenomena.2026-04-23T12:03:56ZEmilio BarucciYuheng LanDaniele Marazzinahttp://arxiv.org/abs/2604.20949v1Early Detection of Latent Microstructure Regimes in Limit Order Books2026-04-22T17:47:52ZLimit order books can transition rapidly from stable to stressed conditions, yet standard early-warning signals such as order flow imbalance and short-term volatility are inherently reactive. We formalise this limitation via a three-regime causal data-generating process (stable $\to$ latent build-up $\to$ stress) in which a latent deterioration phase creates a prediction window prior to observable stress. Under mild assumptions on temporal drift and regime persistence, we establish identifiability of the latent build-up regime and derive guarantees for strictly positive expected lead-time and non-trivial probability of early detection. We propose a trigger-based detector combining MAX aggregation of complementary signal channels, a rising-edge condition, and adaptive thresholding. Across 200 simulations, the method achieves mean lead-time $+18.6 \pm 3.2$ timesteps with perfect precision and moderate coverage, outperforming classical change-point and microstructure baselines. A preliminary application to one week of BTC/USDT order book data shows consistent positive lead-times while baselines remain reactive. Results degrade in low signal-to-noise and short build-up regimes, consistent with theory.2026-04-22T17:47:52Z48 pages, 7 figures. Combines theoretical guarantees (identifiability and early-detection bounds), 200-run simulation study, and preliminary real-data evaluation on BTC/USDT limit order books. Code and data availablePrakul Sunil HiremathVruksha Arun Hiremathhttp://arxiv.org/abs/2605.00864v1Arbitrage Analysis in Polymarket NBA Markets2026-04-22T03:01:49ZWhile decentralized prediction markets like Polymarket have gained significant traction, their market microstructure and high-frequency pricing efficiency remain underexplored. This paper conducts a systematic empirical analysis of algorithmic arbitrage within Polymarket's NBA game markets. By reconstructing continuous market states from over 75 million limit order book snapshots across 173 games, we evaluate the frequency, duration, and profitability of both single-market and combinatorial arbitrage opportunities. Our findings demonstrate profound microstructural efficiency. Single-market anomalies are exceedingly rare, yielding only 7 executable in-game episodes that persist for a median duration of just 3.6 seconds. Combinatorial inefficiencies are more frequent, producing 290 active episodes overwhelmingly concentrated in the final minutes of live play. While combinatorial execution yields a statistically meaningful median return of 101 basis points, we find that the theoretical "Middle" jackpot is never empirically realized. Furthermore, execution is severely bottlenecked by shallow order book depth, with 76.9\% of combinatorial opportunities constrained to an average executable size of just 14.8 shares. Ultimately, while executable mispricings exist, they are structurally bounded by liquidity, confining risk-free extraction strictly to the retail scale.2026-04-22T03:01:49ZGuang ChengJiaxin YangHaoxuan Zouhttp://arxiv.org/abs/2604.20067v1Testing replication for an agent-based model of market fragmentation and latency arbitrage2026-04-22T00:13:33ZThis study strengthens the foundations of multi-venue market modeling by attempting an independent replication of Wah and Wellman's 2016 model of latency arbitrage in a fragmented market. We find that faithful replication is hindered by missing implementation details in the original paper and limited quantitative reporting. We demonstrate that increasing the number of simulation runs beyond the original design allows for the creation of bootstrap confidence intervals to support rigorous tests of quantitative alignment, compensating for lacking distributional information (e.g. variance). We also demonstrate that increased complexity across the modeled scenarios corresponds with increased difficulty aligning to the original results. We draw on a codebase released by the original authors in connection with a later paper to recover additional implementation details; however, we reject quantitative alignment between that codebase and the published results. Combining information from the paper and the released code, we achieve relational equivalence for most metrics but reject quantitative alignment for model settings where latency is non-zero. We show that many of the qualitative takeaways from the original paper on the effects of market fragmentation and latency arbitrage are sensitive to the specifics of a `greedy strategy' extension given to the zero-intelligence (ZI) trader agents. Under an alternative interpretation of this strategy, we find that market fragmentation decreases execution times in all experiments and increases trader welfare in most experiments. Finally, to facilitate future replication, critique, and extension, we provide an ODD (Overview, Design concepts, Details) protocol for our implementations of the model.2026-04-22T00:13:33ZEthan Ratliff-CrainColin M. Van OortMatthew T. K. KoehlerBrian F. Tivnanhttp://arxiv.org/abs/2604.19956v1On-chain Peak Shaving2026-04-21T20:04:50ZBlockchain technology is widely expected to reduce transaction costs by automating contract enforcement and eliminating intermediaries; yet, the execution costs imposed by network congestion have received little attention in the operations management literature. We study on-chain peak shaving, the systematic scheduling of Ethereum transactions toward low-congestion windows to reduce gas fee exposure. We use transaction-level data from seven firms across seven industries (N = 62,142 transactions, January-March 2026).
Gas fees vary significantly throughout the day: the peak-hour premium at 10 AM Eastern Time reaches USD 0.220 per transaction above the overnight baseline, driven primarily by speculative-arbitrage demand rather than operational activity. Firm-level scheduling responses are heterogeneous and not uniformly disciplined. Only three of seven firms transact disproportionately during off-peak hours; four transact counter-cyclically, concentrated in peak windows due to external deadlines or governance cycles. This heterogeneity is explained by two moderators: transaction deferrability and gas intensity. We formalize these into an On-Chain Scheduling Matrix that maps firms to four regimes: 1) full peak shaving, 2) selective peak shaving, 3) cost provisioning, and 4) accept-market-rate, with regime membership predicting both fee savings and residual cost floors (40-92 percent of actual expenditure).
Theoretically, we extend Transaction Cost Economics to account for time-varying execution costs imposed by congestion externalities. In addition to extending Williamson's original cost taxonomy, we introduce a dual classification of gas fees as execution costs in timing but maladaptation costs in origin. The findings reposition on-chain gas-fee management alongside energy procurement and foreign exchange hedging as a domain requiring systematic operational planning.2026-04-21T20:04:50Z26 pagesIrene AldridgeGavhar AnnaevaLeyla BerikerZhiheng CaiSamyak ChoudharyCamila GodoyKaicheng GongZitao HuangJonah JiHetvi KharvasiyaHeng LiYuxuan LiTianchi MaQingcheng MengRuiyang ShiAnanya ShrivastavaJiaqi WangYifan WangZihua WuJiayang XuYuheng YanZijun ZengBowen ZhangFrancesco Zhanghttp://arxiv.org/abs/2605.00854v1Dynamics of Periodic Bubbles and Crashes: Modeling Market Overheating and Panic Selling via Cubic Momentum2026-04-21T00:17:00ZThis paper proposes a simple and parsimonious discrete-time simulation model to describe the endogenous formation and periodic collapse of financial bubbles. While existing literature has extensively explored the statistical properties of locally explosive bubble dynamics, capturing the micro-level interplay of investor herd behavior and panic selling within a unified framework remains a challenge. Our model addresses this by introducing a cubic function of market momentum to determine the balance of trading directions. This mechanism drives both trend-following behavior during the bubble phase and sudden market crashes when the momentum exceeds a critical threshold. Furthermore, inspired by the self-exciting nature of the Hawkes process, the model endogenizes``market frenzy" by linking trading frequency directly to the accumulated momentum. Simulation results demonstrate that this minimal setup successfully replicates the complex, nonlinear dynamics of bubbles, including simultaneous surges in liquidity and price, followed by dramatic crashes.2026-04-21T00:17:00Z12 pages, 2 figuresNaohiro Yoshidahttp://arxiv.org/abs/2603.19944v2Large Language Models and Stock Investing: Is the Human Factor Required?2026-04-20T14:40:05ZThis paper investigates whether large language models (LLMs) can generate reliable stock market predictions. We evaluate four state-of-the-art models - ChatGPT, Gemini, DeepSeek, and Perplexity - across three prompting strategies: a naive query, a structured approach, and chain-of-thought reasoning. Our results show that LLM-generated recommendations are hindered by recurring reasoning failures, including financial misconceptions, carryover errors, and reliance on outdated or hallucinated information. When appropriately guided and supervised, LLMs demonstrate the capacity to outperform the market, but realizing LLMs' full potential requires substantial human oversight. We also find that grounding stock recommendations in official regulatory filings increases their forecasting accuracy. Overall, our findings underscore the need for robust safeguards and validation when deploying LLMs in financial markets.2026-03-20T13:47:13Z33 pages; 6 tables; 2 figureRicardo CrisostomoDiana Mykhalyukhttp://arxiv.org/abs/2510.22341v2Understanding Carbon Trade Dynamics: A European Union Emissions Trading System Perspective2026-04-18T10:19:25ZThe European Union Emissions Trading System (EU ETS), the world's first and largest cap-and-trade carbon market, is a cornerstone of EU climate policy. This study provides a comprehensive empirical analysis of the EU carbon market's efficiency, price dynamics, and structural network from 2010 to 2020. First, we identify significant price clustering and short-term return predictability using an AR-GARCH model, achieving around 60 percent directional accuracy and a 80 percent hit rate within forecasted confidence intervals. These observed patterns motivate a deeper exploration of market structure. Second, leveraging this insight, a weighted network analysis of inter-country transactions uncovers a concentrated market where a few registries dominate high-value flows and exert disproportionate influence. Finally, building upon the network findings, country-specific log-log regressions of price on traded quantity reveal heterogeneous and sometimes counter-intuitive elasticities; in several cases, positive elasticities exceed unity, indicating that trading volumes rise with prices, a deviation from conventional demand behavior that highlights potential inefficiencies driven by speculation, strategic behavior, or policy distortions. Collectively, these results point to persistent inefficiencies within the EU ETS, including partial predictability, asymmetric market power, and anomalous price-volume relationships, implying that while the system has driven decarbonization, its trading and pricing mechanisms remain imperfect.2025-10-25T15:55:02ZAvirup Chakrabortyhttp://arxiv.org/abs/2604.10005v2What Happens When Institutional Liquidity Enters Prediction Markets: Identification, Measurement, and a Synthetic Proof of Concept2026-04-17T23:06:18ZPrediction markets are starting to look less like crowd polls and more like electronic markets. The central question is therefore no longer only whether these markets forecast well, but what happens when institutional liquidity enters: do spreads tighten, does price discovery improve, and do those gains actually reach the traders who are slowest to react when information arrives?
This paper offers a research design for answering that question. It defines a broad market-quality lens, separates the main channels through which institutional liquidity enters, and maps the identification problems that arise in live venue data. It also uses a synthetic microstructure laboratory as a proof of concept for the measurement pipeline.
The main lesson of the synthetic exercise is deliberately narrow. Market-maker coverage, liquidity incentives, and automation do not have to work through the same channel; average liquidity gains do not have to translate into equal gains for all traders; and the sharpest welfare losses are most likely to appear in shock states, when slower takers receive the least pass-through of tighter quoted markets. The synthetic results are useful because they stress-test the design, not because they settle the live empirical question.2026-04-11T03:27:43ZShaw Dalenhttp://arxiv.org/abs/2604.13334v1Against a Universal Trading Strategy: No-Arbitrage, No-Free-Lunch, and Adversarial Cantor Diagonalization2026-04-14T22:52:12ZWe investigate the impossibility of universally winning trading strategies -- those generating strict profit across all market trajectories -- through three distinct mathematical paradigms. Fundamentally, under standard admissibility constraints, the existence of such a strategy is a strict subset of strong arbitrage, which is mathematically precluded in competitive markets admitting an equivalent martingale measure. Beyond this rigorous measure-theoretic foundation, we explore analogous limitations in two alternative modeling regimes. Combinatorially, the No-Free-Lunch theorem demonstrates that outperformance requires exploitation of non-uniform market structure, as uniform averaging precludes universal dominance. Computationally, a Turing diagonalization argument constructs an adversarial environment that defeats any computable trading algorithm, shifting the impossibility from exogenous price paths to adaptive adversaries. These mathematical limits are framed by a time-reversal heuristic that establishes a formal analogy between financial martingale measures and thermodynamic detailed balance, resolving the Maxwell's Demon analogy for markets without relying on physically irrelevant Landauer erasure costs. Using the Wheel Options Strategy as a case study, we demonstrate that strategies succeeding ``for all practical purposes'' (FAPP) inherently depend on transient regime assumptions, meaning their automated execution systematically amplifies tail risks.2026-04-14T22:52:12Z6 pages, 2 tablesKarl Svozilhttp://arxiv.org/abs/2604.22818v1Representation Homogeneity and Systemic Instability in AI-Dominated Financial Markets: A Structural Approach2026-04-14T21:14:28ZThis paper investigates how similarity in the informational representation of market states among Artificial Intelligence (AI) trading agents can generate systemic instability in financial markets. We construct a structural multi-agent market model calibrated using high-frequency microstructural moments. AI agents are modeled through a two-layer decision architecture consisting of a nonlinear representation layer and an adaptive linear readout layer. The representation layer maps raw market states into high-dimensional feature vectors, while the readout layer generates return forecasts that feed into a risk-controlled trading rule. This representation-based microfoundation separates two objects that are often conflated in the literature: representation homogeneity (the degree to which agents encode market states into similar feature spaces) and forecast overlap (the degree to which agents produce similar return predictions). We show theoretically that these two concepts are related but not equivalent, and that representation homogeneity can compress the effective space of forecast disagreement under stress even when predictions appear diverse in normal times. Through controlled factorial experiments that vary representation homogeneity while conditioning on alternative risk-aversion and learning-rate distributions, we hypothesize that increasing representation similarity amplifies synchronization in beliefs and positions, leading to volatility clustering, liquidity stress, and elevated tail risk. Our structural mechanisms suggest that low perceived volatility regimes can endogenously accumulate hidden leverage through position stickiness, which subsequently collapses when shocks trigger synchronized deleveraging. The results provide a structural foundation for macroprudential policies aimed at monitoring and preserving diversity in how AI systems represent and process market information.2026-04-14T21:14:28ZYimeng QiuQiwei Hanhttp://arxiv.org/abs/2604.13260v1Which Voices Move Markets? Speaker Identity and the Cross-Section of Post-Earnings Returns2026-04-14T19:48:48ZWe utilize FinBERT, a domain-specific transformer model, to parse 6.5 million sentences from 16,428 S&P 500 quarterly earnings call transcripts (2015-2025) and demonstrate that post-earnings stock returns are not equally affected by all speakers in a conference call. Our section-weighted sentiment, with empirically derived speaker weights (Analyst 49%, CFO 30%, Executive 16%, Other 5%), achieves an out-of-sample Spearman IC of 0.142 versus 0.115 in-sample, generates monthly long-short alpha of 2.03% unexplained by the Fama-French five-factor model (t = 6.49), and remains significant after controlling for standardized unexpected earnings (SUE). FinBERT section-weighted sentiment entirely subsumes the Loughran-McDonald dictionary approach (FinBERT t = 5.90; LM t = 0.86 in the combined specification). Signal decay analysis and cumulative abnormal return charts confirm gradual price adjustment consistent with sluggish assimilation of soft information. All results undergo rigorous out-of-sample validation with an explicit temporal split, yielding improved rather than deteriorated predictive power.2026-04-14T19:48:48Z22 tables, 2 figures, 16 referencesKarmanpartap Singh SidhuJunyi FanMaryam Pishgarhttp://arxiv.org/abs/2604.11477v1OOM-RL: Out-of-Money Reinforcement Learning Market-Driven Alignment for LLM-Based Multi-Agent Systems2026-04-13T13:45:42ZThe alignment of Multi-Agent Systems (MAS) for autonomous software engineering is constrained by evaluator epistemic uncertainty. Current paradigms, such as Reinforcement Learning from Human Feedback (RLHF) and AI Feedback (RLAIF), frequently induce model sycophancy, while execution-based environments suffer from adversarial "Test Evasion" by unconstrained agents. In this paper, we introduce an objective alignment paradigm: \textbf{Out-of-Money Reinforcement Learning (OOM-RL)}. By deploying agents into the non-stationary, high-friction reality of live financial markets, we utilize critical capital depletion as an un-hackable negative gradient. Our longitudinal 20-month empirical study (July 2024 -- February 2026) chronicles the system's evolution from a high-turnover, sycophantic baseline to a robust, liquidity-aware architecture. We demonstrate that the undeniable ontological consequences of financial loss forced the MAS to abandon overfitted hallucinations in favor of the \textbf{Strict Test-Driven Agentic Workflow (STDAW)}, which enforces a Byzantine-inspired uni-directional state lock (RO-Lock) anchored to a deterministically verified $\geq 95\%$ code coverage constraint matrix. Our results show that while early iterations suffered severe execution decay, the final OOM-RL-aligned system achieved a stable equilibrium with an annualized Sharpe ratio of 2.06 in its mature phase. We conclude that substituting subjective human preference with rigorous economic penalties provides a robust methodology for aligning autonomous agents in high-stakes, real-world environments, laying the groundwork for generalized paradigms where computational billing acts as an objective physical constraint2026-04-13T13:45:42Z13 pages, 3 figuresKun LiuLiqun Chen