https://arxiv.org/api/aaaIczErAYlsfDWTIwvOCnoY/WE2026-03-28T12:53:40Z217121015http://arxiv.org/abs/2508.04003v1The Marginal Effects of Ethereum Network MEV Transaction Re-Ordering2025-08-06T01:32:02ZTwo MEV builders now produce nearly 80\% of Ethereum blocks. Block builders have the ability to reorder transactions on the blockchain in a way that can be harmful to participants. We estimate they would pay in the aggregate nearly \$14 million per month to ensure that they remained in the first quartile of the block. Sandwich attacks, in which a transaction is front-run, are frequent, averaging more than one per block. Gas fees on these transactions pay for nearly 15\% of the MEV payments to the validator. These attacks have especially large marginal effects and skew the distribution. Reforms such as gas fee priority or private transaction pools might be helpful.2025-08-06T01:32:02ZBruce MizrachNathaniel Yoshidahttp://arxiv.org/abs/2508.03474v1Unravelling the Probabilistic Forest: Arbitrage in Prediction Markets2025-08-05T14:06:50ZPolymarket is a prediction market platform where users can speculate on future events by trading shares tied to specific outcomes, known as conditions. Each market is associated with a set of one or more such conditions. To ensure proper market resolution, the condition set must be exhaustive -- collectively accounting for all possible outcomes -- and mutually exclusive -- only one condition may resolve as true. Thus, the collective prices of all related outcomes should be \$1, representing a combined probability of 1 of any outcome. Despite this design, Polymarket exhibits cases where dependent assets are mispriced, allowing for purchasing (or selling) a certain outcome for less than (or more than) \$1, guaranteeing profit. This phenomenon, known as arbitrage, could enable sophisticated participants to exploit such inconsistencies.
In this paper, we conduct an empirical arbitrage analysis on Polymarket data to answer three key questions: (Q1) What conditions give rise to arbitrage (Q2) Does arbitrage actually occur on Polymarket and (Q3) Has anyone exploited these opportunities. A major challenge in analyzing arbitrage between related markets lies in the scalability of comparisons across a large number of markets and conditions, with a naive analysis requiring $O(2^{n+m})$ comparisons. To overcome this, we employ a heuristic-driven reduction strategy based on timeliness, topical similarity, and combinatorial relationships, further validated by expert input.
Our study reveals two distinct forms of arbitrage on Polymarket: Market Rebalancing Arbitrage, which occurs within a single market or condition, and Combinatorial Arbitrage, which spans across multiple markets. We use on-chain historical order book data to analyze when these types of arbitrage opportunities have existed, and when they have been executed by users. We find a realized estimate of 40 million USD of profit extracted.2025-08-05T14:06:50ZOriol SaguilloVahid GhafouriLucianna KifferGuillermo Suarez-Tangilhttp://arxiv.org/abs/2508.03217v1Measuring DEX Efficiency and The Effect of an Enhanced Routing Method on Both DEX Efficiency and Stakeholders' Benefits2025-08-05T08:45:38ZThe efficiency of decentralized exchanges (DEXs) and the influence of token routing algorithms on market performance and stakeholder outcomes remain underexplored. This paper introduces the concept of Standardized Total Arbitrage Profit (STAP), computed via convex optimization, as a systematic measure of DEX efficiency. We prove that executing the trade order maximizing STAP and reintegrating the resulting transaction fees eliminates all arbitrage opportunities-both cyclic arbitrage within DEXs and between DEXs and centralized exchanges (CEXs). In a fully efficient DEX (i.e., STAP = 0), the monetary value of target tokens received must not exceed that of the source tokens, regardless of the routing algorithm. Any violation indicates arbitrage potential, making STAP a reliable metric for arbitrage detection. Using a token graph comprising 11 tokens and 18 liquidity pools based on Uniswap V2 data, we observe a decline in DEX efficiency between June 21 and November 8, 2024. Simulations comparing two routing algorithms-Yu Zhang et al.'s line-graph-based method and the depth-first search (DFS) algorithm-show that employing more profitable routing improves DEX efficiency and trader returns over time. Moreover, while total value locked (TVL) remains stable with the line-graph method, it increases under the DFS algorithm, indicating greater aggregate benefits for liquidity providers.2025-08-05T08:45:38ZYu ZhangClaudio J. Tessonehttp://arxiv.org/abs/2508.02971v1Modeling Loss-Versus-Rebalancing in Automated Market Makers via Continuous-Installment Options2025-08-05T00:30:24ZThis paper mathematically models a constant-function automated market maker (CFAMM) position as a portfolio of exotic options, known as perpetual American continuous-installment (CI) options. This model replicates an AMM position's delta at each point in time over an infinite time horizon, thus taking into account the perpetual nature and optionality to withdraw of liquidity provision. This framework yields two key theoretical results: (a) It proves that the AMM's adverse-selection cost, loss-versus-rebalancing (LVR), is analytically identical to the continuous funding fees (the time value decay or theta) earned by the at-the-money CI option embedded in the replicating portfolio. (b) A special case of this model derives an AMM liquidity position's delta profile and boundaries that suffer approximately constant LVR, up to a bounded residual error, over an arbitrarily long forward window. Finally, the paper describes how the constant volatility parameter required by the perpetual option can be calibrated from the term structure of implied volatilities and estimates the errors for both implied volatility calibration and LVR residual error. Thus, this work provides a practical framework enabling liquidity providers to choose an AMM liquidity profile and price boundaries for an arbitrarily long, forward-looking time window where they can expect an approximately constant, price-independent LVR. The results establish a rigorous option-theoretic interpretation of AMMs and their LVR, and provide actionable guidance for liquidity providers in estimating future adverse-selection costs and optimizing position parameters.2025-08-05T00:30:24ZSrisht Fateh SinghReina Ke Xin LiSamuel GaskinYuntao WuJeffrey KlinckPanagiotis MichalopoulosZissis PoulosAndreas Venerishttp://arxiv.org/abs/2410.08744v3No Tick-Size Too Small: A General Method for Modelling Small Tick Limit Order Books2025-08-04T15:46:59ZTick-sizes not only influence the granularity of the price formation process but also affect market agents' behavior. We investigate the disparity in the microstructural properties of the Limit Order Book (LOB) across a basket of assets with different relative tick-sizes. A key contribution of this study is the identification of several stylized facts, which are used to differentiate between large, medium, and small-tick assets, along with clear metrics for their measurement. We provide cross-asset visualizations to illustrate how these attributes vary with relative tick-size. Further, we propose a Hawkes Process model that {\color{black}not only fits well for large-tick assets, but also accounts for }sparsity, multi-tick level price moves, and the shape of the LOB in small-tick assets. Through simulation studies, we demonstrate the {\color{black} versatility} of the model and identify key variables that determine whether a simulated LOB resembles a large-tick or small-tick asset. Our tests show that stylized facts like sparsity, shape, and relative returns distribution can be smoothly transitioned from a large-tick to a small-tick asset using our model. We test this model's assumptions, showcase its challenges and propose questions for further directions in this area of research.2024-10-11T12:02:21ZKonark JainJean-François MuzyJonathan KochemsEmmanuel Bacryhttp://arxiv.org/abs/2502.16246v2The "double" square-root law: Evidence for the mechanical origin of market impact using Tokyo Stock Exchange data2025-08-04T15:21:33ZUnderstanding the impact of trades on prices is a crucial question for both academic research and industry practice. It is well established that impact follows a square-root impact as a function of traded volume. However, the microscopic origin of such a law remains elusive: empirical studies are particularly challenging due to the anonymity of orders in public data. Indeed, there is ongoing debate about whether price impact has a mechanical origin or whether it is primarily driven by information, as suggested by many economic theories. In this paper, we revisit this question using a very detailed dataset provided by the Japanese stock exchange, containing the trader IDs for all orders sent to the exchange between 2012 and 2018. Our central result is that such a law has in fact microscopic roots and applies already at the level of single child orders, provided one waits long enough for the market to "digest" them. The mesoscopic impact of metaorders arises from a "double" square-root effect: square-root in volume of individual impact, followed by an inverse square-root decay as a function of time. Since market orders are anonymous, we expect and indeed find that these results apply to any market orders, and the impact of synthetic metaorders, reconstructed by scrambling the identity of the issuers, is described by the very same square-root impact law. We conclude that price impact is essentially mechanical, at odds with theories that emphasize the information content of such trades to explain the square-root impact law.2025-02-22T14:48:06ZGuillaume MaitrierGrégoire LoeperKiyoshi KanazawaJean-Philippe Bouchaudhttp://arxiv.org/abs/2507.13023v3Measuring CEX-DEX Extracted Value and Searcher Profitability: The Darkest of the MEV Dark Forest2025-08-03T08:26:10ZThis paper provides a comprehensive empirical analysis of the economics and dynamics behind arbitrages between centralized and decentralized exchanges (CEX-DEX) on Ethereum. We refine heuristics to identify arbitrage transactions from on-chain data and introduce a robust empirical framework to estimate arbitrage revenue without knowing traders' actual behaviors on CEX. Leveraging an extensive dataset spanning 19 months from August 2023 to March 2025, we estimate a total of 233.8M USD extracted by 19 major CEX-DEX searchers from 7,203,560 identified CEX-DEX arbitrages. Our analysis reveals increasing centralization trends as three searchers captured three-quarters of both volume and extracted value. We also demonstrate that searchers' profitability is tied to their integration level with block builders and uncover exclusive searcher-builder relationships and their market impact. Finally, we correct the previously underestimated profitability of block builders who vertically integrate with a searcher. These insights illuminate the darkest corner of the MEV landscape and highlight the critical implications of CEX-DEX arbitrages for Ethereum's decentralization.2025-07-17T11:50:42ZAccepted by AFT 2025Fei WuDanning SuiThomas ThieryMallesh Paihttp://arxiv.org/abs/2408.02634v2CLVR Ordering of Transactions on AMMs2025-07-28T07:13:03ZThis paper introduces a trade ordering rule that aims to reduce intra-block price volatility in Automated Market Maker (AMM) powered decentralized exchanges. The ordering rule introduced here, Clever Look-ahead Volatility Reduction (CLVR), operates under the (common) framework in decentralized finance that allows some entities to observe trade requests before they are settled, assemble them into "blocks", and order them as they like. On AMM exchanges, asset prices are continuously and transparently updated as a result of each trade and therefore, transaction order has high financial value. CLVR aims to order transactions for traders' benefit. Our primary focus is intra-block price stability (minimizing volatility), which has two main benefits for traders: it reduces transaction failure rate and allows traders to receive closer prices to the reference price at which they submit their transactions accordingly. We show that CLVR constructs an ordering which approximately minimizes price volatility with a small computation cost and can be trivially verified externally.2024-08-05T16:58:48ZRobert McLaughlinNir ChemayaDingyue LiuDahlia Malkhihttp://arxiv.org/abs/2507.18417v1FinDPO: Financial Sentiment Analysis for Algorithmic Trading through Preference Optimization of LLMs2025-07-24T13:57:05ZOpinions expressed in online finance-related textual data are having an increasingly profound impact on trading decisions and market movements. This trend highlights the vital role of sentiment analysis as a tool for quantifying the nature and strength of such opinions. With the rapid development of Generative AI (GenAI), supervised fine-tuned (SFT) large language models (LLMs) have become the de facto standard for financial sentiment analysis. However, the SFT paradigm can lead to memorization of the training data and often fails to generalize to unseen samples. This is a critical limitation in financial domains, where models must adapt to previously unobserved events and the nuanced, domain-specific language of finance. To this end, we introduce FinDPO, the first finance-specific LLM framework based on post-training human preference alignment via Direct Preference Optimization (DPO). The proposed FinDPO achieves state-of-the-art performance on standard sentiment classification benchmarks, outperforming existing supervised fine-tuned models by 11% on the average. Uniquely, the FinDPO framework enables the integration of a fine-tuned causal LLM into realistic portfolio strategies through a novel 'logit-to-score' conversion, which transforms discrete sentiment predictions into continuous, rankable sentiment scores (probabilities). In this way, simulations demonstrate that FinDPO is the first sentiment-based approach to maintain substantial positive returns of 67% annually and strong risk-adjusted performance, as indicated by a Sharpe ratio of 2.0, even under realistic transaction costs of 5 basis points (bps).2025-07-24T13:57:05ZGiorgos IacovidesWuyang ZhouDanilo Mandichttp://arxiv.org/abs/2507.16548v2Alternative Loss Function in Evaluation of Transformer Models2025-07-24T09:56:46ZThe proper design and architecture of testing machine learning models, especially in their application to quantitative finance problems, is crucial. The most important aspect of this process is selecting an adequate loss function for training, validation, estimation purposes, and hyperparameter tuning. Therefore, in this research, through empirical experiments on equity and cryptocurrency assets, we apply the Mean Absolute Directional Loss (MADL) function, which is more adequate for optimizing forecast-generating models used in algorithmic investment strategies. The MADL function results are compared between Transformer and LSTM models, and we show that in almost every case, Transformer results are significantly better than those obtained with LSTM.2025-07-22T12:57:25Z12 pages, fixed grammar, typos and minor error in tablesJakub MichańkówPaweł SakowskiRobert Ślepaczukhttp://arxiv.org/abs/2305.14604v2Automated Market Making and Arbitrage Profits in the Presence of Fees2025-07-23T15:39:32ZWe consider the impact of trading fees on the profits of arbitrageurs trading against an automated market maker (AMM) or, equivalently, on the adverse selection incurred by liquidity providers (LPs) due to arbitrage. We extend the model of Milionis et al. [2022] for a general class of two asset AMMs to introduce both fees and discrete Poisson block generation times. In our setting, we are able to compute the expected instantaneous rate of arbitrage profit in closed form. When the fees are low, in the fast block asymptotic regime, the impact of fees takes a particularly simple form: fees simply scale down arbitrage profits by the fraction of blocks which present profitable trading opportunities to arbitrageurs. This fraction decreases with an increasing block rate, hence our model yields an important practical insight: faster blockchains will result in reduced LP losses. Further introducing gas fees (fixed costs) in our model, we show that, in the fast block asymptotic regime, lower gas fees lead to smaller losses for LPs.2023-05-24T00:59:32Z47 pagesJason MilionisCiamac C. MoallemiTim Roughgardenhttp://arxiv.org/abs/2507.17162v1Optimal Trading under Instantaneous and Persistent Price Impact, Predictable Returns and Multiscale Stochastic Volatility2025-07-23T02:54:38ZWe consider a dynamic portfolio optimization problem that incorporates predictable returns, instantaneous transaction costs, price impact, and stochastic volatility, extending the classical results of Garleanu and Pedersen (2013), which assume constant volatility. Constructing the optimal portfolio strategy in this general setting is challenging due to the nonlinear nature of the resulting Hamilton-Jacobi-Bellman (HJB) equations. To address this, we propose a multi-scale volatility expansion that captures stochastic volatility dynamics across different time scales. Specifically, the analysis involves a singular perturbation for the fast mean-reverting volatility factor and a regular perturbation for the slow-moving factor. We also introduce an approximation for small price impact and demonstrate its numerical accuracy. We formally derive asymptotic approximations up to second order and use Monte Carlo simulations to show how incorporating these corrections improves the Profit and Loss (PnL) of the resulting portfolio strategy.2025-07-23T02:54:38ZPatrick ChanRonnie SircarIosif Zimbidishttp://arxiv.org/abs/2507.17023v1Modeling for the Growth of Unorganized Retailing in the Presence of Organized and E-Retailing in Indian Pharmaceutical Industry2025-07-22T21:26:28ZThe present study considers the rural pharmaceutical retail sector in India, where the arrival of organized retailers and e-retailers is testing the survival strategies of unorganized retailers. Grounded in a field investigation of the Indian pharmaceutical retail sector, this study integrates primary data collection, consumer conjoint analysis and design of experiments to develop an empirically grounded agent-based simulation of multi-channel competition among unorganized, organized and e-pharmaceutical retailers. The results of the conjoint analysis reveal that store attributes of price discount, quality of products offered, variety of assortment, and degree of personalized service, and customer attributes of distance, degree of mobility, and degree of emergency are key determinants of optimal store choice strategies. The primary insight obtained from the agent-based modeling is that the attribute levels of each individual retailer have some effect on other retailers performance. The field-calibrated simulation also evidenced counterintuitive behavior that an increase in unorganized price discounts initially leads to an increase in average footprint at unorganized retailers, but eventually leads to these retailers moving out of the market. Hence, the unorganized retailers should not increase the price discount offered beyond a tipping point or it will be detrimental to them. Another counterintuitive behavior found was that high emergency customers give less importance to variety of assortment than low emergency customers. This study aids in understanding the levers for policy design towards improving the competition dynamics among retail channels in the pharmaceutical retail sector in India.2025-07-22T21:26:28ZKoushik MondalBalagopal G MenonSunil Sahadevhttp://arxiv.org/abs/2407.10561v3Nash Equilibrium between Brokers and Traders2025-07-22T17:35:52ZWe study the perfect information Nash equilibrium between a broker and her clients -- an informed trader and an uniformed trader. In our model, the broker trades in the lit exchange where trades have instantaneous and transient price impact with exponential resilience, while both clients trade with the broker. The informed trader and the broker maximise expected wealth subject to inventory penalties, while the uninformed trader is not strategic and sends the broker random buy and sell orders. We characterise the Nash equilibrium of the trading strategies with the solution to a coupled system of forward-backward stochastic differential equations (FBSDEs). We solve this system explicitly and study the effect of information, profitability, and inventory control in the trading strategies of the broker and the informed trader.2024-07-15T09:23:05Z24 pages, 3 figuresÁlvaro CarteaSebastian JaimungalLeandro Sánchez-Betancourthttp://arxiv.org/abs/2508.02685v1Benchmarking Classical and Quantum Models for DeFi Yield Prediction on Curve Finance2025-07-22T06:55:20ZThe rise of decentralized finance (DeFi) has created a growing demand for accurate yield and performance forecasting to guide liquidity allocation strategies. In this study, we benchmark six models, XGBoost, Random Forest, LSTM, Transformer, quantum neural networks (QNN), and quantum support vector machines with quantum feature maps (QSVM-QNN), on one year of historical data from 28 Curve Finance pools. We evaluate model performance on test MAE, RMSE, and directional accuracy. Our results show that classical ensemble models, particularly XGBoost and Random Forest, consistently outperform both deep learning and quantum models. XGBoost achieves the highest directional accuracy (71.57%) with a test MAE of 1.80, while Random Forest attains the lowest test MAE of 1.77 and 71.36% accuracy. In contrast, quantum models underperform with directional accuracy below 50% and higher errors, highlighting current limitations in applying quantum machine learning to real-world DeFi time series data. This work offers a reproducible benchmark and practical insights into model suitability for DeFi applications, emphasizing the robustness of classical methods over emerging quantum approaches in this domain.2025-07-22T06:55:20ZChi-Sheng ChenAidan Hung-Wen Tsai