https://arxiv.org/api/rtAdIZqGYoMWHQsUczzOpwF7KZo 2026-06-14T10:16:38Z 2259 315 15 http://arxiv.org/abs/2410.08744v3 No Tick-Size Too Small: A General Method for Modelling Small Tick Limit Order Books 2025-08-04T15:46:59Z

Tick-sizes not only influence the granularity of the price formation process but also affect market agents' behavior. We investigate the disparity in the microstructural properties of the Limit Order Book (LOB) across a basket of assets with different relative tick-sizes. A key contribution of this study is the identification of several stylized facts, which are used to differentiate between large, medium, and small-tick assets, along with clear metrics for their measurement. We provide cross-asset visualizations to illustrate how these attributes vary with relative tick-size. Further, we propose a Hawkes Process model that {\color{black}not only fits well for large-tick assets, but also accounts for }sparsity, multi-tick level price moves, and the shape of the LOB in small-tick assets. Through simulation studies, we demonstrate the {\color{black} versatility} of the model and identify key variables that determine whether a simulated LOB resembles a large-tick or small-tick asset. Our tests show that stylized facts like sparsity, shape, and relative returns distribution can be smoothly transitioned from a large-tick to a small-tick asset using our model. We test this model's assumptions, showcase its challenges and propose questions for further directions in this area of research.

2024-10-11T12:02:21Z Konark Jain Jean-François Muzy Jonathan Kochems Emmanuel Bacry http://arxiv.org/abs/2502.16246v2 The "double" square-root law: Evidence for the mechanical origin of market impact using Tokyo Stock Exchange data 2025-08-04T15:21:33Z

Understanding the impact of trades on prices is a crucial question for both academic research and industry practice. It is well established that impact follows a square-root impact as a function of traded volume. However, the microscopic origin of such a law remains elusive: empirical studies are particularly challenging due to the anonymity of orders in public data. Indeed, there is ongoing debate about whether price impact has a mechanical origin or whether it is primarily driven by information, as suggested by many economic theories. In this paper, we revisit this question using a very detailed dataset provided by the Japanese stock exchange, containing the trader IDs for all orders sent to the exchange between 2012 and 2018. Our central result is that such a law has in fact microscopic roots and applies already at the level of single child orders, provided one waits long enough for the market to "digest" them. The mesoscopic impact of metaorders arises from a "double" square-root effect: square-root in volume of individual impact, followed by an inverse square-root decay as a function of time. Since market orders are anonymous, we expect and indeed find that these results apply to any market orders, and the impact of synthetic metaorders, reconstructed by scrambling the identity of the issuers, is described by the very same square-root impact law. We conclude that price impact is essentially mechanical, at odds with theories that emphasize the information content of such trades to explain the square-root impact law.

2025-02-22T14:48:06Z Guillaume Maitrier Grégoire Loeper Kiyoshi Kanazawa Jean-Philippe Bouchaud http://arxiv.org/abs/2507.13023v3 Measuring CEX-DEX Extracted Value and Searcher Profitability: The Darkest of the MEV Dark Forest 2025-08-03T08:26:10Z

This paper provides a comprehensive empirical analysis of the economics and dynamics behind arbitrages between centralized and decentralized exchanges (CEX-DEX) on Ethereum. We refine heuristics to identify arbitrage transactions from on-chain data and introduce a robust empirical framework to estimate arbitrage revenue without knowing traders' actual behaviors on CEX. Leveraging an extensive dataset spanning 19 months from August 2023 to March 2025, we estimate a total of 233.8M USD extracted by 19 major CEX-DEX searchers from 7,203,560 identified CEX-DEX arbitrages. Our analysis reveals increasing centralization trends as three searchers captured three-quarters of both volume and extracted value. We also demonstrate that searchers' profitability is tied to their integration level with block builders and uncover exclusive searcher-builder relationships and their market impact. Finally, we correct the previously underestimated profitability of block builders who vertically integrate with a searcher. These insights illuminate the darkest corner of the MEV landscape and highlight the critical implications of CEX-DEX arbitrages for Ethereum's decentralization.

2025-07-17T11:50:42Z Accepted by AFT 2025 Fei Wu Danning Sui Thomas Thiery Mallesh Pai http://arxiv.org/abs/2507.18417v1 FinDPO: Financial Sentiment Analysis for Algorithmic Trading through Preference Optimization of LLMs 2025-07-24T13:57:05Z

Opinions expressed in online finance-related textual data are having an increasingly profound impact on trading decisions and market movements. This trend highlights the vital role of sentiment analysis as a tool for quantifying the nature and strength of such opinions. With the rapid development of Generative AI (GenAI), supervised fine-tuned (SFT) large language models (LLMs) have become the de facto standard for financial sentiment analysis. However, the SFT paradigm can lead to memorization of the training data and often fails to generalize to unseen samples. This is a critical limitation in financial domains, where models must adapt to previously unobserved events and the nuanced, domain-specific language of finance. To this end, we introduce FinDPO, the first finance-specific LLM framework based on post-training human preference alignment via Direct Preference Optimization (DPO). The proposed FinDPO achieves state-of-the-art performance on standard sentiment classification benchmarks, outperforming existing supervised fine-tuned models by 11% on the average. Uniquely, the FinDPO framework enables the integration of a fine-tuned causal LLM into realistic portfolio strategies through a novel 'logit-to-score' conversion, which transforms discrete sentiment predictions into continuous, rankable sentiment scores (probabilities). In this way, simulations demonstrate that FinDPO is the first sentiment-based approach to maintain substantial positive returns of 67% annually and strong risk-adjusted performance, as indicated by a Sharpe ratio of 2.0, even under realistic transaction costs of 5 basis points (bps).

2025-07-24T13:57:05Z Giorgos Iacovides Wuyang Zhou Danilo Mandic http://arxiv.org/abs/2507.16548v2 Alternative Loss Function in Evaluation of Transformer Models 2025-07-24T09:56:46Z

The proper design and architecture of testing machine learning models, especially in their application to quantitative finance problems, is crucial. The most important aspect of this process is selecting an adequate loss function for training, validation, estimation purposes, and hyperparameter tuning. Therefore, in this research, through empirical experiments on equity and cryptocurrency assets, we apply the Mean Absolute Directional Loss (MADL) function, which is more adequate for optimizing forecast-generating models used in algorithmic investment strategies. The MADL function results are compared between Transformer and LSTM models, and we show that in almost every case, Transformer results are significantly better than those obtained with LSTM.

2025-07-22T12:57:25Z 12 pages, fixed grammar, typos and minor error in tables Jakub Michańków Paweł Sakowski Robert Ślepaczuk http://arxiv.org/abs/2305.14604v2 Automated Market Making and Arbitrage Profits in the Presence of Fees 2025-07-23T15:39:32Z

We consider the impact of trading fees on the profits of arbitrageurs trading against an automated market maker (AMM) or, equivalently, on the adverse selection incurred by liquidity providers (LPs) due to arbitrage. We extend the model of Milionis et al. [2022] for a general class of two asset AMMs to introduce both fees and discrete Poisson block generation times. In our setting, we are able to compute the expected instantaneous rate of arbitrage profit in closed form. When the fees are low, in the fast block asymptotic regime, the impact of fees takes a particularly simple form: fees simply scale down arbitrage profits by the fraction of blocks which present profitable trading opportunities to arbitrageurs. This fraction decreases with an increasing block rate, hence our model yields an important practical insight: faster blockchains will result in reduced LP losses. Further introducing gas fees (fixed costs) in our model, we show that, in the fast block asymptotic regime, lower gas fees lead to smaller losses for LPs.

2023-05-24T00:59:32Z 47 pages Jason Milionis Ciamac C. Moallemi Tim Roughgarden http://arxiv.org/abs/2507.17162v1 Optimal Trading under Instantaneous and Persistent Price Impact, Predictable Returns and Multiscale Stochastic Volatility 2025-07-23T02:54:38Z

We consider a dynamic portfolio optimization problem that incorporates predictable returns, instantaneous transaction costs, price impact, and stochastic volatility, extending the classical results of Garleanu and Pedersen (2013), which assume constant volatility. Constructing the optimal portfolio strategy in this general setting is challenging due to the nonlinear nature of the resulting Hamilton-Jacobi-Bellman (HJB) equations. To address this, we propose a multi-scale volatility expansion that captures stochastic volatility dynamics across different time scales. Specifically, the analysis involves a singular perturbation for the fast mean-reverting volatility factor and a regular perturbation for the slow-moving factor. We also introduce an approximation for small price impact and demonstrate its numerical accuracy. We formally derive asymptotic approximations up to second order and use Monte Carlo simulations to show how incorporating these corrections improves the Profit and Loss (PnL) of the resulting portfolio strategy.

2025-07-23T02:54:38Z Patrick Chan Ronnie Sircar Iosif Zimbidis http://arxiv.org/abs/2507.17023v1 Modeling for the Growth of Unorganized Retailing in the Presence of Organized and E-Retailing in Indian Pharmaceutical Industry 2025-07-22T21:26:28Z

The present study considers the rural pharmaceutical retail sector in India, where the arrival of organized retailers and e-retailers is testing the survival strategies of unorganized retailers. Grounded in a field investigation of the Indian pharmaceutical retail sector, this study integrates primary data collection, consumer conjoint analysis and design of experiments to develop an empirically grounded agent-based simulation of multi-channel competition among unorganized, organized and e-pharmaceutical retailers. The results of the conjoint analysis reveal that store attributes of price discount, quality of products offered, variety of assortment, and degree of personalized service, and customer attributes of distance, degree of mobility, and degree of emergency are key determinants of optimal store choice strategies. The primary insight obtained from the agent-based modeling is that the attribute levels of each individual retailer have some effect on other retailers performance. The field-calibrated simulation also evidenced counterintuitive behavior that an increase in unorganized price discounts initially leads to an increase in average footprint at unorganized retailers, but eventually leads to these retailers moving out of the market. Hence, the unorganized retailers should not increase the price discount offered beyond a tipping point or it will be detrimental to them. Another counterintuitive behavior found was that high emergency customers give less importance to variety of assortment than low emergency customers. This study aids in understanding the levers for policy design towards improving the competition dynamics among retail channels in the pharmaceutical retail sector in India.

2025-07-22T21:26:28Z Koushik Mondal Balagopal G Menon Sunil Sahadev http://arxiv.org/abs/2407.10561v3 Nash Equilibrium between Brokers and Traders 2025-07-22T17:35:52Z

We study the perfect information Nash equilibrium between a broker and her clients -- an informed trader and an uniformed trader. In our model, the broker trades in the lit exchange where trades have instantaneous and transient price impact with exponential resilience, while both clients trade with the broker. The informed trader and the broker maximise expected wealth subject to inventory penalties, while the uninformed trader is not strategic and sends the broker random buy and sell orders. We characterise the Nash equilibrium of the trading strategies with the solution to a coupled system of forward-backward stochastic differential equations (FBSDEs). We solve this system explicitly and study the effect of information, profitability, and inventory control in the trading strategies of the broker and the informed trader.

2024-07-15T09:23:05Z 24 pages, 3 figures Álvaro Cartea Sebastian Jaimungal Leandro Sánchez-Betancourt http://arxiv.org/abs/2508.02685v1 Benchmarking Classical and Quantum Models for DeFi Yield Prediction on Curve Finance 2025-07-22T06:55:20Z

The rise of decentralized finance (DeFi) has created a growing demand for accurate yield and performance forecasting to guide liquidity allocation strategies. In this study, we benchmark six models, XGBoost, Random Forest, LSTM, Transformer, quantum neural networks (QNN), and quantum support vector machines with quantum feature maps (QSVM-QNN), on one year of historical data from 28 Curve Finance pools. We evaluate model performance on test MAE, RMSE, and directional accuracy. Our results show that classical ensemble models, particularly XGBoost and Random Forest, consistently outperform both deep learning and quantum models. XGBoost achieves the highest directional accuracy (71.57%) with a test MAE of 1.80, while Random Forest attains the lowest test MAE of 1.77 and 71.36% accuracy. In contrast, quantum models underperform with directional accuracy below 50% and higher errors, highlighting current limitations in applying quantum machine learning to real-world DeFi time series data. This work offers a reproducible benchmark and practical insights into model suitability for DeFi applications, emphasizing the robustness of classical methods over emerging quantum approaches in this domain.

2025-07-22T06:55:20Z Chi-Sheng Chen Aidan Hung-Wen Tsai http://arxiv.org/abs/2507.14960v1 A Comparative Analysis of Statistical and Machine Learning Models for Outlier Detection in Bitcoin Limit Order Books 2025-07-20T13:42:36Z

The detection of outliers within cryptocurrency limit order books (LOBs) is of paramount importance for comprehending market dynamics, particularly in highly volatile and nascent regulatory environments. This study conducts a comprehensive comparative analysis of robust statistical methods and advanced machine learning techniques for real-time anomaly identification in cryptocurrency LOBs. Within a unified testing environment, named AITA Order Book Signal (AITA-OBS), we evaluate the efficacy of thirteen diverse models to identify which approaches are most suitable for detecting potentially manipulative trading behaviours. An empirical evaluation, conducted via backtesting on a dataset of 26,204 records from a major exchange, demonstrates that the top-performing model, Empirical Covariance (EC), achieves a 6.70% gain, significantly outperforming a standard Buy-and-Hold benchmark. These findings underscore the effectiveness of outlier-driven strategies and provide insights into the trade-offs between model complexity, trade frequency, and performance. This study contributes to the growing corpus of research on cryptocurrency market microstructure by furnishing a rigorous benchmark of anomaly detection models and highlighting their potential for augmenting algorithmic trading and risk management.

2025-07-20T13:42:36Z Ivan Letteri http://arxiv.org/abs/2507.15876v1 Re-evaluating Short- and Long-Term Trend Factors in CTA Replication: A Bayesian Graphical Approach 2025-07-17T12:09:29Z

Commodity Trading Advisors (CTAs) have historically relied on trend-following rules that operate on vastly different horizons from long-term breakouts that capture major directional moves to short-term momentum signals that thrive in fast-moving markets. Despite a large body of work on trend following, the relative merits and interactions of short-versus long-term trend systems remain controversial. This paper adds to the debate by (i) dynamically decomposing CTA returns into short-term trend, long-term trend and market beta factors using a Bayesian graphical model, and (ii) showing how the blend of horizons shapes the strategy's risk-adjusted performance.

2025-07-17T12:09:29Z 13 pages Eric Benhamou Jean-Jacques Ohana Alban Etienne Béatrice Guez Ethan Setrouk Thomas Jacquot http://arxiv.org/abs/2507.10701v1 Kernel Learning for Mean-Variance Trading Strategies 2025-07-14T18:17:50Z

In this article, we develop a kernel-based framework for constructing dynamic, pathdependent trading strategies under a mean-variance optimisation criterion. Building on the theoretical results of (Muca Cirone and Salvi, 2025), we parameterise trading strategies as functions in a reproducing kernel Hilbert space (RKHS), enabling a flexible and non-Markovian approach to optimal portfolio problems. We compare this with the signature-based framework of (Futter, Horvath, Wiese, 2023) and demonstrate that both significantly outperform classical Markovian methods when the asset dynamics or predictive signals exhibit temporal dependencies for both synthetic and market-data examples. Using kernels in this context provides significant modelling flexibility, as the choice of feature embedding can range from randomised signatures to the final layers of neural network architectures. Crucially, our framework retains closed-form solutions and provides an alternative to gradient-based optimisation.

2025-07-14T18:17:50Z 49 pages Owen Futter Nicola Muca Cirone Blanka Horvath http://arxiv.org/abs/2507.10149v1 A Coincidence of Wants Mechanism for Swap Trade Execution in Decentralized Exchanges 2025-07-14T10:53:25Z

We propose a mathematically rigorous framework for identifying and completing Coincidence of Wants (CoW) cycles in decentralized exchange (DEX) aggregators. Unlike existing auction based systems such as CoWSwap, our approach introduces an asset matrix formulation that not only verifies feasibility using oracle prices and formal conservation laws but also completes partial CoW cycles of swap orders that are discovered using graph traversal and are settled using imbalance correction. We define bridging orders and show that the resulting execution is slippage free and capital preserving for LPs. Applied to real world Arbitrum swap data, our algorithm demonstrates efficient discovery of CoW cycles and supports the insertion of synthetic orders for atomic cycle closure. This work can be thought of as the detailing of a potential delta-neutral strategy by liquidity providing market makers: a structured CoW cycle execution.

2025-07-14T10:53:25Z Abhimanyu Nag Madhur Prabhakar Tanuj Behl http://arxiv.org/abs/2409.02025v2 Logarithmic regret in the ergodic Avellaneda-Stoikov market making model 2025-07-14T07:04:00Z

We analyse the regret arising from learning the price sensitivity parameter $κ$ of liquidity takers in the ergodic version of the Avellaneda-Stoikov market making model. We show that a learning algorithm based on a maximum-likelihood estimator for the parameter achieves the regret upper bound of order $\ln^2 T$ in expectation. To obtain the result we need two key ingredients. The first is the twice differentiability of the ergodic constant under the misspecified parameter in the Hamilton-Jacobi-Bellman (HJB) equation with respect to $κ$, which leads to a second--order performance gap. The second is the learning rate of the regularised maximum-likelihood estimator which is obtained from concentration inequalities for Bernoulli signals. Numerical experiments confirm the convergence and the robustness of the proposed algorithm.

2024-09-03T16:20:07Z Jialun Cao David Šiška Lukasz Szpruch Tanut Treetanthiploet