https://arxiv.org/api/rtAdIZqGYoMWHQsUczzOpwF7KZo2026-06-14T10:16:38Z225931515http://arxiv.org/abs/2410.08744v3No Tick-Size Too Small: A General Method for Modelling Small Tick Limit Order Books2025-08-04T15:46:59ZTick-sizes not only influence the granularity of the price formation process but also affect market agents' behavior. We investigate the disparity in the microstructural properties of the Limit Order Book (LOB) across a basket of assets with different relative tick-sizes. A key contribution of this study is the identification of several stylized facts, which are used to differentiate between large, medium, and small-tick assets, along with clear metrics for their measurement. We provide cross-asset visualizations to illustrate how these attributes vary with relative tick-size. Further, we propose a Hawkes Process model that {\color{black}not only fits well for large-tick assets, but also accounts for }sparsity, multi-tick level price moves, and the shape of the LOB in small-tick assets. Through simulation studies, we demonstrate the {\color{black} versatility} of the model and identify key variables that determine whether a simulated LOB resembles a large-tick or small-tick asset. Our tests show that stylized facts like sparsity, shape, and relative returns distribution can be smoothly transitioned from a large-tick to a small-tick asset using our model. We test this model's assumptions, showcase its challenges and propose questions for further directions in this area of research.2024-10-11T12:02:21ZKonark JainJean-François MuzyJonathan KochemsEmmanuel Bacryhttp://arxiv.org/abs/2502.16246v2The "double" square-root law: Evidence for the mechanical origin of market impact using Tokyo Stock Exchange data2025-08-04T15:21:33ZUnderstanding the impact of trades on prices is a crucial question for both academic research and industry practice. It is well established that impact follows a square-root impact as a function of traded volume. However, the microscopic origin of such a law remains elusive: empirical studies are particularly challenging due to the anonymity of orders in public data. Indeed, there is ongoing debate about whether price impact has a mechanical origin or whether it is primarily driven by information, as suggested by many economic theories. In this paper, we revisit this question using a very detailed dataset provided by the Japanese stock exchange, containing the trader IDs for all orders sent to the exchange between 2012 and 2018. Our central result is that such a law has in fact microscopic roots and applies already at the level of single child orders, provided one waits long enough for the market to "digest" them. The mesoscopic impact of metaorders arises from a "double" square-root effect: square-root in volume of individual impact, followed by an inverse square-root decay as a function of time. Since market orders are anonymous, we expect and indeed find that these results apply to any market orders, and the impact of synthetic metaorders, reconstructed by scrambling the identity of the issuers, is described by the very same square-root impact law. We conclude that price impact is essentially mechanical, at odds with theories that emphasize the information content of such trades to explain the square-root impact law.2025-02-22T14:48:06ZGuillaume MaitrierGrégoire LoeperKiyoshi KanazawaJean-Philippe Bouchaudhttp://arxiv.org/abs/2507.13023v3Measuring CEX-DEX Extracted Value and Searcher Profitability: The Darkest of the MEV Dark Forest2025-08-03T08:26:10ZThis paper provides a comprehensive empirical analysis of the economics and dynamics behind arbitrages between centralized and decentralized exchanges (CEX-DEX) on Ethereum. We refine heuristics to identify arbitrage transactions from on-chain data and introduce a robust empirical framework to estimate arbitrage revenue without knowing traders' actual behaviors on CEX. Leveraging an extensive dataset spanning 19 months from August 2023 to March 2025, we estimate a total of 233.8M USD extracted by 19 major CEX-DEX searchers from 7,203,560 identified CEX-DEX arbitrages. Our analysis reveals increasing centralization trends as three searchers captured three-quarters of both volume and extracted value. We also demonstrate that searchers' profitability is tied to their integration level with block builders and uncover exclusive searcher-builder relationships and their market impact. Finally, we correct the previously underestimated profitability of block builders who vertically integrate with a searcher. These insights illuminate the darkest corner of the MEV landscape and highlight the critical implications of CEX-DEX arbitrages for Ethereum's decentralization.2025-07-17T11:50:42ZAccepted by AFT 2025Fei WuDanning SuiThomas ThieryMallesh Paihttp://arxiv.org/abs/2507.18417v1FinDPO: Financial Sentiment Analysis for Algorithmic Trading through Preference Optimization of LLMs2025-07-24T13:57:05ZOpinions expressed in online finance-related textual data are having an increasingly profound impact on trading decisions and market movements. This trend highlights the vital role of sentiment analysis as a tool for quantifying the nature and strength of such opinions. With the rapid development of Generative AI (GenAI), supervised fine-tuned (SFT) large language models (LLMs) have become the de facto standard for financial sentiment analysis. However, the SFT paradigm can lead to memorization of the training data and often fails to generalize to unseen samples. This is a critical limitation in financial domains, where models must adapt to previously unobserved events and the nuanced, domain-specific language of finance. To this end, we introduce FinDPO, the first finance-specific LLM framework based on post-training human preference alignment via Direct Preference Optimization (DPO). The proposed FinDPO achieves state-of-the-art performance on standard sentiment classification benchmarks, outperforming existing supervised fine-tuned models by 11% on the average. Uniquely, the FinDPO framework enables the integration of a fine-tuned causal LLM into realistic portfolio strategies through a novel 'logit-to-score' conversion, which transforms discrete sentiment predictions into continuous, rankable sentiment scores (probabilities). In this way, simulations demonstrate that FinDPO is the first sentiment-based approach to maintain substantial positive returns of 67% annually and strong risk-adjusted performance, as indicated by a Sharpe ratio of 2.0, even under realistic transaction costs of 5 basis points (bps).2025-07-24T13:57:05ZGiorgos IacovidesWuyang ZhouDanilo Mandichttp://arxiv.org/abs/2507.16548v2Alternative Loss Function in Evaluation of Transformer Models2025-07-24T09:56:46ZThe proper design and architecture of testing machine learning models, especially in their application to quantitative finance problems, is crucial. The most important aspect of this process is selecting an adequate loss function for training, validation, estimation purposes, and hyperparameter tuning. Therefore, in this research, through empirical experiments on equity and cryptocurrency assets, we apply the Mean Absolute Directional Loss (MADL) function, which is more adequate for optimizing forecast-generating models used in algorithmic investment strategies. The MADL function results are compared between Transformer and LSTM models, and we show that in almost every case, Transformer results are significantly better than those obtained with LSTM.2025-07-22T12:57:25Z12 pages, fixed grammar, typos and minor error in tablesJakub MichańkówPaweł SakowskiRobert Ślepaczukhttp://arxiv.org/abs/2305.14604v2Automated Market Making and Arbitrage Profits in the Presence of Fees2025-07-23T15:39:32ZWe consider the impact of trading fees on the profits of arbitrageurs trading against an automated market maker (AMM) or, equivalently, on the adverse selection incurred by liquidity providers (LPs) due to arbitrage. We extend the model of Milionis et al. [2022] for a general class of two asset AMMs to introduce both fees and discrete Poisson block generation times. In our setting, we are able to compute the expected instantaneous rate of arbitrage profit in closed form. When the fees are low, in the fast block asymptotic regime, the impact of fees takes a particularly simple form: fees simply scale down arbitrage profits by the fraction of blocks which present profitable trading opportunities to arbitrageurs. This fraction decreases with an increasing block rate, hence our model yields an important practical insight: faster blockchains will result in reduced LP losses. Further introducing gas fees (fixed costs) in our model, we show that, in the fast block asymptotic regime, lower gas fees lead to smaller losses for LPs.2023-05-24T00:59:32Z47 pagesJason MilionisCiamac C. MoallemiTim Roughgardenhttp://arxiv.org/abs/2507.17162v1Optimal Trading under Instantaneous and Persistent Price Impact, Predictable Returns and Multiscale Stochastic Volatility2025-07-23T02:54:38ZWe consider a dynamic portfolio optimization problem that incorporates predictable returns, instantaneous transaction costs, price impact, and stochastic volatility, extending the classical results of Garleanu and Pedersen (2013), which assume constant volatility. Constructing the optimal portfolio strategy in this general setting is challenging due to the nonlinear nature of the resulting Hamilton-Jacobi-Bellman (HJB) equations. To address this, we propose a multi-scale volatility expansion that captures stochastic volatility dynamics across different time scales. Specifically, the analysis involves a singular perturbation for the fast mean-reverting volatility factor and a regular perturbation for the slow-moving factor. We also introduce an approximation for small price impact and demonstrate its numerical accuracy. We formally derive asymptotic approximations up to second order and use Monte Carlo simulations to show how incorporating these corrections improves the Profit and Loss (PnL) of the resulting portfolio strategy.2025-07-23T02:54:38ZPatrick ChanRonnie SircarIosif Zimbidishttp://arxiv.org/abs/2507.17023v1Modeling for the Growth of Unorganized Retailing in the Presence of Organized and E-Retailing in Indian Pharmaceutical Industry2025-07-22T21:26:28ZThe present study considers the rural pharmaceutical retail sector in India, where the arrival of organized retailers and e-retailers is testing the survival strategies of unorganized retailers. Grounded in a field investigation of the Indian pharmaceutical retail sector, this study integrates primary data collection, consumer conjoint analysis and design of experiments to develop an empirically grounded agent-based simulation of multi-channel competition among unorganized, organized and e-pharmaceutical retailers. The results of the conjoint analysis reveal that store attributes of price discount, quality of products offered, variety of assortment, and degree of personalized service, and customer attributes of distance, degree of mobility, and degree of emergency are key determinants of optimal store choice strategies. The primary insight obtained from the agent-based modeling is that the attribute levels of each individual retailer have some effect on other retailers performance. The field-calibrated simulation also evidenced counterintuitive behavior that an increase in unorganized price discounts initially leads to an increase in average footprint at unorganized retailers, but eventually leads to these retailers moving out of the market. Hence, the unorganized retailers should not increase the price discount offered beyond a tipping point or it will be detrimental to them. Another counterintuitive behavior found was that high emergency customers give less importance to variety of assortment than low emergency customers. This study aids in understanding the levers for policy design towards improving the competition dynamics among retail channels in the pharmaceutical retail sector in India.2025-07-22T21:26:28ZKoushik MondalBalagopal G MenonSunil Sahadevhttp://arxiv.org/abs/2407.10561v3Nash Equilibrium between Brokers and Traders2025-07-22T17:35:52ZWe study the perfect information Nash equilibrium between a broker and her clients -- an informed trader and an uniformed trader. In our model, the broker trades in the lit exchange where trades have instantaneous and transient price impact with exponential resilience, while both clients trade with the broker. The informed trader and the broker maximise expected wealth subject to inventory penalties, while the uninformed trader is not strategic and sends the broker random buy and sell orders. We characterise the Nash equilibrium of the trading strategies with the solution to a coupled system of forward-backward stochastic differential equations (FBSDEs). We solve this system explicitly and study the effect of information, profitability, and inventory control in the trading strategies of the broker and the informed trader.2024-07-15T09:23:05Z24 pages, 3 figuresÁlvaro CarteaSebastian JaimungalLeandro Sánchez-Betancourthttp://arxiv.org/abs/2508.02685v1Benchmarking Classical and Quantum Models for DeFi Yield Prediction on Curve Finance2025-07-22T06:55:20ZThe rise of decentralized finance (DeFi) has created a growing demand for accurate yield and performance forecasting to guide liquidity allocation strategies. In this study, we benchmark six models, XGBoost, Random Forest, LSTM, Transformer, quantum neural networks (QNN), and quantum support vector machines with quantum feature maps (QSVM-QNN), on one year of historical data from 28 Curve Finance pools. We evaluate model performance on test MAE, RMSE, and directional accuracy. Our results show that classical ensemble models, particularly XGBoost and Random Forest, consistently outperform both deep learning and quantum models. XGBoost achieves the highest directional accuracy (71.57%) with a test MAE of 1.80, while Random Forest attains the lowest test MAE of 1.77 and 71.36% accuracy. In contrast, quantum models underperform with directional accuracy below 50% and higher errors, highlighting current limitations in applying quantum machine learning to real-world DeFi time series data. This work offers a reproducible benchmark and practical insights into model suitability for DeFi applications, emphasizing the robustness of classical methods over emerging quantum approaches in this domain.2025-07-22T06:55:20ZChi-Sheng ChenAidan Hung-Wen Tsaihttp://arxiv.org/abs/2507.14960v1A Comparative Analysis of Statistical and Machine Learning Models for Outlier Detection in Bitcoin Limit Order Books2025-07-20T13:42:36ZThe detection of outliers within cryptocurrency limit order books (LOBs) is of paramount importance for comprehending market dynamics, particularly in highly volatile and nascent regulatory environments. This study conducts a comprehensive comparative analysis of robust statistical methods and advanced machine learning techniques for real-time anomaly identification in cryptocurrency LOBs. Within a unified testing environment, named AITA Order Book Signal (AITA-OBS), we evaluate the efficacy of thirteen diverse models to identify which approaches are most suitable for detecting potentially manipulative trading behaviours. An empirical evaluation, conducted via backtesting on a dataset of 26,204 records from a major exchange, demonstrates that the top-performing model, Empirical Covariance (EC), achieves a 6.70% gain, significantly outperforming a standard Buy-and-Hold benchmark. These findings underscore the effectiveness of outlier-driven strategies and provide insights into the trade-offs between model complexity, trade frequency, and performance. This study contributes to the growing corpus of research on cryptocurrency market microstructure by furnishing a rigorous benchmark of anomaly detection models and highlighting their potential for augmenting algorithmic trading and risk management.2025-07-20T13:42:36ZIvan Letterihttp://arxiv.org/abs/2507.15876v1Re-evaluating Short- and Long-Term Trend Factors in CTA Replication: A Bayesian Graphical Approach2025-07-17T12:09:29ZCommodity Trading Advisors (CTAs) have historically relied on trend-following rules that operate on vastly different horizons from long-term breakouts that capture major directional moves to short-term momentum signals that thrive in fast-moving markets. Despite a large body of work on trend following, the relative merits and interactions of short-versus long-term trend systems remain controversial. This paper adds to the debate by (i) dynamically decomposing CTA returns into short-term trend, long-term trend and market beta factors using a Bayesian graphical model, and (ii) showing how the blend of horizons shapes the strategy's risk-adjusted performance.2025-07-17T12:09:29Z13 pagesEric BenhamouJean-Jacques OhanaAlban EtienneBéatrice GuezEthan SetroukThomas Jacquothttp://arxiv.org/abs/2507.10701v1Kernel Learning for Mean-Variance Trading Strategies2025-07-14T18:17:50ZIn this article, we develop a kernel-based framework for constructing dynamic, pathdependent trading strategies under a mean-variance optimisation criterion. Building on the theoretical results of (Muca Cirone and Salvi, 2025), we parameterise trading strategies as functions in a reproducing kernel Hilbert space (RKHS), enabling a flexible and non-Markovian approach to optimal portfolio problems. We compare this with the signature-based framework of (Futter, Horvath, Wiese, 2023) and demonstrate that both significantly outperform classical Markovian methods when the asset dynamics or predictive signals exhibit temporal dependencies for both synthetic and market-data examples. Using kernels in this context provides significant modelling flexibility, as the choice of feature embedding can range from randomised signatures to the final layers of neural network architectures. Crucially, our framework retains closed-form solutions and provides an alternative to gradient-based optimisation.2025-07-14T18:17:50Z49 pagesOwen FutterNicola Muca CironeBlanka Horvathhttp://arxiv.org/abs/2507.10149v1A Coincidence of Wants Mechanism for Swap Trade Execution in Decentralized Exchanges2025-07-14T10:53:25ZWe propose a mathematically rigorous framework for identifying and completing Coincidence of Wants (CoW) cycles in decentralized exchange (DEX) aggregators. Unlike existing auction based systems such as CoWSwap, our approach introduces an asset matrix formulation that not only verifies feasibility using oracle prices and formal conservation laws but also completes partial CoW cycles of swap orders that are discovered using graph traversal and are settled using imbalance correction. We define bridging orders and show that the resulting execution is slippage free and capital preserving for LPs. Applied to real world Arbitrum swap data, our algorithm demonstrates efficient discovery of CoW cycles and supports the insertion of synthetic orders for atomic cycle closure. This work can be thought of as the detailing of a potential delta-neutral strategy by liquidity providing market makers: a structured CoW cycle execution.2025-07-14T10:53:25ZAbhimanyu NagMadhur PrabhakarTanuj Behlhttp://arxiv.org/abs/2409.02025v2Logarithmic regret in the ergodic Avellaneda-Stoikov market making model2025-07-14T07:04:00ZWe analyse the regret arising from learning the price sensitivity parameter $κ$ of liquidity takers in the ergodic version of the Avellaneda-Stoikov market making model. We show that a learning algorithm based on a maximum-likelihood estimator for the parameter achieves the regret upper bound of order $\ln^2 T$ in expectation. To obtain the result we need two key ingredients. The first is the twice differentiability of the ergodic constant under the misspecified parameter in the Hamilton-Jacobi-Bellman (HJB) equation with respect to $κ$, which leads to a second--order performance gap. The second is the learning rate of the regularised maximum-likelihood estimator which is obtained from concentration inequalities for Bernoulli signals. Numerical experiments confirm the convergence and the robustness of the proposed algorithm.2024-09-03T16:20:07ZJialun CaoDavid ŠiškaLukasz SzpruchTanut Treetanthiploet