https://arxiv.org/api/WrFQabRE2TkdzMHTaD0ldWp7Bw42026-03-26T18:56:56Z217115015http://arxiv.org/abs/2510.22685v1TABL-ABM: A Hybrid Framework for Synthetic LOB Generation2025-10-26T14:04:49ZThe recent application of deep learning models to financial trading has heightened the need for high fidelity financial time series data. This synthetic data can be used to supplement historical data to train large trading models. The state-of-the-art models for the generative application often rely on huge amounts of historical data and large, complicated models. These models range from autoregressive and diffusion-based models through to architecturally simpler models such as the temporal-attention bilinear layer. Agent-based approaches to modelling limit order book dynamics can also recreate trading activity through mechanistic models of trader behaviours. In this work, we demonstrate how a popular agent-based framework for simulating intraday trading activity, the Chiarella model, can be combined with one of the most performant deep learning models for forecasting multi-variate time series, the TABL model. This forecasting model is coupled to a simulation of a matching engine with a novel method for simulating deleted order flow. Our simulator gives us the ability to test the generative abilities of the forecasting model using stylised facts. Our results show that this methodology generates realistic price dynamics however, when analysing deeper, parts of the markets microstructure are not accurately recreated, highlighting the necessity for including more sophisticated agent behaviors into the modeling framework to help account for tail events.2025-10-26T14:04:49Z8 pages, 5 figures, accepted to the Workshop on AI in Finance at ECAI2025Ollie OlbyRory BaggottNamid Stillmanhttp://arxiv.org/abs/2508.02366v3Language Model Guided Reinforcement Learning in Quantitative Trading2025-10-25T22:25:16ZAlgorithmic trading requires short-term tactical decisions consistent with long-term financial objectives. Reinforcement Learning (RL) has been applied to such problems, but adoption is limited by myopic behaviour and opaque policies. Large Language Models (LLMs) offer complementary strategic reasoning and multi-modal signal interpretation when guided by well-structured prompts. This paper proposes a hybrid framework in which LLMs generate high-level trading strategies to guide RL agents. We evaluate (i) the economic rationale of LLM-generated strategies through expert review, and (ii) the performance of LLM-guided agents against unguided RL baselines using Sharpe Ratio (SR) and Maximum Drawdown (MDD). Empirical results indicate that LLM guidance improves both return and risk metrics relative to standard RL.2025-08-04T12:52:11Z12 pages (4 pages appendix and references) and 6 figures. Accepted for presentation at FLLM 2025, ViennaAdam DarmaninVince Vellahttp://arxiv.org/abs/2510.22341v1Understanding Carbon Trade Dynamics: A European Union Emissions Trading System Perspective2025-10-25T15:55:02ZThe European Union Emissions Trading System (EU ETS), the worlds largest cap-and-trade carbon market, is central to EU climate policy. This study analyzes its efficiency, price behavior, and market structure from 2010 to 2020. Using an AR-GARCH framework, we find pronounced price clustering and short-term return predictability, with 60.05 percent directional accuracy and a 70.78 percent hit rate within forecast intervals. Network analysis of inter-country transactions shows a concentrated structure dominated by a few registries that control most high-value flows. Country-specific log-log regressions of price on traded quantity reveal heterogeneous and sometimes positive elasticities exceeding unity, implying that trading volumes often rise with prices. These results point to persistent inefficiencies in the EU ETS, including partial predictability, asymmetric market power, and unconventional price-volume relationships, suggesting that while the system contributes to decarbonization, its trading dynamics and price formation remain imperfect.2025-10-25T15:55:02ZAvirup Chakrabortyhttp://arxiv.org/abs/2510.22206v1Right Place, Right Time: Market Simulation-based RL for Execution Optimisation2025-10-25T08:10:18ZExecution algorithms are vital to modern trading, they enable market participants to execute large orders while minimising market impact and transaction costs. As these algorithms grow more sophisticated, optimising them becomes increasingly challenging. In this work, we present a reinforcement learning (RL) framework for discovering optimal execution strategies, evaluated within a reactive agent-based market simulator. This simulator creates reactive order flow and allows us to decompose slippage into its constituent components: market impact and execution risk. We assess the RL agent's performance using the efficient frontier based on work by Almgren and Chriss, measuring its ability to balance risk and cost. Results show that the RL-derived strategies consistently outperform baselines and operate near the efficient frontier, demonstrating a strong ability to optimise for risk and impact. These findings highlight the potential of reinforcement learning as a powerful tool in the trader's toolkit.2025-10-25T08:10:18Z8 pages, 4 figures, accepted to ICAIF 2025Ollie OlbyAndreea BacalumRory BaggottNamid Stillmanhttp://arxiv.org/abs/2506.21775v3On the hidden costs of passive investing2025-10-20T19:24:01ZPassive investing has gained immense popularity due to its low fees and the perceived simplicity of focusing on zero tracking error, rather than security selection. However, our analysis shows that the passive (zero tracking error) approach of waiting until the market close on the day of index reconstitution to purchase a stock (that was announced days earlier as an upcoming addition) results in costs amounting to hundreds of basis points compared to strategies that involve gradually acquiring a small portion of the required shares in advance with minimal additional tracking errors. In addition, we show that under all scenarios analyzed, a trader who builds a small inventory post-announcement and provides liquidity at the reconstitution event can consistently earn several hundreds of basis points in profit and often much more, assuming minimal risk.2025-06-26T21:18:11Zv2-Introduction: expanded to incorporate additional empirical evidence from recent studies that further substantiate the motivation and real-world relevance of the work. Discussion and Conclusion: included illustrative price path linking the modeling framework to observed data and estimates of the performance drag. All other sections and results remain unchangedIro Tasitsiomihttp://arxiv.org/abs/2509.06510v2Optimal Exit Time for Liquidity Providers in Automated Market Makers2025-10-20T09:26:52ZWe study the problem of optimal liquidity withdrawal for a representative liquidity provider (LP) in an automated market maker (AMM). LPs earn fees from trading activity but are exposed to impermanent loss (IL) due to price fluctuations. While existing work has focused on static provision and exogenous exit strategies, we characterise the optimal exit time as the solution to a stochastic control problem with an endogenous stopping time. Mathematically, the LP's value function is shown to satisfy a Hamilton-Jacobi-Bellman quasi-variational inequality, for which we establish uniqueness in the viscosity sense. To solve the problem numerically, we develop two complementary approaches: a Euler scheme based on operator splitting and a Longstaff-Schwartz regression method. Calibrated simulations highlight how the LP's optimal exit strategy depends on the oracle price volatility, fee levels, and the behaviour of arbitrageurs and noise traders. Our results show that while arbitrage generates both fees and IL, the LP's optimal decision balances these opposing effects based on the pool state variables and price misalignments. Lastly, we find the optimal fee level for the representative LP when they play the exit strategy we derived. This work contributes to a deeper understanding of dynamic liquidity provision in AMMs and provides insights into the sustainability of passive LP strategies under different market regimes.2025-09-08T10:15:23ZPhilippe BergaultSébastien BieberLeandro Sánchez-Betancourthttp://arxiv.org/abs/2510.17165v1Trading with the Devil: Risk and Return in Foundation Model Strategies2025-10-20T05:12:52ZFoundation models - already transformative in domains such as natural language processing - are now starting to emerge for time-series tasks in finance. While these pretrained architectures promise versatile predictive signals, little is known about how they shape the risk profiles of the trading strategies built atop them, leaving practitioners reluctant to commit serious capital. In this paper, we propose an extension to the Capital Asset Pricing Model (CAPM) that disentangles the systematic risk introduced by a shared foundation model - potentially capable of generating alpha if the underlying model is genuinely predictive - from the idiosyncratic risk attributable to custom fine-tuning, which typically accrues no systematic premium. To enable a practical estimation of these separate risks, we align this decomposition with the concepts of uncertainty disentanglement, casting systematic risk as epistemic uncertainty (rooted in the pretrained model) and idiosyncratic risk as aleatory uncertainty (introduced during custom adaptations). Under the Aleatory Collapse Assumption, we illustrate how Monte Carlo dropout - among other methods in the uncertainty-quantization toolkit - can directly measure the epistemic risk, thereby mapping trading strategies to a more transparent risk-return plane. Our experiments show that isolating these distinct risk factors yields deeper insights into the performance limits of foundation-model-based strategies, their model degradation over time, and potential avenues for targeted refinements. Taken together, our results highlight both the promise and the pitfalls of deploying large pretrained models in competitive financial markets.2025-10-20T05:12:52ZJinrui Zhanghttp://arxiv.org/abs/2506.18147v2Causal Interventions in Bond Multi-Dealer-to-Client Platforms2025-10-17T12:00:53ZThe digitalization of financial markets has shifted trading from voice to electronic channels, with Multi-Dealer-to-Client (MD2C) platforms now enabling clients to request quotes (RfQs) for financial instruments like bonds from multiple dealers simultaneously. In this competitive landscape, dealers cannot see each other's prices, making a rigorous analysis of the negotiation process crucial to ensure their profitability. This article introduces a novel general framework for analyzing the RfQ process using probabilistic graphical models and causal inference. Within this framework, we explore different inferential questions that are relevant for dealers participating in MD2C platforms, such as the computation of optimal prices, estimating potential revenues and the identification of clients that might be interested in trading the dealer's axes. We then move into analyzing two different approaches for model specification: a generative model built on the work of (Fermanian, Guéant, \& Pu, 2017); and discriminative models utilizing machine learning techniques. Our results show that generative models can match the predictive accuracy of leading discriminative algorithms such as LightGBM (ROC-AUC: 0.742 vs. 0.743) while simultaneously enforcing critical business requirements, notably spread monotonicity.2025-06-22T19:30:36ZPaloma MarínSergio Ardanza-TrevijanoJavier Sabiohttp://arxiv.org/abs/2510.15988v1On Bellman equation in the limit order optimization problem for high-frequency trading2025-10-13T20:40:08ZAn approximation method for construction of optimal strategies in the bid \& ask limit order book in the high-frequency trading (HFT) is studied. The basis is the article by M. Avellaneda \& S. Stoikov 2008, in which certain seemingly serious gaps have been found; in the present paper they are carefully corrected. However, a bit surprisingly, our corrections do not change the main answer in the cited paper, so that, in fact, the gaps turn out to be unimportant. An explanation of this effect is offered.2025-10-13T20:40:08Z19 pages, 7 referencesM. I. BalakaevaA. Yu. Veretennikovhttp://arxiv.org/abs/2502.03194v2Efficient Triangular Arbitrage Detection via Graph Neural Networks2025-10-13T03:03:56ZTriangular arbitrage is a profitable trading strategy in financial markets that exploits discrepancies in currency exchange rates. Traditional methods for detecting triangular arbitrage opportunities, such as exhaustive search algorithms and linear programming solvers, often suffer from high computational complexity and may miss potential opportunities in dynamic markets. In this paper, we propose a novel approach to triangular arbitrage detection using Graph Neural Networks (GNNs). By representing the currency exchange network as a graph, we leverage the powerful representation and learning capabilities of GNNs to identify profitable arbitrage opportunities more efficiently. Specifically, we formulate the triangular arbitrage problem as a graph-based optimization task and design a GNN architecture that captures the complex relationships between currencies and exchange rates. We introduce a relaxed loss function to enable more flexible learning and integrate Deep Q-Learning principles to optimize the expected returns. Our experiments on a synthetic dataset demonstrate that the proposed GNN-based method achieves a higher average yield with significantly reduced computational time compared to traditional methods. This work highlights the potential of using GNNs for solving optimization problems in finance and provides a promising approach for real-time arbitrage detection in dynamic financial markets.2025-02-05T14:13:31ZThe topic selection is biased, and the experiment is incomplete and may have flawsDi Zhanghttp://arxiv.org/abs/2503.01629v2A New Traders' Game? -- Empirical Analysis of Response Functions in a Historical Perspective2025-10-12T13:01:12ZTraders on financial markets generate non-Markovian effects in various ways, particularly through their competition with one another which can be interpreted as a game between different (types of) traders. To quantify the market mechanisms, we empirically analyze self-response functions for pairs of different stocks and the corresponding trade sign correlators. While the non-Markovian dynamics in the self-responses is liquidity-driven, it is expectation-driven in the cross-responses which is related to the emergence of correlations. We empirically study the non-stationarity of these responses over time. In our previous data analysis, we only investigated the crisis year 2008. We now considerably extend this by also analyzing the years 2007, 2014 and 2021. To improve statistics, we also work out averaged response functions for the different years. We find significant variations over time revealing changes in the traders' game.2025-03-03T15:04:46ZPhysica A 679, 130981 (2025)Cedric SchuhmannBenjamin KöhlerAnton J. HeckensThomas Guhr10.1016/j.physa.2025.130981http://arxiv.org/abs/2502.15800v3LLM Agents Do Not Replicate Human Market Traders: Evidence From Experimental Finance2025-10-11T16:53:05ZThis paper explores how Large Language Models (LLMs) behave in a classic experimental finance paradigm widely known for eliciting bubbles and crashes in human participants. We adapt an established trading design, where traders buy and sell a risky asset with a known fundamental value, and introduce several LLM-based agents, both in single-model markets (all traders are instances of the same LLM) and in mixed-model "battle royale" settings (multiple LLMs competing in the same market). Our findings reveal that LLMs generally exhibit a "textbook-rational" approach, pricing the asset near its fundamental value, and show only a muted tendency toward bubble formation. Further analyses indicate that LLM-based agents display less trading strategy variance in contrast to humans. Taken together, these results highlight the risk of relying on LLM-only data to replicate human-driven market phenomena, as key behavioral features, such as large emergent bubbles, were not robustly reproduced. While LLMs clearly possess the capacity for strategic decision-making, their relative consistency and rationality suggest that they do not accurately mimic human market dynamics.2025-02-18T23:05:32Z51 pages, 33 figures, 12 tables, PreprintThomas HenningSiddhartha M. OjhaRoss SpoonJiatong HanColin F. Camererhttp://arxiv.org/abs/2508.17086v2Detecting Multilevel Manipulation from Limit Order Book via Cascaded Contrastive Representation Learning2025-10-10T07:32:36ZTrade-based manipulation (TBM) undermines the fairness and stability of financial markets drastically. Spoofing, one of the most covert and deceptive TBM strategies, exhibits complex anomaly patterns across multilevel prices, while often being simplified as a single-level manipulation. These patterns are usually concealed within the rich, hierarchical information of the Limit Order Book (LOB), which is challenging to leverage due to high dimensionality and noise. To address this, we propose a representation learning framework combining a cascaded LOB representation architecture with supervised contrastive learning. Extensive experiments demonstrate that our framework consistently improves detection performance across diverse models, with Transformer-based architectures achieving state-of-the-art results. In addition, we conduct systematic analyses and ablation studies to investigate multilevel manipulation and the contributions of key components for detection, offering broader insights into representation learning and anomaly detection for complex time series data.2025-08-23T16:57:32ZYushi LinPeng Yanghttp://arxiv.org/abs/2510.08085v1A Deterministic Limit Order Book Simulator with Hawkes-Driven Order Flow2025-10-09T11:17:14ZWe present a reproducible research framework for market microstructure combining a deterministic C++ limit order book (LOB) simulator with stochastic order flow generated by multivariate marked Hawkes processes. The paper derives full stability and ergodicity proofs for both linear and nonlinear Hawkes models, implements time-rescaling and goodness-of-fit diagnostics, and calibrates exponential and power-law kernels on Binance BTCUSDT and LOBSTER AAPL datasets. Empirical results highlight the nearly-unstable subcritical regime as essential for reproducing realistic clustering in order flow. All code, datasets, and configuration files are publicly available at https://github.com/sohaibelkarmi/High-Frequency-Trading-Simulator2025-10-09T11:17:14Z22 pages, 7 figures, includes theoretical proofs, simulator architecture, and calibration results. Code available at https://github.com/sohaibelkarmi/High-Frequency-Trading-SimulatorSohaib El Karmihttp://arxiv.org/abs/2510.15937v1Tail-Safe Stochastic-Control SPX-VIX Hedging: A White-Box Bridge Between AI Sensitivities and Arbitrage-Free Market Dynamics2025-10-09T09:30:17ZWe present a white-box, risk-sensitive framework for jointly hedging SPX and VIX exposures under transaction costs and regime shifts. The approach couples an arbitrage-free market teacher with a control layer that enforces safety as constraints. On the market side, we integrate an SSVI-based implied-volatility surface and a Cboe-compliant VIX computation (including wing pruning and 30-day interpolation), and connect prices to dynamics via a clipped, convexity-preserving Dupire local-volatility extractor. On the control side, we pose hedging as a small quadratic program with control-barrier-function (CBF) boxes for inventory, rate, and tail risk; a sufficient-descent execution gate that trades only when risk drop justifies cost; and three targeted tail-safety upgrades: a correlation/expiry-aware VIX weight, guarded no-trade bands, and expiry-aware micro-trade thresholds with cooldown. We prove existence/uniqueness and KKT regularity of the per-step QP, forward invariance of safety sets, one-step risk descent when the gate opens, and no chattering with bounded trade rates. For the dynamics layer, we establish positivity and second-order consistency of the discrete Dupire estimator and give an index-coherence bound linking the teacher VIX to a CIR-style proxy with explicit quadrature and projection errors. In a reproducible synthetic environment mirroring exchange rules and execution frictions, the controller reduces expected shortfall while suppressing nuisance turnover, and the teacher-surface construction keeps index-level residuals small and stable.2025-10-09T09:30:17Z52 pages; 3 figures; PRIMEarxiv template; fully reproducible artifact (code, configs, plots)Jian'an Zhang