https://arxiv.org/api/ugg1yWCz6VUyUC/xgUqwQzpu3hY 2026-06-20T08:42:02Z 2263 420 15 http://arxiv.org/abs/2502.07868v1 Minimal Shortfall Strategies for Liquidation of a Basket of Stocks using Reinforcement Learning 2025-02-11T18:55:14Z This paper studies the ubiquitous problem of liquidating large quantities of highly correlated stocks, a task frequently encountered by institutional investors and proprietary trading firms. Traditional methods in this setting suffer from the curse of dimensionality, making them impractical for high-dimensional problems. In this work, we propose a novel method based on stochastic optimal control to optimally tackle this complex multidimensional problem. The proposed method minimizes the overall execution shortfall of highly correlated stocks using a reinforcement learning approach. We rigorously establish the convergence of our optimal trading strategy and present an implementation of our algorithm using intra-day market data. 2025-02-11T18:55:14Z Moustapha Pemy Na Zhang http://arxiv.org/abs/2502.07625v1 Intraday order transition dynamics in high, medium, and low market cap stocks: A Markov chain approach 2025-02-11T15:13:10Z An empirical stochastic analysis of high-frequency, tick-by-tick order data of NASDAQ100 listed stocks is conducted using a first-order discrete-time Markov chain model to explore intraday order transition dynamics. This analysis focuses on three market cap categories: High, Medium, and Low. Time-homogeneous transition probability matrices are estimated and compared across time-zones and market cap categories, and we found that limit orders exhibit higher degree of inertia (DoI), i.e., the probability of placing consecutive limit order is higher, during the opening hour. However, in the subsequent hour, the DoI of limit order decreases, while that of market order increases. Limit order adjustments via additions and deletions of limit orders increases significantly after the opening hour. All the order transitions then stabilize during mid-hours. As the closing hour approaches, consecutive order executions surge, with decreased placement of buy and sell limit orders following sell and buy executions, respectively. In terms of the differences in order transitions between stocks of different market cap, DoI of orders is stronger in high and medium market cap stocks. On the other hand, lower market cap stocks show a higher probability of limit order modifications and greater likelihood of submitting sell/buy limit orders after buy/sell executions. Further, order transitions are clustered across all stocks, except during opening and closing hours. The findings of this study may be useful in understanding intraday order placement dynamics across stocks of varying market cap, thus aiding market participants in making informed order placements at different times of trading hour. 2025-02-11T15:13:10Z 18 pages S. R. Luwang National Institute of Technology Sikkim, India A. Rai National Institute of Technology Sikkim, India Algolabs, Chennai Mathematical Institute, India Md. Nurujjaman National Institute of Technology Sikkim, India F. Petroni University of Chieti-Pescara, Italy http://arxiv.org/abs/2502.07393v1 FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents 2025-02-11T09:23:14Z This paper presents a novel risk-sensitive trading agent combining reinforcement learning and large language models (LLMs). We extend the Conditional Value-at-Risk Proximal Policy Optimization (CPPO) algorithm, by adding risk assessment and trading recommendation signals generated by a LLM from financial news. Our approach is backtested on the Nasdaq-100 index benchmark, using financial news data from the FNSPID dataset and the DeepSeek V3, Qwen 2.5 and Llama 3.3 language models. The code, data, and trading agents are available at: https://github.com/benstaf/FinRL_DeepSeek 2025-02-11T09:23:14Z Mostapha Benhenda LAGA http://arxiv.org/abs/2502.15742v1 Currency Arbitrage Optimization using Quantum Annealing, QAOA and Constraint Mapping 2025-02-08T01:22:31Z Currency arbitrage capitalizes on price discrepancies in currency exchange rates between markets to produce profits with minimal risk. By employing a combinatorial optimization problem, one can ascertain optimal paths within directed graphs, thereby facilitating the efficient identification of profitable trading routes. This research investigates the methodologies of quantum annealing and gate-based quantum computing in relation to the currency arbitrage problem. In this study, we implement the Quantum Approximate Optimization Algorithm (QAOA) utilizing Qiskit version 1.2. In order to optimize the parameters of QAOA, we perform simulations utilizing the AerSimulator and carry out experiments in simulation. Furthermore, we present an NchooseK-based methodology utilizing D-Wave's Ocean suite. This methodology enables a comparison of the effectiveness of quantum techniques in identifying optimal arbitrage paths. The results of our study enhance the existing literature on the application of quantum computing in financial optimization challenges, emphasizing both the prospective benefits and the present limitations of these developing technologies in real-world scenarios. 2025-02-08T01:22:31Z Sangram Deshpande Elin Ranjan Das Frank Mueller http://arxiv.org/abs/2502.04027v1 High-Frequency Market Manipulation Detection with a Markov-modulated Hawkes process 2025-02-06T12:31:17Z This work focuses on a self-exciting point process defined by a Hawkes-like intensity and a switching mechanism based on a hidden Markov chain. Previous works in such a setting assume constant intensities between consecutive events. We extend the model to general Hawkes excitation kernels that are piecewise constant between events. We develop an expectation-maximization algorithm for the statistical inference of the Hawkes intensities parameters as well as the state transition probabilities. The numerical convergence of the estimators is extensively tested on simulated data. Using high-frequency cryptocurrency data on a top centralized exchange, we apply the model to the detection of anomalous bursts of trades. We benchmark the goodness-of-fit of the model with the Markov-modulated Poisson process and demonstrate the relevance of the model in detecting suspicious activities. 2025-02-06T12:31:17Z 35 pages, 15 figures Timothée Fabre Ioane Muni Toke http://arxiv.org/abs/2502.01992v1 FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024 2025-02-04T04:11:09Z In response to Task II of the FinRL Challenge at ACM ICAIF 2024, this study proposes a novel prompt framework for fine-tuning large language models (LLM) with Reinforcement Learning from Market Feedback (RLMF). Our framework incorporates market-specific features and short-term price dynamics to generate more precise trading signals. Traditional LLMs, while competent in sentiment analysis, lack contextual alignment for financial market applications. To bridge this gap, we fine-tune the LLaMA-3.2-3B-Instruct model using a custom RLMF prompt design that integrates historical market data and reward-based feedback. Our evaluation shows that this RLMF-tuned framework outperforms baseline methods in signal consistency and achieving tighter trading outcomes; awarded as winner of Task II. You can find the code for this project on GitHub. 2025-02-04T04:11:09Z Competition Track FinRL, ICAIF 2024 Arnav Grover http://arxiv.org/abs/2502.01931v1 Liquidity provision of utility indifference type in decentralized exchanges 2025-02-04T02:06:28Z We present a mathematical formulation of liquidity provision in decentralized exchanges. We focus on constant function market makers of utility indifference type, which include constant product market makers with concentrated liquidity as a special case. First, we examine no-arbitrage conditions for a liquidity pool and compute an optimal arbitrage strategy when there is an external liquid market. Second, we show that liquidity provision suffers from impermanent loss unless a transaction fee is levied under the general framework with concentrated liquidity. Third, we establish the well-definedness of arbitrage-free reserve processes of a liquidity pool in continuous-time and show that there is no loss-versus-rebalancing under a nonzero fee if the external market price is continuous. We then argue that liquidity provision by multiple liquidity providers can be understood as liquidity provision by a representative liquidity provider, meaning that the analysis boils down to that for a single liquidity provider. Last, but not least, we give an answer to the fundamental question in which sense the very construction of constant function market makers with concentrated liquidity in the popular platform Uniswap v3 is optimal. 2025-02-04T02:06:28Z Masaaki Fukasawa Basile Maire Marcus Wunsch http://arxiv.org/abs/2502.01574v1 An End-To-End LLM Enhanced Trading System 2025-02-03T17:57:04Z This project introduces an end-to-end trading system that leverages Large Language Models (LLMs) for real-time market sentiment analysis. By synthesizing data from financial news and social media, the system integrates sentiment-driven insights with technical indicators to generate actionable trading signals. FinGPT serves as the primary model for sentiment analysis, ensuring domain-specific accuracy, while Kubernetes is used for scalable and efficient deployment. 2025-02-03T17:57:04Z 6 pages, 1 figure Ziyao Zhou Ronitt Mehra http://arxiv.org/abs/2502.01495v1 Supervised Similarity for High-Yield Corporate Bonds with Quantum Cognition Machine Learning 2025-02-03T16:28:44Z We investigate the application of quantum cognition machine learning (QCML), a novel paradigm for both supervised and unsupervised learning tasks rooted in the mathematical formalism of quantum theory, to distance metric learning in corporate bond markets. Compared to equities, corporate bonds are relatively illiquid and both trade and quote data in these securities are relatively sparse. Thus, a measure of distance/similarity among corporate bonds is particularly useful for a variety of practical applications in the trading of illiquid bonds, including the identification of similar tradable alternatives, pricing securities with relatively few recent quotes or trades, and explaining the predictions and performance of ML models based on their training data. Previous research has explored supervised similarity learning based on classical tree-based models in this context; here, we explore the application of the QCML paradigm for supervised distance metric learning in the same context, showing that it outperforms classical tree-based models in high-yield (HY) markets, while giving comparable or better performance (depending on the evaluation metric) in investment grade (IG) markets. 2025-02-03T16:28:44Z Joshua Rosaler Luca Candelori Vahagn Kirakosyan Kharen Musaelian Ryan Samson Martin T. Wells Dhagash Mehta Stefano Pasquali http://arxiv.org/abs/2210.01227v4 Axioms for Automated Market Makers: A Mathematical Framework in FinTech and Decentralized Finance 2025-02-01T11:36:56Z Within this work we consider an axiomatic framework for Automated Market Makers (AMMs). AMMs are smart contracts that set prices for swaps on a pool of assets. By imposing reasonable axioms on the underlying utility function, we are able to characterize the properties of the swap size of the assets and of the resulting pricing oracle. In providing these general axioms, we define a novel measure of price impacts that can be used to quantify those costs between different AMM constructions. We have analyzed many existing AMMs and shown that the vast majority of them satisfy our axioms. We have also considered the question of fees and divergence loss. In doing so, we have proposed a new fee structure so as to make the AMM indifferent to transaction splitting. Finally, we have proposed a novel AMM that has nice analytical properties and provides a large range over which there is no divergence loss. 2022-10-03T21:00:55Z Maxim Bichuch Zachary Feinstein http://arxiv.org/abs/2408.02322v2 Consistent time travel for realistic interactions with historical data: reinforcement learning for market making 2025-01-29T09:43:45Z Reinforcement learning works best when the impact of the agent's actions on its environment can be perfectly simulated or fully appraised from available data. Some systems are however both hard to simulate and very sensitive to small perturbations. An additional difficulty arises when a RL agent is trained offline to be part of a multi-agent system using only anonymous data, which makes it impossible to infer the state of each agent, thus to use data directly. Typical examples are competitive systems without agent-resolved data such as financial markets. We introduce consistent data time travel for offline RL as a remedy for these problems: instead of using historical data in a sequential way, we argue that one needs to perform time travel in historical data, i.e., to adjust the time index so that both the past state and the influence of the RL agent's action on the system coincide with real data. This both alleviates the need to resort to imperfect models and consistently accounts for both the immediate and long-term reactions of the system when using anonymous historical data. We apply this idea to market making in limit order books, a notoriously difficult task for RL; it turns out that the gain of the agent is significantly higher with data time travel than with naive sequential data, which suggests that the difficulty of this task for RL may have been overestimated. 2024-08-05T09:07:36Z 11 pages Vincent Ragel Damien Challet http://arxiv.org/abs/2501.17366v1 Forecasting S&P 500 Using LSTM Models 2025-01-29T01:31:56Z With the volatile and complex nature of financial data influenced by external factors, forecasting the stock market is challenging. Traditional models such as ARIMA and GARCH perform well with linear data but struggle with non-linear dependencies. Machine learning and deep learning models, particularly Long Short-Term Memory (LSTM) networks, address these challenges by capturing intricate patterns and long-term dependencies. This report compares ARIMA and LSTM models in predicting the S&P 500 index, a major financial benchmark. Using historical price data and technical indicators, we evaluated these models using Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE). The ARIMA model showed reasonable performance with an MAE of 462.1, RMSE of 614, and 89.8 percent accuracy, effectively capturing short-term trends but limited by its linear assumptions. The LSTM model, leveraging sequential processing capabilities, outperformed ARIMA with an MAE of 369.32, RMSE of 412.84, and 92.46 percent accuracy, capturing both short- and long-term dependencies. Notably, the LSTM model without additional features performed best, achieving an MAE of 175.9, RMSE of 207.34, and 96.41 percent accuracy, showcasing its ability to handle market data efficiently. Accurately predicting stock movements is crucial for investment strategies, risk assessments, and market stability. Our findings confirm the potential of deep learning models in handling volatile financial data compared to traditional ones. The results highlight the effectiveness of LSTM and suggest avenues for further improvements. This study provides insights into financial forecasting, offering a comparative analysis of ARIMA and LSTM while outlining their strengths and limitations. 2025-01-29T01:31:56Z Prashant Pilla Raji Mekonen 10.5281/zenodo.14759118 http://arxiv.org/abs/2501.16488v1 Solvability of the Gaussian Kyle model with imperfect information and risk aversion 2025-01-27T20:45:09Z We investigate a Kyle model under Gaussian assumptions where a risk-averse informed trader has imperfect information on the fundamental price of an asset. We show that an equilibrium can be constructed by considering an optimal transport problem that is solved under a measure that renders the utility of the informed trader martingale and a filtering problem under the historical measure. 2025-01-27T20:45:09Z Reda Chhaibi Ibrahim Ekren Eunjung Noh http://arxiv.org/abs/2301.05157v2 Statistical Learning with Sublinear Regret of Propagator Models 2025-01-22T00:42:52Z We consider a class of learning problems in which an agent liquidates a risky asset while creating both transient price impact driven by an unknown convolution propagator and linear temporary price impact with an unknown parameter. We characterize the trader's performance as maximization of a revenue-risk functional, where the trader also exploits available information on a price predicting signal. We present a trading algorithm that alternates between exploration and exploitation phases and achieves sublinear regrets with high probability. For the exploration phase we propose a novel approach for non-parametric estimation of the price impact kernel by observing only the visible price process and derive sharp bounds on the convergence rate, which are characterised by the singularity of the propagator. These kernel estimation methods extend existing methods from the area of Tikhonov regularisation for inverse problems and are of independent interest. The bound on the regret in the exploitation phase is obtained by deriving stability results for the optimizer and value function of the associated class of infinite-dimensional stochastic control problems. As a complementary result we propose a regression-based algorithm to estimate the conditional expectation of non-Markovian signals and derive its convergence rate. 2023-01-12T17:16:27Z 57 pages, accepted by The Annals of Applied Probability Eyal Neuman Yufei Zhang http://arxiv.org/abs/2410.19107v2 What Drives Liquidity on Decentralized Exchanges? Evidence from the Uniswap Protocol 2025-01-17T18:13:21Z We study liquidity on decentralized exchanges (DEXs), identifying factors at the platform, blockchain, token pair, and liquidity pool levels with predictive power for market depth metrics. We introduce the v2 counterfactual spread metric, a novel criterion which assesses the degree of liquidity concentration in pools using the ``concentrated liquidity'' mechanism, allowing us to decompose the effect of a factor on market depth into two channels: total value locked (TVL) and concentration. We further explore how external liquidity from competing DEXs and private inventory on DEX aggregators influence market depth. We find that (i) gas prices, returns, and a DEX's share of trading volume affect liquidity through concentration, (ii) internalization of order flow by private market makers affects TVL but not the overall market depth, and (iii) volatility, fee revenue, and markout affect liquidity through both channels. 2024-10-24T19:15:17Z Brian Z. Zhu Dingyue Liu Xin Wan Gordon Liao Ciamac C. Moallemi Brad Bachu