https://arxiv.org/api/Wh/ei9x+unrmV/HW8jstD/fBx/g2026-04-05T19:32:33Z217633015http://arxiv.org/abs/2502.01992v1FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 20242025-02-04T04:11:09ZIn response to Task II of the FinRL Challenge at ACM ICAIF 2024, this study proposes a novel prompt framework for fine-tuning large language models (LLM) with Reinforcement Learning from Market Feedback (RLMF). Our framework incorporates market-specific features and short-term price dynamics to generate more precise trading signals. Traditional LLMs, while competent in sentiment analysis, lack contextual alignment for financial market applications. To bridge this gap, we fine-tune the LLaMA-3.2-3B-Instruct model using a custom RLMF prompt design that integrates historical market data and reward-based feedback. Our evaluation shows that this RLMF-tuned framework outperforms baseline methods in signal consistency and achieving tighter trading outcomes; awarded as winner of Task II. You can find the code for this project on GitHub.2025-02-04T04:11:09ZCompetition Track FinRL, ICAIF 2024Arnav Groverhttp://arxiv.org/abs/2502.01931v1Liquidity provision of utility indifference type in decentralized exchanges2025-02-04T02:06:28ZWe present a mathematical formulation of liquidity provision in decentralized exchanges. We focus on constant function market makers of utility indifference type, which include constant product market makers with concentrated liquidity as a special case. First, we examine no-arbitrage conditions for a liquidity pool and compute an optimal arbitrage strategy when there is an external liquid market. Second, we show that liquidity provision suffers from impermanent loss unless a transaction fee is levied under the general framework with concentrated liquidity. Third, we establish the well-definedness of arbitrage-free reserve processes of a liquidity pool in continuous-time and show that there is no loss-versus-rebalancing under a nonzero fee if the external market price is continuous. We then argue that liquidity provision by multiple liquidity providers can be understood as liquidity provision by a representative liquidity provider, meaning that the analysis boils down to that for a single liquidity provider. Last, but not least, we give an answer to the fundamental question in which sense the very construction of constant function market makers with concentrated liquidity in the popular platform Uniswap v3 is optimal.2025-02-04T02:06:28ZMasaaki FukasawaBasile MaireMarcus Wunschhttp://arxiv.org/abs/2502.01574v1An End-To-End LLM Enhanced Trading System2025-02-03T17:57:04ZThis project introduces an end-to-end trading system that leverages Large Language Models (LLMs) for real-time market sentiment analysis. By synthesizing data from financial news and social media, the system integrates sentiment-driven insights with technical indicators to generate actionable trading signals. FinGPT serves as the primary model for sentiment analysis, ensuring domain-specific accuracy, while Kubernetes is used for scalable and efficient deployment.2025-02-03T17:57:04Z6 pages, 1 figureZiyao ZhouRonitt Mehrahttp://arxiv.org/abs/2502.01495v1Supervised Similarity for High-Yield Corporate Bonds with Quantum Cognition Machine Learning2025-02-03T16:28:44ZWe investigate the application of quantum cognition machine learning (QCML), a novel paradigm for both supervised and unsupervised learning tasks rooted in the mathematical formalism of quantum theory, to distance metric learning in corporate bond markets. Compared to equities, corporate bonds are relatively illiquid and both trade and quote data in these securities are relatively sparse. Thus, a measure of distance/similarity among corporate bonds is particularly useful for a variety of practical applications in the trading of illiquid bonds, including the identification of similar tradable alternatives, pricing securities with relatively few recent quotes or trades, and explaining the predictions and performance of ML models based on their training data. Previous research has explored supervised similarity learning based on classical tree-based models in this context; here, we explore the application of the QCML paradigm for supervised distance metric learning in the same context, showing that it outperforms classical tree-based models in high-yield (HY) markets, while giving comparable or better performance (depending on the evaluation metric) in investment grade (IG) markets.2025-02-03T16:28:44ZJoshua RosalerLuca CandeloriVahagn KirakosyanKharen MusaelianRyan SamsonMartin T. WellsDhagash MehtaStefano Pasqualihttp://arxiv.org/abs/2210.01227v4Axioms for Automated Market Makers: A Mathematical Framework in FinTech and Decentralized Finance2025-02-01T11:36:56ZWithin this work we consider an axiomatic framework for Automated Market Makers (AMMs). AMMs are smart contracts that set prices for swaps on a pool of assets. By imposing reasonable axioms on the underlying utility function, we are able to characterize the properties of the swap size of the assets and of the resulting pricing oracle. In providing these general axioms, we define a novel measure of price impacts that can be used to quantify those costs between different AMM constructions. We have analyzed many existing AMMs and shown that the vast majority of them satisfy our axioms. We have also considered the question of fees and divergence loss. In doing so, we have proposed a new fee structure so as to make the AMM indifferent to transaction splitting. Finally, we have proposed a novel AMM that has nice analytical properties and provides a large range over which there is no divergence loss.2022-10-03T21:00:55ZMaxim BichuchZachary Feinsteinhttp://arxiv.org/abs/2408.02322v2Consistent time travel for realistic interactions with historical data: reinforcement learning for market making2025-01-29T09:43:45ZReinforcement learning works best when the impact of the agent's actions on its environment can be perfectly simulated or fully appraised from available data. Some systems are however both hard to simulate and very sensitive to small perturbations. An additional difficulty arises when a RL agent is trained offline to be part of a multi-agent system using only anonymous data, which makes it impossible to infer the state of each agent, thus to use data directly. Typical examples are competitive systems without agent-resolved data such as financial markets. We introduce consistent data time travel for offline RL as a remedy for these problems: instead of using historical data in a sequential way, we argue that one needs to perform time travel in historical data, i.e., to adjust the time index so that both the past state and the influence of the RL agent's action on the system coincide with real data. This both alleviates the need to resort to imperfect models and consistently accounts for both the immediate and long-term reactions of the system when using anonymous historical data. We apply this idea to market making in limit order books, a notoriously difficult task for RL; it turns out that the gain of the agent is significantly higher with data time travel than with naive sequential data, which suggests that the difficulty of this task for RL may have been overestimated.2024-08-05T09:07:36Z11 pagesVincent RagelDamien Challethttp://arxiv.org/abs/2501.17366v1Forecasting S&P 500 Using LSTM Models2025-01-29T01:31:56ZWith the volatile and complex nature of financial data influenced by external factors, forecasting the stock market is challenging. Traditional models such as ARIMA and GARCH perform well with linear data but struggle with non-linear dependencies. Machine learning and deep learning models, particularly Long Short-Term Memory (LSTM) networks, address these challenges by capturing intricate patterns and long-term dependencies. This report compares ARIMA and LSTM models in predicting the S&P 500 index, a major financial benchmark.
Using historical price data and technical indicators, we evaluated these models using Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE). The ARIMA model showed reasonable performance with an MAE of 462.1, RMSE of 614, and 89.8 percent accuracy, effectively capturing short-term trends but limited by its linear assumptions. The LSTM model, leveraging sequential processing capabilities, outperformed ARIMA with an MAE of 369.32, RMSE of 412.84, and 92.46 percent accuracy, capturing both short- and long-term dependencies. Notably, the LSTM model without additional features performed best, achieving an MAE of 175.9, RMSE of 207.34, and 96.41 percent accuracy, showcasing its ability to handle market data efficiently.
Accurately predicting stock movements is crucial for investment strategies, risk assessments, and market stability. Our findings confirm the potential of deep learning models in handling volatile financial data compared to traditional ones. The results highlight the effectiveness of LSTM and suggest avenues for further improvements. This study provides insights into financial forecasting, offering a comparative analysis of ARIMA and LSTM while outlining their strengths and limitations.2025-01-29T01:31:56ZPrashant PillaRaji Mekonen10.5281/zenodo.14759118http://arxiv.org/abs/2501.16488v1Solvability of the Gaussian Kyle model with imperfect information and risk aversion2025-01-27T20:45:09ZWe investigate a Kyle model under Gaussian assumptions where a risk-averse informed trader has imperfect information on the fundamental price of an asset. We show that an equilibrium can be constructed by considering an optimal transport problem that is solved under a measure that renders the utility of the informed trader martingale and a filtering problem under the historical measure.2025-01-27T20:45:09ZReda ChhaibiIbrahim EkrenEunjung Nohhttp://arxiv.org/abs/2501.12591v1Optimal Rebate Design: Incentives, Competition and Efficiency in Auction Markets2025-01-22T02:34:43ZThis study explores the design of an efficient rebate policy in auction markets, focusing on a continuous-time setting with competition among market participants. In this model, a stock exchange collects transaction fees from auction investors executing block trades to buy or sell a risky asset, then redistributes these fees as rebates to competing market makers submitting limit orders. Market makers influence both the price at which the asset trades and their arrival intensity in the auction. We frame this problem as a principal-multi-agent problem and provide necessary and sufficient conditions to characterize the Nash equilibrium among market makers. The exchange's optimization problem is formulated as a high-dimensional Hamilton-Jacobi-Bellman equation with Poisson jump processes, which is solved using a verification result. To numerically compute the optimal rebate and transaction fee policies, we apply the Deep BSDE method. Our results show that optimal transaction fees and rebate structures improve market efficiency by narrowing the spread between the auction clearing price and the asset's fundamental value, while ensuring a minimal gain for both market makers indexed on the price of the asset on a coexisting limit order book.2025-01-22T02:34:43ZThibaut MastroliaTianrui Xuhttp://arxiv.org/abs/2301.05157v2Statistical Learning with Sublinear Regret of Propagator Models2025-01-22T00:42:52ZWe consider a class of learning problems in which an agent liquidates a risky asset while creating both transient price impact driven by an unknown convolution propagator and linear temporary price impact with an unknown parameter. We characterize the trader's performance as maximization of a revenue-risk functional, where the trader also exploits available information on a price predicting signal. We present a trading algorithm that alternates between exploration and exploitation phases and achieves sublinear regrets with high probability. For the exploration phase we propose a novel approach for non-parametric estimation of the price impact kernel by observing only the visible price process and derive sharp bounds on the convergence rate, which are characterised by the singularity of the propagator. These kernel estimation methods extend existing methods from the area of Tikhonov regularisation for inverse problems and are of independent interest. The bound on the regret in the exploitation phase is obtained by deriving stability results for the optimizer and value function of the associated class of infinite-dimensional stochastic control problems. As a complementary result we propose a regression-based algorithm to estimate the conditional expectation of non-Markovian signals and derive its convergence rate.2023-01-12T17:16:27Z57 pages, accepted by The Annals of Applied ProbabilityEyal NeumanYufei Zhanghttp://arxiv.org/abs/2410.19107v2What Drives Liquidity on Decentralized Exchanges? Evidence from the Uniswap Protocol2025-01-17T18:13:21ZWe study liquidity on decentralized exchanges (DEXs), identifying factors at the platform, blockchain, token pair, and liquidity pool levels with predictive power for market depth metrics. We introduce the v2 counterfactual spread metric, a novel criterion which assesses the degree of liquidity concentration in pools using the ``concentrated liquidity'' mechanism, allowing us to decompose the effect of a factor on market depth into two channels: total value locked (TVL) and concentration. We further explore how external liquidity from competing DEXs and private inventory on DEX aggregators influence market depth. We find that (i) gas prices, returns, and a DEX's share of trading volume affect liquidity through concentration, (ii) internalization of order flow by private market makers affects TVL but not the overall market depth, and (iii) volatility, fee revenue, and markout affect liquidity through both channels.2024-10-24T19:15:17ZBrian Z. ZhuDingyue LiuXin WanGordon LiaoCiamac C. MoallemiBrad Bachuhttp://arxiv.org/abs/2501.09636v2LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading2025-01-17T11:44:53ZRecent advances in deep learning and large language models (LLMs) have facilitated the deployment of the mixture-of-experts (MoE) mechanism in the stock investment domain. While these models have demonstrated promising trading performance, they are often unimodal, neglecting the wealth of information available in other modalities, such as textual data. Moreover, the traditional neural network-based router selection mechanism fails to consider contextual and real-world nuances, resulting in suboptimal expert selection. To address these limitations, we propose LLMoE, a novel framework that employs LLMs as the router within the MoE architecture. Specifically, we replace the conventional neural network-based router with LLMs, leveraging their extensive world knowledge and reasoning capabilities to select experts based on historical price data and stock news. This approach provides a more effective and interpretable selection mechanism. Our experiments on multimodal real-world stock datasets demonstrate that LLMoE outperforms state-of-the-art MoE models and other deep neural network approaches. Additionally, the flexible architecture of LLMoE allows for easy adaptation to various downstream tasks.2025-01-16T16:25:30ZAccepted by AAAI 2025 Workshop on AI for Social Impact - Bridging Innovations in Finance, Social Media, and Crime PreventionKuan-Ming LiuMing-Chih Lohttp://arxiv.org/abs/2501.09638v1Optimal Execution among $N$ Traders with Transient Price Impact2025-01-16T16:27:52ZWe study $N$-player optimal execution games in an Obizhaeva--Wang model of transient price impact. When the game is regularized by an instantaneous cost on the trading rate, a unique equilibrium exists and we derive its closed form. Whereas without regularization, there is no equilibrium. We prove that existence is restored if (and only if) a very particular, time-dependent cost on block trades is added to the model. In that case, the equilibrium is particularly tractable. We show that this equilibrium is the limit of the regularized equilibria as the instantaneous cost parameter $\varepsilon$ tends to zero. Moreover, we explain the seemingly ad-hoc block cost as the limit of the equilibrium instantaneous costs. Notably, in contrast to the single-player problem, the optimal instantaneous costs do not vanish in the limit $\varepsilon\to0$. We use this tractable equilibrium to study the cost of liquidating in the presence of predators and the cost of anarchy. Our results also give a new interpretation to the erratic behaviors previously observed in discrete-time trading games with transient price impact.2025-01-16T16:27:52Z63 pages, 4 figures, 1 tableSteven CampbellMarcel Nutzhttp://arxiv.org/abs/2501.09404v1Agent-Based Simulation of a Perpetual Futures Market2025-01-16T09:19:08ZI introduce an agent-based model of a Perpetual Futures market with heterogeneous agents trading via a central limit order book. Perpetual Futures (henceforth Perps) are financial derivatives introduced by the economist Robert Shiller, designed to peg their price to that of the underlying Spot market. This paper extends the limit order book model of Chiarella et al. (2002) by taking their agent and orderbook parameters, designed for a simple stock exchange, and applying it to the more complex environment of a Perp market with long and short traders who exhibit both positional and basis-trading behaviors. I find that despite the simplicity of the agent behavior, the simulation is able to reproduce the most salient feature of a Perp market, the pegging of the Perp price to the underlying Spot price. In contrast to fundamental simulations of stock markets which aim to reproduce empirically observed stylized facts such as the leptokurtosis and heteroscedasticity of returns, volatility clustering and others, in derivatives markets many of these features are provided exogenously by the underlying Spot price signal. This is especially true of Perps since the derivative is designed to mimic the price of the Spot market. Therefore, this paper will focus exclusively on analyzing how market and agent parameters such as order lifetime, trading horizon and spread affect the premiums at which Perps trade with respect to the underlying Spot market. I show that this simulation provides a simple and robust environment for exploring the dynamics of Perpetual Futures markets and their microstructure in this regard. Lastly, I explore the ability of the model to reproduce the effects of biasing long traders to trade positionally and short traders to basis-trade, which was the original intention behind the market design, and is a tendency observed empirically in real Perp markets.2025-01-16T09:19:08Z37 pages, 11 figures, Minerva University Master of Science in Decision Analysis (MDA) ThesisRamshreyas Raohttp://arxiv.org/abs/2501.08822v1Deep Learning Meets Queue-Reactive: A Framework for Realistic Limit Order Book Simulation2025-01-15T14:19:20ZThe Queue-Reactive model introduced by Huang et al. (2015) has become a standard tool for limit order book modeling, widely adopted by both researchers and practitioners for its simplicity and effectiveness. We present the Multidimensional Deep Queue-Reactive (MDQR) model, which extends this framework in three ways: it relaxes the assumption of queue independence, enriches the state space with market features, and models the distribution of order sizes. Through a neural network architecture, the model learns complex dependencies between different price levels and adapts to varying market conditions, while preserving the interpretable point-process foundation of the original framework. Using data from the Bund futures market, we show that MDQR captures key market properties including the square-root law of market impact, cross-queue correlations, and realistic order size patterns. The model demonstrates particular strength in reproducing both conditional and stationary distributions of order sizes, as well as various stylized facts of market microstructure. The model achieves this while maintaining the computational efficiency needed for practical applications such as strategy development through reinforcement learning or realistic backtesting.2025-01-15T14:19:20ZHamza BodorLaurent Carlier