https://arxiv.org/api/Im5tiZKZHY7ZBKuDxq2mbFMHKe0 2026-03-28T11:09:11Z 2171 195 15 http://arxiv.org/abs/2510.15883v1 FinFlowRL: An Imitation-Reinforcement Learning Framework for Adaptive Stochastic Control in Finance 2025-08-30T02:08:19Z

Traditional stochastic control methods in finance struggle in real world markets due to their reliance on simplifying assumptions and stylized frameworks. Such methods typically perform well in specific, well defined environments but yield suboptimal results in changed, non stationary ones. We introduce FinFlowRL, a novel framework for financial optimal stochastic control. The framework pretrains an adaptive meta policy learning from multiple expert strategies, then finetunes through reinforcement learning in the noise space to optimize the generative process. By employing action chunking generating action sequences rather than single decisions, it addresses the non Markovian nature of markets. FinFlowRL consistently outperforms individually optimized experts across diverse market conditions.

2025-08-30T02:08:19Z 21 pages, 5 algorithms, 4 tables, 5 figures Yang Li Zhi Chen http://arxiv.org/abs/2509.10483v1 Equity Premium Prediction: Taking into Account the Role of Long, even Asymmetric, Swings in Stock Market Behavior 2025-08-29T07:51:01Z

Through a novel approach, this paper shows that substantial change in stock market behavior has a statistically and economically significant impact on equity risk premium predictability both on in-sample and out-of-sample cases. In line with Auer's ''Bullish ratio'', a ''Bullish index'' is introduced to measure the changes in stock market behavior, which we describe through a ''fluctuation detrending moving average analysis'' (FDMAA) for returns. We consider 28 indicators. We find that a ''positive shock'' of the Bullish Index is closely related to strong equity risk premium predictability for forecasts based on macroeconomic variables for up to six months. In contrast, a ''negative shock'' is associated with strong equity risk premium predictability with adequate forecasts for up to nine months when based on technical indicators.

2025-08-29T07:51:01Z 36 pages, 69 references, 7 tables, 3 figures ; as prepared for Physica A Kuok Sin Un Marcel Ausloos http://arxiv.org/abs/2504.06932v3 Maximizing Battery Storage Profits via High-Frequency Intraday Trading 2025-08-26T14:06:52Z

Maximizing revenue for grid-scale battery energy storage systems in continuous intraday electricity markets requires strategies that are able to seize trading opportunities as soon as new information arrives. This paper introduces and evaluates an automated high-frequency trading strategy for battery energy storage systems trading on the intraday market for power while explicitly considering the dynamics of the limit order book, market rules, and technical parameters. The standard rolling intrinsic strategy is adapted for continuous intraday electricity markets and solved using a dynamic programming approximation that is two to three orders of magnitude faster than an exact mixed-integer linear programming solution. A detailed backtest over a full year of German order book data demonstrates that the proposed dynamic programming formulation does not reduce trading profits and enables the policy to react to every relevant order book update, enabling realistic rapid backtesting. Our results show the significant revenue potential of high-frequency trading: our policy earns 58% more than when re-optimizing only once every hour and 14% more than when re-optimizing once per minute, highlighting that profits critically depend on trading speed. Furthermore, we leverage the speed of our algorithm to train a parametric extension of the rolling intrinsic, increasing yearly revenue by 8.4% out of sample.

2025-04-09T14:38:09Z David Schaurecker David Wozabal Nils Löhndorf Thorsten Staake http://arxiv.org/abs/2508.17837v1 Bimodal Dynamics of the Artificial Limit Order Book Stock Exchange with Autonomous Traders 2025-08-25T09:36:00Z

This paper explores the bifurcative dynamics of an artificial stock market exchange (ASME) with endogenous, myopic traders interacting through a limit order book (LOB). We showed that agent-based price dynamics possess intrinsic bistability, which is not a result of randomness but an emergent property of micro-level trading rules, where even identical initial conditions lead to qualitatively different long-run price equilibria: a deterministic zero-price state and a persistent positive-price equilibrium. The study also identifies a metastable region with elevated volatility between the basins of attraction and reveals distinct transient behaviors for trajectories converging to these equilibria. Furthermore, we observe that the system is neither entirely regular nor fully chaotic. By highlighting the emergence of divergent market outcomes from uniform beginnings, this work contributes a novel perspective on the inherent path dependence and complex dynamics of artificial stock markets.

2025-08-25T09:36:00Z Matej Steinbacher Mitja Steinbacher Matjaz Steinbacher http://arxiv.org/abs/2508.14784v1 Graph Learning for Foreign Exchange Rate Prediction and Statistical Arbitrage 2025-08-20T15:29:31Z

We propose a two-step graph learning approach for foreign exchange statistical arbitrages (FXSAs), addressing two key gaps in prior studies: the absence of graph-learning methods for foreign exchange rate prediction (FXRP) that leverage multi-currency and currency-interest rate relationships, and the disregard of the time lag between price observation and trade execution. In the first step, to capture complex multi-currency and currency-interest rate relationships, we formulate FXRP as an edge-level regression problem on a discrete-time spatiotemporal graph. This graph consists of currencies as nodes and exchanges as edges, with interest rates and foreign exchange rates serving as node and edge features, respectively. We then introduce a graph-learning method that leverages the spatiotemporal graph to address the FXRP problem. In the second step, we present a stochastic optimization problem to exploit FXSAs while accounting for the observation-execution time lag. To address this problem, we propose a graph-learning method that enforces constraints through projection and ReLU, maximizes risk-adjusted return by leveraging a graph with exchanges as nodes and influence relationships as edges, and utilizes the predictions from the FXRP method for the constraint parameters and node features. Moreover, we prove that our FXSA method satisfies empirical arbitrage constraints. The experimental results demonstrate that our FXRP method yields statistically significant improvements in mean squared error, and that the FXSA method achieves a 61.89% higher information ratio and a 45.51% higher Sortino ratio than a benchmark. Our approach provides a novel perspective on FXRP and FXSA within the context of graph learning.

2025-08-20T15:29:31Z Yoonsik Hong Diego Klabjan http://arxiv.org/abs/2508.14656v1 Deep Learning for Short Term Equity Trend Forecasting: A Behavior Driven Multi Factor Approach 2025-08-20T12:15:32Z

This study proposes a behaviorally-informed multi-factor stock selection framework that integrates short-cycle technical alpha signals with deep learning. We design a dual-task multilayer perceptron (MLP) that jointly predicts five-day future returns and directional price movements, thereby capturing nonlinear market behaviors such as volume-price divergence, momentum-driven herding, and bottom reversals. The model is trained on 40 carefully constructed factors derived from price-volume patterns and behavioral finance insights. Empirical evaluation demonstrates that the dual-task MLP achieves superior and stable performance across both predictive accuracy and economic relevance, as measured by information coefficient (IC), information ratio (IR), and portfolio backtesting results. Comparative experiments further show that deep learning methods outperform linear baselines by effectively capturing structural interactions between factors. This work highlights the potential of structure-aware deep learning in enhancing multi-factor modeling and provides a practical framework for short-horizon quantitative investment strategies.

2025-08-20T12:15:32Z Yuqi Luan http://arxiv.org/abs/2508.00554v3 ContestTrade: A Multi-Agent Trading System Based on Internal Contest Mechanism 2025-08-18T06:13:10Z

In financial trading, large language model (LLM)-based agents demonstrate significant potential. However, the high sensitivity to market noise undermines the performance of LLM-based trading systems. To address this limitation, we propose a novel multi-agent system featuring an internal competitive mechanism inspired by modern corporate management structures. The system consists of two specialized teams: (1) Data Team - responsible for processing and condensing massive market data into diversified text factors, ensuring they fit the model's constrained context. (2) Research Team - tasked with making parallelized multipath trading decisions based on deep research methods. The core innovation lies in implementing a real-time evaluation and ranking mechanism within each team, driven by authentic market feedback. Each agent's performance undergoes continuous scoring and ranking, with only outputs from top-performing agents being adopted. The design enables the system to adaptively adjust to dynamic environment, enhances robustness against market noise and ultimately delivers superior trading performance. Experimental results demonstrate that our proposed system significantly outperforms prevailing multi-agent systems and traditional quantitative investment methods across diverse evaluation metrics. ContestTrade is open-sourced on GitHub at https://github.com/FinStep-AI/ContestTrade.

2025-08-01T11:48:13Z Li Zhao Rui Sun Zuoyou Jiang Bo Yang Yuxiao Bai Mengting Chen Xinyang Wang Jing Li Zuo Bai http://arxiv.org/abs/2508.08698v1 DiffVolume: Diffusion Models for Volume Generation in Limit Order Books 2025-08-12T07:42:00Z

Modeling limit order books (LOBs) dynamics is a fundamental problem in market microstructure research. In particular, generating high-dimensional volume snapshots with strong temporal and liquidity-dependent patterns remains a challenging task, despite recent work exploring the application of Generative Adversarial Networks to LOBs. In this work, we propose a conditional \textbf{Diff}usion model for the generation of future LOB \textbf{Volume} snapshots (\textbf{DiffVolume}). We evaluate our model across three axes: (1) \textit{Realism}, where we show that DiffVolume, conditioned on past volume history and time of day, better reproduces statistical properties such as marginal distribution, spatial correlation, and autocorrelation decay; (2) \textit{Counterfactual generation}, allowing for controllable generation under hypothetical liquidity scenarios by additionally conditioning on a target future liquidity profile; and (3) \textit{Downstream prediction}, where we show that the synthetic counterfactual data from our model improves the performance of future liquidity forecasting models. Together, these results suggest that DiffVolume provides a powerful and flexible framework for realistic and controllable LOB volume generation.

2025-08-12T07:42:00Z 13 pages, 6 figures, 3 tables Zhuohan Wang Carmine Ventre http://arxiv.org/abs/2508.21075v1 A Stream Pipeline Framework for Digital Payment Programming based on Smart Contracts 2025-08-12T03:58:19Z

Digital payments play a pivotal role in the burgeoning digital economy. Moving forward, the enhancement of digital payment systems necessitates programmability, going beyond just efficiency and convenience, to meet the evolving needs and complexities. Smart contract platforms like Central Bank Digital Currency (CBDC) networks and blockchains support programmable digital payments. However, the prevailing paradigm of programming payment logics involves coding smart contracts with programming languages, leading to high costs and significant security challenges. A novel and versatile method for payment programming on DLTs was presented in this paper - transforming digital currencies into token streams, then pipelining smart contracts to authorize, aggregate, lock, direct, and dispatch these streams efficiently from source to target accounts. By utilizing a small set of configurable templates, a few specialized smart contracts could be generated, and support most of payment logics through configuring and composing them. This approach could substantially reduce the cost of payment programming and enhance security, self-enforcement, adaptability, and controllability, thus hold the potential to become an essential component in the infrastructure of digital economy.

2025-08-12T03:58:19Z 5 pages, 2 figures Zijia Meng Victor Feng http://arxiv.org/abs/2508.08152v1 Optimal Fees for Liquidity Provision in Automated Market Makers 2025-08-11T16:30:02Z

Passive liquidity providers (LPs) in automated market makers (AMMs) face losses due to adverse selection (LVR), which static trading fees often fail to offset in practice. We study the key determinants of LP profitability in a dynamic reduced-form model where an AMM operates in parallel with a centralized exchange (CEX), traders route their orders optimally to the venue offering the better price, and arbitrageurs exploit price discrepancies. Using large-scale simulations and real market data, we analyze how LP profits vary with market conditions such as volatility and trading volume, and characterize the optimal AMM fee as a function of these conditions. We highlight the mechanisms driving these relationships through extensive comparative statics, and confirm the model's relevance through market data calibration. A key trade-off emerges: fees must be low enough to attract volume, yet high enough to earn sufficient revenues and mitigate arbitrage losses. We find that under normal market conditions, the optimal AMM fee is competitive with the trading cost on the CEX and remarkably stable, whereas in periods of very high volatility, a high fee protects passive LPs from severe losses. These findings suggest that a threshold-type dynamic fee schedule is both robust enough to market conditions and improves LP outcomes.

2025-08-11T16:30:02Z 43 pages, 23 figures, 8 tables Steven Campbell Philippe Bergault Jason Milionis Marcel Nutz http://arxiv.org/abs/2508.06914v1 Prediction of high-frequency futures return directions based on the mean uncertainty classification methods: An application in China's future market 2025-08-09T09:56:48Z

In this paper, we mainly focus on the prediction of short-term average return directions in China's high-frequency futures market. As minor fluctuations with limited amplitude and short duration are typically regarded as random noise, only price movements of sufficient magnitude qualify as statistically significant signals. Therefore data imbalance emerges as a key problem during predictive modeling. From the view of data distribution imbalance, we employee the mean-uncertainty logistic regression (mean-uncertainty LR) classification method under the sublinear expectation (SLE) framework, and further propose the mean-uncertainty support vector machines (mean-uncertainty SVM) method for the prediction. Corresponding investment strategies are developed based on the prediction results. For data selection, we utilize trading data and limit order book data of the top 15 liquid products among the most active contracts in China's future market. Empirical results demonstrate that comparing with conventional LR-related and SVM-related imbalanced data classification methods, the two mean-uncertainty approaches yields significant advantages in both classification metrics and average returns per trade.

2025-08-09T09:56:48Z 19 pages, 3 figures Ying Peng Yifan Zhang Xin Wang http://arxiv.org/abs/2508.16598v1 Sizing the Risk: Kelly, VIX, and Hybrid Approaches in Put-Writing on Index Options 2025-08-09T08:31:00Z

This paper examines systematic put-writing strategies applied to S&P 500 Index options, with a focus on position sizing as a key determinant of long-term performance. Despite the well-documented volatility risk premium, where implied volatility exceeds realized volatility, the practical implementation of short-dated volatility-selling strategies remains underdeveloped in the literature. This study evaluates three position sizing approaches: the Kelly criterion, VIX-based volatility regime scaling, and a novel hybrid method combining both. Using SPXW options with expirations from 0 to 5 days, the analysis explores a broad design space, including moneyness levels, volatility estimators, and memory horizons. Results show that ultra-short-dated, far out-of-the-money options deliver superior risk-adjusted returns. The hybrid sizing method consistently balances return generation with robust drawdown control, particularly under low-volatility conditions such as those seen in 2024. The study offers new insights into volatility harvesting, introducing a dynamic sizing framework that adapts to shifting market regimes. It also contributes practical guidance for constructing short-dated option strategies that are robust across market environments. These findings have direct applications for institutional investors seeking to enhance portfolio efficiency through systematic exposure to volatility premia.

2025-08-09T08:31:00Z Maciej Wysocki http://arxiv.org/abs/2508.16589v1 ARL-Based Multi-Action Market Making with Hawkes Processes and Variable Volatility 2025-08-07T21:50:30Z

We advance market-making strategies by integrating Adversarial Reinforcement Learning (ARL), Hawkes Processes, and variable volatility levels while also expanding the action space available to market makers (MMs). To enhance the adaptability and robustness of these strategies -- which can quote always, quote only on one side of the market or not quote at all -- we shift from the commonly used Poisson process to the Hawkes process, which better captures real market dynamics and self-exciting behaviors. We then train and evaluate strategies under volatility levels of 2 and 200. Our findings show that the 4-action MM trained in a low-volatility environment effectively adapts to high-volatility conditions, maintaining stable performance and providing two-sided quotes at least 92\% of the time. This indicates that incorporating flexible quoting mechanisms and realistic market simulations significantly enhances the effectiveness of market-making strategies.

2025-08-07T21:50:30Z ICAIF '24: Proceedings of the 5th ACM International Conference on AI in Finance, November 14--17, 2024, Brooklyn, NY, USA Ziyi Wang Carmine Ventre Maria Polukarov 10.1145/3677052.3698695 http://arxiv.org/abs/2508.16588v1 Robust Market Making: To Quote, or not To Quote 2025-08-07T21:49:24Z

Market making is a popular trading strategy, which aims to generate profit from the spread between the quotes posted at either side of the market. It has been shown that training market makers (MMs) with adversarial reinforcement learning allows to overcome the risks due to changing market conditions and to lead to robust performances. Prior work assumes, however, that MMs keep quoting throughout the trading process, but in practice this is not required, even for ``registered'' MMs (that only need to satisfy quoting ratios defined by the market rules). In this paper, we build on this line of work and enrich the strategy space of the MM by allowing to occasionally not quote or provide single-sided quotes. Towards this end, in addition to the MM agents that provide continuous bid-ask quotes, we have designed two new agents with increasingly richer action spaces. The first has the option to provide bid-ask quotes or refuse to quote. The second has the option to provide bid-ask quotes, refuse to quote, or only provide single-sided ask or bid quotes. We employ a model-driven approach to empirically compare the performance of the continuously quoting MM with the two agents above in various types of adversarial environments. We demonstrate how occasional refusal to provide bid-ask quotes improves returns and/or Sharpe ratios. The quoting ratios of well-trained MMs can basically meet any market requirements, reaching up to 99.9$\%$ in some cases.

2025-08-07T21:49:24Z ICAIF '23: Proceedings of the Fourth ACM International Conference on AI in Finance, November 27--29, 2023, Brooklyn, NY, USA Ziyi Wang Carmine Ventre Maria Polukarov 10.1145/3604237.3626858 http://arxiv.org/abs/2508.02247v2 ByteGen: A Tokenizer-Free Generative Model for Orderbook Events in Byte Space 2025-08-07T04:31:56Z

Generative modeling of high-frequency limit order book (LOB) dynamics is a critical yet unsolved challenge in quantitative finance, essential for robust market simulation and strategy backtesting. Existing approaches are often constrained by simplifying stochastic assumptions or, in the case of modern deep learning models like Transformers, rely on tokenization schemes that affect the high-precision, numerical nature of financial data through discretization and binning. To address these limitations, we introduce ByteGen, a novel generative model that operates directly on the raw byte streams of LOB events. Our approach treats the problem as an autoregressive next-byte prediction task, for which we design a compact and efficient 32-byte packed binary format to represent market messages without information loss. The core novelty of our work is the complete elimination of feature engineering and tokenization, enabling the model to learn market dynamics from its most fundamental representation. We achieve this by adapting the H-Net architecture, a hybrid Mamba-Transformer model that uses a dynamic chunking mechanism to discover the inherent structure of market messages without predefined rules. Our primary contributions are: 1) the first end-to-end, byte-level framework for LOB modeling; 2) an efficient packed data representation; and 3) a comprehensive evaluation on high-frequency data. Trained on over 34 million events from CME Bitcoin futures, ByteGen successfully reproduces key stylized facts of financial markets, generating realistic price distributions, heavy-tailed returns, and bursty event timing. Our findings demonstrate that learning directly from byte space is a promising and highly flexible paradigm for modeling complex financial systems, achieving competitive performance on standard market quality metrics without the biases of tokenization.

2025-08-04T09:48:42Z 21 pages, 3 tables, 5 figures Yang Li Zhi Chen