https://arxiv.org/api/b9FsGaHZPjC4unZaq/ZOstOQNQM2026-04-01T10:23:04Z217425515http://arxiv.org/abs/2501.16772v2Trends and Reversion in Financial Markets on Time Scales from Minutes to Decades2025-05-30T07:51:01ZWe empirically analyze the reversion of financial market trends with time horizons ranging from minutes to decades. The analysis covers equities, interest rates, currencies and commodities and combines 14 years of futures tick data, 30 years of daily futures prices, 330 years of monthly asset prices, and yearly financial data since medieval times.
Across asset classes, we find that markets are in a trending regime on time scales that range from a few hours to a few years, while they are in a reversion regime on shorter and longer time scales. In the trending regime, weak trends tend to persist, which can be explained by herding behavior of investors. However, in this regime trends tend to revert before they become strong enough to be statistically significant, which can be interpreted as a return of asset prices to their intrinsic value. In the reversion regime, we find the opposite pattern: weak trends tend to revert, while those trends that become statistically significant tend to persist.
Our results provide a set of empirical tests of theoretical models of financial markets. We interpret them in the light of a recently proposed lattice gas model, where the lattice represents the social network of traders, the gas molecules represent the shares of financial assets, and efficient markets correspond to the critical point. If this model is accurate, the lattice gas must be near this critical point on time scales from 1 hour to a few days, with a correlation time of a few years.2025-01-28T07:51:41Z38 pages, 10 figures. Added additional explanations, references, and minor correctionsSara A. SafariChristof Schmidhuberhttp://arxiv.org/abs/2502.17906v4Why do financial prices exhibit Brownian motion despite predictable order flow?2025-05-27T09:32:30ZIn financial market microstructure, there are two enigmatic empirical laws: (i) the market-order flow has predictable persistence due to metaorder splitters by institutional investors, well formulated as the Lillo-Mike-Farmer model. However, this phenomenon seems paradoxical given the diffusive and unpredictable price dynamics; (ii) the price impact $I(Q)$ of a large metaorder $Q$ follows the square-root law, $I(Q)\propto \sqrt{Q}$. Here we theoretically reveal why price dynamics follows Brownian motion despite predictable order flow by unifying these enigmas. We generalize the Lillo-Mike-Farmer model to nonlinear price-impact dynamics, which is mapped to an exactly solvable Lévy-walk model. Our exact solution shows that the price dynamics remains diffusive under the square-root law, even under persistent order flow. This work illustrates the crucial role of the square-root law in mitigating large price movements by large metaorders, thereby leading to the Brownian price dynamics, consistently with the efficient market hypothesis over long timescales.2025-02-25T07:12:03ZMain: 7 pages, 4 figures. SI: 6 pages, 3 figures. Minor bugs in simulation codes are fixedYuki SatoKiyoshi Kanazawahttp://arxiv.org/abs/2505.19617v1Hybrid Models for Financial Forecasting: Combining Econometric, Machine Learning, and Deep Learning Models2025-05-26T07:32:23ZThis research systematically develops and evaluates various hybrid modeling approaches by combining traditional econometric models (ARIMA and ARFIMA models) with machine learning and deep learning techniques (SVM, XGBoost, and LSTM models) to forecast financial time series. The empirical analysis is based on two distinct financial assets: the S&P 500 index and Bitcoin. By incorporating over two decades of daily data for the S&P 500 and almost ten years of Bitcoin data, the study provides a comprehensive evaluation of forecasting methodologies across different market conditions and periods of financial distress. Models' training and hyperparameter tuning procedure is performed using a novel three-fold dynamic cross-validation method. The applicability of applied models is evaluated using both forecast error metrics and trading performance indicators. The obtained findings indicate that the proper construction process of hybrid models plays a crucial role in developing profitable trading strategies, outperforming their individual components and the benchmark Buy&Hold strategy. The most effective hybrid model architecture was achieved by combining the econometric ARIMA model with either SVM or LSTM, under the assumption of a non-additive relationship between the linear and nonlinear components.2025-05-26T07:32:23Z30 pages, 9 figures, 7 tablesDominik StempieńRobert Ślepaczukhttp://arxiv.org/abs/2505.19243v1Comparative analysis of financial data differentiation techniques using LSTM neural network2025-05-25T17:49:10ZWe compare traditional approach of computing logarithmic returns with the fractional differencing method and its tempered extension as methods of data preparation before their usage in advanced machine learning models. Differencing parameters are estimated using multiple techniques. The empirical investigation is conducted on data from four major stock indices covering the most recent 10-year period. The set of explanatory variables is additionally extended with technical indicators. The effectiveness of the differencing methods is evaluated using both forecast error metrics and risk-adjusted return trading performance metrics. The findings suggest that fractional differentiation methods provide a suitable data transformation technique, improving the predictive model forecasting performance. Furthermore, the generated predictions appeared to be effective in constructing profitable trading strategies for both individual assets and a portfolio of stock indices. These results underline the importance of appropriate data transformation techniques in financial time series forecasting, supporting the application of memory-preserving techniques.2025-05-25T17:49:10Z71 pages, 21 figures, 14 tablesDominik StempieńJanusz Gajdahttp://arxiv.org/abs/2505.17388v1Stochastic Price Dynamics in Response to Order Flow Imbalance: Evidence from CSI 300 Index Futures2025-05-23T01:53:28ZWe conduct modeling of the price dynamics following order flow imbalance in market microstructure and apply the model to the analysis of Chinese CSI 300 Index Futures. There are three findings. The first is that the order flow imbalance is analogous to a shock to the market. Unlike the common practice of using Hawkes processes, we model the impact of order flow imbalance as an Ornstein-Uhlenbeck process with memory and mean-reverting characteristics driven by a jump-type Lévy process. Motivated by the empirically stable correlation between order flow imbalance and contemporaneous price changes, we propose a modified asset price model where the drift term of canonical geometric Brownian motion is replaced by an Ornstein-Uhlenbeck process. We establish stochastic differential equations and derive the logarithmic return process along with its mean and variance processes under initial boundary conditions, and evolution of cost-effectiveness ratio with order flow imbalance as the trading trigger point, termed as the quasi-Sharpe ratio or response ratio. Secondly, our results demonstrate horizon-dependent heterogeneity in how conventional metrics interact with order flow imbalance. This underscores the critical role of forecast horizon selection for strategies. Thirdly, we identify regime-dependent dynamics in the memory and forecasting power of order flow imbalance. This taxonomy provides both a screening protocol for existing indicators and an ex-ante evaluation paradigm for novel metrics.2025-05-23T01:53:28Z37 pages, 18 figuresChen HuKouxiao Zhanghttp://arxiv.org/abs/2505.05784v3FlowHFT: Imitation Learning via Flow Matching Policy for Optimal High-Frequency Trading under Diverse Market Conditions2025-05-22T04:48:37ZHigh-frequency trading (HFT) is an investing strategy that continuously monitors market states and places bid and ask orders at millisecond speeds. Traditional HFT approaches fit models with historical data and assume that future market states follow similar patterns. This limits the effectiveness of any single model to the specific conditions it was trained for. Additionally, these models achieve optimal solutions only under specific market conditions, such as assumptions about stock price's stochastic process, stable order flow, and the absence of sudden volatility. Real-world markets, however, are dynamic, diverse, and frequently volatile. To address these challenges, we propose the FlowHFT, a novel imitation learning framework based on flow matching policy. FlowHFT simultaneously learns strategies from numerous expert models, each proficient in particular market scenarios. As a result, our framework can adaptively adjust investment decisions according to the prevailing market state. Furthermore, FlowHFT incorporates a grid-search fine-tuning mechanism. This allows it to refine strategies and achieve superior performance even in complex or extreme market scenarios where expert strategies may be suboptimal. We test FlowHFT in multiple market environments. We first show that flow matching policy is applicable in stochastic market environments, thus enabling FlowHFT to learn trading strategies under different market conditions. Notably, our single framework consistently achieves performance superior to the best expert for each market condition.2025-05-09T04:58:14Z16 pages, 6 figures, 7 tables, 2 algorithmsYang LiZhi ChenSteve Yanghttp://arxiv.org/abs/2505.16136v1Interpretable Machine Learning for Macro Alpha: A News Sentiment Case Study2025-05-22T02:24:45ZThis study introduces an interpretable machine learning (ML) framework to extract macroeconomic alpha from global news sentiment. We process the Global Database of Events, Language, and Tone (GDELT) Project's worldwide news feed using FinBERT -- a Bidirectional Encoder Representations from Transformers (BERT) based model pretrained on finance-specific language -- to construct daily sentiment indices incorporating mean tone, dispersion, and event impact. These indices drive an XGBoost classifier, benchmarked against logistic regression, to predict next-day returns for EUR/USD, USD/JPY, and 10-year U.S. Treasury futures (ZN). Rigorous out-of-sample (OOS) backtesting (5-fold expanding-window cross-validation, OOS period: c. 2017-April 2025) demonstrates exceptional, cost-adjusted performance for the XGBoost strategy: Sharpe ratios achieve 5.87 (EUR/USD), 4.65 (USD/JPY), and 4.65 (Treasuries), with respective compound annual growth rates (CAGRs) exceeding 50% in Foreign Exchange (FX) and 22% in bonds. Shapley Additive Explanations (SHAP) affirm that sentiment dispersion and article impact are key predictive features. Our findings establish that integrating domain-specific Natural Language Processing (NLP) with interpretable ML offers a potent and explainable source of macro alpha.2025-05-22T02:24:45Z18 pages (including references), 1 figure, 1 table. Code available at \url{https://github.com/yukepenn/macro-news-sentiment-trading}. Keywords: Macro Sentiment, News Sentiment, Algorithmic Trading, GDELT, FinBERT, NLP, Alternative Data, Foreign Exchange, Treasury Futures, Quantitative Finance, Machine Learning, SHAP, InterpretabilityYuke Zhanghttp://arxiv.org/abs/2505.15611v1Shortermism and excessive risk taking in optimal execution with a target performance2025-05-21T15:02:07ZWe deal with the optimal execution problem when the broker's goal is to reach a performance barrier avoiding a downside barrier. The performance is provided by the wealth accumulated by trading in the market, the shares detained by the broker evaluated at the market price plus a slippage cost yielding a quadratic inventory cost. Over a short horizon, this type of remuneration leads, at the same time, to a more aggressive and less risky strategy compared to the classical one, and over a long horizon the performance turns to be poorer and more dispersed.2025-05-21T15:02:07ZEmilio BarucciYuheng Lanhttp://arxiv.org/abs/2505.15296v1Agent-based Liquidity Risk Modelling for Financial Markets2025-05-21T09:25:32ZIn this paper, we describe a novel agent-based approach for modelling the transaction cost of buying or selling an asset in financial markets, e.g., to liquidate a large position as a result of a margin call to meet financial obligations. The simple act of buying or selling in the market causes a price impact and there is a cost described as liquidity risk. For example, when selling a large order, there is market slippage -- each successive trade will execute at the same or worse price. When the market adjusts to the new information revealed by the execution of such a large order, we observe in the data a permanent price impact that can be attributed to the change in the fundamental value as market participants reassess the value of the asset. In our ABM model, we introduce a novel mechanism where traders assume orderflow is informed and each trade reveals some information about the value of the asset, and traders update their belief of the fundamental value for every trade. The result is emergent, realistic price impact without oversimplifying the problem as most stylised models do, but within a realistic framework that models the exchange with its protocols, its limit orderbook and its auction mechanism and that can calculate the transaction cost of any execution strategy without limitation. Our stochastic ABM model calculates the costs and uncertainties of buying and selling in a market by running Monte-Carlo simulations, for a better understanding of liquidity risk and can be used to optimise for optimal execution under liquidity risk. We demonstrate its practical application in the real world by calculating the liquidity risk for the Hang-Seng Futures Index.2025-05-21T09:25:32ZSimudyne Working Paper 008, 9 pagesPerukrishnen VytelingumRory BaggottNamid StillmanJianfei ZhangDingqiu ZhuTao ChenJustin Lyonhttp://arxiv.org/abs/2407.12683v2Information Flow in the FTX Bankruptcy: A Network Approach2025-05-19T13:11:14ZThis paper investigates the cryptocurrency network of the FTX exchange during the collapse of its native token, FTT, to understand how network structures adapt to significant financial disruptions, by exploiting vertex centrality measures. Using proprietary data on the transactional relationships between various cryptocurrencies, we construct the filtered correlation matrix to identify the most significant relations in the FTX and Binance markets. By using suitable centrality measures - closeness and information centrality - we assess network stability during FTX's bankruptcy. The findings document the appropriateness of such vertex centralities in understanding the resilience and vulnerabilities of financial networks. By tracking the changes in centrality values before and during the FTX crisis, this study provides useful insights into the structural dynamics of the cryptocurrency market. Results reveal how different cryptocurrencies experienced shifts in their network roles due to the crisis. Moreover, our findings highlight the interconnectedness of cryptocurrency markets and how the failure of a single entity can lead to widespread repercussions that destabilize other nodes of the network.2024-07-17T16:02:51Z\Physica A: Statistical Mechanics and its Applications, 655, 130167 (2024)Riccardo De BlasisLuca GalatiRosanna GrassiGiorgio Rizzini10.1016/j.physa.2024.130167http://arxiv.org/abs/2501.07581v2Optimal Execution Strategies Incorporating Internal Liquidity Through Market Making2025-05-15T14:01:48ZThis paper introduces a new algorithmic execution model that integrates interbank limit and market orders with internal liquidity generated through market making. Based on the Cartea et al.\cite{cartea2015algorithmic} framework, we incorporate market impact in interbank orders while excluding it for internal market-making transactions. Our model aims to optimize the balance between interbank and internal liquidity, reducing market impact and improving execution efficiency.2024-12-28T03:07:57Z12 pages, 3 figuresYusuke Morimotohttp://arxiv.org/abs/2505.05113v3Loss-Versus-Rebalancing under Deterministic and Generalized block-times2025-05-15T10:51:35ZAlthough modern blockchains almost universally produce blocks at fixed intervals, existing models still lack an analytical formula for the loss-versus-rebalancing (LVR) incurred by Automated Market Makers (AMMs) liquidity providers in this setting. Leveraging tools from random walk theory, we derive the following closed-form approximation for the per block per unit of liquidity expected LVR under constant block time:
\[ \overline{\mathrm{ARB}}= \frac{\,σ_b^{2}} {\,2+\sqrt{2π}\,γ/(|ζ(1/2)|\,σ_b)\,}+O\!\bigl(e^{-\mathrm{const}\tfracγ{σ_b}}\bigr)\;\approx\; \frac{σ_b^{2}}{\,2 + 1.7164\,γ/σ_b}, \] where $σ_b$ is the intra-block asset volatility, $γ$ the AMM spread and $ζ$ the Riemann Zeta function. Our large Monte Carlo simulations show that this formula is in fact quasi-exact across practical parameter ranges.
Extending our analysis to arbitrary block-time distributions as well, we demonstrate both that--under every admissible inter-block law--the probability that a block carries an arbitrage trade converges to a universal limit, and that only constant block spacing attains the asymptotically minimal LVR. This shows that constant block intervals provide the best possible protection against arbitrage for liquidity providers.2025-05-08T10:30:24Z16 pages, 2 figuresAlex NezlobinMartin Tassyhttp://arxiv.org/abs/2505.09423v1FLUXLAYER: High-Performance Design for Cross-chain Fragmented Liquidity2025-05-14T14:23:56ZAutonomous Market Makers (AMMs) rely on arbitrage to facilitate passive price updates. Liquidity fragmentation poses a complex challenge across different blockchain networks.
This paper proposes FluxLayer, a solution to mitigate fragmented liquidity and capture the maximum extractable value (MEV) in a cross-chain environment. FluxLayer is a three-layer framework that integrates a settlement layer, an intent layer, and an under-collateralised leverage lending vault mechanism. Our evaluation demonstrates that FluxLayer can effectively enhance cross-chain MEV by capturing more arbitrage opportunities, reducing costs, and improving overall liquidity.2025-05-14T14:23:56ZXin LaoShiping ChenQin Wanghttp://arxiv.org/abs/2505.22678v1An Efficient deep learning model to Predict Stock Price Movement Based on Limit Order Book2025-05-14T12:46:21ZIn high-frequency trading (HFT), leveraging limit order books (LOB) to model stock price movements is crucial for achieving profitable outcomes. However, this task is challenging due to the high-dimensional and volatile nature of the original data. Even recent deep learning models often struggle to capture price movement patterns effectively, particularly without well-designed features. We observed that raw LOB data exhibits inherent symmetry between the ask and bid sides, and the bid-ask differences demonstrate greater stability and lower complexity compared to the original data. Building on this insight, we propose a novel approach in which leverages the Siamese architecture to enhance the performance of existing deep learning models. The core idea involves processing the ask and bid sides separately using the same module with shared parameters. We applied our Siamese-based methods to several widely used strong baselines and validated their effectiveness using data from 14 military industry stocks in the Chinese A-share market. Furthermore, we integrated multi-head attention (MHA) mechanisms with the Long Short-Term Memory (LSTM) module to investigate its role in modeling stock price movements. Our experiments used raw data and widely used Order Flow Imbalance (OFI) features as input with some strong baseline models. The results show that our method improves the performance of strong baselines in over 75$% of cases, excluding the Multi-Layer Perception (MLP) baseline, which performed poorly and is not considered practical. Furthermore, we found that Multi-Head Attention can enhance model performance, particularly over shorter forecasting horizons.2025-05-14T12:46:21ZJiahao YangRan FangMing ZhangJun Zhouhttp://arxiv.org/abs/2504.20349v3ClusterLOB: Enhancing Trading Strategies by Clustering Orders in Limit Order Books2025-05-10T02:46:12ZIn the rapidly evolving world of financial markets, understanding the dynamics of limit order book (LOB) is crucial for unraveling market microstructure and participant behavior. We introduce ClusterLOB as a method to cluster individual market events in a stream of market-by-order (MBO) data into different groups. To do so, each market event is augmented with six time-dependent features. By applying the K-means++ clustering algorithm to the resulting order features, we are then able to assign each new order to one of three distinct clusters, which we identify as directional, opportunistic, and market-making participants, each capturing unique trading behaviors. Our experimental results are performed on one year of MBO data containing small-tick, medium-tick, and large-tick stocks from NASDAQ. To validate the usefulness of our clustering, we compute order flow imbalances across each cluster within 30-minute buckets during the trading day. We treat each cluster's imbalance as a signal that provides insights into trading strategies and participants' responses to varying market conditions. To assess the effectiveness of these signals, we identify the trading strategy with the highest Sharpe ratio in the training dataset, and demonstrate that its performance in the test dataset is superior to benchmark trading strategies that do not incorporate clustering. We also evaluate trading strategies based on order flow imbalance decompositions across different market event types, including add, cancel, and trade events, to assess their robustness in various market conditions. This work establishes a robust framework for clustering market participant behavior, which helps us to better understand market microstructure, and inform the development of more effective predictive trading signals with practical applications in algorithmic trading and quantitative finance.2025-04-29T01:37:33ZYichi ZhangMihai CucuringuAlexander Y. ShestopaloffStefan Zohren