https://arxiv.org/api/U8OV8jvYUQ+2VlxFxV3Ser0jz6U 2026-04-06T06:36:55Z 2178 435 15 http://arxiv.org/abs/2409.00270v1 Bitcoin ETF: Opportunities and risk 2024-08-30T21:57:31Z The year 2024 witnessed a major development in the cryptocurrency industry with the long-awaited approval of spot Bitcoin exchange-traded funds (ETFs). This innovation provides investors with a new, regulated path to gain exposure to Bitcoin through a familiar investment vehicle (Kumar et al., 2024). However, unlike traditional ETFs that directly hold underlying assets, Bitcoin ETFs rely on a creation and redemption process managed by authorized participants (APs). This unique structure introduces distinct characteristics in terms of premium/discount behavior compared to traditional ETFs. This paper investigates the premium and discount patterns observed in Bitcoin ETFs during first four-month period (January 11th, 2024, to May 17th, 2024). Our analysis reveals that these patterns differ significantly from those observed in traditional index ETFs, potentially exposing investors to additional risk factors. By identifying and analyzing these risk factors associated with Bitcoin ETF premiums/discounts, this paper aims to achieve two key objectives: Enhance market understanding: Equip and market and investors with a deeper comprehension of the unique liquidity risks inherent in Bitcoin ETFs. Provide a clearer risk management frameworks: Offer a clearer perspective on the risk-return profile of digital asset ETFs, specifically focusing on Bitcoin ETFs. Through a thorough analysis of premium/discount behavior and the underlying factors contributing to it, this paper strives to contribute valuable insights for investors navigating the evolving landscape of digital asset investments 2024-08-30T21:57:31Z Di Wu http://arxiv.org/abs/2404.18470v2 ECC Analyzer: Extract Trading Signal from Earnings Conference Calls using Large Language Model for Stock Performance Prediction 2024-08-29T23:13:56Z In the realm of financial analytics, leveraging unstructured data, such as earnings conference calls (ECCs), to forecast stock volatility is a critical challenge that has attracted both academics and investors. While previous studies have used multimodal deep learning-based models to obtain a general view of ECCs for volatility predicting, they often fail to capture detailed, complex information. Our research introduces a novel framework: \textbf{ECC Analyzer}, which utilizes large language models (LLMs) to extract richer, more predictive content from ECCs to aid the model's prediction performance. We use the pre-trained large models to extract textual and audio features from ECCs and implement a hierarchical information extraction strategy to extract more fine-grained information. This strategy first extracts paragraph-level general information by summarizing the text and then extracts fine-grained focus sentences using Retrieval-Augmented Generation (RAG). These features are then fused through multimodal feature fusion to perform volatility prediction. Experimental results demonstrate that our model outperforms traditional analytical benchmarks, confirming the effectiveness of advanced LLM techniques in financial analysis. 2024-04-29T07:11:39Z 9 pages, 1 figures, 2 tables Yupeng Cao Zhi Chen Qingyun Pei Nathan Jinseok Lee K. P. Subbalakshmi Papa Momar Ndiaye http://arxiv.org/abs/2408.12553v1 Dynamic Pricing for Real Estate 2024-08-22T17:08:23Z We study a mathematical model for the optimization of the price of real estate (RE). This model can be characterised by a limited amount of goods, fixed sales horizon and presence of intermediate sales and revenue goals. We develop it as an enhancement and upgrade of the model presented by Besbes and Maglaras now also taking into account variable demand, time value of money, and growth of the objective value of Real Estate with the development stage. 2024-08-22T17:08:23Z Lev Razumovskiy Mariya Gerasimova Nikolay Karenin http://arxiv.org/abs/2408.11740v1 Less is more: AI Decision-Making using Dynamic Deep Neural Networks for Short-Term Stock Index Prediction 2024-08-21T16:04:31Z In this paper we introduce a multi-agent deep-learning method which trades in the Futures markets based on the US S&P 500 index. The method (referred to as Model A) is an innovation founded on existing well-established machine-learning models which sample market prices and associated derivatives in order to decide whether the investment should be long/short or closed (zero exposure), on a day-to-day decision. We compare the predictions with some conventional machine-learning methods namely, Long Short-Term Memory, Random Forest and Gradient-Boosted-Trees. Results are benchmarked against a passive model in which the Futures contracts are held (long) continuously with the same exposure (level of investment). Historical tests are based on daily daytime trading carried out over a period of 6 calendar years (2018-23). We find that Model A outperforms the passive investment in key performance metrics, placing it within the top quartile performance of US Large Cap active fund managers. Model A also outperforms the three machine-learning classification comparators over this period. We observe that Model A is extremely efficient (doing less and getting more) with an exposure to the market of only 41.95% compared to the 100% market exposure of the passive investment, and thus provides increased profitability with reduced risk. 2024-08-21T16:04:31Z 12 pages, 5 figures CJ Finnegan James F. McCann Salissou Moutari http://arxiv.org/abs/2408.11255v1 MEV Capture and Decentralization in Execution Tickets 2024-08-21T00:34:07Z We provide an economic model of Execution Tickets and use it to study the ability of the Ethereum protocol to capture MEV from block construction. We demonstrate that Execution Tickets extract all MEV when all buyers are homogeneous, risk neutral and face no capital costs. We also show that MEV capture decreases with risk aversion and capital costs. Moreover, when buyers are heterogeneous, MEV capture can be especially low and a single dominant buyer can extract much of the MEV. This adverse effect can be partially mitigated by the presence of a Proposer Builder Separation (PBS) mechanism, which gives ET buyers access to a market of specialized builders, but in practice centralization vectors still persist. With PBS, ETs are concentrated among those with the highest ex-ante MEV extraction ability and lowest cost of capital. We show how it is possible that large investors that are not builders but have substantial advantage in capital cost can come to dominate the ET market. 2024-08-21T00:34:07Z 15 pages, 1 figure. This paper was co-authored by researchers from Blockchain Capital, Ethereum Foundation (Robust Incentives Group), and the University of Florida Jonah Burian Davide Crapis Fahad Saleh http://arxiv.org/abs/2408.10016v1 High-Frequency Trading Liquidity Analysis | Application of Machine Learning Classification 2024-08-19T14:11:46Z This research presents a comprehensive framework for analyzing liquidity in financial markets, particularly in the context of high-frequency trading. By leveraging advanced machine learning classification techniques, including Logistic Regression, Support Vector Machine, and Random Forest, the study aims to predict minute-level price movements using an extensive set of liquidity metrics derived from the Trade and Quote (TAQ) data. The findings reveal that employing a broad spectrum of liquidity measures yields higher predictive accuracy compared to models utilizing a reduced subset of features. Key liquidity metrics, such as Liquidity Ratio, Flow Ratio, and Turnover, consistently emerged as significant predictors across all models, with the Random Forest algorithm demonstrating superior accuracy. This study not only underscores the critical role of liquidity in market stability and transaction costs but also highlights the complexities involved in short-interval market predictions. The research suggests that a comprehensive set of liquidity measures is essential for accurate prediction, and proposes future work to validate these findings across different stock datasets to assess their generalizability. 2024-08-19T14:11:46Z Sid Bhatia Sidharth Peri Sam Friedman Michelle Malen http://arxiv.org/abs/2409.14510v1 Crisis Alpha: A High-Performance Trading Algorithm Tested in Market Downturns 2024-08-18T19:35:07Z Forming quantitative portfolios using statistical risk models presents a significant challenge for hedge funds and portfolio managers. This research investigates three distinct statistical risk models to construct quantitative portfolios of 1,000 floating stocks in the US market. Utilizing five different investment strategies, these models are tested across four periods, encompassing the last three major financial crises: The Dot Com Bubble, Global Financial Crisis, and Covid-19 market downturn. Backtests leverage the CRSP dataset from January 1990 through December 2023. The results demonstrate that the proposed models consistently outperformed market excess returns across all periods. These findings suggest that the developed risk models can serve as valuable tools for asset managers, aiding in strategic decision-making and risk management in various economic conditions. 2024-08-18T19:35:07Z Maysam Khodayari Gharanchaei Reza Babazadeh http://arxiv.org/abs/2408.08866v1 High-Frequency Options Trading | With Portfolio Optimization 2024-08-16T17:49:21Z This paper explores the effectiveness of high-frequency options trading strategies enhanced by advanced portfolio optimization techniques, investigating their ability to consistently generate positive returns compared to traditional long or short positions on options. Utilizing SPY options data recorded in five-minute intervals over a one-month period, we calculate key metrics such as Option Greeks and implied volatility, applying the Binomial Tree model for American options pricing and the Newton-Raphson algorithm for implied volatility calculation. Investment universes are constructed based on criteria like implied volatility and Greeks, followed by the application of various portfolio optimization models, including Standard Mean-Variance and Robust Methods. Our research finds that while basic long-short strategies centered on implied volatility and Greeks generally underperform, more sophisticated strategies incorporating advanced Greeks, such as Vega and Rho, along with dynamic portfolio optimization, show potential in effectively navigating the complexities of the options market. The study highlights the importance of adaptability and responsiveness in dynamic portfolio strategies within the high-frequency trading environment, particularly under volatile market conditions. Future research could refine strategy parameters and explore less frequently traded options, offering new insights into high-frequency options trading and portfolio management. 2024-08-16T17:49:21Z Sid Bhatia http://arxiv.org/abs/2312.08927v5 Limit Order Book Dynamics and Order Size Modelling Using Compound Hawkes Process 2024-08-14T13:14:28Z Hawkes Process has been used to model Limit Order Book (LOB) dynamics in several ways in the literature however the focus has been limited to capturing the inter-event times while the order size is usually assumed to be constant. We propose a novel methodology of using Compound Hawkes Process for the LOB where each event has an order size sampled from a calibrated distribution. The process is formulated in a novel way such that the spread of the process always remains positive. Further, we condition the model parameters on time of day to support empirical observations. We make use of an enhanced non-parametric method to calibrate the Hawkes kernels and allow for inhibitory cross-excitation kernels. We showcase the results and quality of fits for an equity stock's LOB in the NASDAQ exchange and compare them against several baselines. Finally, we conduct a market impact study of the simulator and show the empirical observation of a concave market impact function is indeed replicated. 2023-12-14T13:36:15Z Presented at Market Microstructure 2023, Quantitative Finance Workshop 2024. Oxford SML Finance Seminar 2024 and Submitted to Finance Research Letters journal Konark Jain Nick Firoozye Jonathan Kochems Philip Treleaven http://arxiv.org/abs/2303.07393v4 Many learning agents interacting with an agent-based market model 2024-08-14T11:37:15Z We consider the dynamics and the interactions of multiple reinforcement learning optimal execution trading agents interacting with a reactive Agent-Based Model (ABM) of a financial market in event time. The model represents a market ecology with 3-trophic levels represented by: optimal execution learning agents, minimally intelligent liquidity takers, and fast electronic liquidity providers. The optimal execution agent classes include buying and selling agents that can either use a combination of limit orders and market orders, or only trade using market orders. The reward function explicitly balances trade execution slippage against the penalty of not executing the order timeously. This work demonstrates how multiple competing learning agents impact a minimally intelligent market simulation as functions of the number of agents, the size of agents' initial orders, and the state spaces used for learning. We use phase space plots to examine the dynamics of the ABM, when various specifications of learning agents are included. Further, we examine whether the inclusion of optimal execution agents that can learn is able to produce dynamics with the same complexity as empirical data. We find that the inclusion of optimal execution agents changes the stylised facts produced by ABM to conform more with empirical data, and are a necessary inclusion for ABMs investigating market micro-structure. However, including execution agents to chartist-fundamentalist-noise ABMs is insufficient to recover the complexity observed in empirical data. 2023-03-13T18:15:52Z 16 pages, 8 figures, 5 tables, enhanced discussion and figures Matthew Dicks Andrew Paskaramoorthy Tim Gebbie http://arxiv.org/abs/2407.21025v2 Reinforcement Learning in High-frequency Market Making 2024-08-12T16:51:02Z This paper establishes a new and comprehensive theoretical analysis for the application of reinforcement learning (RL) in high-frequency market making. We bridge the modern RL theory and the continuous-time statistical models in high-frequency financial economics. Different with most existing literature on methodological research about developing various RL methods for market making problem, our work is a pilot to provide the theoretical analysis. We target the effects of sampling frequency, and find an interesting tradeoff between error and complexity of RL algorithm when tweaking the values of the time increment $Δ$ $-$ as $Δ$ becomes smaller, the error will be smaller but the complexity will be larger. We also study the two-player case under the general-sum game framework and establish the convergence of Nash equilibrium to the continuous-time game equilibrium as $Δ\rightarrow0$. The Nash Q-learning algorithm, which is an online multi-agent RL method, is applied to solve the equilibrium. Our theories are not only useful for practitioners to choose the sampling frequency, but also very general and applicable to other high-frequency financial decision making problems, e.g., optimal executions, as long as the time-discretization of a continuous-time markov decision process is adopted. Monte Carlo simulation evidence support all of our theories. 2024-07-14T22:07:48Z Yuheng Zheng Zihan Ding http://arxiv.org/abs/2310.06079v4 Anomalous diffusion and price impact in the fluid-limit of an order book 2024-08-07T08:30:18Z We extend a Discrete Time Random Walk (DTRW) numerical scheme to simulate the anomalous diffusion of financial market orders in a simulated order book. Here using random walks with Sibuya waiting times to include a time-dependent stochastic forcing function with non-uniformly sampled times between order book events in the setting of fractional diffusion. This models the fluid limit of an order book by modelling the continuous arrival, cancellation and diffusion of orders in the presence of information shocks. We study the impulse response and stylised facts of orders undergoing anomalous diffusion for different forcing functions and model parameters. Concretely, we demonstrate the price impact for flash limit-orders and market orders and show how the numerical method generate kinks in the price impact. We use cubic spline interpolation to generate smoothed price impact curves. The work promotes the use of non-uniform sampling in the presence of diffusive dynamics as the preferred simulation method. 2023-10-09T18:36:08Z 36 pages, 23 figures, 4 tables; Accepted, Journal of Computational and Applied Mathematics. Clarified notation and added additional commentary and interpretation of this model Journal of Computational and Applied Mathematics (2024) Derick Diana Tim Gebbie 10.1016/j.cam.2024.116202 http://arxiv.org/abs/2408.03594v1 Forecasting High Frequency Order Flow Imbalance 2024-08-07T07:16:06Z Market information events are generated intermittently and disseminated at high speeds in real-time. Market participants consume this high-frequency data to build limit order books, representing the current bids and offers for a given asset. The arrival processes, or the order flow of bid and offer events, are asymmetric and possibly dependent on each other. The quantum and direction of this asymmetry are often associated with the direction of the traded price movement. The Order Flow Imbalance (OFI) is an indicator commonly used to estimate this asymmetry. This paper uses Hawkes processes to estimate the OFI while accounting for the lagged dependence in the order flow between bids and offers. Secondly, we develop a method to forecast the near-term distribution of the OFI, which can then be used to compare models for forecasting OFI. Thirdly, we propose a method to compare the forecasts of OFI for an arbitrarily large number of models. We apply the approach developed to tick data from the National Stock Exchange and observe that the Hawkes process modeled with a Sum of Exponential's kernel gives the best forecast among all competing models. 2024-08-07T07:16:06Z 21 pages Aditya Nittur Anantha Shashi Jain http://arxiv.org/abs/2408.03181v1 Correlation emergence in two coupled simulated limit order books 2024-08-06T13:29:55Z We use random walks to simulate the fluid limit of two coupled diffusive limit order books to model correlation emergence. The model implements the arrival, cancellation and diffusion of orders coupled by a pairs trader profiting from the mean-reversion between the two order books in the fluid limit for a Lit order book with vanishing boundary conditions and order volume conservation. We are able to demonstrate the recovery of an Epps effect from this. We discuss how various stylised facts depend on the model parameters and the numerical scheme and discuss the various strengths and weaknesses of the approach. We demonstrate how the Epps effect depends on different choices of time and price discretisation. This shows how an Epps effect can emerge without recourse to market microstructure noise relative to a latent model but can rather be viewed as an emergent property arising from trader interactions in a world of asynchronous events. 2024-08-06T13:29:55Z Dominic Bauer Derick Diana Tim Gebbie http://arxiv.org/abs/2408.02355v1 Quantile Regression using Random Forest Proximities 2024-08-05T10:02:33Z Due to the dynamic nature of financial markets, maintaining models that produce precise predictions over time is difficult. Often the goal isn't just point prediction but determining uncertainty. Quantifying uncertainty, especially the aleatoric uncertainty due to the unpredictable nature of market drivers, helps investors understand varying risk levels. Recently, quantile regression forests (QRF) have emerged as a promising solution: Unlike most basic quantile regression methods that need separate models for each quantile, quantile regression forests estimate the entire conditional distribution of the target variable with a single model, while retaining all the salient features of a typical random forest. We introduce a novel approach to compute quantile regressions from random forests that leverages the proximity (i.e., distance metric) learned by the model and infers the conditional distribution of the target variable. We evaluate the proposed methodology using publicly available datasets and then apply it towards the problem of forecasting the average daily volume of corporate bonds. We show that using quantile regression using Random Forest proximities demonstrates superior performance in approximating conditional target distributions and prediction intervals to the original version of QRF. We also demonstrate that the proposed framework is significantly more computationally efficient than traditional approaches to quantile regressions. 2024-08-05T10:02:33Z 9 pages, 5 figures, 3 tables Mingshu Li Bhaskarjit Sarmah Dhruv Desai Joshua Rosaler Snigdha Bhagat Philip Sommer Dhagash Mehta