https://arxiv.org/api/ClWMSkqS85htfnfODW6A13JjaXc2026-06-21T16:39:02Z323713515http://arxiv.org/abs/2604.12197v1Emergence of Statistical Financial Factors by a Diffusion Process2026-04-14T02:06:35ZFactor models characterize the joint behavior of large sets of financial assets through a smaller number of underlying drivers. We develop a network-based framework in which factors emerge naturally from the structure of interactions among assets rather than being imposed statistically. The market is modeled as a system of coupled iterated maps, where assets' return depends on its own past returns and those of related assets. Effectively modeling the influence of irrational traders whose decisions are based on the past movements of a collection of stocks. The interaction structure between stock returns is defined by a coupling matrix derived from an orthogonal transformation of a Laplacian matrix that gradually links initially isolated clusters into a fully connected network. Within this structure, stable patterns of co-movement arise and can be interpreted as financial factors. The relationship between the initial clustering and the number of observed factors is consistent with a center manifold reduction. We identify an optimal regime in which assets' variance is effectively explained by the set of factors produced by the network. Our framework offers a structural perspective based on interaction-based factor formation and dimension reduction in financial markets.2026-04-14T02:06:35Z20 pages, 8 figuresJose NegreteJaime Joel Ramoshttp://arxiv.org/abs/2108.00480v6Realised Volatility Forecasting: Machine Learning via Financial Word Embedding2026-04-13T23:00:19ZWe examine whether news can improve realised volatility forecasting using a modern yet operationally simple NLP framework. News text is transformed into embedding-based representations, and forecasts are evaluated both as a standalone, news-only model and as a complement to standard realised volatility benchmarks. In out-of-sample tests on a cross-section of stocks, news contains useful predictive information, with stronger effects for stock-related content and during high volatility days. Combining the news-based signal with a leading benchmark yields consistent improvements in statistical performance and economically meaningful gains, while explainability analysis highlights the news themes most relevant for volatility.2021-08-01T15:43:57ZEghbal RahimikiaStefan ZohrenSer-Huang Poon10.2139/ssrn.3895272http://arxiv.org/abs/2603.10559v2A Bipartite Graph Approach to U.S.-China Cross-Market Return Forecasting2026-04-13T20:21:52ZThis paper studies cross-market return predictability through a machine learning framework that preserves economic structure. Exploiting the non-overlapping trading hours of the U.S. and Chinese equity markets, we construct a directed bipartite graph that captures time-ordered predictive linkages between stocks across markets. Edges are selected via rolling-window hypothesis testing, and the resulting graph serves as a sparse, economically interpretable feature-selection layer for downstream machine learning models. We apply a range of regularized and ensemble methods to forecast open-to-close returns using lagged foreign-market information. Our results reveal a pronounced directional asymmetry: U.S. previous-close-to-close returns contain substantial predictive information for Chinese intraday returns, whereas the reverse effect is limited. This informational asymmetry translates into economically meaningful performance differences and highlights how structured machine learning frameworks can uncover cross-market dependencies while maintaining interpretability.2026-03-11T09:07:15ZJing LiuMaria GrithXiaowen DongMihai Cucuringuhttp://arxiv.org/abs/2508.14813v2Pricing Options on Forwards in Function-Valued Affine Stochastic Volatility Models2026-04-13T14:57:22ZWe study the pricing of European-style options written on forward contracts within function-valued infinite-dimensional affine stochastic volatility models. The dynamics of the underlying forward price curves are modeled within the Heath-Jarrow-Morton-Musiela framework as solution to a stochastic partial differential equation modulated by a stochastic volatility process. We analyze two classes of affine stochastic volatility models: (i) a Gaussian model governed by a finite-rank Wishart process, and (ii) a pure-jump affine model extending the Barndorff--Nielsen--Shephard framework with state-dependent jumps in the covariance component. For both models, we derive conditions for the existence of exponential moments and develop semi-closed Fourier-based pricing formulas for vanilla call and put options written on forward price curves. Our approach allows for tractable pricing in models with infinitely many risk factors, thereby capturing maturity-specific and term structure risk essential in forward markets.2025-08-20T16:04:56ZJian HeSven KarbachAsma Khedherhttp://arxiv.org/abs/2408.06531v2Adaptive Multilevel Stochastic Approximation of the Value-at-Risk2026-04-12T15:37:39ZCrépey, Frikha, and Louzi (2025) introduced a multilevel stochastic approximation scheme to compute the value-at-risk of a financial loss that is only simulatable by Monte Carlo. The best complexity of the scheme is in O($\varepsilon^{-\frac52}$), $\varepsilon>0$ being a prescribed accuracy, which is suboptimal compared to the canonical multilevel Monte Carlo performance. This suboptimality stems from the discontinuity ofthe Heaviside function involved in the biased stochastic gradient that is recursively evaluated to derive the value-at-risk. To mitigate this issue, this paper proposes and analyzes a multilevel stochastic approximation algorithm that adaptively selects the number of inner samples at each level, and proves that its best complexity is in O($\varepsilon^{-2}|\ln{\varepsilon}|^\frac52$). Our theoretical analysis is exemplified through numerical experiments.2024-08-12T23:32:07Z43 pages, 6 tables, 5 figures, 3 algorithmsStéphane CrépeyNoufel FrikhaAzar LouziJonathan Spencehttp://arxiv.org/abs/2304.01207v4A Multilevel Stochastic Approximation Algorithm for Value-at-Risk and Expected Shortfall Estimation2026-04-12T15:30:17ZWe propose a multilevel stochastic approximation (MLSA) scheme for the computation of the value-at-risk (VaR) and expected shortfall (ES) of a financial loss, which can only be computed via simulations conditionally on the realisation of future risk factors. Thus the problem of estimating its VaR and ES is nested in nature and can be viewed as an instance of stochastic approximation problems with biased innovations. In this framework, for a prescribed accuracy $\varepsilon$, the optimal complexity of a nested stochastic approximation algorithm is shown to be of the order $\varepsilon^{-3}$. To estimate the VaR, our MLSA algorithm attains an optimal complexity of the order $\varepsilon^{-2-δ}$, where $δ\in(0,1)$ is some parameter depending on the integrability degree of the loss, while to estimate the ES, the algorithm achieves an optimal complexity of the order $\varepsilon^{-2}|\ln{\varepsilon}|^2$. Numerical studies of the joint evolution of the error rate and the execution time demonstrate how our MLSA algorithm regains a significant amount of the performance lost due to the nested nature of the problem.2023-03-24T08:49:27Z50 pages, 3 figures, 4 tables, 3 algorithmsStéphane CrépeyLPSMNoufel FrikhaCESAzar LouziLPSM10.1007/s00780-025-00573-5http://arxiv.org/abs/2311.15333v5Asymptotic Error Analysis of Multilevel Stochastic Approximations for the Value-at-Risk and Expected Shortfall2026-04-12T15:23:45ZCrépey, Frikha, and Louzi (2025) introduced a nested stochastic approximation algorithm and its multilevel acceleration to compute the value-at-risk and expected shortfall of a random financial loss. We hereby establish central limit theorems for the renormalized estimation errors associated with both algorithms as well as their averaged versions. Our findings are substantiated through a numerical example.2023-11-26T15:39:22Z56 pages, 1 figure, 4 tablesStéphane CrépeyNoufel FrikhaAzar LouziGilles Pagès10.1214/24-EJP1246http://arxiv.org/abs/2604.08649v1PRAGMA: Revolut Foundation Model2026-04-09T18:00:00ZModern financial systems generate vast quantities of transactional and event-level data that encode rich economic signals. This paper presents PRAGMA, a family of foundation models for multi-source banking event sequences. Our approach pre-trains a Transformer-based architecture with masked modelling on a large-scale, heterogeneous banking event corpus using a self-supervised objective tailored to the discrete, variable-length nature of financial records. The resulting model supports a wide range of downstream tasks such as credit scoring, fraud detection, and lifetime value prediction: strong performance can be achieved by training a simple linear model on top of the extracted embeddings and can be further improved with lightweight fine-tuning. Through extensive evaluation on downstream tasks, we demonstrate that PRAGMA achieves superior performance across multiple domains directly from raw event sequences, providing a general-purpose representation layer for financial applications.2026-04-09T18:00:00ZMaxim OstroukhovRuslan MikhailovVladimir IashinArtem SokolovAndrei AkshonovVitaly ProtasovDmitrii BeloborodovVince MullinRoman Yokunda EnzmannGeorgios KolovosJason RendersPavel NesterovAnton Repushkohttp://arxiv.org/abs/2604.08180v1Quantum Computing for Financial Transformation: A Review of Optimisation, Pricing, Risk, Machine Learning, and Post-Quantum Security2026-04-09T12:35:53ZQuantum computing is becoming strategically relevant to finance because several core financial bottlenecks are already defined by combinatorial search, expectation estimation, rare-event analysis, representation learning, and long-horizon cryptographic resilience. This review examines that landscape across five connected domains: constrained portfolio optimisation, derivative pricing, tail-risk and scenario estimation, quantum machine learning, and post-quantum security. Rather than treating these topics as isolated demonstrations, the article studies them as linked layers of a financial-computation stack. Across all five domains, the review applies a common evaluative logic: identify the financial bottleneck, specify the relevant quantum primitive, compare it with an explicit classical benchmark, and assess the result under realistic implementation and governance constraints. The main conclusion is measured but consequential. The strongest near-term case for quantum finance lies in carefully designed hybrid workflows rather than blanket claims of universal advantage. Quantum optimisation is most credible when constrained search dominates; amplitude-estimation methods matter most when repeated expectation evaluation is the binding cost; quantum machine learning remains task dependent; and post-quantum cryptography is already strategically necessary because financial infrastructures must migrate before fault-tolerant attacks arrive. By combining system-level synthesis with locally reproducible small-scale case studies on simulated qubit registers, the article is intended both as a review of the field and as a handbook-style entry point for future work.2026-04-09T12:35:53Z134 pages, 6 figures. Review articleHui GongAkash SedaiThomas SchroederFrancesca Meddahttp://arxiv.org/abs/2402.17148v2Time series generation for option pricing on quantum computers using tensor network2026-04-09T00:31:14ZFinance, especially option pricing, is a promising industrial field that might benefit from quantum computing. While quantum algorithms for option pricing have been proposed, it is desired to devise more efficient implementations of costly operations in the algorithms, one of which is preparing a quantum state that encodes a probability distribution of the underlying asset price. In particular, in pricing a path-dependent option, we need to generate a state encoding a joint distribution of the underlying asset price at multiple time points, which is more demanding. To address these issues, we propose a novel approach that uses a Matrix Product State (MPS), which can be encoded into a state of qubits, as a generative model for time series generation. We focus on the training of such an MPS and present its procedure in detail. To validate our approach, taking the Heston model as a target, we conduct numerical experiments to generate time series in the model. Our findings demonstrate the capability of the MPS model to generate paths in the Heston model, highlighting its potential for path-dependent option pricing on quantum computers.2024-02-27T02:29:24Z18 pages, 3 figuresQuantum Mach. Intell. 8, 39 (2026)Nozomu KobayashiYoshiyuki SuimonKoichi Miyamoto10.1007/s42484-026-00342-3http://arxiv.org/abs/2512.14735v2PyFi: Toward Pyramid-like Financial Image Understanding for VLMs via Adversarial Agents2026-04-08T06:53:15ZThis paper proposes PyFi, a novel framework for pyramid-like financial image understanding that enables vision language models (VLMs) to reason through question chains in a progressive, simple-to-complex manner. At the core of PyFi is PyFi-600K, a dataset comprising 600K financial question-answer pairs organized into a reasoning pyramid: questions at the base require only basic perception, while those toward the apex demand increasing levels of capability in financial visual understanding and expertise. This data is scalable because it is synthesized without human annotations, using PyFi-adv, a multi-agent adversarial mechanism under the Monte Carlo Tree Search (MCTS) paradigm, in which, for each image, a challenger agent competes with a solver agent by generating question chains that progressively probe deeper capability levels in financial visual reasoning. Leveraging this dataset, we present fine-grained, hierarchical, and comprehensive evaluations of advanced VLMs in the financial domain. Moreover, fine-tuning Qwen2.5-VL-3B and Qwen2.5-VL-7B on the pyramid-structured question chains enables these models to answer complex financial questions by decomposing them into sub-questions with gradually increasing reasoning demands, yielding average accuracy improvements of 19.52% and 8.06%, respectively, on the dataset. All resources of code, dataset and models are available at: https://github.com/AgenticFinLab/PyFi .2025-12-11T06:04:33ZYuqun ZhangYuxuan ZhaoSijia Chenhttp://arxiv.org/abs/2512.00448v2Efficient Simulation and Calibration of the Rough Bergomi Model via Wasserstein Distance2026-04-08T03:07:29ZDespite the empirical success of the rough Bergomi (rBergomi) model in modeling volatility dynamics, its practical use remains challenging due to high computational complexity in both pricing and calibration arising from its non-Markovian structure. To address these difficulties, we develop an efficient computational framework. First, we propose a modified-sum-of-exponentials (mSOE) Monte Carlo scheme within the class of hybrid multifactor approximations. The method combines an exact treatment of the singular kernel over the first time step with a sum-of-exponentials approximation over the remaining time interval, and exact Gaussian simulation of the resulting multifactor components. For a fixed number of exponential terms, the method maintains linear online complexity with respect to the number of time steps. It achieves high pricing accuracy in numerical experiments, particularly for out-of-the-money options. Second, building on this pricing engine, we formulate a calibration approach based on distributional matching of the terminal underlying asset via the Wasserstein-1 distance. Instead of fitting option prices only at selected strikes, this method compares model-generated and market-implied terminal distributions through the Kantorovich-Rubinstein dual representation. Numerical experiments indicate that the mSOE scheme exhibits stable convergence, and the Wasserstein-based calibration scheme improves parameter recovery, optimization stability, and out-of-sample performance relative to conventional MSE-based fitting in the rBergomi setting considered in this paper.2025-11-29T11:25:49ZChangqing TengGuanglian Lihttp://arxiv.org/abs/2604.06068v1Beyond Black-Scholes: A Computational Framework for Option Pricing Using Heston, GARCH, and Jump Diffusion Models2026-04-07T16:48:26ZThis research addresses accurate option pricing by employing models beyond the traditional Black-Scholes framework. While Black-Scholes provides a closed-form solution, it is limited by assumptions of constant volatility, no dividends, and continuous price movements. To overcome these limitations, we use Monte Carlo simulation alongside the GARCH model, Heston stochastic volatility model, and Merton jump-diffusion model. The Black-Scholes-Monte Carlo method simulates diverse stock price paths using geometric Brownian motion. The GARCH model forecasts time-varying volatility from historical data. The Heston model incorporates stochastic volatility to capture volatility clustering and skew. The Merton jump-diffusion model adds sudden price jumps via a Poisson process. Results show the Heston model consistently produces estimates closer to market prices, while the Merton model performs well for volatile assets with sudden price movements. The GARCH model provides improved volatility forecasts for future option price prediction. All experiments used live market data from November 2024.2026-04-07T16:48:26Z10 pages, 7 figuresKarmanpartap Singh SidhuPranshi Saxenahttp://arxiv.org/abs/2510.15205v2Toward Black Scholes for Prediction Markets: A Unified Kernel and Market Maker's Handbook2026-04-06T04:29:37ZPrediction markets, such as Polymarket, aggregate dispersed information into tradable probabilities, but they still lack a unifying stochastic kernel comparable to the one options gained from Black-Scholes. As these markets scale with institutional participation, exchange integrations, and higher volumes around elections and macro prints, market makers face belief volatility, jump, and cross-event risks without standardized tools for quoting or hedging. We propose such a foundation: a logit jump-diffusion with risk-neutral drift that treats the traded probability p_t as a Q-martingale and exposes belief volatility, jump intensity, and dependence as quotable risk factors. On top, we build a calibration pipeline that filters microstructure noise, separates diffusion from jumps using expectation-maximization, enforces the risk-neutral drift, and yields a stable belief-volatility surface. We then define a coherent derivative layer (variance, correlation, corridor, and first-passage instruments) analogous to volatility and correlation products in option markets. In controlled experiments on synthetic risk-neutral paths and real event data, the model reduces short-horizon belief-variance forecast error relative to diffusion-only and probability-space baselines, supporting both causal calibration and economic interpretability. Conceptually, the logit jump-diffusion kernel supplies an implied-volatility analogue for prediction markets: a tractable, tradable language for quoting, hedging, and transferring belief risk across venues such as Polymarket.2025-10-17T00:18:29ZShaw Dalenhttp://arxiv.org/abs/2508.10208v2CATNet: A geometric deep learning approach for CAT bond spread prediction in the primary market2026-04-06T00:42:10ZTraditional models for pricing catastrophe (CAT) bonds struggle to capture the complex, relational data inherent in these instruments. This paper introduces CATNet, a novel framework that applies a geometric deep learning architecture, the Relational Graph Convolutional Network (R-GCN), to model the CAT bond primary market as a graph, leveraging its underlying network structure for spread prediction. Our analysis reveals that the CAT bond market exhibits the characteristics of a scale-free network, a structure dominated by a few highly connected and influential hubs. CATNet demonstrates higher predictive performance, significantly outperforming strong Random Forest and XGBoost benchmarks. Interpretability analysis confirms that the network's topological properties are not mere statistical artifacts; they are quantitative proxies for long-held industry intuition regarding issuer reputation, underwriter influence, and peril concentration. This research provides evidence that network connectivity is a key determinant of price, offering a new paradigm for risk assessment and proving that graph-based models can deliver both state-of-the-art accuracy and deeper, quantifiable market insights.2025-08-13T21:38:25ZDixon DomfehSaeid Safarveisi