https://arxiv.org/api/UyVMUDAcLFGa2l7REz9rswUBHZU2026-03-26T11:02:52Z313027015http://arxiv.org/abs/2510.19126v1An Efficient Calibration Framework for Volatility Derivatives under Rough Volatility with Jumps2025-10-21T23:10:40ZWe present a fast and robust calibration method for stochastic volatility models that admit Fourier-analytic transform-based pricing via characteristic functions. The design is structure-preserving: we keep the original pricing transform and (i) split the pricing formula into data-independent inte- grals and a market-dependent remainder; (ii) precompute those data-independent integrals with GPU acceleration; and (iii) approximate only the remaining, market-dependent pricing map with a small neural network. We instantiate the workflow on a rough volatility model with tempered-stable jumps tailored to power-type volatility derivatives and calibrate it to VIX options with a global-to-local search. We verify that a pure-jump rough volatility model adequately captures the VIX dynamics, consistent with prior empirical findings, and demonstrate that our calibration method achieves high accuracy and speed.2025-10-21T23:10:40ZCode repository: https://github.com/TenghanZhong/GPU-NN-Option-CalibrationKeyuan WuTenghan ZhongYuxuan Ouyanghttp://arxiv.org/abs/2510.18995v1Optimized Multi-Level Monte Carlo Parametrization and Antithetic Sampling for Nested Simulations2025-10-21T18:20:29ZEstimating risk measures such as large loss probabilities and Value-at-Risk is fundamental in financial risk management and often relies on computationally intensive nested Monte Carlo methods. While Multi-Level Monte Carlo (MLMC) techniques and their weighted variants are typically more efficient, their effectiveness tends to deteriorate when dealing with irregular functions, notably indicator functions, which are intrinsic to these risk measures. We address this issue by introducing a novel MLMC parametrization that significantly improves performance in practical, non-asymptotic settings while maintaining theoretical asymptotic guarantees. We also prove that antithetic sampling of MLMC levels enhances efficiency regardless of the regularity of the underlying function. Numerical experiments motivated by the calculation of economic capital in a life insurance context confirm the practical value of our approach for estimating loss probabilities and quantiles, bridging theoretical advances and practical requirements in financial risk estimation.2025-10-21T18:20:29Z37 pages, 4 figuresAlexandre BoumezouedAdel CherchaliVincent LemaireGilles PagèsMathieu Truchttp://arxiv.org/abs/2503.13544v7Decision by Supervised Learning with Deep Ensembles: A Practical Framework for Robust Portfolio Optimization2025-10-21T04:53:27ZWe propose Decision by Supervised Learning (DSL), a practical framework for robust portfolio optimization. DSL reframes portfolio construction as a supervised learning problem: models are trained to predict optimal portfolio weights, using cross-entropy loss and portfolios constructed by maximizing the Sharpe or Sortino ratio. To further enhance stability and reliability, DSL employs Deep Ensemble methods, substantially reducing variance in portfolio allocations. Through comprehensive backtesting across diverse market universes and neural architectures, shows superior performance compared to both traditional strategies and leading machine learning-based methods, including Prediction-Focused Learning and End-to-End Learning. We show that increasing the ensemble size leads to higher median returns and more stable risk-adjusted performance. The code is available at https://github.com/DSLwDE/DSLwDE.2025-03-16T10:57:45Z8 pages, 3 figures, Accepted at CIKM 2025 FinAI WorkshopJuhyeong KimSungyoon ChoiYoungbin LeeYejin KimYongmin ChoiYongjae Leehttp://arxiv.org/abs/2510.10343v2Learning the Exact SABR Model2025-10-20T22:32:09ZThe SABR model is a cornerstone of interest rate volatility modeling, but its practical application relies heavily on the analytical approximation by Hagan et al., whose accuracy deteriorates for high volatility, long maturities, and out-of-the-money options, admitting arbitrage. While machine learning approaches have been proposed to overcome these limitations, they have often been limited by simplified SABR dynamics or a lack of systematic validation against the full spectrum of market conditions.
We develop a novel SABR DNN, a specialized Artificial Deep Neural Network (DNN) architecture that learns the true SABR stochastic dynamics using an unprecedented large training dataset (more than 200 million points) of interest rate Cap/Floor volatility surfaces, including very long maturities (30Y) and extreme strikes consistently with market quotations. Our dataset is obtained via high-precision unbiased Monte Carlo simulation of a special scaled shifted-SABR stochastic dynamics, which allows dimensional reduction without any loss of generality.
Our SABR DNN provides arbitrage-free calibration of real market volatility surfaces and Cap/Floor prices for any maturity and strike with negligible computational effort and without retraining across business dates. Our results fully address the gaps in the previous machine learning SABR literature in a systematic and self-consistent way, and can be extended to cover any interest rate European options in different rate tenors and currencies, thus establishing a comprehensive functional SABR framework that can be adopted for daily trading and risk management activities.2025-10-11T21:06:13ZMain paper 23 pages, Appendices 12 pages, 37 references, 10 figures, 14 tables. Revised x-y scales in figure 3 and fixed minor typosGiorgia RensiPietro RossiMarco Bianchettihttp://arxiv.org/abs/2506.20930v2Quantum Reinforcement Learning Trading Agent for Sector Rotation in the Taiwan Stock Market2025-10-20T14:32:07ZWe propose a hybrid quantum-classical reinforcement learning framework for sector rotation in the Taiwan stock market. Our system employs Proximal Policy Optimization (PPO) as the backbone algorithm and integrates both classical architectures (LSTM, Transformer) and quantum-enhanced models (QNN, QRWKV, QASA) as policy and value networks. An automated feature engineering pipeline extracts financial indicators from capital share data to ensure consistent model input across all configurations. Empirical backtesting reveals a key finding: although quantum-enhanced models consistently achieve higher training rewards, they underperform classical models in real-world investment metrics such as cumulative return and Sharpe ratio. This discrepancy highlights a core challenge in applying reinforcement learning to financial domains -- namely, the mismatch between proxy reward signals and true investment objectives. Our analysis suggests that current reward designs may incentivize overfitting to short-term volatility rather than optimizing risk-adjusted returns. This issue is compounded by the inherent expressiveness and optimization instability of quantum circuits under Noisy Intermediate-Scale Quantum (NISQ) constraints. We discuss the implications of this reward-performance gap and propose directions for future improvement, including reward shaping, model regularization, and validation-based early stopping. Our work offers a reproducible benchmark and critical insights into the practical challenges of deploying quantum reinforcement learning in real-world finance.2025-06-26T01:29:19ZChi-Sheng ChenXinyu ZhangYa-Chuan Chenhttp://arxiv.org/abs/2410.23587v4Moments by Integrating the Moment-Generating Function2025-10-20T01:33:59ZWe introduce a novel method for obtaining a wide variety of moments of any random variable with a well-defined moment-generating function (MGF). We derive new expressions for fractional moments and fractional absolute moments, both central and non-central moments. The expressions are relatively simple integrals that involve the MGF, but do not require its derivatives. We label the new method CMGF because it uses a complex extension of the MGF and can be used to obtain complex moments. We illustrate the new method with three applications where the MGF is available in closed-form, while the corresponding densities and the derivatives of the MGF are either unavailable or very difficult to obtain.2024-10-31T02:58:56ZPeter Reinhard HansenChen Tonghttp://arxiv.org/abs/2510.16636v1A three-step machine learning approach to predict market bubbles with financial news2025-10-18T20:31:31ZThis study presents a three-step machine learning framework to predict bubbles in the S&P 500 stock market by combining financial news sentiment with macroeconomic indicators. Building on traditional econometric approaches, the proposed approach predicts bubble formation by integrating textual and quantitative data sources. In the first step, bubble periods in the S&P 500 index are identified using a right-tailed unit root test, a widely recognized real-time bubble detection method. The second step extracts sentiment features from large-scale financial news articles using natural language processing (NLP) techniques, which capture investors' expectations and behavioral patterns. In the final step, ensemble learning methods are applied to predict bubble occurrences based on high sentiment-based and macroeconomic predictors. Model performance is evaluated through k-fold cross-validation and compared against benchmark machine learning algorithms. Empirical results indicate that the proposed three-step ensemble approach significantly improves predictive accuracy and robustness, providing valuable early warning insights for investors, regulators, and policymakers in mitigating systemic financial risks.2025-10-18T20:31:31ZAbraham Atsiwohttp://arxiv.org/abs/2506.09080v2FinHEAR: Human Expertise and Adaptive Risk-Aware Temporal Reasoning for Financial Decision-Making2025-10-17T11:11:12ZFinancial decision-making presents unique challenges for language models, demanding temporal reasoning, adaptive risk assessment, and responsiveness to dynamic events. While large language models (LLMs) show strong general reasoning capabilities, they often fail to capture behavioral patterns central to human financial decisions-such as expert reliance under information asymmetry, loss-averse sensitivity, and feedback-driven temporal adjustment. We propose FinHEAR, a multi-agent framework for Human Expertise and Adaptive Risk-aware reasoning. FinHEAR orchestrates specialized LLM-based agents to analyze historical trends, interpret current events, and retrieve expert-informed precedents within an event-centric pipeline. Grounded in behavioral economics, it incorporates expert-guided retrieval, confidence-adjusted position sizing, and outcome-based refinement to enhance interpretability and robustness. Empirical results on curated financial datasets show that FinHEAR consistently outperforms strong baselines across trend prediction and trading tasks, achieving higher accuracy and better risk-adjusted returns.2025-06-10T04:06:51ZJiaxiang ChenMingxi ZouZhuo WangQifan WangDongning SunChi ZhangZenglin Xuhttp://arxiv.org/abs/2305.09166v10Finite-Difference Solution Ansatz approach in Least-Squares Monte Carlo2025-10-17T02:50:00ZThis article presents a simple but effective and efficient approach to improve the accuracy and stability of Least-Squares Monte Carlo. The key idea is to construct the ansatz of conditional expected continuation payoff using the finite-difference solution from one dimension, to be used in linear regression. This approach bridges between solving backward partial differential equations and Monte Carlo simulation, aiming at achieving the best of both worlds. In a general setting encompassing both local and stochastic volatility models, the ansatz is proven to act as a control variate, reducing the mean squared error, thereby leading to a reduction of the final pricing error. We illustrate the technique with realistic examples including Bermudan options, worst of issuer callable notes and expected positive exposure on European options under valuation adjustments.2023-05-16T04:52:10ZThe Journal of Computational Finance 29(2), 67-121, (2025)Jiawei Huo10.21314/JCF.2025.008http://arxiv.org/abs/2510.15205v1Toward Black Scholes for Prediction Markets: A Unified Kernel and Market Maker's Handbook2025-10-17T00:18:29ZPrediction markets, such as Polymarket, aggregate dispersed information into tradable probabilities, but they still lack a unifying stochastic kernel comparable to the one options gained from Black-Scholes. As these markets scale with institutional participation, exchange integrations, and higher volumes around elections and macro prints, market makers face belief volatility, jump, and cross-event risks without standardized tools for quoting or hedging. We propose such a foundation: a logit jump-diffusion with risk-neutral drift that treats the traded probability p_t as a Q-martingale and exposes belief volatility, jump intensity, and dependence as quotable risk factors. On top, we build a calibration pipeline that filters microstructure noise, separates diffusion from jumps using expectation-maximization, enforces the risk-neutral drift, and yields a stable belief-volatility surface. We then define a coherent derivative layer (variance, correlation, corridor, and first-passage instruments) analogous to volatility and correlation products in option markets. In controlled experiments on synthetic risk-neutral paths and real event data, the model reduces short-horizon belief-variance forecast error relative to diffusion-only and probability-space baselines, supporting both causal calibration and economic interpretability. Conceptually, the logit jump-diffusion kernel supplies an implied-volatility analogue for prediction markets: a tractable, tradable language for quoting, hedging, and transferring belief risk across venues such as Polymarket.2025-10-17T00:18:29ZShaw Dalenhttp://arxiv.org/abs/2510.14418v1Wariness and Poverty Traps2025-10-16T08:22:28ZWe investigate the effects of wariness (defined as individuals' concern for their minimum utility over time) on poverty traps and equilibrium multiplicity in an overlapping generations (OLG) model. We explore conditions under which (i) wariness amplifies or mitigates the likelihood of poverty traps in the economy and (ii) it gives rise to multiple intertemporal equilibria. Furthermore, we conduct comparative statics to characterize these effects and to examine how the interplay between wariness, productivity, and factor substitutability influences the dynamics of the economy.2025-10-16T08:22:28ZHai Ha PhamEM NormandieNgoc-Sang PhamEM Normandiehttp://arxiv.org/abs/2509.24449v2Efficient simulation of prices for European call options under Heston stochastic-local volatility model: a comparison of methods2025-10-15T11:11:03ZThe Heston stochastic-local volatility model, consisting of a asset price process and a Cox--Ingersoll--Ross-type variance process, offers a wide range of applications in the financial industry. The pursuit for efficient model evaluation has been assiduously ongoing and central to which is the numerical simulation of CIR process. Different from the weakly convergent noncentral chi-squared approximation used in 25, this paper considers two strongly convergent and positivity-preserving methods for CIR process under Lamperti transformation, namely, the truncated Euler method and the backward Euler method. It should be noted that these two methods are completely different. The explicit truncated Euler method is computationally effective and remains robust under high volatility, while the implicit backward Euler method provides high computational accuracy and stable performance. Numerical experiments on European call options are presented to show the superiority of different methods.2025-09-29T08:31:52ZMeng caiTianze Lihttp://arxiv.org/abs/2410.07222v2Computing Systemic Risk Measures with Graph Neural Networks2025-10-14T12:49:25ZThis paper investigates systemic risk measures for stochastic financial networks of explicitly modelled bilateral liabilities. We extend the notion of systemic risk measures from Biagini, Fouque, Fritelli and Meyer-Brandis (2019) to graph structured data. In particular, we focus on an aggregation function that is derived from a market clearing algorithm proposed by Eisenberg and Noe (2001). In this setting, we show the existence of an optimal random allocation that distributes the overall minimal bailout capital and secures the network. We study numerical methods for the approximation of systemic risk and optimal random allocations. We propose to use permutation equivariant architectures of neural networks like graph neural networks (GNNs) and a class that we name (extended) permutation equivariant neural networks ((X)PENNs). We compare their performance to several benchmark allocations. The main feature of GNNs and (X)PENNs is that they are permutation equivariant with respect to the underlying graph data. In numerical experiments we find evidence that these permutation equivariant methods are superior to other approaches.2024-09-30T10:18:13Z50 pagesLukas GononThilo Meyer-BrandisNiklas Weberhttp://arxiv.org/abs/2308.01121v2An optimal transport approach for the multiple quantile hedging problem2025-10-14T12:39:52ZWe consider the multiple quantile hedging problem, which is a class of partial hedging problems containing as special examples the quantile hedging problem (F{ö}llmer \& Leukert 1999) and the PnL matching problem (introduced in Bouchard \& Vu 2012). In complete non-linear markets, we show that the problem can be reformulated as a kind of Monge optimal transport problem. Using this observation, we introduce a Kantorovitch version of the problem and prove that the value of both problems coincide. In the linear case, we thus obtain that the multiple quantile hedging problem can be seen as a semi-discrete optimal transport problem, for which we further introduce the dual problem. We then prove that there is no duality gap, allowing us to design a numerical method based on SGA algorithms to compute the multiple quantile hedging price.2023-08-02T13:04:13ZCyril BénézetENSIIE, LaMMEJean-François ChassagneuxLPSMMohan YangADIAhttp://arxiv.org/abs/2408.02477v2Existence, uniqueness and positivity of solutions to the Guyon-Lekeufack path-dependent volatility model with general kernels2025-10-14T09:31:11ZWe show the existence and uniqueness of a continuous solution to a path-dependent volatility model introduced by Guyon and Lekeufack (2023) to model the price of an equity index and its spot volatility. The considered model for the trend and activity features can be written as a Stochastic Volterra Equation (SVE) with non-convolutional and non-bounded kernels as well as non-Lipschitz coefficients. We first prove the existence and uniqueness of a solution to the SVE under integrability and regularity assumptions on the two kernels and under a condition on the second kernel weighting the past squared returns which ensures that the activity feature is bounded from below by a positive constant. Then, assuming in addition that the kernel weighting the past returns is of exponential type and that an inequality relating the logarithmic derivatives of the two kernels with respect to their second variables is satisfied, we show the positivity of the volatility process which is obtained as a non-linear function of the SVE's solution. We show numerically that the choice of an exponential kernel for the kernel weighting the past returns has little impact on the quality of model calibration compared to other choices and the inequality involving the logarithmic derivatives is satisfied by the calibrated kernels. These results extend those of Nutz and Valdevenito (2023).2024-08-05T14:00:55ZHervé AndrèsCERMICSBenjamin JourdainCERMICS, MATHRISK