Exact simulation scheme for the Ornstein-Uhlenbeck driven stochastic volatility model with the Karhunen-Loève expansions

2026-05-05T03:53:05Z

This study proposes a fast exact simulation scheme for the Ornstein-Uhlenbeck driven stochastic volatility model. With the Karhunen-Loève expansions, the stochastic volatility path (Ornstein-Uhlenbeck process) is expressed as a sine series, and the time integrals of volatility and variance are analytically derived as infinite series of independent normal random variables. The new method is several hundred times faster than the existing method using numerical transform inversion. The simulation variance is further reduced with conditional simulation and the control variate.

Quantum Monte Carlo algorithm for option pricing and its complexity analysis

2026-05-03T15:26:54Z

In this paper we provide a quantum Monte Carlo algorithm to solve multidimensional Black-Scholes PDEs with correlation for option pricing. The payoff function of the option is of general form and is only required to be continuous and piecewise affine, which covers most of the relevant payoff functions used in finance. We provide a rigorous error analysis and complexity analysis of our algorithm. In particular, we prove that the computational complexity of our algorithm is bounded polynomially in the space dimension $d$ of the PDE and the reciprocal of the prescribed accuracy $\varepsilon$. Moreover, we show that for payoff functions which are bounded, our algorithm indeed has a speed-up compared to classical Monte Carlo methods. Furthermore, we provide numerical simulations in two dimensions using our developed package within the Qiskit framework tailored to price continuous piecewise affine options with respect to the Black-Scholes model, as well as discuss the potential extension of the numerical simulations to arbitrary space dimension.

ValueBlindBench: Agreement-Gated Stress Testing of LLM-Judged Investment Rationales Before Returns Are Observable

2026-05-03T14:44:55Z

LLM-based financial agents increasingly produce investment rationales before the outcomes needed to evaluate them are observable. This creates a delayed-ground-truth evaluation problem: realized returns remain the eventual arbiter of investment quality, but they arrive too late and are too noisy to guide many model-development and governance decisions. LLM judges offer a tempting shortcut for pre-deployment evaluation of AI-finance systems, but unvalidated judges may reward verbosity, confidence, or rubric mimicry rather than financial judgment. This paper introduces ValueBlindBench, a preregistered agreement-gated stress-test protocol for deciding when LLM-judged investment-rationale claims are publishable, qualified, or invalid. In a controlled market-state capital-allocation prototype with 1,000 honest decision cycles and 100 preregistered adversarial controls (1,100 trajectories, 5,500 judge calls), ValueBlindBench clears the aggregate agreement gate at $\barκ_w = 0.7168$ but prevents several overclaims. Lower-rank systems collapse into a tie-class, one rubric dimension fails the per-dimension gate (\texttt{constraint\_awareness}, $\barκ_w = 0.2022$), single-judge rankings are family-dependent, and terse-correct rationales receive a $Δ= -2.81$ rubric-point penalty relative to honest rationales. A targeted anchor-specificity probe further shows that financial constructs such as constraint awareness are operationally load-bearing. The scientific object is therefore not a leaderboard and not a claim to measure true investment skill. ValueBlindBench is a pre-calibration metrology layer for AI-finance evaluation: it governs whether a proposed LLM-judge-based investment-rationale claim is stable enough, agreed enough, and uncontaminated enough to be reported at all.

First-passage horizons in horizontal visibility graphs: a rank-invariant estimator of path roughness for rough volatility models

2026-05-03T14:30:54Z

Horizontal visibility graphs (HVGs) encode the ordinal structure of time series and provide graph-local summaries of path topology. This article introduces L+(t), the forward visibility horizon at node t, with finite-sample terminal non-crossings treated as right-censored observations. For paths without ties, each uncensored L+(t) is identical to the first-passage time τ+(t) = inf{k â¥ 1 : x_{t+k} â¥ x_t}. For an i.i.d. sequence with a continuous distribution, the survival law is exactly Pr[L+ â¥ k] = 1/k, equivalent to Rényi's record statistic and implying infinite mean and variance. Hence roughness is estimated on a power-law survival scale through a single tail exponent θ. Combining the identity L+ = τ+ with discrete-grid persistence theory for fractional Brownian motion gives the prediction θ(H) = 1 â H. For rough Bergomi-type volatility, the same prediction is derived under an explicit persistence hypothesis for RiemannâLiouville fBm increments and verified numerically. In Monte-Carlo experiments (N = 10,000, T = 2^16), a Hill-MLE with ClausetâShaliziâNewman threshold selection recovers θ(H) within one cross-replicate standard deviation for H â¤ 0.2 and reveals a positive finite-size bias for smoother paths. The rank-invariant, parameter-free estimator separates rough Bergomi volatility from classical Heston, GARCH, and FIGARCH benchmarks. Applied to daily FRED VIX data from 2000â2026, the rolling estimate is θÌ = 0.91 Â± 0.19 across 45 four-year windows and lies far below an overlapping-window i.i.d. Monte-Carlo null (p < 0.001). The statistic offers an ordinal diagnostic of roughness for financial volatility and other complex time-series systems.

Identifying Risk Variables From Raw ESG Data Using Its Hierarchical Structure

2026-05-03T04:19:51Z

Environmental, Social, and Governance (ESG) data provides non-financial insights into corporations. In this study, we aim to identify relevant ESG raw variables to assess financial risk, measured by logarithmic volatility of return. We propose a framework specifically designed for ESG datasets characterized by a hierarchical data structure and a significantly larger number of variables than observations. We show that raw variables selected by the proposed framework are significantly more relevant to financial risk than aggregated ESG scores. Furthermore, these selected risk variables provide additional insights beyond the traditional financial factors. We validate the robustness of this framework using out-of-sample data. We illustrate our framework using company data from various sectors of the US economy. We further identify the specific ESG risk variables relevant to large and small companies within each sector.

SBCA: Cross-Modal BERT-driven Actor-Critic for Multi-Asset Portfolio Optimization

2026-05-02T11:16:01Z

Portfolio optimization is constrained by linear assumptions and insufficient integration of multi-modal information in traditional models. This paper proposes a cross-modal BERT-driven Actor-Critic framework SBCA for multi-asset portfolio optimization to address the deficiencies of existing deep reinforcement learning DRL methods in fusing price data and financial text sentiment, as well as lacking practical trading constraints. The framework adopts a cross-modal gated fusion mechanism to adaptively integrate price time-series features and text semantic features, embeds downside risk and turnover penalty constraints into the reward function, and constructs a complete empirical system for validation. Experiments on 11-year U.S. stock multi-asset datasets show that SBCA outperforms equal weight, buy-and-hold and market benchmark strategies in portfolio value, annual return, Sharpe ratio and maximum drawdown. Ablation studies verify the complementary enhancement of Actor-Critic mechanism and cross-modal fusion module. Cost sensitivity analysis confirms the model's robustness under varying transaction costs. SBCA provides an effective and interpretable end-to-end solution for dynamic quantitative portfolio decision-making.

American Options Pricing under Heston Model via Curriculum Learning in Coupled PINNs

2026-05-01T07:38:16Z

In American options, the early exercise feature allows the option to be exercised at any time prior to expiration. However, this flexibility introduces a challenge: the pricing model must value the option while simultaneously determining an unknown, time-varying exercise boundary. The Heston model is one of the most popular ways to model real market behavior because it allows volatility to change over time. However, unlike European options, there is no closed-form solution for American options under the Heston model, so we have to use numerical methods. In this paper, we propose a novel approach to solving the stochastic Heston partial differential equation for American options, using coupled physics-informed neural networks (PINNs) to predict both the option price and the free boundary, while employing curriculum learning and adaptive resampling to stabilize model training. Our work builds on recent deep learning methods but introduces a more effective training strategy to address the limitations of these approaches. The numerical results demonstrate the effectiveness of the proposed learning framework, providing a robust and efficient alternative to pricing American options, enabling rapid inference and accurate estimation under stochastic volatility.

Randomized Kolmogorov-Smirnov Analysis of Volatility Roughness

2026-05-01T06:15:49Z

We introduce a novel distribution-based estimator for the Hurst parameter of log-volatility, leveraging the Kolmogorov-Smirnov statistic to assess the scaling behavior of entire distributions rather than individual moments. To address the temporal dependence of financial volatility, we propose a random permutation procedure that effectively removes serial correlation while preserving marginal distributions, enabling the rigorous application of the KS framework to dependent data. We establish the asymptotic variance of the estimator, useful for inference and confidence interval construction. From a computational standpoint, we show that derivative-free optimization methods, particularly Brent's method and the Nelder-Mead simplex, achieve substantial efficiency gains relative to grid search while maintaining estimation accuracy. Empirical analysis of the CBOE VIX index and the 5-minute realized volatility of the S&P 500 reveals a statistically significant hierarchy of roughness, with implied volatility smoother than realized volatility. Both measures, however, exhibit Hurst exponents well below one-half, reinforcing the rough volatility paradigm and highlighting the open challenge of disentangling local roughness from long-memory effects in fractional modeling.

Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification

2026-04-30T15:16:04Z

This paper studies whether a lightweight supervised aggregator can combine diverse zero-shot large language model outputs into a stronger downstream signal for corporate disclosure classification. Zero-shot LLMs can read disclosures without task-specific fine-tuning, but their predictions often vary across prompt perspectives, model families, and confidence levels. I examine this problem with a multi-prompt framework in which three fixed zero-shot LLM classifiers read each disclosure from different financial perspectives and output a sentiment label, a confidence score, and a short rationale. A logistic meta-classifier then aggregates these outputs to predict next-day stock return direction. To reduce pretrained-model contamination, I restrict evaluation to a post-release sample of 9{,}860 U.S.\ corporate disclosures issued by large publicly traded firms between January 2025 and March 2026, after the release of the frozen base LLMs used in the experiment. Results show that the trained aggregator outperforms single classifiers, majority vote, confidence-weighted voting, a zero-shot LLM judge, and a FinBERT baseline. Balanced accuracy rises from 0.566 for the best single classifier to 0.606 for the trained aggregator. The gain is largest in mixed-signal disclosures where classifiers disagree. The results suggest that zero-shot LLM outputs contain complementary financial signals, while also showing that the strongest gains come from supervised aggregation rather than from zero-shot voting alone.

Improving Bayesian Optimization for Portfolio Management with an Adaptive Scheduling

2026-04-29T12:43:19Z

Existing black-box portfolio management systems are prevalent in the financial industry due to commercial and safety constraints, though their performance can fluctuate dramatically with changing market regimes. Evaluating these non-transparent systems is computationally expensive, as fixed budgets limit the number of possible observations. Therefore, achieving stable and sample-efficient optimization for these systems has become a critical challenge. This work presents a novel Bayesian optimization framework (TPE-AS) that improves search stability and efficiency for black-box portfolio models under these limited observation budgets. Standard Bayesian optimization, which solely maximizes expected return, can yield erratic search trajectories and misalign the surrogate model with the true objective, thereby wasting the limited evaluation budget. To mitigate these issues, we propose a weighted Lagrangian estimator that leverages an adaptive schedule and importance sampling. This estimator dynamically balances exploration and exploitation by incorporating both the maximization of model performance and the minimization of the variance of model observations. It guides the search from broad, performance-seeking exploration towards stable and desirable regions as the optimization progresses. Extensive experiments and ablation studies, which establish our proposed method as the primary approach and other configurations as baselines, demonstrate its effectiveness across four backtest settings with three distinct black-box portfolio management models.

Pricing with Passion: The Local Occupied Volatility (LOV) Model

2026-04-28T22:21:30Z

We introduce the Local Occupied Volatility (LOV) model that sits between Dupire's local volatility and fully path-dependent dynamics. By design, the LOV model ensures automatic calibration to European vanilla options, while offering the flexibility to capture stylized facts of volatility or fit additional instruments. This is achieved by tuning the occupation sensitivity function that quantifies the effect of path-dependent shocks on volatility. We validate the model through the joint American-European calibration of options chain on non-dividend paying stocks.

Heath-Jarrow-Morton meet lifted Heston in energy markets for joint historical and implied calibration

2026-04-28T08:54:51Z

In energy markets, joint historical and implied calibration is of paramount importance for practitioners, yet notoriously challenging due to the need to align historical correlations of futures contracts with implied volatility smiles from the option market. We address this crucial problem with a multiplicative multi-factor Heath-Jarrow-Morton (HJM) model for forward curves, combined with a stochastic volatility factor coming from the lifted Heston model. We develop a sequential fast calibration procedure leveraging the Kemna-Vorst approximation of futures contracts: (i) historical correlations and the Variance Swap (VS) volatility term structure are captured through Level, Slope, and Curvature factors, (ii) the VS volatility term structure can then be corrected for a perfect match via a fixed-point algorithm, (iii) implied volatility smiles are calibrated using Fourier-based techniques. The main advantage of the proposed calibration framework is the decoupling of the calibration steps: each step tackles a simpler calibration subproblem and guaranties that the previously optimized parameters remain unchanged. Our model displays remarkable joint historical and implied calibration fits on the German power market and enables realistic interpolation within the implied volatility hypercube.

Yau's Affine-Normal Descent for Large-Scale Unrestricted Higher-Moment Portfolio Optimization

2026-04-28T08:42:43Z

Unrestricted mean-variance-skewness-kurtosis portfolio optimization can capture asymmetry and tail risk, but sample-moment formulations become computationally impractical when the asset universe is large: they produce dense nonconvex quartic objectives with prohibitive coskewness and cokurtosis tensors and anisotropic, ill-conditioned level sets. We develop a structure-exploiting algorithm based on Yau's affine-normal descent that follows affine-normal directions of the current level set while working directly with the return matrix. The method avoids explicit higher-order tensors and exploits the quartic structure for exact sample oracles, derivative evaluation, and exact line search. We also provide theory for the reduced simplex formulation, including regularity and convexity conditions that separate data-map geometry from investor preference coefficients. Computational results show a clear implementation split: a direct configuration is effective on the standard small benchmark, whereas a preconditioned conjugate-gradient configuration with stall recovery becomes the preferred large-scale implementation by the upper end of the hundreds and remains competitive as the asset universe moves into the thousands. On a 5-minute A-share panel with 5,440 stocks, the method makes direct full-universe comparisons with exact mean-variance portfolios feasible and shows on the baseline split that the incremental value of higher moments is strongest at moderate return targets.

Volatility time series modeling by single-qubit quantum circuit learning

2026-04-28T04:25:41Z

We employ single-qubit quantum circuit learning (QCL) to model the dynamics of volatility time series. To assess its effectiveness, we generate synthetic data using the Rational GARCH model, which is specifically designed to capture volatility asymmetry. Our results show that QCL-based volatility predictions preserve the negative return-volatility correlation, a hallmark of asymmetric volatility dynamics. Moreover, analysis of the Hurst exponent and multifractal characteristics indicates that the predicted series, like the original synthetic data, exhibits anti-persistent behavior and retains its multifractal structure.

Financial Market as a Self-Organized Ecosystem: Simulation via Learning with Heterogeneous Preferences

2026-04-27T02:38:38Z

Agent-based models provide a constructive approach to studying emergent dynamics in life-like systems composed of interacting, adaptive agents. Financial markets serve as a canonical example of such systems, where collective price dynamics arise from individual decision-making. In this modeling tradition, investor behavior has typically been captured by two distinct mechanisms -- learning and heterogeneous preferences -- which have been explored as separate paradigms in prior studies. However, the impact of their joint modeling on the resulting collective dynamics remains largely unexplored. We develop a multi-agent reinforcement learning framework in which agents endowed with heterogeneous risk aversion, time discounting, and information access learn trading strategies interactively within an artificial market. The experiment reveals that (i) learning under heterogeneous preferences drives agents to develop functionally differentiated strategies through interaction, rather than trait-specific rules, resulting in role specialization, and (ii) the interactions by the differentiated agents are essential for the emergence of realistic market dynamics such as fat-tailed price fluctuations and volatility clustering. Overall, this study demonstrates that the joint design of heterogeneous preferences and learning mechanisms enables the synthesis of an artificial market in which adaptive interactions drive the self-organization of a market ecology, providing a computational realization of the Adaptive Market Hypothesis.