https://arxiv.org/api/mIEOvl2Ozouu20vwdrOUQTcboas 2026-03-24T12:56:51Z 3128 225 15 http://arxiv.org/abs/2301.12072v3 Unbiased estimators for the Heston model with stochastic interest rates 2025-11-13T02:38:08Z

We combine the unbiased estimators in Rhee and Glynn (Operations Research: 63(5), 1026-1043, 2015) and the Heston model with stochastic interest rates. Specifically, we first develop a semi-exact log-Euler scheme for the Heston model with stochastic interest rates. Then, under mild assumptions, we show that the convergence rate in the $L^2$ norm is $O(h)$, where $h$ is the step size. The result applies to a large class of models, such as the Heston-Hull-While model, the Heston-CIR model and the Heston-Black-Karasinski model. Numerical experiments support our theoretical convergence rate.

2023-01-28T03:03:48Z Chao Zheng Jiangtao Pan http://arxiv.org/abs/2511.09175v1 Proof-Carrying No-Arbitrage Surfaces: Constructive PCA-Smolyak Meets Chain-Consistent Diffusion with c-EMOT Certificates 2025-11-12T10:15:23Z

We study the construction of SPX--VIX (multi\textendash product) option surfaces that are simultaneously free of static arbitrage and dynamically chain\textendash consistent across maturities. Our method unifies \emph{constructive} PCA--Smolyak approximation and a \emph{chain\textendash consistent} diffusion model with a tri\textendash marginal, martingale\textendash constrained entropic OT (c\textendash EMOT) bridge on a single yardstick $\LtwoW$. We provide \emph{computable certificates} with explicit constant dependence: a strong\textendash convexity lower bound $\muhat$ controlled by the whitened kernel Gram's $λ_{\min}$, the entropic strength $\varepsilon$, and a martingale\textendash moment radius; solver correctness via $\KKT$ and geometric decay $\rgeo$; and a $1$-Lipschitz metric projection guaranteeing Dupire/Greeks stability. Finally, we report an end\textendash to\textendash end \emph{log\textendash additive} risk bound $\RiskTotal$ and a \emph{Gate\textendash V2} decision protocol that uses tolerance bands (from $α$\textendash mixing concentration) and tail\textendash robust summaries, under which all tests \emph{pass}: for example $\KKT=\CTwoKKT\ (\le 4!\!\times\!10^{-2})$, $\rgeo=\CTworgeo\ (\le 1.05)$, empirical Lipschitz $\CThreelipemp\!\le\!1.01$, and Dupire nonincrease certificate $=\texttt{True}$.

2025-11-12T10:15:23Z 51 pages; includes figures, algorithms, and appendices Jian'an Zhang http://arxiv.org/abs/2511.08571v1 Forecast-to-Fill: Benchmark-Neutral Alpha and Billion-Dollar Capacity in Gold Futures (2015-2025) 2025-11-11T18:52:06Z

We test whether simple, interpretable state variables-trend and momentum-can generate durable out-of-sample alpha in one of the world's most liquid assets, gold. Using a rolling 10-year training and 6-month testing walk-forward from 2015 to 2025 (2,793 trading days), we convert a smoothed trend-momentum regime signal into volatility-targeted, friction-aware positions through fractional, impact-adjusted Kelly sizing and ATR-based exits. Out of sample, the strategy delivers a Sharpe ratio of 2.88 and a maximum drawdown of 0.52 percent, net of 0.7 basis-point linear cost and a square-root impact term (gamma = 0.02). A regression on spot-gold returns yields a 43 percent annualized return (CAGR approximately 43 percent) and a 37 percent alpha (Sharpe = 2.88, IR = 2.09) at a 15 percent volatility target with beta approximately 0.03, confirming benchmark-neutral performance. Bootstrap confidence intervals ([2.49, 3.27]) and SPA tests (p = 0.000) confirm statistical significance and robustness to latency, reversal, and cost stress. We conclude that forecast-to-fill engineering-linking transparent signals to executable trades with explicit risk, cost, and impact control-can transform modest predictability into allocator-grade, billion-dollar-scalable alpha.

2025-11-11T18:52:06Z Institutional-grade systematic framework: Sharpe 2.88, $1B capacity, benchmark-neutral. Seeking feedback on live deployment considerations, multi-asset extensions, and operational implementation at scale Mainak Singha Jose Aguilera-Toste Vinayak Lahiri http://arxiv.org/abs/2511.08306v1 An extreme Gradient Boosting (XGBoost) Trees approach to Detect and Identify Unlawful Insider Trading (UIT) Transactions 2025-11-11T14:36:30Z

Corporate insiders have control of material non-public preferential information (MNPI). Occasionally, the insiders strategically bypass legal and regulatory safeguards to exploit MNPI in their execution of securities trading. Due to a large volume of transactions a detection of unlawful insider trading becomes an arduous task for humans to examine and identify underlying patterns from the insider's behavior. On the other hand, innovative machine learning architectures have shown promising results for analyzing large-scale and complex data with hidden patterns. One such popular technique is eXtreme Gradient Boosting (XGBoost), the state-of-the-arts supervised classifier. We, hence, resort to and apply XGBoost to alleviate challenges of identification and detection of unlawful activities. The results demonstrate that XGBoost can identify unlawful transactions with a high accuracy of 97 percent and can provide ranking of the features that play the most important role in detecting fraudulent activities.

2025-11-11T14:36:30Z Krishna Neupane Igor Griva http://arxiv.org/abs/2511.07834v1 Levy-stable scaling of risk and performance functionals 2025-11-11T05:07:09Z

We develop a finite-horizon model in which liquid-asset returns exhibit Levy-stable scaling on a data-driven window [tau_UV, tau_IR] and aggregate into a finite-variance regime outside. The window and the tail index alpha are identified from the log-log slope of the central body and a two-segment fit of scale versus horizon. With an anchor horizon tau_0, we derive horizon-correct formulas for Value-at-Risk, Expected Shortfall, Sharpe and Information ratios, Kelly under a Value-at-Risk constraint, and one-step drawdown, where each admits a closed-form Gaussian-bias term driven by the exponent gap (1/alpha - 1/2). The implementation is nonparametric up to alpha and fixed tail quantiles. The formulas are reproducible across horizons on the Levy window.

2025-11-11T05:07:09Z Dmitrii Vlasiuk http://arxiv.org/abs/2511.07571v1 Forecasting implied volatility surface with generative diffusion models 2025-11-10T19:31:04Z

We introduce a conditional Denoising Diffusion Probabilistic Model (DDPM) for generating arbitrage-free implied volatility (IV) surfaces, offering a more stable and accurate alternative to existing GAN-based approaches. To capture the path-dependent nature of volatility dynamics, our model is conditioned on a rich set of market variables, including exponential weighted moving averages (EWMAs) of historical surfaces, returns and squared returns of underlying asset, and scalar risk indicators like VIX. Empirical results demonstrate our model significantly outperforms leading GAN-based models in capturing the stylized facts of IV dynamics. A key challenge is that historical data often contains small arbitrage opportunities in the earlier dataset for training, which conflicts with the goal of generating arbitrage-free surfaces. We address this by incorporating a standard arbitrage penalty into the loss function, but apply it using a novel, parameter-free weighting scheme based on the signal-to-noise ratio (SNR) that dynamically adjusts the penalty's strength across the diffusion process. We also show a formal analysis of this trade-off and provide a proof of convergence showing that the penalty introduces a small, controllable bias that steers the model toward the manifold of arbitrage-free surfaces while ensuring the generated distribution remains close to the real-world data.

2025-11-10T19:31:04Z Chen Jin Ankush Agarwal http://arxiv.org/abs/2511.07235v1 Deep Neural Operator Learning for Probabilistic Models 2025-11-10T15:52:48Z

We propose a deep neural-operator framework for a general class of probability models. Under global Lipschitz conditions on the operator over the entire Euclidean space-and for a broad class of probabilistic models-we establish a universal approximation theorem with explicit network-size bounds for the proposed architecture. The underlying stochastic processes are required only to satisfy integrability and general tail-probability conditions. We verify these assumptions for both European and American option-pricing problems within the forward-backward SDE (FBSDE) framework, which in turn covers a broad class of operators arising from parabolic PDEs, with or without free boundaries. Finally, we present a numerical example for a basket of American options, demonstrating that the learned model produces optimal stopping boundaries for new strike prices without retraining.

2025-11-10T15:52:48Z 36 pages, 1 figure Erhan Bayraktar Qi Feng Zecheng Zhang Zhaoyu Zhang http://arxiv.org/abs/2511.07045v1 Machine-learning a family of solutions to an optimal pension investment problem 2025-11-10T12:38:32Z

We use a neural network to identify the optimal solution to a family of optimal investment problems, where the parameters determining an investor's risk and consumption preferences are given as inputs to the neural network in addition to economic variables. This is used to develop a practical tool that can be used to explore how pension outcomes vary with preference parameters. We use a Black-Scholes economic model so that we may validate the accuracy of network using a classical and provably convergent numerical method developed using the duality approach.

2025-11-10T12:38:32Z John Armstrong Cristin Buescu James Dalby Rohan Hobbs http://arxiv.org/abs/2511.06451v1 A Risk-Neutral Neural Operator for Arbitrage-Free SPX-VIX Term Structures 2025-11-09T16:35:37Z

We propose ARBITER, a risk-neutral neural operator for learning joint SPX-VIX term structures under no-arbitrage constraints. ARBITER maps market states to an operator that outputs implied volatility and variance curves while enforcing static arbitrage (calendar, vertical, butterfly), Lipschitz bounds, and monotonicity. The model couples operator learning with constrained decoders and is trained with extragradient-style updates plus projection. We introduce evaluation metrics for derivatives term structures (NAS, CNAS, NI, Dual-Gap, Stability Rate) and show gains over Fourier Neural Operator, DeepONet, and state-space sequence models on historical SPX and VIX data. Ablation studies indicate that tying the SPX and VIX legs reduces Dual-Gap and improves NI, Lipschitz projection stabilizes calibration, and selective state updates improve long-horizon generalization. We provide identifiability and approximation results and describe practical recipes for arbitrage-free interpolation and extrapolation across maturities and strikes.

2025-11-09T16:35:37Z 46 pages, 9 figures, includes appendices; v11 draft aligned with final outline Jian'an Zhang http://arxiv.org/abs/2511.06177v1 Push-response anomalies in high-frequency S&P 500 price series 2025-11-09T01:34:00Z

We test the hypothesis that consecutive intraday price changes in the most liquid U.S. equity ETF (SPY) are conditionally nonrandom. Using NBBO event-time data for about 1,500 regular trading days, we form for every lag L ordered pairs of a backward price increment ("push") and a forward price increment ("response"), standardize them, and estimate the expected responses on a fine grid of push magnitudes. The resulting lag-by-magnitude maps reveal a persistent structural shift: for short lags (1-5,000 ticks), expected responses cluster near zero across most push magnitudes, suggesting high short-term efficiency; beyond that range, pronounced tails emerge, indicating that larger historical pushes increasingly correlate with nonzero conditional responses. We also find that large negative pushes are followed by stronger positive responses than equally large positive pushes, consistent with asymmetric liquidity replenishment after sell-side shocks. Decomposition into symmetric and antisymmetric components and the associated dominance curves confirm that short-horizon efficiency is restored only partially. The evidence points to an intraday, lag-resolved anomaly that is invisible in unconditional returns and that can be used to define tradable pockets and risk controls.

2025-11-09T01:34:00Z Dmitrii Vlasiuk Mikhail Smirnov http://arxiv.org/abs/2502.17493v2 A Novel Loss Function for Deep Learning Based Daily Stock Trading System 2025-11-07T23:27:12Z

Making consistently profitable financial decisions in a continuously evolving and volatile stock market has always been a difficult task. Professionals from different disciplines have developed foundational theories to anticipate price movement and evaluate securities such as the famed Capital Asset Pricing Model (CAPM). In recent years, the role of artificial intelligence (AI) in asset pricing has been growing. Although the black-box nature of deep learning models lacks interpretability, they have continued to solidify their position in the financial industry. We aim to further enhance AI's potential and utility by introducing a return-weighted loss function that will drive top growth while providing the ML models a limited amount of information. Using only publicly accessible stock data (open/close/high/low, trading volume, sector information) and several technical indicators constructed from them, we propose an efficient daily trading system that detects top growth opportunities. Our best models achieve 61.73\% annual return on daily rebalancing with an annualized Sharpe Ratio of 1.18 over 1340 testing days from 2019 to 2024, and 37.61\% annual return with an annualized Sharpe Ratio of 0.97 over 1360 testing days from 2005 to 2010. The main drivers for success, especially independent of any domain knowledge, are the novel return-weighted loss function, the integration of categorical and continuous data, and the ML model architecture. We also demonstrate the superiority of our novel loss function over traditional loss functions via several performance metrics and statistical evidence.

2025-02-20T21:43:51Z 27 pages, 11 figures, GitHub repo: https://github.com/Tony-Guo-1/daily_trading_strategy Ruoyu Guo Haochen Qiu Xuelun Hou http://arxiv.org/abs/2511.05315v1 Economic uncertainty and exchange rates linkage revisited: modelling tail dependence with high frequency data 2025-11-07T15:14:41Z

The aim of this paper is to dig deeper into understanding the exchange rates and uncertainty dependence. Using the novel Baker et al. (2020)'s daily Twitter Uncertainty Index and BRICS exchange rates, we investigate their extreme tail dependence within an original time-varying copula framework. Our analysis makes several noteworthy results. Evidence for Indian, Russian and South African currencies indicates an elliptical copulas' dominance implying neither asymmetric features nor extreme movements in their dependence structure with the global economic uncertainty. Importantly, Brazilian and Chinese currencies tail dependence is upward trending suggesting a safe-haven role in times of high global economic uncertainty including the recent COVID-19 pandemic. In such circumstances, these markets offer opportunities to significant gains through portfolio diversification.

2025-11-07T15:14:41Z Nourhaine Nefzi COGI Abir Abid COGI http://arxiv.org/abs/2511.08621v1 The LLM Pro Finance Suite: Multilingual Large Language Models for Financial Applications 2025-11-07T11:08:31Z

The financial industry's growing demand for advanced natural language processing (NLP) capabilities has highlighted the limitations of generalist large language models (LLMs) in handling domain-specific financial tasks. To address this gap, we introduce the LLM Pro Finance Suite, a collection of five instruction-tuned LLMs (ranging from 8B to 70B parameters) specifically designed for financial applications. Our approach focuses on enhancing generalist instruction-tuned models, leveraging their existing strengths in instruction following, reasoning, and toxicity control, while fine-tuning them on a curated, high-quality financial corpus comprising over 50% finance-related data in English, French, and German. We evaluate the LLM Pro Finance Suite on a comprehensive financial benchmark suite, demonstrating consistent improvement over state-of-the-art baselines in finance-oriented tasks and financial translation. Notably, our models maintain the strong general-domain capabilities of their base models, ensuring reliable performance across non-specialized tasks. This dual proficiency, enhanced financial expertise without compromise on general abilities, makes the LLM Pro Finance Suite an ideal drop-in replacement for existing LLMs in financial workflows, offering improved domain-specific performance while preserving overall versatility. We publicly release two 8B-parameters models to foster future research and development in financial NLP applications: https://huggingface.co/collections/DragonLLM/llm-open-finance.

2025-11-07T11:08:31Z Gaëtan Caillaut Raheel Qader Jingshu Liu Mariam Nakhlé Arezki Sadoune Massinissa Ahmim Jean-Gabriel Barthelemy http://arxiv.org/abs/2508.13557v2 Portfolio construction using a sampling-based variational quantum scheme 2025-11-07T08:52:35Z

The efficient and effective construction of portfolios that adhere to real-world constraints is a challenging optimization task in finance. We investigate a concrete representation of the problem with a focus on design proposals of an Exchange Traded Fund. We evaluate the sampling-based CVaR Variational Quantum Algorithm (VQA), combined with a local-search post-processing, for solving problem instances that beyond a certain size become classically hard. We also propose a problem formulation that is suited for sampling-based VQA. Our utility-scale experiments on IBM Heron processors involve 109 qubits and up to 4200 gates, achieving a relative solution error of 0.49%. Results indicate that a combined quantum-classical workflow achieves better accuracy compared to purely classical local search, and that hard-to-simulate quantum circuits may lead to better convergence than simpler circuits. Our work paves the path to further explore portfolio construction with quantum computers.

2025-08-19T06:32:54Z Gabriele Agliardi Dimitris Alevras Vaibhaw Kumar Roberto Lo Nardo Gabriele Compostella Sumit Kumar Manuel Proissl Bimal Mehta http://arxiv.org/abs/2507.09601v2 NMIXX: Domain-Adapted Neural Embeddings for Cross-Lingual eXploration of Finance 2025-11-07T05:55:20Z

General-purpose sentence embedding models often struggle to capture specialized financial semantics, especially in low-resource languages like Korean, due to domain-specific jargon, temporal meaning shifts, and misaligned bilingual vocabularies. To address these gaps, we introduce NMIXX (Neural eMbeddings for Cross-lingual eXploration of Finance), a suite of cross-lingual embedding models fine-tuned with 18.8K high-confidence triplets that pair in-domain paraphrases, hard negatives derived from a semantic-shift typology, and exact Korean-English translations. Concurrently, we release KorFinSTS, a 1,921-pair Korean financial STS benchmark spanning news, disclosures, research reports, and regulations, designed to expose nuances that general benchmarks miss. When evaluated against seven open-license baselines, NMIXX's multilingual bge-m3 variant achieves Spearman's rho gains of +0.10 on English FinSTS and +0.22 on KorFinSTS, outperforming its pre-adaptation checkpoint and surpassing other models by the largest margin, while revealing a modest trade-off in general STS performance. Our analysis further shows that models with richer Korean token coverage adapt more effectively, underscoring the importance of tokenizer design in low-resource, cross-lingual settings. By making both models and the benchmark publicly available, we provide the community with robust tools for domain-adapted, multilingual representation learning in finance.

2025-07-13T12:14:57Z Accepted at FinAI@CIKM 2025 Hanwool Lee Sara Yu Yewon Hwang Jonghyun Choi Heejae Ahn Sungbum Jung Youngjae Yu