https://arxiv.org/api/NHDHpc0NyNloiq7iwgJ1CiTXjR4 2026-03-26T08:20:47Z 3130 240 15 http://arxiv.org/abs/2508.13557v2 Portfolio construction using a sampling-based variational quantum scheme 2025-11-07T08:52:35Z

The efficient and effective construction of portfolios that adhere to real-world constraints is a challenging optimization task in finance. We investigate a concrete representation of the problem with a focus on design proposals of an Exchange Traded Fund. We evaluate the sampling-based CVaR Variational Quantum Algorithm (VQA), combined with a local-search post-processing, for solving problem instances that beyond a certain size become classically hard. We also propose a problem formulation that is suited for sampling-based VQA. Our utility-scale experiments on IBM Heron processors involve 109 qubits and up to 4200 gates, achieving a relative solution error of 0.49%. Results indicate that a combined quantum-classical workflow achieves better accuracy compared to purely classical local search, and that hard-to-simulate quantum circuits may lead to better convergence than simpler circuits. Our work paves the path to further explore portfolio construction with quantum computers.

2025-08-19T06:32:54Z Gabriele Agliardi Dimitris Alevras Vaibhaw Kumar Roberto Lo Nardo Gabriele Compostella Sumit Kumar Manuel Proissl Bimal Mehta http://arxiv.org/abs/2507.09601v2 NMIXX: Domain-Adapted Neural Embeddings for Cross-Lingual eXploration of Finance 2025-11-07T05:55:20Z

General-purpose sentence embedding models often struggle to capture specialized financial semantics, especially in low-resource languages like Korean, due to domain-specific jargon, temporal meaning shifts, and misaligned bilingual vocabularies. To address these gaps, we introduce NMIXX (Neural eMbeddings for Cross-lingual eXploration of Finance), a suite of cross-lingual embedding models fine-tuned with 18.8K high-confidence triplets that pair in-domain paraphrases, hard negatives derived from a semantic-shift typology, and exact Korean-English translations. Concurrently, we release KorFinSTS, a 1,921-pair Korean financial STS benchmark spanning news, disclosures, research reports, and regulations, designed to expose nuances that general benchmarks miss. When evaluated against seven open-license baselines, NMIXX's multilingual bge-m3 variant achieves Spearman's rho gains of +0.10 on English FinSTS and +0.22 on KorFinSTS, outperforming its pre-adaptation checkpoint and surpassing other models by the largest margin, while revealing a modest trade-off in general STS performance. Our analysis further shows that models with richer Korean token coverage adapt more effectively, underscoring the importance of tokenizer design in low-resource, cross-lingual settings. By making both models and the benchmark publicly available, we provide the community with robust tools for domain-adapted, multilingual representation learning in finance.

2025-07-13T12:14:57Z Accepted at FinAI@CIKM 2025 Hanwool Lee Sara Yu Yewon Hwang Jonghyun Choi Heejae Ahn Sungbum Jung Youngjae Yu http://arxiv.org/abs/2511.08606v1 Data-driven Feynman-Kac Discovery with Applications to Prediction and Data Generation 2025-11-05T01:57:01Z

In this paper, we propose a novel data-driven framework for discovering probabilistic laws underlying the Feynman-Kac formula. Specifically, we introduce the first stochastic SINDy method formulated under the risk-neutral probability measure to recover the backward stochastic differential equation (BSDE) from a single pair of stock and option trajectories. Unlike existing approaches to identifying stochastic differential equations-which typically require ergodicity-our framework leverages the risk-neutral measure, thereby eliminating the ergodicity assumption and enabling BSDE recovery from limited financial time series data. Using this algorithm, we are able not only to make forward-looking predictions but also to generate new synthetic data paths consistent with the underlying probabilistic law.

2025-11-05T01:57:01Z 39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: Generative AI in Finance Qi Feng Guang Lin Purav Matlia Denny Serdarevic http://arxiv.org/abs/2511.02700v1 Numerical valuation of European options under two-asset infinite-activity exponential Lévy models 2025-11-04T16:22:26Z

We propose a numerical method for the valuation of European-style options under two-asset infinite-activity exponential Lévy models. Our method extends the effective approach developed by Wang, Wan & Forsyth (2007) for the 1-dimensional case to the 2-dimensional setting and is applicable for general Lévy measures under mild assumptions. A tailored discretization of the non-local integral term is developed, which can be efficiently evaluated by means of the fast Fourier transform. For the temporal discretization, the semi-Lagrangian theta-method is employed in a convenient splitting fashion, where the diffusion term is treated implicitly and the integral term is handled explicitly by a fixed-point iteration. Numerical experiments for put-on-the-average options under Normal Tempered Stable dynamics reveal favourable second-order convergence of our method whenever the exponential Lévy process has finite-variation.

2025-11-04T16:22:26Z Massimiliano Moda Karel J. in 't Hout Michèle Vanmaele Fred Espen Benth http://arxiv.org/abs/2301.09241v5 Quantum Monte Carlo algorithm for solving Black-Scholes PDEs for high-dimensional option pricing in finance and its complexity analysis 2025-11-04T13:53:43Z

In this paper we provide a quantum Monte Carlo algorithm to solve high-dimensional Black-Scholes PDEs with correlation for high-dimensional option pricing. The payoff function of the option is of general form and is only required to be continuous and piece-wise affine (CPWA), which covers most of the relevant payoff functions used in finance. We provide a rigorous error analysis and complexity analysis of our algorithm. In particular, we prove that the computational complexity of our algorithm is bounded polynomially in the space dimension $d$ of the PDE and the reciprocal of the prescribed accuracy $\varepsilon$. Moreover, we show that for payoff functions which are bounded, our algorithm indeed has a speed-up compared to classical Monte Carlo methods. Furthermore, we provide numerical simulations in one and two dimensions using our developed package within the Qiskit framework tailored to price CPWA options with respect to the Black-Scholes model, as well as discuss the potential extension of the numerical simulations to arbitrary space dimension.

2023-01-23T01:55:01Z Jianjun Chen Yongming Li Ariel Neufeld http://arxiv.org/abs/2511.02469v1 Modeling Hawkish-Dovish Latent Beliefs in Multi-Agent Debate-Based LLMs for Monetary Policy Decision Classification 2025-11-04T10:56:01Z

Accurately forecasting central bank policy decisions, particularly those of the Federal Open Market Committee(FOMC) has become increasingly important amid heightened economic uncertainty. While prior studies have used monetary policy texts to predict rate changes, most rely on static classification models that overlook the deliberative nature of policymaking. This study proposes a novel framework that structurally imitates the FOMC's collective decision-making process by modeling multiple large language models(LLMs) as interacting agents. Each agent begins with a distinct initial belief and produces a prediction based on both qualitative policy texts and quantitative macroeconomic indicators. Through iterative rounds, agents revise their predictions by observing the outputs of others, simulating deliberation and consensus formation. To enhance interpretability, we introduce a latent variable representing each agent's underlying belief(e.g., hawkish or dovish), and we theoretically demonstrate how this belief mediates the perception of input information and interaction dynamics. Empirical results show that this debate-based approach significantly outperforms standard LLMs-based baselines in prediction accuracy. Furthermore, the explicit modeling of beliefs provides insights into how individual perspectives and social influence shape collective policy forecasts.

2025-11-04T10:56:01Z PRIMA2025 Accepted Kaito Takano Masanori Hirano Kei Nakagawa http://arxiv.org/abs/2510.10807v3 Multi-Agent Regime-Conditioned Diffusion (MARCD) for CVaR-Constrained Portfolio Decisions 2025-11-03T12:18:32Z

We examine whether regime-conditioned generative scenarios combined with a convex CVaR allocator improve portfolio decisions under regime shifts. We present MARCD, a generative-to-decision framework with: (i) a Gaussian HMM to infer latent regimes; (ii) a diffusion generator that produces regime-conditioned scenarios; (iii) signal extraction via blended, shrunk moments; and (iv) a governed CVaR epigraph quadratic program. Contributions: Within the Scenario stage we introduce a tail-weighted diffusion objective that up-weights low-quantile outcomes relevant for drawdowns and a regime-expert (MoE) denoiser whose gate increases with crisis posteriors; both are evaluated end-to-end through the allocator. Under strict walk-forward on liquid multi-asset ETFs (2005-2025), MARCD exhibits stronger scenario calibration and materially smaller drawdowns: MaxDD 9.3% versus 14.1% for BL (a 34% reduction) over 2020-2025 out-of-sample. The framework provides an auditable pipeline with explicit budget, box, and turnover constraints, demonstrating the value of decision-aware generative modeling in finance.

2025-10-12T20:56:10Z Code available at: https://github.com/AliAtiah/MARCD Ali Atiah Alzahrani http://arxiv.org/abs/2511.01471v1 Trade Execution Flow as the Underlying Source of Market Dynamics 2025-11-03T11:30:59Z

In this work, we demonstrate experimentally that the execution flow, $I = dV/dt$, is the fundamental driving force of market dynamics. We develop a numerical framework to calculate execution flow from sampled moments using the Radon-Nikodym derivative. A notable feature of this approach is its ability to automatically determine thresholds that can serve as actionable triggers. The technique also determines the characteristic time scale directly from the corresponding eigenproblem. The methodology has been validated on actual market data to support these findings. Additionally, we introduce a framework based on the Christoffel function spectrum, which is invariant under arbitrary non-degenerate linear transformations of input attributes and offers an alternative to traditional principal component analysis (PCA), which is limited to unitary invariance.

2025-11-03T11:30:59Z Mikhail Gennadievich Belov Victor Victorovich Dubov Vadim Konstantinovich Ivanov Alexander Yurievich Maslov Olga Vladimirovna Proshina Vladislav Gennadievich Malyshkin http://arxiv.org/abs/2510.05702v2 Uncovering Representation Bias for Investment Decisions in Open-Source Large Language Models 2025-11-03T01:00:40Z

Large Language Models are increasingly adopted in financial applications to support investment workflows. However, prior studies have seldom examined how these models reflect biases related to firm size, sector, or financial characteristics, which can significantly impact decision-making. This paper addresses this gap by focusing on representation bias in open-source Qwen models. We propose a balanced round-robin prompting method over approximately 150 U.S. equities, applying constrained decoding and token-logit aggregation to derive firm-level confidence scores across financial contexts. Using statistical tests and variance analysis, we find that firm size and valuation consistently increase model confidence, while risk factors tend to decrease it. Confidence varies significantly across sectors, with the Technology sector showing the greatest variability. When models are prompted for specific financial categories, their confidence rankings best align with fundamental data, moderately with technical signals, and least with growth indicators. These results highlight representation bias in Qwen models and motivate sector-aware calibration and category-conditioned evaluation protocols for safe and fair financial LLM deployment.

2025-10-07T09:10:13Z Fabrizio Dimino Krati Saxena Bhaskarjit Sarmah Stefano Pasquali http://arxiv.org/abs/2511.01125v1 One model to solve them all: 2BSDE families via neural operators 2025-11-03T00:27:13Z

We introduce a mild generative variant of the classical neural operator model, which leverages Kolmogorov--Arnold networks to solve infinite families of second-order backward stochastic differential equations ($2$BSDEs) on regular bounded Euclidean domains with random terminal time. Our first main result shows that the solution operator associated with a broad range of $2$BSDE families is approximable by appropriate neural operator models. We then identify a structured subclass of (infinite) families of $2$BSDEs whose neural operator approximation requires only a polynomial number of parameters in the reciprocal approximation rate, as opposed to the exponential requirement in general worst-case neural operator guarantees.

2025-11-03T00:27:13Z Takashi Furuya Anastasis Kratsios Dylan Possamaï Bogdan Raonić http://arxiv.org/abs/2505.17048v2 Words That Unite The World: A Unified Framework for Deciphering Central Bank Communications Globally 2025-11-01T21:51:13Z

Central banks around the world play a crucial role in maintaining economic stability. Deciphering policy implications in their communications is essential, especially as misinterpretations can disproportionately impact vulnerable populations. To address this, we introduce the World Central Banks (WCB) dataset, the most comprehensive monetary policy corpus to date, comprising over 380k sentences from 25 central banks across diverse geographic regions, spanning 28 years of historical data. After uniformly sampling 1k sentences per bank (25k total) across all available years, we annotate and review each sentence using dual annotators, disagreement resolutions, and secondary expert reviews. We define three tasks: Stance Detection, Temporal Classification, and Uncertainty Estimation, with each sentence annotated for all three. We benchmark seven Pretrained Language Models (PLMs) and nine Large Language Models (LLMs) (Zero-Shot, Few-Shot, and with annotation guide) on these tasks, running 15,075 benchmarking experiments. We find that a model trained on aggregated data across banks significantly surpasses a model trained on an individual bank's data, confirming the principle "the whole is greater than the sum of its parts." Additionally, rigorous human evaluations, error analyses, and predictive tasks validate our framework's economic utility. Our artifacts are accessible through the HuggingFace and GitHub under the CC-BY-NC-SA 4.0 license.

2025-05-15T19:49:20Z Accepted at NeurIPS 2025 (main conference) Agam Shah Siddhant Sukhani Huzaifa Pardawala Saketh Budideti Riya Bhadani Rudra Gopal Siddhartha Somani Rutwik Routu Michael Galarnyk Soungmin Lee Arnav Hiray Akshar Ravichandran Eric Kim Pranav Aluru Joshua Zhang Sebastian Jaskowski Veer Guda Meghaj Tarte Liqin Ye Spencer Gosden Rachel Yuh Sloka Chava Sahasra Chava Dylan Patrick Kelly Aiden Chiang Harsit Mittal Sudheer Chava http://arxiv.org/abs/2511.00665v1 Technical Analysis Meets Machine Learning: Bitcoin Evidence 2025-11-01T19:13:07Z

In this note, we compare Bitcoin trading performance using two machine learning models-Light Gradient Boosting Machine (LightGBM) and Long Short-Term Memory (LSTM)-and two technical analysis-based strategies: Exponential Moving Average (EMA) crossover and a combination of Moving Average Convergence/Divergence with the Average Directional Index (MACD+ADX). The objective is to evaluate how trading signals can be used to maximize profits in the Bitcoin market. This comparison was motivated by the U.S. Securities and Exchange Commission's (SEC) approval of the first spot Bitcoin exchange-traded funds (ETFs) on 2024-01-10. Our results show that the LSTM model achieved a cumulative return of approximately 65.23% in under a year, significantly outperforming LightGBM, the EMA and MACD+ADX strategies, as well as the baseline buy-and-hold. This study highlights the potential for deeper integration of machine learning and technical analysis in the rapidly evolving cryptocurrency landscape.

2025-11-01T19:13:07Z José Ángel Islas Anguiano Andrés García-Medina http://arxiv.org/abs/2511.00190v1 Deep reinforcement learning for optimal trading with partial information 2025-10-31T18:48:59Z

Reinforcement Learning (RL) applied to financial problems has been the subject of a lively area of research. The use of RL for optimal trading strategies that exploit latent information in the market is, to the best of our knowledge, not widely tackled. In this paper we study an optimal trading problem, where a trading signal follows an Ornstein-Uhlenbeck process with regime-switching dynamics. We employ a blend of RL and Recurrent Neural Networks (RNN) in order to make the most at extracting underlying information from the trading signal with latent parameters. The latent parameters driving mean reversion, speed, and volatility are filtered from observations of the signal, and trading strategies are derived via RL. To address this problem, we propose three Deep Deterministic Policy Gradient (DDPG)-based algorithms that integrate Gated Recurrent Unit (GRU) networks to capture temporal dependencies in the signal. The first, a one -step approach (hid-DDPG), directly encodes hidden states from the GRU into the RL trader. The second and third are two-step methods: one (prob-DDPG) makes use of posterior regime probability estimates, while the other (reg-DDPG) relies on forecasts of the next signal value. Through extensive simulations with increasingly complex Markovian regime dynamics for the trading signal's parameters, as well as an empirical application to equity pair trading, we find that prob-DDPG achieves superior cumulative rewards and exhibits more interpretable strategies. By contrast, reg-DDPG provides limited benefits, while hid-DDPG offers intermediate performance with less interpretable strategies. Our results show that the quality and structure of the information supplied to the agent are crucial: embedding probabilistic insights into latent regimes substantially improves both profitability and robustness of reinforcement learning-based trading strategies.

2025-10-31T18:48:59Z Andrea Macrì Sebastian Jaimungal Fabrizio Lillo http://arxiv.org/abs/2510.26438v2 An Impulse Control Approach to Market Making in a Hawkes LOB Market 2025-10-31T13:35:38Z

We study the optimal Market Making problem in a Limit Order Book (LOB) market simulated using a high-fidelity, mutually exciting Hawkes process. Departing from traditional Brownian-driven mid-price models, our setup captures key microstructural properties such as queue dynamics, inter-arrival clustering, and endogenous price impact. Recognizing the realistic constraint that market makers cannot update strategies at every LOB event, we formulate the control problem within an impulse control framework, where interventions occur discretely via limit, cancel, or market orders. This leads to a high-dimensional, non-local Hamilton-Jacobi-Bellman Quasi-Variational Inequality (HJB-QVI), whose solution is analytically intractable and computationally expensive due to the curse of dimensionality. To address this, we propose a novel Reinforcement Learning (RL) approximation inspired by auxiliary control formulations. Using a two-network PPO-based architecture with self-imitation learning, we demonstrate strong empirical performance with limited training, achieving Sharpe ratios above 30 in a realistic simulated LOB. In addition to that, we solve the HJB-QVI using a deep learning method inspired by Sirignano and Spiliopoulos 2018 and compare the performance with the RL agent. Our findings highlight the promise of combining impulse control theory with modern deep RL to tackle optimal execution problems in jump-driven microstructural markets.

2025-10-30T12:34:06Z Konark Jain Nick Firoozye Jonathan Kochems Philip Treleaven http://arxiv.org/abs/2510.27132v1 Exact Terminal Condition Neural Network for American Option Pricing Based on the Black-Scholes-Merton Equations 2025-10-31T03:11:30Z

This paper proposes the Exact Terminal Condition Neural Network (ETCNN), a deep learning framework for accurately pricing American options by solving the Black-Scholes-Merton (BSM) equations. The ETCNN incorporates carefully designed functions that ensure the numerical solution not only exactly satisfies the terminal condition of the BSM equations but also matches the non-smooth and singular behavior of the option price near expiration. This method effectively addresses the challenges posed by the inequality constraints in the BSM equations and can be easily extended to high-dimensional scenarios. Additionally, input normalization is employed to maintain the homogeneity. Multiple experiments are conducted to demonstrate that the proposed method achieves high accuracy and exhibits robustness across various situations, outperforming both traditional numerical methods and other machine learning approaches.

2025-10-31T03:11:30Z Wenxuan Zhang Yixiao Guo Benzhuo Lu