https://arxiv.org/api/S8fPrV5JI0Bs0Dc6sr6iNw1xKM4 2026-06-21T15:15:01Z 3237 120 15 http://arxiv.org/abs/2602.07096v2 RealFin: How Well Do LLMs Reason About Finance When Users Leave Things Unsaid? 2026-04-26T09:26:40Z Reliable financial reasoning requires knowing not only how to answer, but also when an answer cannot be justified. In real financial practice, problems often rely on implicit assumptions that are taken for granted rather than stated explicitly, causing problems to appear solvable while lacking enough information for a definite answer. We introduce REALFIN, a bilingual benchmark that evaluates financial reasoning by systematically removing essential premises from exam-style questions while keeping them linguistically plausible. Based on this, we evaluate models under three formulations that test answering, recognizing missing information, and rejecting unjustified options, and find consistent performance drops when key conditions are absent. General-purpose models tend to over-commit and guess, while most finance-specialized models fail to clearly identify missing premises. These results highlight a critical gap in current evaluations and show that reliable financial models must know when a question should not be answered. 2026-02-06T13:47:54Z Yuyang Dai Yan Lin Zhuohan Xie Yuxia Wang http://arxiv.org/abs/2502.17011v2 Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation 2026-04-24T06:19:27Z Financial bond yield forecasting is challenging due to data scarcity, nonlinear macroeconomic dependencies, and evolving market conditions. In this paper, we propose a novel framework that leverages Causal Generative Adversarial Networks (CausalGANs) and Soft Actor-Critic (SAC) reinforcement learning (RL) to generate high-fidelity synthetic bond yield data for four major bond categories (AAA, BAA, US10Y, Junk). By incorporating 12 key macroeconomic variables, we ensure statistical fidelity by preserving essential market properties. To transform this market dependent synthetic data into actionable insights, we employ a finetuned Large Language Model (LLM) Qwen2.5-7B that generates trading signals (BUY/HOLD/SELL), risk assessments, and volatility projections. We use automated, human and LLM evaluations, all of which demonstrate that our framework improves forecasting performance over existing methods, with statistical validation via predictive accuracy, MAE evaluation(0.103%), profit/loss evaluation (60% profit rate), LLM evaluation (3.37/5) and expert assessments scoring 4.67 out of 5. The reinforcement learning-enhanced synthetic data generation achieves the least Mean Absolute Error of 0.103, demonstrating its effectiveness in replicating real-world bond market dynamics. We not only enhance data-driven trading strategies but also provides a scalable, high-fidelity synthetic financial data pipeline for risk & volatility management and investment decision-making. This work establishes a bridge between synthetic data generation, LLM driven financial forecasting, and language model evaluation, contributing to AI-driven financial decision-making. 2025-02-24T09:46:37Z Jaskaran Singh Walia Aarush Sinha Naman Saraswat Srinitish Srinivasan Srihari Unnikrishnan http://arxiv.org/abs/2604.21672v1 Agentic Artificial Intelligence in Finance: A Comprehensive Survey 2026-04-23T13:37:06Z The emergence of agentic artificial intelligence (AI) represents a fundamental transformation in financial markets, characterized by autonomous systems capable of reasoning, planning, and adaptive decision-making with minimal human intervention. This comprehensive survey synthesizes recent advances in agentic AI across multiple dimensions of financial operations, including system architecture, market applications, regulatory frameworks, and systemic implications. We examine how agentic AI differs from traditional algorithmic trading and generative AI through its capacity for goal-oriented autonomy, continuous learning, and multi-agent coordination. Our analysis shows that while agentic AI offers substantial potential for enhanced market efficiency, liquidity provision, and risk management, it also introduces novel challenges related to market stability, regulatory compliance, interpretability, and systemic risk. Through a systematic review of foundational research, technical architectures, market applications, and governance frameworks, this survey provides scholars and practitioners with a structured understanding of how agentic AI is reshaping financial markets and identifies critical research directions for ensuring that these systems enhance both operational efficiency and market resilience. 2026-04-23T13:37:06Z 35 pages Irene Aldridge Jolie An Riley Burke Michael Cao Chia-Yi Chien Kexin Deng Ruipeng Deng Yichen Gao Olivia Guo Shunran He Zheng Li George Lin Weihang Lin Percy Lyu Alex Ng Qi Wang Hanxi Xiao Dora Xu Yuanyuan Xue Sheng Zhang Sirui Zhang Yun Zhang Sirui Zhao Xiaolong Zhao Yihan Zhao Waner Zheng http://arxiv.org/abs/2605.06677v1 Extrema, Barrier Options, and Semi-Analytic Leverage Corrections in Stochastic-Clock Volatility Models 2026-04-22T08:17:33Z Barrier derivatives depend on extrema and first-passage events and are therefore highly sensitive to volatility dynamics -- especially to the instantaneous return-volatility correlation $ρ$, often called ``leverage''. This sensitivity makes accurate and fast pricing under realistic stochastic-volatility specifications difficult: two-dimensional PDE solvers are expensive inside calibration loops, while Monte Carlo methods converge slowly when barrier hits are rare and discretely monitored. In equity markets in particular, the pronounced implied-volatility skew motivates factoring in a negative return-volatility correlation. We study a class of continuous-path stochastic-clock volatility models in which the log-price is represented as a Brownian motion run on a random increasing clock. In the baseline independent-clock case (ρ=0), a broad family of barrier-relevant objects-maximum distributions, survival probabilities, and killed joint laws-reduces to one-dimensional quantities determined by the Laplace transform of the terminal clock. This yields transform-only pricing formulas for single- and double-barrier contracts that are fast and numerically stable once the clock transform is available, notably for affine and quadratic clocks. To incorporate leverage without forfeiting tractability, we develop a systematic small-ρexpansion around the ρ=0 backbone. The expansion produces a hierarchy of forced problems whose forcing terms are semi-analytic and computable from baseline barrier objects. We provide two implementable leverage-correction routes\,: forced PDEs and a Duhamel-type Monte Carlo representation, and we show how Pad{é} acceleration can extend practical accuracy to equity-like correlations. Calibration then proceeds by\,: (i) fitting clock parameters from vanillas using only one-dimensional transforms, (ii) precomputing the ρ=0 barrier backbone once, and (iii) iterating on ρ(and any remaining parameters) using the fast semi-analytic corrections-optionally Pad{é}-accelerated-inside a standard least-squares loop. 2026-04-22T08:17:33Z Tristan Guillaume CYU http://arxiv.org/abs/2604.00472v3 Valuation of variable annuities under the Volterra mortality and rough Heston models 2026-04-22T03:49:05Z This paper investigates the valuation of variable annuity contracts with an early surrender option under non-Markovian models. Moreover, policyholders are provided with guaranteed minimum maturity and death benefits to protect against the downside risk. Unlike the existing literature, our variable annuity account value is linked to two non-Markovian processes: an equity index modeled by a rough Heston model and a force of mortality following a Volterra-type stochastic model. In this case, the early surrender feature introduces an optimal stopping problem where continuation values depend on the entire path history, rendering traditional numerical methods infeasible. We develop a deep signature Least Squares Monte Carlo approach to learn optimal surrender strategies on a discretized time grid. To mitigate the curse of dimensionality arising from the path-dependent model, we use truncated rough-path signatures to encode the historical paths and approximate the continuation values using a neural network. Numerically, we find that the fair fee increases with the Hurst parameters of both the stock volatility and the force of mortality. Finally, a convergence proof is provided to further support the stability of our method. 2026-04-01T04:34:38Z Wenyuan Li Haoqi Lyu http://arxiv.org/abs/2605.00862v1 Replication-Consistent Liquidity Forecasting for Derivatives -- Forward Funding Sensitivities and a Liquidity Valuation Adjustment for Settlement Lags 2026-04-21T14:43:28Z We study cash-flow forecasting for derivatives used in liquidity management and clarify its relation to risk-neutral valuation and replication. While it is well known that expectations under different measures (e.g., $\mathbb{P}$ vs. $\mathbb{Q}$) can yield different undiscounted cash-flows, further inconsistencies arise when payment times are stochastic. We show that using discounting sensitivities (funding-curve hedge ratios) instead of "expected cash-flows" aligns forecasting with the self-financing replication strategy and avoids measure-mixing/aggregation issues. We then illustrate how a standard valuation model delivers pathwise funding requirements and propose a simple liquidity valuation adjustment to capture settlement lags and related timing frictions. The note provides implementation hints (American Monte Carlo with adjoint differentiation) and clarifies when "expected cash-flows" are informative and when sensitivities should be used instead. 2026-04-21T14:43:28Z 34 pages Christian P. Fries http://arxiv.org/abs/2604.19290v1 Orthogonal reparametrization of the Nelson-Siegel-Svensson interest rate curve model: conditioning, diagnostics, and identifiability 2026-04-21T09:55:25Z The Nelson-Siegel-Svensson (NSS) interest rate curve model yields a separable nonlinear least-squares problem whose inner linear block is often ill-conditioned because the basis functions become nearly collinear. We analyze this instability via an exact orthogonal reparametrization of the design matrix. A thin QR decomposition produces orthogonal linear parameters for which, conditional on the nonlinear parameters, the Fisher information matrix is diagonal. We also derive a finite-horizon analytical orthogonalization: on $[0,T]$, the $4\times 4$ continuous Gram matrix has closed-form entries involving exponentials, logarithms, and the exponential integral $E_1$, yielding an explicit horizon-dependent orthogonal NSS basis. Together with Jacobian-rank and profile-likelihood arguments, this representation clarifies the degenerate manifold $λ_1=λ_2$, where the Svensson extension loses two degrees of freedom. Orthogonalization leaves the least-squares fit and uncertainty of the original linear parameters unchanged, but isolates the conditioning structure. When the decay parameters are estimated jointly, the full first-order covariance in orthogonal coordinates admits an explicit Schur-complement form. The approach also yields a scalar identifiability diagnostic through the QR element $R_{44}$ and separates model reduction from numerical instability. Synthetic experiments confirm that orthogonal parametrization eliminates correlations among the linear parameters and keeps their conditional uncertainty uniform. A daily U.S. Treasury study on a reduced fixed 9-tenor grid from 1981 to 2026 shows smoother orthogonal parameter series than classical NSS parameters while the moving QR basis remains nearly constant. 2026-04-21T09:55:25Z 28 pages, 10 figures Robert Flassig Emrah Gülay Daniel Guterding http://arxiv.org/abs/2508.20467v2 QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning 2026-04-21T03:38:08Z In the highly volatile and uncertain global financial markets, traditional quantitative trading models relying on statistical modeling or empirical rules often fail to adapt to dynamic market changes and black swan events due to rigid assumptions and limited generalization. To address these issues, this paper proposes QTMRL (Quantitative Trading Multi-Indicator Reinforcement Learning), an intelligent trading agent combining multi-dimensional technical indicators with reinforcement learning (RL) for adaptive and stable portfolio management. We first construct a comprehensive multi-indicator dataset using 23 years of S&P 500 daily OHLCV data (2000-2022) for 16 representative stocks across 5 sectors, enriching raw data with trend, volatility, and momentum indicators to capture holistic market dynamics. Then we design a lightweight RL framework based on the Advantage Actor-Critic (A2C) algorithm, including data processing, A2C algorithm, and trading agent modules to support policy learning and actionable trading decisions. Extensive experiments compare QTMRL with 9 baselines (e.g., ARIMA, LSTM, moving average strategies) across diverse market regimes, verifying its superiority in profitability, risk adjustment, and downside risk control. The code of QTMRL is publicly available at https://github.com/ChenJiahaoJNU/QTMRL.git 2025-08-28T06:37:41Z Jingfeng Pan Jiahao Chen http://arxiv.org/abs/2507.14808v3 Decoding RWA Tokenized U.S. Treasuries: Functional Dissection and Address Role Inference 2026-04-20T08:28:20Z Tokenized U.S. Treasuries have emerged as a prominent subclass of real-world assets (RWAs), offering cryptographically secured, yield-bearing instruments issued across multi-chain Web3 infrastructures, with growing significance for transparency, accessibility, and financial inclusion. While the market has expanded rapidly, empirical analyses of transaction-level behaviours remain limited. This paper conducts a quantitative, function-level dissection of U.S. Treasury-backed RWA tokens, including BUIDL, BENJI, and USDY across multi-chain: mostly Ethereum and Layer-2s. Decoded contract calls expose core financial primitives such as issuance, redemption, transfer, and bridging, revealing patterns that distinguish institutional participants from smaller or retail users for the extent and limits of inclusivity in current RWA adoption. To infer address-level economic roles, we introduce a curvature-aware representation learning model. Our method outperforms baseline models in role inference on our collected U.S. Treasury transaction dataset and generalizes to address classification across broader public blockchain transaction datasets. The decoded transaction-level patterns in tokenized U.S. Treasuries across chains surface the degree of retail participation, and the role inference model enables the distinction between institutional treasuries, arbitrage bots, and retail traders based on behavioral patterns, facilitating future more transparent, inclusive, and accountable Web3 finance. 2025-07-20T03:54:06Z accepted at the 8th edition of the IEEE International Conference on Blockchain and Cryptocurrency (ICBC 2026) Junliang Luo Katrin Tinn Samuel Ferreira Duran Di Wu Xue Liu http://arxiv.org/abs/2601.05290v2 Multi-Period Martingale Optimal Transport: Classical Theory, Neural Acceleration, and Financial Applications 2026-04-19T05:38:40Z This paper develops a computational framework for Multi-Period Martingale Optimal Transport (MMOT), addressing convergence rates, algorithmic efficiency, and financial calibration. Our contributions include: (1) Theoretical analysis: We establish discrete convergence rates of $O(\sqrt{Δt} \log(1/Δt))$ via Donsker's principle and linear algorithmic convergence of $(1-κ)^{2/3}$; (2) Algorithmic improvements: We introduce incremental updates ($O(M^2)$ complexity) and adaptive sparse grids; (3) Numerical implementation: A hybrid neural-projection solver is proposed, combining transformer-based warm-starting with Newton-Raphson projection. Once trained, the pure neural solver achieves a $1{,}597\times$ online inference speedup ($4.7$s $\to 2.9$ms) suitable for real-time applications, while the hybrid solver ensures martingale constraints to $10^{-6}$ precision. Validated on 12,000 synthetic instances (GBM, Merton, Heston) and 120 real market scenarios. 2026-01-07T21:10:29Z This preprint is being withdrawn by the authors. We identified errors in the reference list, including incorrect attribution of works to authors -- references and were cited inaccurately with wrong author arrangements and publication details. We are withdrawing the manuscript to correct these errors before any further dissemination. We apologize for the oversight Sri Sairam Gautam B http://arxiv.org/abs/2604.10005v2 What Happens When Institutional Liquidity Enters Prediction Markets: Identification, Measurement, and a Synthetic Proof of Concept 2026-04-17T23:06:18Z Prediction markets are starting to look less like crowd polls and more like electronic markets. The central question is therefore no longer only whether these markets forecast well, but what happens when institutional liquidity enters: do spreads tighten, does price discovery improve, and do those gains actually reach the traders who are slowest to react when information arrives? This paper offers a research design for answering that question. It defines a broad market-quality lens, separates the main channels through which institutional liquidity enters, and maps the identification problems that arise in live venue data. It also uses a synthetic microstructure laboratory as a proof of concept for the measurement pipeline. The main lesson of the synthetic exercise is deliberately narrow. Market-maker coverage, liquidity incentives, and automation do not have to work through the same channel; average liquidity gains do not have to translate into equal gains for all traders; and the sharpest welfare losses are most likely to appear in shock states, when slower takers receive the least pass-through of tighter quoted markets. The synthetic results are useful because they stress-test the design, not because they settle the live empirical question. 2026-04-11T03:27:43Z Shaw Dalen http://arxiv.org/abs/2604.14793v1 LR-Robot: An Human-in-the-Loop LLM Framework for Systematic Literature Reviews with Applications in Financial Research 2026-04-16T08:53:48Z The exponential growth of financial research has rendered traditional systematic literature reviews (SLRs) increasingly impractical, as manual screening and narrative synthesis struggle to keep pace with the scale and complexity of modern scholarship. While the existing artificial intelligence (AI) and natural language processing (NLP) approaches often often produce outputs that are efficient but contextually limited, still requiring substantial expert oversight. To address these challenges, we propose LR-Robot, a novel framework in which domain experts define multidimensional classification taxonomies and prompt constraints that encode conceptual boundaries, large language models (LLMs) execute scalable classification across large corpora, and systematic human-in-the-loop evaluation ensures reliability before full-dataset deployment.The framework further leverages retrieval-augmented generation (RAG) to support downstream analyses including temporal evolution tracking and label-enhanced citation networks. We demonstrate the framework on a corpus of 12,666 option pricing articles spanning 50 years, designing a four-dimensional taxonomy and systematically evaluating up to eleven mainstream LLMs across classification tasks of varying complexity. The results reveal the current capabilities of AI in understanding and synthesizing literature, uncover emerging trends, reveal structural research patterns, and highlight core research directions. By accelerating labor-intensive review stages while preserving interpretive accuracy, LR-Robot provides a practical, customizable, and high-quality approach for AI-assisted SLRs. 2026-04-16T08:53:48Z Wei Wei Jin Zheng Zining Wang Weibin Feng http://arxiv.org/abs/2604.14619v1 The Acoustic Camouflage Phenomenon: Re-evaluating Speech Features for Financial Risk Prediction 2026-04-16T04:59:36Z In computational paralinguistics, detecting cognitive load and deception from speech signals is a heavily researched domain. Recent efforts have attempted to apply these acoustic frameworks to corporate earnings calls to predict catastrophic stock market volatility. In this study, we empirically investigate the limits of acoustic feature extraction (pitch, jitter, and hesitation) when applied to highly trained speakers in in-the-wild teleconference environments. Utilizing a two-stream late-fusion architecture, we contrast an acoustic-based stream with a baseline Natural Language Processing (NLP) stream. The isolated NLP model achieved a recall of 66.25% for tail-risk downside events. Surprisingly, integrating acoustic features via late fusion significantly degraded performance, reducing recall to 47.08%. We identify this degradation as Acoustic Camouflage, where media-trained vocal regulation introduces contradictory noise that disrupts multimodal meta-learners. We present these findings as a boundary condition for speech processing applications in high-stakes financial forecasting. 2026-04-16T04:59:36Z Dhruvin Dungrani Disha Dungrani http://arxiv.org/abs/2512.20515v2 Modeling Bank Systemic Risk of Emerging Markets under Geopolitical Shocks: Empirical Evidence from BRICS Countries 2026-04-15T17:16:00Z In this study, we introduce an analytics framework, the Bank Risk Interlinkage with Dynamic Graph and Event Simulations (BRIDGES), to capture the systemic risks associated with the growing economic influence of the BRICS nations. This framework includes a Dynamic Time Warping (DTW) method to construct a dynamic network of 551 BRICS banks with their annual balance sheet data from 2008 to 2024; a trend analysis in risk ratios to detect shifts in banks' behavior; a Temporal Graph Neural Network (TGNN) to detect anomalous changes in the bank network's structural relationships; and Agent-Based Model (ABM) simulations to measure the impact of anomalous changes on network stability and assess the banking system's resilience to internal financial failure and external geopolitical shocks at the individual country level and across BRICS nations. Our simulation results highlight several important insights. The failure of the largest BRICS banks can cause more systemic damage than that of financially vulnerable or anomalous banks due to the panic effects. Moreover, compared to the failure of the largest BRICS banks, a geopolitical shock with correlated country-wide propagation can cause more systemic damage, resulting in a near-total systemic collapse. Our findings suggest that the panic over the failure of the largest BRICS banks and large-scale geopolitical shocks are the primary threats to the financial stability of the BRICS nations, which traditional bank risk analysis models might not detect. 2025-12-23T17:03:04Z 22 pages and 7 figures Haibo Wang http://arxiv.org/abs/2604.13311v1 Topological Complexity and Phase Space Stability: A Persistent Homology Approach to Cryptocurrency Risk 2026-04-14T21:24:40Z Traditional risk measures in finance, predominantly based on the second moment of return distributions or tail risk heuristics (VaR/CVaR), fail to account for the intrinsic geometric structure of market dynamics. This paper introduces a rigorous mathematical framework utilizing Topological Data Analysis (TDA) to quantify risk as the structural instability of the reconstructed phase space. By applying Takens' Delay Embedding Theorem to cryptocurrency log-returns, we generate a point cloud representation of the underlying attractor. We analyze the evolution of the filtration of Vietoris-Rips complexes to compute persistent homology groups $H_k$. We define a "Topological Persistence Norm" to characterize market regimes and propose a leverage calibration heuristic based on the persistence of 1-dimensional cycles. This approach provides a coordinate-free, stability-invariant metric for risk assessment that is robust to high-frequency noise. 2026-04-14T21:24:40Z Gabriel Santana Jemirson Ramirez