https://arxiv.org/api/tigAg3/1QNThfQcEve2NngDVXdQ 2026-03-26T15:37:12Z 3130 315 15 http://arxiv.org/abs/2510.01526v1 One More Question is Enough, Expert Question Decomposition (EQD) Model for Domain Quantitative Reasoning 2025-10-01T23:45:45Z

Domain-specific quantitative reasoning remains a major challenge for large language models (LLMs), especially in fields requiring expert knowledge and complex question answering (QA). In this work, we propose Expert Question Decomposition (EQD), an approach designed to balance the use of domain knowledge with computational efficiency. EQD is built on a two-step fine-tuning framework and guided by a reward function that measures the effectiveness of generated sub-questions in improving QA outcomes. It requires only a few thousand training examples and a single A100 GPU for fine-tuning, with inference time comparable to zero-shot prompting. Beyond its efficiency, EQD outperforms state-of-the-art domain-tuned models and advanced prompting strategies. We evaluate EQD in the financial domain, characterized by specialized knowledge and complex quantitative reasoning, across four benchmark datasets. Our method consistently improves QA performance by 0.6% to 10.5% across different LLMs. Our analysis reveals an important insight: in domain-specific QA, a single supporting question often provides greater benefit than detailed guidance steps.

2025-10-01T23:45:45Z Accepted by EMNLP 2025 Mengyu Wang Sotirios Sabanis Miguel de Carvalho Shay B. Cohen Tiejun Ma http://arxiv.org/abs/2510.01446v1 Can Machine Learning Algorithms Outperform Traditional Models for Option Pricing? 2025-10-01T20:37:18Z

This study investigates the application of machine learning techniques, specifically Neural Networks, Random Forests, and CatBoost for option pricing, in comparison to traditional models such as Black-Scholes and Heston Model. Using both synthetically generated data and real market option data, each model is evaluated in predicting the option price. The results show that machine learning models can capture complex, non-linear relationships in option prices and, in several cases, outperform both Black-Scholes and Heston models. These findings highlight the potential of data-driven methods to improve pricing accuracy and better reflect market dynamics.

2025-10-01T20:37:18Z 9 pages, 1 figure. Code available at: https://github.com/GeorgMil29/OptionPricing Georgy Milyushkov http://arxiv.org/abs/2510.15921v1 Spiking Neural Network for Cross-Market Portfolio Optimization in Financial Markets: A Neuromorphic Computing Approach 2025-10-01T19:13:44Z

Cross-market portfolio optimization has become increasingly complex with the globalization of financial markets and the growth of high-frequency, multi-dimensional datasets. Traditional artificial neural networks, while effective in certain portfolio management tasks, often incur substantial computational overhead and lack the temporal processing capabilities required for large-scale, multi-market data. This study investigates the application of Spiking Neural Networks (SNNs) for cross-market portfolio optimization, leveraging neuromorphic computing principles to process equity data from both the Indian (Nifty 500) and US (S&P 500) markets. A five-year dataset comprising approximately 1,250 trading days of daily stock prices was systematically collected via the Yahoo Finance API. The proposed framework integrates Leaky Integrate-andFire neuron dynamics with adaptive thresholding, spike-timingdependent plasticity, and lateral inhibition to enable event-driven processing of financial time series. Dimensionality reduction is achieved through hierarchical clustering, while populationbased spike encoding and multiple decoding strategies support robust portfolio construction under realistic trading constraints, including cardinality limits, transaction costs, and adaptive risk aversion. Experimental evaluation demonstrates that the SNN-based framework delivers superior risk-adjusted returns and reduced volatility compared to ANN benchmarks, while substantially improving computational efficiency. These findings highlight the promise of neuromorphic computation for scalable, efficient, and robust portfolio optimization across global financial markets.

2025-10-01T19:13:44Z Amarendra Mohan IIT Kharagpur Ameer Tamoor Khan University of Copenhagen Shuai Li University of Oulu Xinwei Cao Jiangnan University Zhibin Li Chengdu University of Information Technology http://arxiv.org/abs/2510.00205v1 Quantifying Semantic Shift in Financial NLP: Robust Metrics for Market Prediction Stability 2025-09-30T19:25:54Z

Financial news is essential for accurate market prediction, but evolving narratives across macroeconomic regimes introduce semantic and causal drift that weaken model reliability. We present an evaluation framework to quantify robustness in financial NLP under regime shifts. The framework defines four metrics: (1) Financial Causal Attribution Score (FCAS) for alignment with causal cues, (2) Patent Cliff Sensitivity (PCS) for sensitivity to semantic perturbations, (3) Temporal Semantic Volatility (TSV) for drift in latent text representations, and (4) NLI-based Logical Consistency Score (NLICS) for entailment coherence. Applied to LSTM and Transformer models across four economic periods (pre-COVID, COVID, post-COVID, and rate hike), the metrics reveal performance degradation during crises. Semantic volatility and Jensen-Shannon divergence correlate with prediction error. Transformers are more affected by drift, while feature-enhanced variants improve generalisation. A GPT-4 case study confirms that alignment-aware models better preserve causal and logical consistency. The framework supports auditability, stress testing, and adaptive retraining in financial AI systems.

2025-09-30T19:25:54Z The 6th ACM International Conference on Al in Finance Zhongtian Sun Chenghao Xiao Anoushka Harit Jongmin Yu http://arxiv.org/abs/2509.25055v2 AlphaSAGE: Structure-Aware Alpha Mining via GFlowNets for Robust Exploration 2025-09-30T15:20:32Z

The automated mining of predictive signals, or alphas, is a central challenge in quantitative finance. While Reinforcement Learning (RL) has emerged as a promising paradigm for generating formulaic alphas, existing frameworks are fundamentally hampered by a triad of interconnected issues. First, they suffer from reward sparsity, where meaningful feedback is only available upon the completion of a full formula, leading to inefficient and unstable exploration. Second, they rely on semantically inadequate sequential representations of mathematical expressions, failing to capture the structure that determine an alpha's behavior. Third, the standard RL objective of maximizing expected returns inherently drives policies towards a single optimal mode, directly contradicting the practical need for a diverse portfolio of non-correlated alphas. To overcome these challenges, we introduce AlphaSAGE (Structure-Aware Alpha Mining via Generative Flow Networks for Robust Exploration), a novel framework is built upon three cornerstone innovations: (1) a structure-aware encoder based on Relational Graph Convolutional Network (RGCN); (2) a new framework with Generative Flow Networks (GFlowNets); and (3) a dense, multi-faceted reward structure. Empirical results demonstrate that AlphaSAGE outperforms existing baselines in mining a more diverse, novel, and highly predictive portfolio of alphas, thereby proposing a new paradigm for automated alpha mining. Our code is available at https://github.com/BerkinChen/AlphaSAGE.

2025-09-29T17:06:07Z Binqi Chen Hongjun Ding Ning Shen Jinsheng Huang Taian Guo Luchen Liu Ming Zhang http://arxiv.org/abs/2308.07029v2 A discretization scheme for path-dependent FBSDEs and PDEs 2025-09-30T12:39:40Z

This study develops a numerical scheme for path-dependent FBSDEs and PDEs. We introduce a Picard iteration method for solving path-dependent FBSDEs, prove its convergence to the true solution, and establish its rate of convergence. A key contribution of our approach is a novel estimator for the martingale integrand in the FBSDE, specifically designed to handle path-dependence more reliably than existing methods. We derive a concentration inequality that quantifies the statistical error of this estimator in a Monte Carlo framework. Based on these results, we investigate a supervised learning method with neural networks for solving path-dependent PDEs. The proposed algorithm is fully implementable and adaptable to a broad class of path-dependent problems.

2023-08-14T09:36:27Z Jiuk Jang Hyungbin Park http://arxiv.org/abs/2509.24144v1 From Headlines to Holdings: Deep Learning for Smarter Portfolio Decisions 2025-09-29T00:42:24Z

Deep learning offers new tools for portfolio optimization. We present an end-to-end framework that directly learns portfolio weights by combining Long Short-Term Memory (LSTM) networks to model temporal patterns, Graph Attention Networks (GAT) to capture evolving inter-stock relationships, and sentiment analysis of financial news to reflect market psychology. Unlike prior approaches, our model unifies these elements in a single pipeline that produces daily allocations. It avoids the traditional two-step process of forecasting asset returns and then applying mean--variance optimization (MVO), a sequence that can introduce instability. We evaluate the framework on nine U.S. stocks spanning six sectors, chosen to balance sector diversity and news coverage. In this setting, the model delivers higher cumulative returns and Sharpe ratios than equal-weighted and CAPM-based MVO benchmarks. Although the stock universe is limited, the results underscore the value of integrating price, relational, and sentiment signals for portfolio management and suggest promising directions for scaling the approach to larger, more diverse asset sets.

2025-09-29T00:42:24Z 22 pages, 9 figures Yun Lin Jiawei Lou Jinghe Zhang http://arxiv.org/abs/2402.18764v2 An Analytical Approach to (Meta)Relational Models Theory, and its Application to Triple Bottom Line (Profit, People, Planet) -- Towards Social Relations Portfolio Management 2025-09-28T09:06:35Z

Investigating the optimal nature of social interactions among actors (e.g., people or firms), who seek to achieve certain mutually-agreed objectives, has been the subject of extensive academic research. Using the relational models theory (describing all social interactions as combinations of four basic sociality ingredients: Communal Sharing, Authority Ranking, Equality Matching, and Market Pricing), the common approach revolves around qualitative arguments for determining sociality configurations most effective in realizing specific purposes, at times supplemented by empirical data. In the current treatment, we formulate this question as a mathematical optimization problem, in order to quantitatively derive the most suitable combination of sociality forms for dyadic actors, which optimizes their mutually-agreed objective. For this purpose, we develop an analytical framework of the (meta)relational models theory, and demonstrate that combining the four sociality forms to define a specific meaningful social situation inevitably prompts an inherent tension among them, codified by a single elementary and universal metarelation. In analogy with financial portfolio management, we subsequently introduce the concept of Social Relations Portfolio (SRP) management, and propose a generalizable methodology capable of quantitatively identifying the efficient SRP, which, in turn, enables effective stakeholder and change management initiatives. As an important illustration, the methodology is applied to the Triple Bottom Line (Profit, People, Planet) paradigm to derive its efficient SRP. This serves as a guide to practitioners for precisely measuring, monitoring, reporting and steering stakeholder and change management efforts concerning Corporate Social Responsibility (CSR) and Environmental, Social and Governance (ESG) within and / or across organizations.

2024-02-29T00:12:55Z Ver. 2: feedback from community experts incorporated, along with refinements in the text and updated acknowledgement. 41 pages, 8 pdf figures and tables Arsham Farzinnia Corine Boon http://arxiv.org/abs/2509.23557v1 SIMPOL Model for Solving Continuous-Time Heterogeneous Agent Problems 2025-09-28T01:37:29Z

This paper presents SIMPOL (Simplified Policy Iteration), a modular numerical framework for solving continuous-time heterogeneous agent models. The core economic problem, the optimization of consumption and savings under idiosyncratic uncertainty, is formulated as a coupled system of partial differential equations: a Hamilton-Jacobi-Bellman (HJB) equation for the agent's optimal policy and a Fokker-Planck-Kolmogorov (FPK) equation for the stationary wealth distribution. SIMPOL addresses this system using Howard's policy iteration with an *upwind* finite difference scheme that guarantees stability. A distinctive contribution is a novel consumption policy post-processing module that imposes regularity through smoothing and a projection onto an economically plausible slope band, improving convergence and model behavior. The robustness and accuracy of SIMPOL are validated through a set of integrated diagnostics, including verification of contraction in the Wasserstein-2 metric and comparison with the analytical solution of the Merton model in the no-volatility case. The framework is shown to be not only computationally efficient but also to produce solutions consistent with economic and mathematical theory, offering a reliable tool for research in quantitative macroeconomics.

2025-09-28T01:37:29Z Code available at https://doi.org/10.5281/zenodo.17216748 Ricardo Alonzo Fernández Salguero http://arxiv.org/abs/2509.22985v1 Forecasting Liquidity Withdraw with Machine Learning Models 2025-09-26T22:35:55Z

Liquidity withdrawal is a critical indicator of market fragility. In this project, I test a framework for forecasting liquidity withdrawal at the individual-stock level, ranging from less liquid stocks to highly liquid large-cap tickers, and evaluate the relative performance of competing model classes in predicting short-horizon order book stress. We introduce the Liquidity Withdrawal Index (LWI) -- defined as the ratio of order cancellations to the sum of standing depth and new additions at the best quotes -- as a bounded, interpretable measure of transient liquidity removal. Using Nasdaq market-by-order (MBO) data, we compare a spectrum of approaches: linear benchmarks (AR, HAR), and non-linear tree ensembles (XGBoost), across horizons ranging from 250\,ms to 5\,s. Beyond predictive accuracy, our results provide insights into order placement and cancellation dynamics, identify regimes where linear versus non-linear signals dominate, and highlight how early-warning indicators of liquidity withdrawal can inform both market surveillance and execution.

2025-09-26T22:35:55Z Haochuan Kevin Wang http://arxiv.org/abs/2509.05676v2 Design and hedging of unit linked life insurance with environmental factors 2025-09-26T12:24:31Z

We study the problem of designing and hedging unit-linked life policies whose benefits depend on an investment fund that incorporates environmental criteria in its selection process. Offering these products poses two key challenges: constructing a green investment fund and developing a hedging strategy for policies written on that fund. We address these two problems separately. First, we design a portfolio selection rule driven by firms' carbon intensity that endogenously selects assets and avoids ad hoc pre-screens based on ESG scores. The effectiveness of our new portfolio selection method is tested using real market data. Second, we adopt the perspective of an insurance company issuing unit-linked policies written on this fund. Such contracts are exposed to market, carbon, and mortality risk, which the insurer seeks to hedge. Due to market incompleteness, we address the hedging problem via a quadratic approach aimed at minimizing the tracking error. We also make a numerical analysis to assess the performance of the hedging strategy. For our simulation study, we use an efficient weak second-order scheme that allows for variance reduction.

2025-09-06T10:55:09Z 38 pages Katia Colaneri Alessandra Cretarola Edoardo Lombardo Daniele Mancinelli http://arxiv.org/abs/2505.15155v2 R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization 2025-09-25T10:13:08Z

Financial markets pose fundamental challenges for asset return prediction due to their high dimensionality, non-stationarity, and persistent volatility. Despite advances in large language models and multi-agent systems, current quantitative research pipelines suffer from limited automation, weak interpretability, and fragmented coordination across key components such as factor mining and model innovation. In this paper, we propose R&D-Agent for Quantitative Finance, in short RD-Agent(Q), the first data-centric multi-agent framework designed to automate the full-stack research and development of quantitative strategies via coordinated factor-model co-optimization. RD-Agent(Q) decomposes the quant process into two iterative stages: a Research stage that dynamically sets goal-aligned prompts, formulates hypotheses based on domain priors, and maps them to concrete tasks, and a Development stage that employs a code-generation agent, Co-STEER, to implement task-specific code, which is then executed in real-market backtests. The two stages are connected through a feedback stage that thoroughly evaluates experimental outcomes and informs subsequent iterations, with a multi-armed bandit scheduler for adaptive direction selection. Empirically, RD-Agent(Q) achieves up to 2X higher annualized returns than classical factor libraries using 70% fewer factors, and outperforms state-of-the-art deep time-series models on real markets. Its joint factor-model optimization delivers a strong balance between predictive accuracy and strategy robustness. Our code is available at: https://github.com/microsoft/RD-Agent.

2025-05-21T06:20:56Z 42 pages,11figures, NeurIPS 2025 Yuante Li Xu Yang Xiao Yang Minrui Xu Xisen Wang Weiqing Liu Jiang Bian http://arxiv.org/abs/2509.20239v1 Error Propagation in Dynamic Programming: From Stochastic Control to Option Pricing 2025-09-24T15:30:19Z

This paper investigates theoretical and methodological foundations for stochastic optimal control (SOC) in discrete time. We start formulating the control problem in a general dynamic programming framework, introducing the mathematical structure needed for a detailed convergence analysis. The associate value function is estimated through a sequence of approximations combining nonparametric regression methods and Monte Carlo subsampling. The regression step is performed within reproducing kernel Hilbert spaces (RKHSs), exploiting the classical KRR algorithm, while Monte Carlo sampling methods are introduced to estimate the continuation value. To assess the accuracy of our value function estimator, we propose a natural error decomposition and rigorously control the resulting error terms at each time step. We then analyze how this error propagates backward in time-from maturity to the initial stage-a relatively underexplored aspect of the SOC literature. Finally, we illustrate how our analysis naturally applies to a key financial application: the pricing of American options.

2025-09-24T15:30:19Z Andrea Della Vecchia Damir Filipović http://arxiv.org/abs/2509.19663v1 Long-Range Dependence in Financial Markets: Empirical Evidence and Generative Modeling Challenges 2025-09-24T00:41:14Z

This study presents a comprehensive empirical investigation of the presence of long-range dependence (LRD) in the dynamics of major U.S. stock market indexes--S\&P 500, Dow Jones, and Nasdaq--at daily, weekly, and monthly frequencies. We employ three distinct methods: the classical rescaled range (R/S) analysis, the more robust detrended fluctuation analysis (DFA), and a sophisticated ARFIMA--FIGARCH model with Student's $t$-distributed innovations. Our results confirm the presence of LRD, primarily driven by long memory in volatility rather than in the mean returns. Building on these findings, we explore the capability of a modern deep learning approach, Quant generative adversarial networks (GANs), to learn and replicate the LRD observed in the empirical data. While Quant GANs effectively capture heavy-tailed distributions and some aspects of volatility clustering, they suffer from significant limitations in reproducing the LRD, particularly at higher frequencies. This work highlights the challenges and opportunities in using data-driven models for generating realistic financial time series that preserve complex temporal dependencies.

2025-09-24T00:41:14Z 25 pages, 22 figures, 7 tables Yifan He Svetlozar Rachev http://arxiv.org/abs/2509.19628v1 Multimodal Language Models with Modality-Specific Experts for Financial Forecasting from Interleaved Sequences of Text and Time Series 2025-09-23T22:40:31Z

Text and time series data offer complementary views of financial markets: news articles provide narrative context about company events, while stock prices reflect how markets react to those events. However, despite their complementary nature, effectively integrating these interleaved modalities for improved forecasting remains challenging. In this work, we propose a unified neural architecture that models these interleaved sequences using modality-specific experts, allowing the model to learn unique time series patterns, while still enabling joint reasoning across modalities and preserving pretrained language understanding capabilities. To further improve multimodal understanding, we introduce a cross-modal alignment framework with a salient token weighting mechanism that learns to align representations across modalities with a focus on the most informative tokens. We demonstrate the effectiveness of our approach on a large-scale financial forecasting task, achieving state-of-the-art performance across a wide variety of strong unimodal and multimodal baselines. We develop an interpretability method that reveals insights into the value of time series-context and reinforces the design of our cross-modal alignment objective. Finally, we demonstrate that these improvements translate to meaningful economic gains in investment simulations.

2025-09-23T22:40:31Z Preprint Ross Koval Nicholas Andrews Xifeng Yan