https://arxiv.org/api/yuK/uOZL97ZOsnna2gXQp8MuqrI 2026-03-26T14:10:42Z 3130 300 15 http://arxiv.org/abs/2505.14420v2 SAE-FiRE: Enhancing Earnings Surprise Predictions Through Sparse Autoencoder Feature Selection 2025-10-07T14:03:55Z

Predicting earnings surprises from financial documents, such as earnings conference calls, regulatory filings, and financial news, has become increasingly important in financial economics. However, these financial documents present significant analytical challenges, typically containing over 5,000 words with substantial redundancy and industry-specific terminology that creates obstacles for language models. In this work, we propose the SAE-FiRE (Sparse Autoencoder for Financial Representation Enhancement) framework to address these limitations by extracting key information while eliminating redundancy. SAE-FiRE employs Sparse Autoencoders (SAEs) to decompose dense neural representations from large language models into interpretable sparse components, then applies statistical feature selection methods, including ANOVA F-tests and tree-based importance scoring, to identify the top-k most discriminative dimensions for classification. By systematically filtering out noise that might otherwise lead to overfitting, we enable more robust and generalizable predictions. Experimental results across three financial datasets demonstrate that SAE-FiRE significantly outperforms baseline approaches.

2025-05-20T14:31:23Z Huopu Zhang Yanguang Liu Miao Zhang Zirui He Mengnan Du http://arxiv.org/abs/2508.17906v2 FinReflectKG: Agentic Construction and Evaluation of Financial Knowledge Graphs 2025-10-07T11:01:58Z

The financial domain poses unique challenges for knowledge graph (KG) construction at scale due to the complexity and regulatory nature of financial documents. Despite the critical importance of structured financial knowledge, the field lacks large-scale, open-source datasets capturing rich semantic relationships from corporate disclosures. We introduce an open-source, large-scale financial knowledge graph dataset built from the latest annual SEC 10-K filings of all S and P 100 companies - a comprehensive resource designed to catalyze research in financial AI. We propose a robust and generalizable knowledge graph (KG) construction framework that integrates intelligent document parsing, table-aware chunking, and schema-guided iterative extraction with a reflection-driven feedback loop. Our system incorporates a comprehensive evaluation pipeline, combining rule-based checks, statistical validation, and LLM-as-a-Judge assessments to holistically measure extraction quality. We support three extraction modes - single-pass, multi-pass, and reflection-agent-based - allowing flexible trade-offs between efficiency, accuracy, and reliability based on user requirements. Empirical evaluations demonstrate that the reflection-agent-based mode consistently achieves the best balance, attaining a 64.8 percent compliance score against all rule-based policies (CheckRules) and outperforming baseline methods (single-pass and multi-pass) across key metrics such as precision, comprehensiveness, and relevance in LLM-guided evaluations.

2025-08-25T11:24:55Z Abhinav Arun Fabrizio Dimino Tejas Prakash Agarwal Bhaskarjit Sarmah Stefano Pasquali 10.1145/3768292.3770363. http://arxiv.org/abs/2510.05487v1 Smart Contract Adoption under Discrete Overdispersed Demand: A Negative Binomial Optimization Perspective 2025-10-07T01:04:19Z

Effective supply chain management under high-variance demand requires models that jointly address demand uncertainty and digital contracting adoption. Existing research often simplifies demand variability or treats adoption as an exogenous decision, limiting relevance in e-commerce and humanitarian logistics. This study develops an optimization framework combining dynamic Negative Binomial (NB) demand modeling with endogenous smart contract adoption. The NB process incorporates autoregressive dynamics in success probability to capture overdispersion and temporal correlation. Simulation experiments using four real-world datasets, including Delhivery Logistics and the SCMS Global Health Delivery system, apply maximum likelihood estimation and grid search to optimize adoption intensity and order quantity. Across all datasets, the NB specification outperforms Poisson and Gaussian benchmarks, with overdispersion indices exceeding 1.5. Forecasting comparisons show that while ARIMA and Exponential Smoothing achieve similar point accuracy, the NB model provides superior stability under high variance. Scenario analysis reveals that when dispersion exceeds a critical threshold (r > 6), increasing smart contract adoption above 70% significantly enhances profitability and service levels. This framework offers actionable guidance for balancing inventory costs, service levels, and implementation expenses, highlighting the importance of aligning digital adoption strategies with empirically observed demand volatility.

2025-10-07T01:04:19Z 39 pages, 12 figures (7 in main manuscript). Under review at PLOS ONE (Manuscript ID: PONE-D-25-43426, submitted August 2025) Jinho Cha Sahng-Min Han Long Pham http://arxiv.org/abs/2510.05475v1 From Classical Rationality to Contextual Reasoning: Quantum Logic as a New Frontier for Human-Centric AI in Finance 2025-10-07T00:31:19Z

We consider state of the art applications of artificial intelligence (AI) in modelling human financial expectations and explore the potential of quantum logic to drive future advancements in this field. This analysis highlights the application of machine learning techniques, including reinforcement learning and deep neural networks, in financial statement analysis, algorithmic trading, portfolio management, and robo-advisory services. We further discuss the emergence and progress of quantum machine learning (QML) and advocate for broader exploration of the advantages provided by quantum-inspired neural networks.

2025-10-07T00:31:19Z 19 pages, 5 figures, preprint version. Forthcoming in: Journal of Quantum Economics and Finance Fabio Bagarello Francesco Gargano Polina Khrennikova http://arxiv.org/abs/2510.04357v1 From News to Returns: A Granger-Causal Hypergraph Transformer on the Sphere 2025-10-05T20:51:59Z

We propose the Causal Sphere Hypergraph Transformer (CSHT), a novel architecture for interpretable financial time-series forecasting that unifies \emph{Granger-causal hypergraph structure}, \emph{Riemannian geometry}, and \emph{causally masked Transformer attention}. CSHT models the directional influence of financial news and sentiment on asset returns by extracting multivariate Granger-causal dependencies, which are encoded as directional hyperedges on the surface of a hypersphere. Attention is constrained via angular masks that preserve both temporal directionality and geometric consistency. Evaluated on S\&P 500 data from 2018 to 2023, including the 2020 COVID-19 shock, CSHT consistently outperforms baselines across return prediction, regime classification, and top-asset ranking tasks. By enforcing predictive causal structure and embedding variables in a Riemannian manifold, CSHT delivers both \emph{robust generalisation across market regimes} and \emph{transparent attribution pathways} from macroeconomic events to stock-level responses. These results suggest that CSHT is a principled and practical solution for trustworthy financial forecasting under uncertainty.

2025-10-05T20:51:59Z 6th ACM International Conference on AI in Finance Anoushka Harit Zhongtian Sun Jongmin Yu http://arxiv.org/abs/2510.04092v1 Convergence in probability of numerical solutions of a highly non-linear delayed stochastic interest rate model 2025-10-05T08:30:35Z

We examine a delayed stochastic interest rate model with super-linearly growing coefficients and develop several new mathematical tools to establish the properties of its true and truncated EM solutions. Moreover, we show that the true solution converges to the truncated EM solutions in probability as the step size tends to zero. Further, we support the convergence result with some illustrative numerical examples and justify the convergence result for the Monte Carlo evaluation of some financial quantities.

2025-10-05T08:30:35Z Emmanuel Coffie http://arxiv.org/abs/2505.08100v2 DeFi Liquidation Risk Modeling Using Geometric Brownian Motion 2025-10-04T08:01:51Z

In this paper, we propose an analytical method to compute the collateral liquidation probability in decentralized finance (DeFi) stablecoin single-collateral lending. Our approach models the collateral exchange rate as a zero-drift geometric Brownian motion, and derives the probability of it crossing the liquidation threshold. Unlike most existing methods that rely on computationally intensive simulations such as Monte Carlo, our formula provides a lightweight, exact solution. This advancement offers a more efficient alternative for risk assessment in DeFi platforms.

2025-05-12T22:13:44Z Timofei Belenko Georgii Vosorov http://arxiv.org/abs/2509.24254v2 Extracting the Structure of Press Releases for Predicting Earnings Announcement Returns 2025-10-04T00:14:19Z

We examine how textual features in earnings press releases predict stock returns on earnings announcement days. Using over 138,000 press releases from 2005 to 2023, we compare traditional bag-of-words and BERT-based embeddings. We find that press release content (soft information) is as informative as earnings surprise (hard information), with FinBERT yielding the highest predictive power. Combining models enhances explanatory strength and interpretability of the content of press releases. Stock prices fully reflect the content of press releases at market open. If press releases are leaked, it offers predictive advantage. Topic analysis reveals self-serving bias in managerial narratives. Our framework supports real-time return prediction through the integration of online learning, provides interpretability and reveals the nuanced role of language in price formation.

2025-09-29T03:57:05Z 9 pages, 4 figures, 6 tables, Accepted by The 6th ACM International Conference on AI in Finance Yuntao Wu Ege Mert Akin Charles Martineau Vincent Grégoire Andreas Veneris 10.1145/3768292.3770344 http://arxiv.org/abs/2510.03209v1 Joint Bidding on Intraday and Frequency Containment Reserve Markets 2025-10-03T17:48:21Z

As renewable energy integration increases supply variability, battery energy storage systems (BESS) present a viable solution for balancing supply and demand. This paper proposes a novel approach for optimizing battery BESS participation in multiple electricity markets. We develop a joint bidding strategy that combines participation in the primary frequency reserve market with continuous trading in the intraday market, addressing a gap in the extant literature which typically considers these markets in isolation or simplifies the continuous nature of intraday trading. Our approach utilizes a mixed integer linear programming implementation of the rolling intrinsic algorithm for intraday decisions and state of charge recovery, alongside a learned classifier strategy (LCS) that determines optimal capacity allocation between markets. A comprehensive out-of-sample backtest over more than one year of historical German market data validates our approach: The LCS increases overall profits by over 4% compared to the best-performing static strategy and by more than 3% over a naive dynamic benchmark. Crucially, our method closes the gap to a theoretical perfect foresight strategy to just 4%, demonstrating the effectiveness of dynamic, learning-based allocation in a complex, multi-market environment.

2025-10-03T17:48:21Z Yiming Zhang Wolfgang Ridinger David Wozabal http://arxiv.org/abs/2510.02910v1 Joint Stochastic Optimal Control and Stopping in Aquaculture: Finite-Difference and PINN-Based Approaches 2025-10-03T11:27:49Z

This paper studies a joint stochastic optimal control and stopping (JCtrlOS) problem motivated by aquaculture operations, where the objective is to maximize farm profit through an optimal feeding strategy and harvesting time under stochastic price dynamics. We introduce a simplified aquaculture model capturing essential biological and economic features, distinguishing between biologically optimal and economically optimal feeding strategies. The problem is formulated as a Hamilton-Jacobi-Bellman variational inequality and corresponding free boundary problem. We develop two numerical solution approaches: First, a finite difference scheme that serves as a benchmark, and second, a Physics-Informed Neural Network (PINN)-based method, combined with a deep optimal stopping (DeepOS) algorithm to improve stopping time accuracy. Numerical experiments demonstrate that while finite differences perform well in medium-dimensional settings, the PINN approach achieves comparable accuracy and is more scalable to higher dimensions where grid-based methods become infeasible. The results confirm that jointly optimizing feeding and harvesting decisions outperforms strategies that neglect either control or stopping.

2025-10-03T11:27:49Z Working Paper Kevin Kamm http://arxiv.org/abs/2510.02906v1 FinReflectKG -- MultiHop: Financial QA Benchmark for Reasoning with Knowledge Graph Evidence 2025-10-03T11:19:31Z

Multi-hop reasoning over financial disclosures is often a retrieval problem before it becomes a reasoning or generation problem: relevant facts are dispersed across sections, filings, companies, and years, and LLMs often expend excessive tokens navigating noisy context. Without precise Knowledge Graph (KG)-guided selection of relevant context, even strong reasoning models either fail to answer or consume excessive tokens, whereas KG-linked evidence enables models to focus their reasoning on composing already retrieved facts. We present FinReflectKG - MultiHop, a benchmark built on FinReflectKG, a temporally indexed financial KG that links audited triples to source chunks from S&P 100 filings (2022-2024). Mining frequent 2-3 hop subgraph patterns across sectors (via GICS taxonomy), we generate financial analyst style questions with exact supporting evidence from the KG. A two-phase pipeline first creates QA pairs via pattern-specific prompts, followed by a multi-criteria quality control evaluation to ensure QA validity. We then evaluate three controlled retrieval scenarios: (S1) precise KG-linked paths; (S2) text-only page windows centered on relevant text spans; and (S3) relevant page windows with randomizations and distractors. Across both reasoning and non-reasoning models, KG-guided precise retrieval yields substantial gains on the FinReflectKG - MultiHop QA benchmark dataset, boosting correctness scores by approximately 24 percent while reducing token utilization by approximately 84.5 percent compared to the page window setting, which reflects the traditional vector retrieval paradigm. Spanning intra-document, inter-year, and cross-company scopes, our work underscores the pivotal role of knowledge graphs in efficiently connecting evidence for multi-hop financial QA. We also release a curated subset of the benchmark (555 QA Pairs) to catalyze further research.

2025-10-03T11:19:31Z Abhinav Arun Reetu Raj Harsh Bhaskarjit Sarmah Stefano Pasquali http://arxiv.org/abs/2502.00415v2 MarketSenseAI 2.0: Enhancing Stock Analysis through LLM Agents 2025-10-03T06:17:38Z

MarketSenseAI is a novel framework for holistic stock analysis which leverages Large Language Models (LLMs) to process financial news, historical prices, company fundamentals and the macroeconomic environment to support decision making in stock analysis and selection. In this paper, we present the latest advancements on MarketSenseAI, driven by rapid technological expansion in LLMs. Through a novel architecture combining Retrieval-Augmented Generation and LLM agents, the framework processes SEC filings and earnings calls, while enriching macroeconomic analysis through systematic processing of diverse institutional reports. We demonstrate a significant improvement in fundamental analysis accuracy over the previous version. Empirical evaluation on S\&P 100 stocks over two years (2023-2024) shows MarketSenseAI achieving cumulative returns of 125.9% compared to the index return of 73.5%, while maintaining comparable risk profiles. Further validation on S\&P 500 stocks during 2024 demonstrates the framework's scalability, delivering a 33.8% higher Sortino ratio than the market. This work marks a significant advancement in applying LLM technology to financial analysis, offering insights into the robustness of LLM-driven investment strategies.

2025-02-01T12:33:23Z 25 pages, 7 figures, Under review at Financial Innovation (FIN) George Fatouros Kostas Metaxas John Soldatos Manos Karathanassis http://arxiv.org/abs/2408.01898v2 Efficient and accurate simulation of the stochastic-alpha-beta-rho model 2025-10-03T03:19:37Z

We propose an efficient, accurate and reliable simulation scheme for the stochastic-alpha-beta-rho (SABR) model. The two challenges of the SABR simulation lie in sampling (i) integrated variance conditional on terminal volatility and (ii) terminal forward price conditional on terminal volatility and integrated variance. For the first sampling procedure, we sample the conditional integrated variance using the moment-matched shifted lognormal approximation. For the second sampling procedure, we approximate the conditional terminal forward price as a constant-elasticity-of-variance (CEV) distribution. Our CEV approximation preserves the martingale condition and precludes arbitrage, which is a key advantage over Islah's approximation used in most SABR simulation schemes in the literature. We then adopt the exact sampling method of the CEV distribution based on the shifted-Poisson mixture Gamma random variable. Our enhanced procedures avoid the tedious Laplace inversion algorithm for sampling integrated variance and non-efficient inverse transform sampling of the forward price in some of the earlier simulation schemes. Numerical results demonstrate our simulation scheme to be highly efficient, accurate, and reliable.

2024-08-04T01:48:11Z Jaehyuk Choi Lilian Hu Yue Kuen Kwok 10.1016/j.ejor.2025.09.027 http://arxiv.org/abs/2510.01887v1 FINCH: Financial Intelligence using Natural language for Contextualized SQL Handling 2025-10-02T10:55:11Z

Text-to-SQL, the task of translating natural language questions into SQL queries, has long been a central challenge in NLP. While progress has been significant, applying it to the financial domain remains especially difficult due to complex schema, domain-specific terminology, and high stakes of error. Despite this, there is no dedicated large-scale financial dataset to advance research, creating a critical gap. To address this, we introduce a curated financial dataset (FINCH) comprising 292 tables and 75,725 natural language-SQL pairs, enabling both fine-tuning and rigorous evaluation. Building on this resource, we benchmark reasoning models and language models of varying scales, providing a systematic analysis of their strengths and limitations in financial Text-to-SQL tasks. Finally, we propose a finance-oriented evaluation metric (FINCH Score) that captures nuances overlooked by existing measures, offering a more faithful assessment of model performance.

2025-10-02T10:55:11Z Avinash Kumar Singh Bhaskarjit Sarmah Stefano Pasquali http://arxiv.org/abs/2510.01814v1 Mean-field theory of the Santa Fe model revisited: a systematic derivation from an exact BBGKY hierarchy for the zero-intelligence limit-order book model 2025-10-02T09:00:43Z

The Santa Fe model is an established econophysics model for describing stochastic dynamics of the limit order book from the viewpoint of the zero-intelligence approach. While its foundation was studied by combining a dimensional analysis and a mean-field theory by E. Smith et al. in Quantitative Finance 2003, their arguments are rather heuristic and lack solid mathematical foundation; indeed, their mean-field equations were derived with heuristic arguments and their solutions were not explicitly obtained. In this work, we revisit the mean-field theory of the Santa Fe model from the viewpoint of kinetic theory -- a traditional mathematical program in statistical physics. We study the exact master equation for the Santa Fe model and systematically derive the Bogoliubov-Born-Green-Kirkwood-Yvon (BBGKY) hierarchical equation. By applying the mean-field approximation, we derive the mean-field equation for the order-book density profile, parallel to the Boltzmann equation in conventional statistical physics. Furthermore, we obtain explicit and closed expression of the mean-field solutions. Our solutions have several implications: (1)Our scaling formulas are available for both $μ\to 0$ and $μ\to \infty$ asymptotics, where $μ$ is the market-order submission intensity. Particularly, the mean-field theory works very well for small $μ$, while its validity is partially limited for large $μ$. (2)The ``method of image'' solution, heuristically derived by Bouchaud-Mézard-Potters in Quantitative Finance 2002, is obtained for large $μ$, serving as a mathematical foundation for their heuristic arguments. (3)Finally, we point out an error in E. Smith et al. 2003 in the scaling law for the diffusion constant due to a misspecification in their dimensional analysis.

2025-10-02T09:00:43Z 40 pages, 10 figures Taiki Wakatsuki Kiyoshi Kanazawa