https://arxiv.org/api/dJYL85tC3j8fG0QwbipuNcny1gE 2026-03-20T08:58:23Z 3124 90 15 http://arxiv.org/abs/2209.10166v5 Chaotic Hedging with Iterated Integrals and Neural Networks 2026-01-27T17:46:03Z In this paper, we derive an $L^p$-chaos expansion based on iterated Stratonovich integrals with respect to a given exponentially integrable continuous semimartingale. By omitting the orthogonality of the expansion, we show that every $p$-integrable functional, $p \in [1,\infty)$, can be approximated by a finite sum of iterated Stratonovich integrals. Using (possibly random) neural networks as integrands, we therefere obtain universal approximation results for $p$-integrable financial derivatives in the $L^p$-sense. Moreover, we can approximately solve the $L^p$-hedging problem (coinciding for $p = 2$ with the quadratic hedging problem), where the approximating hedging strategy can be computed in closed form within short runtime. 2022-09-21T07:57:07Z Ariel Neufeld Philipp Schmocker http://arxiv.org/abs/2602.00121v1 A Prior-Predictive Monte Carlo Framework for Pricing Complex Data Products in Data-Poor Markets 2026-01-27T16:14:04Z Pricing advanced data products - particularly in complex fields such as semiconductor manufacturing - is a fundamentally challenging task due to the sparsity of publicly available transaction data, and its frequent heterogeneity and confidentiality. While data value depends on multiple interacting factors, such as technical sophistication, quality, utility, and licensing rights, traditional pricing methods tend to rely on ad-hoc heuristics or require massive amounts of historical transaction data. In an increasingly data-based economy, we introduce a prior-predictive Monte Carlo framework that enables the generation of fair, consistent, and justified price ranges for data products in the absence of empirical data. By simulating many plausible pricing 'worlds' and deal configurations, the framework produces stable probabilistic price bands (e.g., P5/P50/P95) rather than single point estimates, creating an auditable and repeatable probabilistic pricing system with business realism enforced via constraint-truncated priors. The proposed model bridges traditional data pricing rooted in professional experience with a data-based approach that also allows for classical Bayesian updating as more transaction data is accumulated. 2026-01-27T16:14:04Z Adam L. Siemiatkowski Victor Zhirnov Kashyap Yellai Gabriella Bein Terresa Zimmerman http://arxiv.org/abs/2601.19504v1 Generating Alpha: A Hybrid AI-Driven Trading System Integrating Technical Analysis, Machine Learning and Financial Sentiment for Regime-Adaptive Equity Strategies 2026-01-27T11:44:47Z The intricate behavior patterns of financial markets are influenced by fundamental, technical, and psychological factors. During times of high volatility and regime shifts causes many traditional strategies like trend-following or mean-reversion to fail. This paper proposes a hybrid AI-based trading strategy that combines (1) trend-following and directional momentum capture via EMA and MACD, (2) detection of price normalization through mean-reversion using RSI and Bollinger Bands, (3) market psychological interpretation through sentiment analysis using FinBERT, (4) signal generation through machine learning using XGBoost and (5)dynamically adjusting exposure with market regime filtering based on volatility and return environments. The system achieved a final portfolio value of $235,492.83, yielding a return of 135.49% on initial investment over a period of 24 months. The hybrid model outperformed major benchmark indexes like S&P 500 and NASDAQ-100 over the same period showing strong flexibility and lower downside risk with superior profits validating the use of multi-modal AI in algorithmic trading. 2026-01-27T11:44:47Z Preprint. Full version of an accepted conference paper (ComSIA 2026) Varun Narayan Kannan Pillai Akshay Ajith Sumesh K J http://arxiv.org/abs/2601.19321v1 Predictive Accuracy versus Interpretability in Energy Markets: A Copula-Enhanced TVP-SVAR Analysis 2026-01-27T08:04:16Z This paper investigates whether structural econometric models can rival machine learning in forecasting energy--macro dynamics while retaining causal interpretability. Using monthly data from 1999 to 2025, we develop a unified framework that integrates Time-Varying Parameter Structural VARs (TVP-SVAR) with advanced dependence structures, including DCC-GARCH, t-copulas, and mixed Clayton--Frank--Gumbel copulas. These models are empirically evaluated against leading machine learning techniques Gaussian Process Regression (GPR), Artificial Neural Networks, Random Forests, and Support Vector Regression across seven macro-financial and energy variables, with Brent crude oil as the central asset. The findings reveal three major insights. First, TVP-SVAR consistently outperforms standard VAR models, confirming structural instability in energy transmission channels. Second, copula-based extensions capture non-linear and tail dependence more effectively than symmetric DCC models, particularly during periods of macroeconomic stress. Third, despite their methodological differences, copula-enhanced econometric models and GPR achieve statistically equivalent predictive accuracy (t-test p = 0.8444). However, only the econometric approach provides interpretable impulse responses, regime shifts, and tail-risk diagnostics. We conclude that machine learning can replicate predictive performance but cannot substitute the explanatory power of structural econometrics. This synthesis offers a pathway where AI accuracy and economic interpretability jointly inform energy policy and risk management. 2026-01-27T08:04:16Z Fredy Pokou MRE, CRIStAL Jules Sadefo Kamdem MRE Kpante Emmanuel Gnandi ENAC-LAB http://arxiv.org/abs/2507.06345v2 Reinforcement Learning for Trade Execution with Market and Limit Orders 2026-01-26T20:13:21Z In this paper, we introduce a novel reinforcement learning framework for optimal trade execution in a limit order book. We formulate the trade execution problem as a dynamic allocation task whose objective is the optimal placement of market and limit orders to maximize expected revenue. By modeling market and limit order allocations with multivariate logistic-normal distributions, the framework enables efficient training of the reinforcement learning algorithm. Numerical experiments show that the proposed method outperforms traditional benchmark strategies in simulated limit order book environments featuring noise traders submitting random orders, tactical traders responding to order book imbalances, and a strategic trader seeking to acquire or liquidate an asset position. 2025-07-08T19:11:14Z Patrick Cheridito Moritz Weiss http://arxiv.org/abs/2601.18686v1 Optimal strategy and deep hedging for share repurchase programs 2026-01-26T17:01:54Z In recent decades, companies have frequently adopted share repurchase programs to return capital to shareholders or for other strategic purposes, instructing investment banks to rapidly buy back shares on their behalf. When the executing institution is allowed to hedge its exposure, it encounters several challenges due to the intrinsic features of the product. Moreover, contractual clauses or market regulations on trading activity may make it infeasible to rely on Greeks. In this work, we address the hedging of these products by developing a machine-learning framework that determines the optimal execution of the buyback while explicitly accounting for the bank's actual trading capabilities. This unified treatment of execution and hedging yields substantial performance improvements, resulting in an optimized policy that provides a feasible and realistic hedging approach. The pricing of these programs can be framed in terms of the discount that banks offer to the client on the price at which the shares are delivered. Since, in our framework, risk measures serve as objective functions, we exploit the concept of indifference pricing to compute this discount, thereby capturing the actual execution performance. 2026-01-26T17:01:54Z Stefano Corti Roberto Daluiso Andrea Pallavicini http://arxiv.org/abs/2501.15828v6 Hybrid Quantum Neural Networks with Amplitude Encoding: Advancing Recovery Rate Predictions 2026-01-25T08:51:29Z Recovery rate prediction plays a pivotal role in bond investment strategies by enhancing risk assessment, optimizing portfolio allocation, improving pricing accuracy, and supporting effective credit risk management. However, accurate forecasting remains challenging due to complex nonlinear dependencies, high-dimensional feature spaces, and limited sample sizes-conditions under which classical machine learning models are prone to overfitting. We propose a hybrid Quantum Machine Learning (QML) model with Amplitude Encoding, leveraging the unitarity constraint of Parametrized Quantum Circuits (PQC) and the exponential data compression capability of qubits. We evaluate the model on a global recovery rate dataset comprising 1,725 observations and 256 features from 1996 to 2023. Our hybrid method significantly outperforms both classical neural networks and QML models using Angle Encoding, achieving a lower Root Mean Squared Error (RMSE) of 0.228, compared to 0.246 and 0.242, respectively. It also performs competitively with ensemble tree methods such as XGBoost. While practical implementation challenges remain for Noisy Intermediate-Scale Quantum (NISQ) hardware, our quantum simulation and preliminary results on noisy simulators demonstrate the promise of hybrid quantum-classical architectures in enhancing the accuracy and robustness of recovery rate forecasting. These findings illustrate the potential of quantum machine learning in shaping the future of credit risk prediction. 2025-01-27T07:27:23Z Ying Chen Paul Griffin Paolo Recchia Lei Zhou Hongrui Zhang http://arxiv.org/abs/2602.00097v1 Rough Martingale Optimal Transport: Theory, Implementation, and Regulatory Applications for Non-Modelable Risk Factors 2026-01-24T10:45:00Z The Fundamental Review of the Trading Book (FRTB) poses a significant challenge for exotic derivatives pricing, particularly for non-modelable risk factors (NMRF) where sparse market data leads to infinite audit bounds under classical Martingale Optimal Transport (MOT). We propose a unified Rough Martingale Optimal Transport (RMOT) framework that regularizes the transport plan with a rough volatility prior, yielding finite, explicit, and asymptotically tight extrapolation bounds. We establish an identifiability theorem for rough volatility parameters under sparse data, proving that 50 strikes are sufficient to estimate the Hurst exponent within $\pm 0.05$. For the multi-asset case, we prove that the correlation matrix is locally identifiable from marginal option surfaces provided the Hurst exponents are distinct. Model calibration on SPY and QQQ options (2019--2024) confirms that the optimal martingale measure exhibits stretched exponential tail decay ($\sim\exp(-k^{1-H})$), consistent with rough volatility asymptotics, whereas classical MOT yields trivial bounds. We validate the framework on live SPX/NDX data and scale it to $N = 30$ assets using a block-sparse optimization algorithm. Empirical results show that RMOT provides approximately \$880M in capital relief per \$1B exotic book compared to classical methods, while maintaining conservative coverage confirmed by 100-seed cross-validation. This constitutes a pricing framework designed to align with FRTB principles for NMRFs with explicit error quantification. 2026-01-24T10:45:00Z 15 pages, 13 figures, 8 tables. Computational implementation with block-sparse optimization for $N=30$ assets in under 3 minutes Sri Sairam Gautam B. Isha http://arxiv.org/abs/2508.02283v2 An Enhanced Focal Loss Function to Mitigate Class Imbalance in Auto Insurance Fraud Detection with Explainable AI 2026-01-23T05:31:36Z Detecting fraudulent auto-insurance claims remains a challenging classification problem, largely due to the extreme imbalance between legitimate and fraudulent cases. Standard learning algorithms tend to overfit to the majority class, resulting in poor detection of economically significant minority events. This paper proposes a structured three-stage training framework that integrates a convex surrogate of focal loss for stable initialization, a controlled non-convex intermediate loss to improve feature discrimination, and the standard focal loss to refine minority-class sensitivity. We derive conditions under which the surrogate retains convexity in the prediction space and show how this facilitates more reliable optimization when combined with deep sequential models. Using a proprietary auto-insurance dataset, the proposed method improves minority-class F1-scores and AUC relative to conventional focal-loss training and resampling baselines. The approach also provides interpretable feature-attribution patterns through SHAP analysis, offering transparency for actuarial and fraud-analytics applications. 2025-08-04T10:53:10Z 15 pages, 4 figures, 2 tables Francis Boabang Samuel Asante Gyamerah http://arxiv.org/abs/2601.16446v1 Brownian ReLU(Br-ReLU): A New Activation Function for a Long-Short Term Memory (LSTM) Network 2026-01-23T04:53:16Z Deep learning models are effective for sequential data modeling, yet commonly used activation functions such as ReLU, LeakyReLU, and PReLU often exhibit gradient instability when applied to noisy, non-stationary financial time series. This study introduces BrownianReLU, a stochastic activation function induced by Brownian motion that enhances gradient propagation and learning stability in Long Short-Term Memory (LSTM) networks. Using Monte Carlo simulation, BrownianReLU provides a smooth, adaptive response for negative inputs, mitigating the dying ReLU problem. The proposed activation is evaluated on financial time series from Apple, GCB, and the S&P 500, as well as LendingClub loan data for classification. Results show consistently lower Mean Squared Error and higher $R^2$ values, indicating improved predictive accuracy and generalization. Although ROC-AUC metric is limited in classification tasks, activation choice significantly affects the trade-off between accuracy and sensitivity, with Brownian ReLU and the selected activation functions yielding practically meaningful performance. 2026-01-23T04:53:16Z 13 pages, 7 figures, 6 tables George Awiakye-Marfo Elijah Agbosu Victoria Mawuena Barns Samuel Asante Gyamerah http://arxiv.org/abs/2508.20097v2 Can LLMs Identify Tax Abuse? 2026-01-21T23:55:51Z We investigate whether large language models can discover and analyze U.S. tax-minimization strategies. This real-world domain challenges even seasoned human experts, and progress can reduce tax revenue lost from well-advised, wealthy taxpayers. We evaluate the most advanced LLMs on their ability to (1) interpret and verify tax strategies, (2) fill in gaps in partially specified strategies, and (3) generate complete, end-to-end strategies from scratch. This domain should be of particular interest to the LLM reasoning community: unlike synthetic challenge problems or scientific reasoning tasks, U.S. tax law involves navigating hundreds of thousands of pages of statutes, case law, and administrative guidance, all updated regularly. Notably, LLM-based reasoning identified an entirely novel tax strategy, highlighting these models' potential to revolutionize tax agencies' fight against tax abuse. 2025-08-10T15:15:45Z 9 pages Andrew Blair-Stanek Nils Holzenberger Benjamin Van Durme http://arxiv.org/abs/2511.04361v2 Causal Regime Detection in Energy Markets With Augmented Time Series Structural Causal Models 2026-01-21T13:29:23Z Energy markets exhibit complex causal relationships between weather patterns, generation technologies, and price formation, with regime changes occurring continuously rather than at discrete break points. Current approaches model electricity prices without explicit causal interpretation or counterfactual reasoning capabilities. We introduce Augmented Time Series Causal Models (ATSCM) for energy markets, extending counterfactual reasoning frameworks to multivariate temporal data with learned causal structure. Our approach models energy systems through interpretable factors (weather, generation mix, demand patterns), rich grid dynamics, and observable market variables. We integrate neural causal discovery to learn time-varying causal graphs without requiring ground truth DAGs. Applied to real-world electricity price data, ATSCM enables novel counterfactual queries such as "What would prices be under different renewable generation scenarios?". 2025-11-06T13:45:15Z EurIPS 2025 Workshop Causality for Impact: Practical challenges for real-world applications of causal methods Dennis Thumm http://arxiv.org/abs/2511.04469v4 Towards Causal Market Simulators 2026-01-21T13:14:57Z Market generators using deep generative models have shown promise for synthetic financial data generation, but existing approaches lack causal reasoning capabilities essential for counterfactual analysis and risk assessment. We propose a Time-series Neural Causal Model VAE (TNCM-VAE) that combines variational autoencoders with structural causal models to generate counterfactual financial time series while preserving both temporal dependencies and causal relationships. Our approach enforces causal constraints through directed acyclic graphs in the decoder architecture and employs the causal Wasserstein distance for training. We validate our method on synthetic autoregressive models inspired by the Ornstein-Uhlenbeck process, demonstrating superior performance in counterfactual probability estimation with L1 distances as low as 0.03-0.10 compared to ground truth. The model enables financial stress testing, scenario analysis, and enhanced backtesting by generating plausible counterfactual market trajectories that respect underlying causal mechanisms. 2025-11-06T15:44:07Z ICAIF 2025 Workshop on Rethinking Financial Time-Series Dennis Thumm Luis Ontaneda Mijares http://arxiv.org/abs/2412.02135v3 Unsupervised Learning-based Calibration Scheme for Rough Volatility Models 2026-01-21T07:45:43Z Existing deep learning-based calibration scheme for rough volatility models predominantly rely on supervised learning frameworks, which incur significant computational costs due to the necessity of generating massive synthetic training datasets. In this work, we propose a novel unsupervised learning-based calibration scheme for rough volatility models that eliminates the data generation bottleneck. Our approach leverages the backward stochastic differential equation (BSDE) representation of the pricing function derived by Bayer et al. \cite{bayer2022pricing}. By treating model parameters as trainable variables, we simultaneously approximate the BSDE solution and optimize the parameters within a unified neural network training process, with the terminal misfit as the loss. We theoretically establish that the mean squared error between the model-implied prices and market data is bounded by the loss function. Furthermore, we prove that the loss can be minimized to an arbitary degree, depending on the model's market fitting capacity and the universal approximation capability of neural networks. Numerical experiments for both simulated and historical S\&P 500 data based on rough Bergomi (rBergomi) model demonstrate the efficiency and accuracy of the proposed scheme. 2024-12-03T03:48:09Z Changqing Teng Guanglian Li http://arxiv.org/abs/2601.13770v1 Look-Ahead-Bench: a Standardized Benchmark of Look-ahead Bias in Point-in-Time LLMs for Finance 2026-01-20T09:23:51Z We introduce Look-Ahead-Bench, a standardized benchmark measuring look-ahead bias in Point-in-Time (PiT) Large Language Models (LLMs) within realistic and practical financial workflows. Unlike most existing approaches that primarily test inner lookahead knowledge via Q\\&A, our benchmark evaluates model behavior in practical scenarios. To distinguish genuine predictive capability from memorization-based performance, we analyze performance decay across temporally distinct market regimes, incorporating several quantitative baselines to establish performance thresholds. We evaluate prominent open-source LLMs -- Llama 3.1 (8B and 70B) and DeepSeek 3.2 -- against a family of Point-in-Time LLMs (Pitinf-Small, Pitinf-Medium, and frontier-level model Pitinf-Large) from PiT-Inference. Results reveal significant lookahead bias in standard LLMs, as measured with alpha decay, unlike Pitinf models, which demonstrate improved generalization and reasoning abilities as they scale in size. This work establishes a foundation for the standardized evaluation of temporal bias in financial LLMs and provides a practical framework for identifying models suitable for real-world deployment. Code is available on GitHub: https://github.com/benstaf/lookaheadbench 2026-01-20T09:23:51Z Mostapha Benhenda LAGA