https://arxiv.org/api/dJYL85tC3j8fG0QwbipuNcny1gE2026-03-20T08:58:23Z31249015http://arxiv.org/abs/2209.10166v5Chaotic Hedging with Iterated Integrals and Neural Networks2026-01-27T17:46:03ZIn this paper, we derive an $L^p$-chaos expansion based on iterated Stratonovich integrals with respect to a given exponentially integrable continuous semimartingale. By omitting the orthogonality of the expansion, we show that every $p$-integrable functional, $p \in [1,\infty)$, can be approximated by a finite sum of iterated Stratonovich integrals. Using (possibly random) neural networks as integrands, we therefere obtain universal approximation results for $p$-integrable financial derivatives in the $L^p$-sense. Moreover, we can approximately solve the $L^p$-hedging problem (coinciding for $p = 2$ with the quadratic hedging problem), where the approximating hedging strategy can be computed in closed form within short runtime.2022-09-21T07:57:07ZAriel NeufeldPhilipp Schmockerhttp://arxiv.org/abs/2602.00121v1A Prior-Predictive Monte Carlo Framework for Pricing Complex Data Products in Data-Poor Markets2026-01-27T16:14:04ZPricing advanced data products - particularly in complex fields such as semiconductor manufacturing - is a fundamentally challenging task due to the sparsity of publicly available transaction data, and its frequent heterogeneity and confidentiality. While data value depends on multiple interacting factors, such as technical sophistication, quality, utility, and licensing rights, traditional pricing methods tend to rely on ad-hoc heuristics or require massive amounts of historical transaction data. In an increasingly data-based economy, we introduce a prior-predictive Monte Carlo framework that enables the generation of fair, consistent, and justified price ranges for data products in the absence of empirical data. By simulating many plausible pricing 'worlds' and deal configurations, the framework produces stable probabilistic price bands (e.g., P5/P50/P95) rather than single point estimates, creating an auditable and repeatable probabilistic pricing system with business realism enforced via constraint-truncated priors. The proposed model bridges traditional data pricing rooted in professional experience with a data-based approach that also allows for classical Bayesian updating as more transaction data is accumulated.2026-01-27T16:14:04ZAdam L. SiemiatkowskiVictor ZhirnovKashyap YellaiGabriella BeinTerresa Zimmermanhttp://arxiv.org/abs/2601.19504v1Generating Alpha: A Hybrid AI-Driven Trading System Integrating Technical Analysis, Machine Learning and Financial Sentiment for Regime-Adaptive Equity Strategies2026-01-27T11:44:47ZThe intricate behavior patterns of financial markets are influenced by fundamental, technical, and psychological factors. During times of high volatility and regime shifts causes many traditional strategies like trend-following or mean-reversion to fail. This paper proposes a hybrid AI-based trading strategy that combines (1) trend-following and directional momentum capture via EMA and MACD, (2) detection of price normalization through mean-reversion using RSI and Bollinger Bands, (3) market psychological interpretation through sentiment analysis using FinBERT, (4) signal generation through machine learning using XGBoost and (5)dynamically adjusting exposure with market regime filtering based on volatility and return environments. The system achieved a final portfolio value of $235,492.83, yielding a return of 135.49% on initial investment over a period of 24 months. The hybrid model outperformed major benchmark indexes like S&P 500 and NASDAQ-100 over the same period showing strong flexibility and lower downside risk with superior profits validating the use of multi-modal AI in algorithmic trading.2026-01-27T11:44:47ZPreprint. Full version of an accepted conference paper (ComSIA 2026)Varun Narayan Kannan PillaiAkshay AjithSumesh K Jhttp://arxiv.org/abs/2601.19321v1Predictive Accuracy versus Interpretability in Energy Markets: A Copula-Enhanced TVP-SVAR Analysis2026-01-27T08:04:16ZThis paper investigates whether structural econometric models can rival machine learning in forecasting energy--macro dynamics while retaining causal interpretability. Using monthly data from 1999 to 2025, we develop a unified framework that integrates Time-Varying Parameter Structural VARs (TVP-SVAR) with advanced dependence structures, including DCC-GARCH, t-copulas, and mixed Clayton--Frank--Gumbel copulas. These models are empirically evaluated against leading machine learning techniques Gaussian Process Regression (GPR), Artificial Neural Networks, Random Forests, and Support Vector Regression across seven macro-financial and energy variables, with Brent crude oil as the central asset. The findings reveal three major insights. First, TVP-SVAR consistently outperforms standard VAR models, confirming structural instability in energy transmission channels. Second, copula-based extensions capture non-linear and tail dependence more effectively than symmetric DCC models, particularly during periods of macroeconomic stress. Third, despite their methodological differences, copula-enhanced econometric models and GPR achieve statistically equivalent predictive accuracy (t-test p = 0.8444). However, only the econometric approach provides interpretable impulse responses, regime shifts, and tail-risk diagnostics. We conclude that machine learning can replicate predictive performance but cannot substitute the explanatory power of structural econometrics. This synthesis offers a pathway where AI accuracy and economic interpretability jointly inform energy policy and risk management.2026-01-27T08:04:16ZFredy PokouMRE, CRIStALJules Sadefo KamdemMREKpante Emmanuel GnandiENAC-LABhttp://arxiv.org/abs/2507.06345v2Reinforcement Learning for Trade Execution with Market and Limit Orders2026-01-26T20:13:21ZIn this paper, we introduce a novel reinforcement learning framework for optimal trade execution in a limit order book. We formulate the trade execution problem as a dynamic allocation task whose objective is the optimal placement of market and limit orders to maximize expected revenue. By modeling market and limit order allocations with multivariate logistic-normal distributions, the framework enables efficient training of the reinforcement learning algorithm. Numerical experiments show that the proposed method outperforms traditional benchmark strategies in simulated limit order book environments featuring noise traders submitting random orders, tactical traders responding to order book imbalances, and a strategic trader seeking to acquire or liquidate an asset position.2025-07-08T19:11:14ZPatrick CheriditoMoritz Weisshttp://arxiv.org/abs/2601.18686v1Optimal strategy and deep hedging for share repurchase programs2026-01-26T17:01:54ZIn recent decades, companies have frequently adopted share repurchase programs to return capital to shareholders or for other strategic purposes, instructing investment banks to rapidly buy back shares on their behalf. When the executing institution is allowed to hedge its exposure, it encounters several challenges due to the intrinsic features of the product. Moreover, contractual clauses or market regulations on trading activity may make it infeasible to rely on Greeks. In this work, we address the hedging of these products by developing a machine-learning framework that determines the optimal execution of the buyback while explicitly accounting for the bank's actual trading capabilities. This unified treatment of execution and hedging yields substantial performance improvements, resulting in an optimized policy that provides a feasible and realistic hedging approach. The pricing of these programs can be framed in terms of the discount that banks offer to the client on the price at which the shares are delivered. Since, in our framework, risk measures serve as objective functions, we exploit the concept of indifference pricing to compute this discount, thereby capturing the actual execution performance.2026-01-26T17:01:54ZStefano CortiRoberto DaluisoAndrea Pallavicinihttp://arxiv.org/abs/2501.15828v6Hybrid Quantum Neural Networks with Amplitude Encoding: Advancing Recovery Rate Predictions2026-01-25T08:51:29ZRecovery rate prediction plays a pivotal role in bond investment strategies by enhancing risk assessment, optimizing portfolio allocation, improving pricing accuracy, and supporting effective credit risk management. However, accurate forecasting remains challenging due to complex nonlinear dependencies, high-dimensional feature spaces, and limited sample sizes-conditions under which classical machine learning models are prone to overfitting. We propose a hybrid Quantum Machine Learning (QML) model with Amplitude Encoding, leveraging the unitarity constraint of Parametrized Quantum Circuits (PQC) and the exponential data compression capability of qubits. We evaluate the model on a global recovery rate dataset comprising 1,725 observations and 256 features from 1996 to 2023. Our hybrid method significantly outperforms both classical neural networks and QML models using Angle Encoding, achieving a lower Root Mean Squared Error (RMSE) of 0.228, compared to 0.246 and 0.242, respectively. It also performs competitively with ensemble tree methods such as XGBoost. While practical implementation challenges remain for Noisy Intermediate-Scale Quantum (NISQ) hardware, our quantum simulation and preliminary results on noisy simulators demonstrate the promise of hybrid quantum-classical architectures in enhancing the accuracy and robustness of recovery rate forecasting. These findings illustrate the potential of quantum machine learning in shaping the future of credit risk prediction.2025-01-27T07:27:23ZYing ChenPaul GriffinPaolo RecchiaLei ZhouHongrui Zhanghttp://arxiv.org/abs/2602.00097v1Rough Martingale Optimal Transport: Theory, Implementation, and Regulatory Applications for Non-Modelable Risk Factors2026-01-24T10:45:00ZThe Fundamental Review of the Trading Book (FRTB) poses a significant challenge for exotic derivatives pricing, particularly for non-modelable risk factors (NMRF) where sparse market data leads to infinite audit bounds under classical Martingale Optimal Transport (MOT). We propose a unified Rough Martingale Optimal Transport (RMOT) framework that regularizes the transport plan with a rough volatility prior, yielding finite, explicit, and asymptotically tight extrapolation bounds. We establish an identifiability theorem for rough volatility parameters under sparse data, proving that 50 strikes are sufficient to estimate the Hurst exponent within $\pm 0.05$. For the multi-asset case, we prove that the correlation matrix is locally identifiable from marginal option surfaces provided the Hurst exponents are distinct. Model calibration on SPY and QQQ options (2019--2024) confirms that the optimal martingale measure exhibits stretched exponential tail decay ($\sim\exp(-k^{1-H})$), consistent with rough volatility asymptotics, whereas classical MOT yields trivial bounds. We validate the framework on live SPX/NDX data and scale it to $N = 30$ assets using a block-sparse optimization algorithm. Empirical results show that RMOT provides approximately \$880M in capital relief per \$1B exotic book compared to classical methods, while maintaining conservative coverage confirmed by 100-seed cross-validation. This constitutes a pricing framework designed to align with FRTB principles for NMRFs with explicit error quantification.2026-01-24T10:45:00Z15 pages, 13 figures, 8 tables. Computational implementation with block-sparse optimization for $N=30$ assets in under 3 minutesSri Sairam Gautam B. Ishahttp://arxiv.org/abs/2508.02283v2An Enhanced Focal Loss Function to Mitigate Class Imbalance in Auto Insurance Fraud Detection with Explainable AI2026-01-23T05:31:36ZDetecting fraudulent auto-insurance claims remains a challenging classification problem, largely due to the extreme imbalance between legitimate and fraudulent cases. Standard learning algorithms tend to overfit to the majority class, resulting in poor detection of economically significant minority events. This paper proposes a structured three-stage training framework that integrates a convex surrogate of focal loss for stable initialization, a controlled non-convex intermediate loss to improve feature discrimination, and the standard focal loss to refine minority-class sensitivity. We derive conditions under which the surrogate retains convexity in the prediction space and show how this facilitates more reliable optimization when combined with deep sequential models. Using a proprietary auto-insurance dataset, the proposed method improves minority-class F1-scores and AUC relative to conventional focal-loss training and resampling baselines. The approach also provides interpretable feature-attribution patterns through SHAP analysis, offering transparency for actuarial and fraud-analytics applications.2025-08-04T10:53:10Z15 pages, 4 figures, 2 tablesFrancis BoabangSamuel Asante Gyamerahhttp://arxiv.org/abs/2601.16446v1Brownian ReLU(Br-ReLU): A New Activation Function for a Long-Short Term Memory (LSTM) Network2026-01-23T04:53:16ZDeep learning models are effective for sequential data modeling, yet commonly used activation functions such as ReLU, LeakyReLU, and PReLU often exhibit gradient instability when applied to noisy, non-stationary financial time series. This study introduces BrownianReLU, a stochastic activation function induced by Brownian motion that enhances gradient propagation and learning stability in Long Short-Term Memory (LSTM) networks. Using Monte Carlo simulation, BrownianReLU provides a smooth, adaptive response for negative inputs, mitigating the dying ReLU problem. The proposed activation is evaluated on financial time series from Apple, GCB, and the S&P 500, as well as LendingClub loan data for classification. Results show consistently lower Mean Squared Error and higher $R^2$ values, indicating improved predictive accuracy and generalization. Although ROC-AUC metric is limited in classification tasks, activation choice significantly affects the trade-off between accuracy and sensitivity, with Brownian ReLU and the selected activation functions yielding practically meaningful performance.2026-01-23T04:53:16Z13 pages, 7 figures, 6 tablesGeorge Awiakye-MarfoElijah AgbosuVictoria Mawuena BarnsSamuel Asante Gyamerahhttp://arxiv.org/abs/2508.20097v2Can LLMs Identify Tax Abuse?2026-01-21T23:55:51ZWe investigate whether large language models can discover and analyze U.S. tax-minimization strategies. This real-world domain challenges even seasoned human experts, and progress can reduce tax revenue lost from well-advised, wealthy taxpayers. We evaluate the most advanced LLMs on their ability to (1) interpret and verify tax strategies, (2) fill in gaps in partially specified strategies, and (3) generate complete, end-to-end strategies from scratch. This domain should be of particular interest to the LLM reasoning community: unlike synthetic challenge problems or scientific reasoning tasks, U.S. tax law involves navigating hundreds of thousands of pages of statutes, case law, and administrative guidance, all updated regularly. Notably, LLM-based reasoning identified an entirely novel tax strategy, highlighting these models' potential to revolutionize tax agencies' fight against tax abuse.2025-08-10T15:15:45Z9 pagesAndrew Blair-StanekNils HolzenbergerBenjamin Van Durmehttp://arxiv.org/abs/2511.04361v2Causal Regime Detection in Energy Markets With Augmented Time Series Structural Causal Models2026-01-21T13:29:23ZEnergy markets exhibit complex causal relationships between weather patterns, generation technologies, and price formation, with regime changes occurring continuously rather than at discrete break points. Current approaches model electricity prices without explicit causal interpretation or counterfactual reasoning capabilities. We introduce Augmented Time Series Causal Models (ATSCM) for energy markets, extending counterfactual reasoning frameworks to multivariate temporal data with learned causal structure. Our approach models energy systems through interpretable factors (weather, generation mix, demand patterns), rich grid dynamics, and observable market variables. We integrate neural causal discovery to learn time-varying causal graphs without requiring ground truth DAGs. Applied to real-world electricity price data, ATSCM enables novel counterfactual queries such as "What would prices be under different renewable generation scenarios?".2025-11-06T13:45:15ZEurIPS 2025 Workshop Causality for Impact: Practical challenges for real-world applications of causal methodsDennis Thummhttp://arxiv.org/abs/2511.04469v4Towards Causal Market Simulators2026-01-21T13:14:57ZMarket generators using deep generative models have shown promise for synthetic financial data generation, but existing approaches lack causal reasoning capabilities essential for counterfactual analysis and risk assessment. We propose a Time-series Neural Causal Model VAE (TNCM-VAE) that combines variational autoencoders with structural causal models to generate counterfactual financial time series while preserving both temporal dependencies and causal relationships. Our approach enforces causal constraints through directed acyclic graphs in the decoder architecture and employs the causal Wasserstein distance for training. We validate our method on synthetic autoregressive models inspired by the Ornstein-Uhlenbeck process, demonstrating superior performance in counterfactual probability estimation with L1 distances as low as 0.03-0.10 compared to ground truth. The model enables financial stress testing, scenario analysis, and enhanced backtesting by generating plausible counterfactual market trajectories that respect underlying causal mechanisms.2025-11-06T15:44:07ZICAIF 2025 Workshop on Rethinking Financial Time-SeriesDennis ThummLuis Ontaneda Mijareshttp://arxiv.org/abs/2412.02135v3Unsupervised Learning-based Calibration Scheme for Rough Volatility Models2026-01-21T07:45:43ZExisting deep learning-based calibration scheme for rough volatility models predominantly rely on supervised learning frameworks, which incur significant computational costs due to the necessity of generating massive synthetic training datasets. In this work, we propose a novel unsupervised learning-based calibration scheme for rough volatility models that eliminates the data generation bottleneck. Our approach leverages the backward stochastic differential equation (BSDE) representation of the pricing function derived by Bayer et al. \cite{bayer2022pricing}. By treating model parameters as trainable variables, we simultaneously approximate the BSDE solution and optimize the parameters within a unified neural network training process, with the terminal misfit as the loss. We theoretically establish that the mean squared error between the model-implied prices and market data is bounded by the loss function. Furthermore, we prove that the loss can be minimized to an arbitary degree, depending on the model's market fitting capacity and the universal approximation capability of neural networks. Numerical experiments for both simulated and historical S\&P 500 data based on rough Bergomi (rBergomi) model demonstrate the efficiency and accuracy of the proposed scheme.2024-12-03T03:48:09ZChangqing TengGuanglian Lihttp://arxiv.org/abs/2601.13770v1Look-Ahead-Bench: a Standardized Benchmark of Look-ahead Bias in Point-in-Time LLMs for Finance2026-01-20T09:23:51ZWe introduce Look-Ahead-Bench, a standardized benchmark measuring look-ahead bias in Point-in-Time (PiT) Large Language Models (LLMs) within realistic and practical financial workflows. Unlike most existing approaches that primarily test inner lookahead knowledge via Q\\&A, our benchmark evaluates model behavior in practical scenarios. To distinguish genuine predictive capability from memorization-based performance, we analyze performance decay across temporally distinct market regimes, incorporating several quantitative baselines to establish performance thresholds. We evaluate prominent open-source LLMs -- Llama 3.1 (8B and 70B) and DeepSeek 3.2 -- against a family of Point-in-Time LLMs (Pitinf-Small, Pitinf-Medium, and frontier-level model Pitinf-Large) from PiT-Inference. Results reveal significant lookahead bias in standard LLMs, as measured with alpha decay, unlike Pitinf models, which demonstrate improved generalization and reasoning abilities as they scale in size. This work establishes a foundation for the standardized evaluation of temporal bias in financial LLMs and provides a practical framework for identifying models suitable for real-world deployment. Code is available on GitHub: https://github.com/benstaf/lookaheadbench2026-01-20T09:23:51ZMostapha BenhendaLAGA