https://arxiv.org/api/Hln1sW7zbiyCtAADg+ULERRkqJ4 2026-06-21T17:52:07Z 3237 150 15 http://arxiv.org/abs/2405.11392v2 Deep Penalty Methods: A Class of Deep Learning Algorithms for Solving High Dimensional Optimal Stopping Problems 2026-04-05T11:19:41Z We propose a deep learning algorithm for high dimensional optimal stopping problems. Our method is inspired by the penalty method for solving free boundary PDEs. Within our approach, the penalized PDE is approximated using the Deep BSDE framework proposed by \cite{weinan2017deep}, which leads us to coin the term "Deep Penalty Method (DPM)" to refer to our algorithm. We show that the error of the DPM can be bounded by the loss function and $O(\frac{1}λ)+O(λh) +O(\sqrt{h})$, where $h$ is the step size in time and $λ$ is the penalty parameter. This finding emphasizes the need for careful consideration when selecting the penalization parameter and suggests that the discretization error converges at a rate of order $\frac{1}{2}$. We validate the efficacy of the DPM through numerical tests conducted on a high-dimensional optimal stopping model in the area of American option pricing. The numerical tests confirm both the accuracy and the computational efficiency of our proposed algorithm. 2024-05-18T20:58:49Z Yunfei Peng Pengyu Wei Wei Wei http://arxiv.org/abs/2605.20192v1 Leveraging Large Language Models for Sentiment Analysis: Multi-Modal Analysis of Decentraland's MANA Token 2026-04-04T04:04:13Z Decentraland, a decentralized virtual reality platform operating within the expanding Metaverse ecosystem, utilizes its native MANA token to facilitate virtual asset transactions and governance. This study investigates the integration of Discord community sentiment with multi-modal financial data to enhance cryptocurrency price prediction within virtual world economies. We address: (1) identifying sentiment patterns within Decentraland's Discord community, and (2) evaluating the impact of multi-modal features on token return forecasting. Using a BERT-based large language model for sentiment analysis, we develop two LSTM architectures: a baseline incorporating historical prices and a multi-modal variant integrating sentiment scores, trading volume, and market capitalization. Results indicate predominantly neutral community sentiment with a positive skew. The multi-modal model significantly outperforms the price-only baseline in prediction accuracy. These findings demonstrate the predictive value of community-derived signals for virtual economy forecasting and establish a foundation for future research at the intersection of immersive virtual environments, natural language processing, and cryptocurrency market analysis. 2026-04-04T04:04:13Z Xintong Wu Peiting Tsai Jing Yuan Michael Yu Greg Sun Luyao Zhang http://arxiv.org/abs/2604.14199v1 PolyBench: Benchmarking LLM Forecasting and Trading Capabilities on Live Prediction Market Data 2026-04-03T06:25:21Z Predicting real-world events from live market signals demands systems that fuse qualitative news with quantitative order-book dynamics under strict temporal discipline -- a challenge existing benchmarks fail to capture. We present \textbf{PolyBench}, a multimodal benchmark derived from Polymarket that records point-in-time cross-sections of 38,666 binary prediction markets spanning 4,997 events, synchronously coupling each snapshot with a Central Limit Order Book (CLOB) state and a real-time news stream. Using PolyBench, we evaluate seven state-of-the-art Large Language Models -- spanning open- and closed-source families -- generating 36,165 predictions under identical, timestamp-locked market states collected between February 6 and 12, 2026. Our multidimensional framework assesses directional accuracy, our proposed Confidence-Weighted Return (CWR), Annualized Percentage Yield (APY), and Sharpe ratio via realistic order-book execution simulation. The results reveal a pronounced performance divergence: only two of seven models achieve positive financial returns -- MiMo-V2-Flash at \textbf{17.6%} CWR and Gemini-3-Flash at 6.2% CWR -- while the remaining five incur losses despite uniformly high stated confidence. These findings highlight the gap between surface-level language fluency and genuine probabilistic reasoning under live market uncertainty, and establish PolyBench as a contamination-proof, financially-grounded evaluation standard for future LLM research. Our dataset and code available at \underline{\href{https://github.com/PolyBench/PolyBench}{https://github.com/PolyBench/PolyBench}}. 2026-04-03T06:25:21Z 16 pages, 4 figures, 6 tables Pu Cheng Juncheng Liu Yunshen Long http://arxiv.org/abs/2604.02035v1 Reinforcement Learning for Speculative Trading under Exploratory Framework 2026-04-02T13:41:05Z We study a speculative trading problem within the exploratory reinforcement learning (RL) framework of Wang et al. [2020]. The problem is formulated as a sequential optimal stopping problem over entry and exit times under general utility function and price process. We first consider a relaxed version of the problem in which the stopping times are modeled by the jump times of Cox processes driven by bounded, non-randomized intensity controls. Under the exploratory formulation, the agent's randomized control is characterized via the probability measure over the jump intensities, and their objective function is regularized by Shannon's differential entropy. This yields a system of the exploratory HJB equations and Gibbs distributions in closed-form as the optimal policy. Error estimates and convergence of the RL objective to the value function of the original problem are established. Finally, an RL algorithm is designed, and its implementation is showcased in a pairs-trading application. 2026-04-02T13:41:05Z 37 pages, 14 figures Yun Zhao Alex S. L. Tse Harry Zheng http://arxiv.org/abs/2605.06670v1 Stochastic Policy Gradient Methods in the Uncertain Volatility Model 2026-04-01T08:44:10Z The multidimensional Uncertain Volatility Model leads to robust option pricing problems under joint volatility and correlation uncertainty. Their numerical resolution quickly becomes challenging because the associated stochastic control problem is high-dimensional. We propose a backward actor-critic stochastic policy gradient scheme tailored to this setting. The method combines a discrete dynamic programming principle with Proximal Policy Optimization and shallow neural-network approximations of both the value function and the control policy. A key ingredient is the policy parameterization: continuous controls are represented through a squashed Gaussian policy built on a C-vine representation of correlation matrices, which enforces positive semidefiniteness by construction. Numerical experiments on a range of multidimensional derivatives show that the method yields accurate prices, remains computationally efficient, and compares favorably with existing Monte Carlo and machine-learning-based benchmarks for robust pricing in the Uncertain Volatility Model. 2026-04-01T08:44:10Z Lokman A Abbas-Turki LPSM Jean-François Chassagneux ENSAE Paris Jean-Philippe Lemor LPSM Grégoire Loeper LPSM Simon Sananes LPSM http://arxiv.org/abs/2604.00556v1 HabitatAgent: An End-to-End Multi-Agent System for Housing Consultation 2026-04-01T06:59:50Z Housing selection is a high-stakes and largely irreversible decision problem. We study housing consultation as a decision-support interface for housing selection. Existing housing platforms and many LLM-based assistants often reduce this process to ranking or recommendation, resulting in opaque reasoning, brittle multi-constraint handling, and limited guarantees on factuality. We present HabitatAgent, the first LLM-powered multi-agent architecture for end-to-end housing consultation. HabitatAgent comprises four specialized agent roles: Memory, Retrieval, Generation, and Validation. The Memory Agent maintains multi-layer user memory through internal stages for constraint extraction, memory fusion, and verification-gated updates; the Retrieval Agent performs hybrid vector--graph retrieval (GraphRAG); the Generation Agent produces evidence-referenced recommendations and explanations; and the Validation Agent applies multi-tier verification and targeted remediation. Together, these agents provide an auditable and reliable workflow for end-to-end housing consultation. We evaluate HabitatAgent on 100 real user consultation scenarios (300 multi-turn question--answer pairs) under an end-to-end correctness protocol. A strong single-stage baseline (Dense+Rerank) achieves 75% accuracy, while HabitatAgent reaches 95%. 2026-04-01T06:59:50Z Accepted at the DMO-FinTech Workshop (PAKDD 2026) Hongyang Yang Yanxin Zhang Yang She Yue Xiao Hao Wu Yiyang Zhang Jiapeng Hou Rongshan Zhang http://arxiv.org/abs/2604.00415v1 Dynamic Weight Optimization for Double Linear Policy: A Stochastic Model Predictive Control Approach 2026-04-01T02:59:15Z The Double Linear Policy (DLP) framework guarantees a Robust Positive Expectation (RPE) under optimized constant-weight designs or admissible prespecified time-varying policies. However, the sequential optimization of these time-varying weights remains an open challenge. To address this gap, we propose a Stochastic Model Predictive Control (SMPC) framework. We formulate weight selection as a receding-horizon optimal control problem that explicitly maximizes risk-adjusted returns while enforcing survivability and predicted positive expectation constraints. Notably, an analytical gradient is derived for the non-convex objective function, enabling efficient optimization via the L-BFGS-B algorithm. Empirical results demonstrate that this dynamic, closed-loop approach improves risk-adjusted performance and drawdown control relative to constant-weight and prescribed time-varying DLP baselines. 2026-04-01T02:59:15Z 8 pages. Submitted for possible publication Tan Chin Hong Chung-Han Hsieh http://arxiv.org/abs/2604.00389v1 Pricing Lookback Options on a Quantum Computer 2026-04-01T02:18:58Z We develop a quantum algorithm to price discretely monitored lookback options in the Black-Scholes framework using imaginary time evolution. By rewriting the pricing PDE as a Schrodinger-type equation, the problem becomes the imaginary time evolution of a quantum state under a non-Hermitian Hamiltonian. This evolution is approximated with the Variational Quantum imaginary time evolution (VarQITE) method, which replaces the exact non-unitary dynamics with a parameterized, hardware-efficient quantum circuit. A central challenge arises from jump conditions caused by the discrete updating of the running maximum. This feature is not present in standard quantum treatments of European or Asian options. To address this, we propose two quantum-compatible formulations: (i) a sequential approach that models jumps via dedicated jump Hamiltonians applied at monitoring dates, and (ii) a simultaneous multi-function evolution that removes explicit jumps at the expense of an increased number of dimensions. We compare both approaches in terms of qubit resources, circuit complexity and numerical accuracy, and benchmark them against Monte Carlo simulations. Our results show that discretely monitored, path-dependent options with jump conditions can be handled within a variational quantum framework, paving the way toward the quantum pricing of more complex derivatives with non-smooth dynamics. 2026-04-01T02:18:58Z Florence Paquette Tania Belabbas Emmanuel Hamel Anne MacKay http://arxiv.org/abs/2603.29994v1 Bridging Stochastic Control and Deep Hedging: Structural Priors for No-Transaction Band Networks 2026-03-31T16:56:17Z This paper studies the problem of hedging and pricing a European call option under proportional transaction costs, from two complementary perspectives. We first derive the optimal hedging strategy under CARA utility, following the stochastic control framework of Davis et al. (1993), characterising the no-transaction band via the Hamilton-Jacobi-Bellman Quasi-Variational Inequality (HJBQVI) and the Whalley-Wilmott asymptotic approximation. We then adopt a deep hedging approach, proposing two architectures that build on the No-Transaction Band Network of Imaki et al. (2023): NTBN-Delta, which makes delta-centring explicit, and WW-NTBN, which incorporates the Whalley-Wilmott formula as a structural prior on the bandwidth and replaces the hard clamp with a differentiable soft clamp. Numerical experiments show that WW-NTBN converges faster, matches the stochastic control no-transaction bands more closely, and generalises well across transaction cost regimes. We further apply both frameworks to the bull call spread, documenting the breakdown of price linearity under transaction costs. 2026-03-31T16:56:17Z Jules Arzel Noureddine Lehdili http://arxiv.org/abs/2603.29593v1 Be Water: An Evolutionary Proof for Trend-Following 2026-03-31T11:13:07Z The proliferation of diverse, high-leverage trading instruments in modern financial markets presents a complex, "noisy" environment, leading to a critical question: which trading strategies are evolutionarily viable? To investigate this, we construct a large-scale agent-based model, "MAS-Utopia," comprising 10,000 agents with five distinct archetypes. This society is immersed in five years of high-frequency data under a counterfactual baseline: zero transaction friction and a robust Unconditional Basic Income (UBI) safety net. The simulation reveals a powerful evolutionary convergence. Strategies that attempt to fight the market's current - namely Mean-Reversion ("buy-the-dip") - prove structurally fragile. In contrast, the Trend-Following archetype, which adapts to the market's flow, emerges as the dominant phenotype. Translating this finding, we architect an LLM-driven system that emulates this successful logic. Our findings offer profound implications, echoing the ancient wisdom of "Be Water": for investors, it demonstrates that survival is achieved not by rigid opposition, but by disciplined alignment with the prevailing current; for markets, it critiques tools that encourage contrarian gambling; for society, it underscores the stabilizing power of economic safety nets. 2026-03-31T11:13:07Z Yijia Chen http://arxiv.org/abs/2604.00064v1 Forecast collapse of transformer-based models under squared loss in financial time series 2026-03-31T09:06:52Z We study trajectory forecasting under squared loss for time series with weak conditional structure, using highly expressive prediction models. Building on the classical characterization of squared-loss risk minimization, we emphasize regimes in which the conditional expectation of future trajectories is effectively degenerate, leading to trivial Bayes-optimal predictors (flat for prices and zero for returns in standard financial settings). In this regime, increased model expressivity does not improve predictive accuracy but instead introduces spurious trajectory fluctuations around the optimal predictor. These fluctuations arise from the reuse of noise and result in increased prediction variance without any reduction in bias. This provides a process-level explanation for the degradation of Transformerbased forecasts on financial time series. We complement these theoretical results with numerical experiments on high-frequency EUR/USD exchange rate data, analyzing the distribution of trajectory-level forecasting errors. The results show that Transformer-based models yield larger errors than a simple linear benchmark on a large majority of forecasting windows, consistent with the variance-driven mechanism identified by the theory. 2026-03-31T09:06:52Z Pierre Andreoletti IDP http://arxiv.org/abs/2511.13616v2 Statistical and economic evaluation of forecasts in electricity markets: beyond RMSE and MAE 2026-03-31T08:39:33Z Electricity price forecasts are typically evaluated using accuracy measures such as RMSE and MAE, although these metrics often fail to reflect their economic value in operational decisions. This paper investigates which statistical properties of electricity price forecasts are most relevant for economic performance, using battery energy storage system (BESS) arbitrage as an application. We assess prediction quality along four dimensions: forecast accuracy, intraday error dispersion, association between predicted and realized prices, and the ability to identify daily price extrema. We construct a comprehensive pool of 192 hourly day-ahead electricity price forecasts and use it to evaluate the relationship between proposed quality measures and profits generated for two representative BESS configurations. The results show that traditional accuracy metrics are only weakly correlated with BESS income. At the same time, dispersion- and association-based measures better capture a forecast's economic value by reflecting its ability to reproduce daily price patterns. These findings demonstrate that incorporating complementary evaluation criteria may improve forecast selection and enhance the economic performance of BESS. 2025-11-17T17:21:57Z Energy Conversion and Management, 356, 121408, 2026 Katarzyna Maciejowska Arkadiusz Lipiecki Bartosz Uniejewski 10.1016/j.enconman.2026.121408 http://arxiv.org/abs/2603.29430v1 Ultra-short-term volatility surfaces 2026-03-31T08:36:43Z Options with maturities below one week, hereafter "ultra-short-term" options, have seen a sharp increase in trading activity in recent years. Yet, these instruments are difficult to price jointly using classical pricing models due to the pronounced oscillations observed in the at-the-money implied-volatility term structure across ultra-short-term tenors. We propose Edgeworth++, a parsimonious jump-diffusion model featuring a nonparametric stochastic volatility component, which provides flexibility in capturing implied-volatility smiles for each tenor, combined with a deterministic shift extension, which allows the model to fit rich at-the-money implied-volatility shapes across tenors. We derive a local (in tenor) expansion of the process characteristic function suited to value ultra-short-term options. The expansion leads to fast and accurate option pricing in closed form via standard Fourier inversion. We discuss the benefits of the proposed approach relative to benchmarks. 2026-03-31T08:36:43Z Federico M. Bandi Nicola Fusari Guido Gazzani Roberto Renò http://arxiv.org/abs/2511.02700v2 Numerical valuation of European options under two-asset infinite-activity exponential Lévy models 2026-03-30T18:33:04Z We propose a numerical method for the valuation of European-style options under two-asset infinite-activity exponential Lévy models. Our method extends the effective approach developed by Wang, Wan & Forsyth (2007) for the 1-dimensional case to the 2-dimensional setting and is applicable for general Lévy measures under mild assumptions. A tailored discretization of the non-local integral term is developed, which can be efficiently evaluated by means of the fast Fourier transform. For the temporal discretization, the semi-Lagrangian theta-method is employed in a convenient splitting fashion, where the diffusion term is treated implicitly and the integral term is handled explicitly by a fixed-point iteration. Numerical experiments for put-on-the-average options under Normal Tempered Stable dynamics reveal favourable second-order convergence of our method whenever the exponential Lévy process has finite-variation. 2025-11-04T16:22:26Z Massimiliano Moda Karel J. in 't Hout Michèle Vanmaele Fred Espen Benth http://arxiv.org/abs/2603.27501v1 From Volatility to Variance: A Skew-Enhanced SABR Model and Its Empirical Study in the Chinese Financial Options Market 2026-03-29T03:41:13Z Accurately characterizing the implied volatility curves is a central challenge in option pricing and risk management. The classical SABR model by Hagan et al. has been widely adopted in practice due to its well-defined stochastic volatility structure and its tractable closed-form approximation for Black implied volatility. However, under complex market conditions, its fitting accuracy for implied volatility curves remains limited. To address this issue, this paper proposes an extended model within the SABR framework, referred to as skew-SABR. Specifically, the proposed approach introduces an extension to the stochastic dynamics of the underlying asset price and its variance process, under which a corresponding Black implied volatility expression is derived. By further simplifying and reorganizing the resulting formula, the implied volatility can be expressed in a form that explicitly incorporates a skew parameter, thereby enabling a direct characterization of the asymmetry in the implied volatility curve. The resulting expression preserves the structural simplicity of the Hagan-SABR formula, while significantly enhancing the model's flexibility in capturing complex volatility smile patterns. From a theoretical perspective, the paper provides a systematic analysis of the model specification and the financial interpretation of its parameters. From an empirical perspective, a comprehensive comparison is conducted using data from the Chinese options market over the period 2018--2025. The skew-SABR model is evaluated against the classical Hagan-SABR model, the SVI parameterization, polynomial fitting, and spline-based methods. Numerical results show that, across different market regimes and a wide range of implied volatility curve shapes, the skew-SABR model consistently achieves high and stable fitting accuracy. 2026-03-29T03:41:13Z Wenxuan Zhang Zhouchi Lin Benzhuo Lu