https://arxiv.org/api/dbALnstSxub1iV2NEUk563LhL382026-06-15T06:33:24Z325160015http://arxiv.org/abs/2308.13850v2Solutions to Equilibrium HJB Equations for Time-Inconsistent Deterministic Linear Quadratic Control: Characterization and Uniqueness2025-05-20T14:33:13ZIn this paper we study a class of HJB equations which solve for equilibria for general time-inconsistent deterministic linear quadratic control problems within the intra-personal game theoretic framework, where the inconsistency arises from non-exponential discount functions. We characterize the solutions to the HJB equations using a class of Riccati equations with integral terms. By studying the uniqueness of solutions to the integro-differential Riccati equations, we prove the uniqueness of solutions to the equilibrium HJB equations.2023-08-26T11:12:08Z32 pagesYunfei PengWei Weihttp://arxiv.org/abs/2502.19213v2Framework for asset-liability management with fixed-term securities2025-05-19T19:32:27ZWe consider an optimal investment-consumption problem for a utility-maximizing investor who has access to assets with different liquidity and whose consumption rate as well as terminal wealth are subject to lower-bound constraints. Assuming utility functions that satisfy standard conditions, we develop a methodology for deriving the optimal strategies in semi-closed form. Our methodology is based on the generalized martingale approach and the decomposition of the problem into subproblems. We illustrate our approach by deriving explicit formulas for agents with power-utility functions and discuss potential extensions of the proposed framework. In numerical studies, we substantiate how the parameters of our framework impact the optimal proportion of initial capital allocated to the illiquid asset, the monetary value that the investor subjectively assigns to the fixed-term asset, and the potential of the illiquid asset to increase terminal the terminal value of liabilities without loss in the investor's expected utility.2025-02-26T15:14:33Z44 pages, 15 figuresYevhen Havrylenkohttp://arxiv.org/abs/2308.13717v3Portfolios Generated by Contingent Claim Functions, with Applications to Option Pricing2025-05-19T17:35:53ZThis paper presents a synthesis of the theories of portfolio generating functions and option pricing. The theory of portfolio generation is extended to measure the value of portfolios generated by positive C^{2,1} functions of asset prices X_1,... , X_n directly, rather than with respect to a numeraire portfolio. If a portfolio generating function satisfies a specific partial differential equation, then the value of the portfolio generated by that function will replicate the value of the function. This differential equation is a general form of the Black-Scholes equation. Similar results apply to contingent claim functions, which are portfolio generating functions that are homogeneous of degree 1. With the addition of a riskless asset, an inhomogeneous portfolio generating function V : R^{+n} x [0, T] \to R^+ can be extended to an equivalent contingent claim function \hat{V} : R^+ x R^{+n} x [0, T] \to R^+ that generates the same portfolio and is replicable if and only if V is replicable. Several examples are presented.2023-08-26T00:42:00Z23 pagesRicardo T. FernholzRobert Fernholzhttp://arxiv.org/abs/2407.17975v2Recursive Optimal Stopping with Poisson Stopping Constraints2025-05-19T11:14:06ZThis paper solves a recursive optimal stopping problem with Poisson stopping constraints using the penalized backward stochastic differential equation (PBSDE) with jumps. Stopping in this problem is only allowed at Poisson random intervention times, and jumps play a significant role not only through the stopping times but also in the recursive objective functional and model coefficients. To solve the problem, we propose a decomposition method based on Jacod-Pham that allows us to separate the problem into a series of sub-problems between each pair of consecutive Poisson stopping times. To represent the value function of the recursive optimal stopping problem when the initial time falls between two consecutive Poisson stopping times and the generator is concave/convex, we leverage the comparison theorem of BSDEs with jumps. We then apply the representation result to American option pricing in a nonlinear market with Poisson stopping constraints.2024-07-25T12:11:01Z29 pages, 2 figuresGechun LiangWei WeiZhen WuZhenda Xuhttp://arxiv.org/abs/2505.05113v3Loss-Versus-Rebalancing under Deterministic and Generalized block-times2025-05-15T10:51:35ZAlthough modern blockchains almost universally produce blocks at fixed intervals, existing models still lack an analytical formula for the loss-versus-rebalancing (LVR) incurred by Automated Market Makers (AMMs) liquidity providers in this setting. Leveraging tools from random walk theory, we derive the following closed-form approximation for the per block per unit of liquidity expected LVR under constant block time:
\[ \overline{\mathrm{ARB}}= \frac{\,σ_b^{2}} {\,2+\sqrt{2π}\,γ/(|ζ(1/2)|\,σ_b)\,}+O\!\bigl(e^{-\mathrm{const}\tfracγ{σ_b}}\bigr)\;\approx\; \frac{σ_b^{2}}{\,2 + 1.7164\,γ/σ_b}, \] where $σ_b$ is the intra-block asset volatility, $γ$ the AMM spread and $ζ$ the Riemann Zeta function. Our large Monte Carlo simulations show that this formula is in fact quasi-exact across practical parameter ranges.
Extending our analysis to arbitrary block-time distributions as well, we demonstrate both that--under every admissible inter-block law--the probability that a block carries an arbitrage trade converges to a universal limit, and that only constant block spacing attains the asymptotically minimal LVR. This shows that constant block intervals provide the best possible protection against arbitrage for liquidity providers.2025-05-08T10:30:24Z16 pages, 2 figuresAlex NezlobinMartin Tassyhttp://arxiv.org/abs/2505.04553v2Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions2025-05-15T10:40:05ZWe propose a reinforcement learning (RL) framework under a broad class of risk objectives, characterized by convex scoring functions. This class covers many common risk measures, such as variance, Expected Shortfall, entropic Value-at-Risk, and mean-risk utility. To resolve the time-inconsistency issue, we consider an augmented state space and an auxiliary variable and recast the problem as a two-state optimization problem. We propose a customized Actor-Critic algorithm and establish some theoretical approximation guarantees. A key theoretical contribution is that our results do not require the Markov decision process to be continuous. Additionally, we propose an auxiliary variable sampling method inspired by the alternating minimization algorithm, which is convergent under certain conditions. We validate our approach in simulation experiments with a financial application in statistical arbitrage trading, demonstrating the effectiveness of the algorithm.2025-05-07T16:31:42Z35 pagesShanyu HanYang LiuXiang Yuhttp://arxiv.org/abs/2505.09184v1Gatheral double stochastic volatility model with Skorokhod reflection2025-05-14T06:30:33ZWe investigate the Gatheral model of double mean-reverting stochastic volatility, in which the drift term itself follows a mean-reverting process, and the overall model exhibits mean-reverting behavior. We demonstrate that such processes can attain values arbitrarily close to zero and remain near zero for extended periods, making them practically and statistically indistinguishable from zero. To address this issue, we propose a modified model incorporating Skorokhod reflection, which preserves the model's flexibility while preventing volatility from approaching zero.2025-05-14T06:30:33Z19 pagesYuliya MishuraAndrey PilipenkoKostiantyn Ralchenkohttp://arxiv.org/abs/2505.08943v1The value of partial information2025-05-13T20:14:37ZWe investigate a pricing rule that is applicable for streams of income or contingent claim liabilities and study how this rule changes under additional insider-type information that an investor might obtain. Considering a model where the risky asset might have jumps, we obtain an explicit form of the associated state price density for the three different types of agents considered in [ER20]: one who has no information about the jumps, one who knows in advance exactly when the each jump will occur, and one who has no information about the size of the jumps but has partial information about the size of each jump. For each of these agents, we provide characterizations of the pricing rule and establish a representation formula, allowing us to quantify the value of partial information for streams of labor income or contingent claim liabilities. Our work is motivated by finding and characterizing a pricing rule that, both with or without partial information about jumps, assigns different values of information for different income streams or contingent claim liabilities.2025-05-13T20:14:37Z20 pages, preliminary versionPhilip A. ErnstOleksii Mostovyihttp://arxiv.org/abs/2505.08852v1Measure-Valued CARMA Processes2025-05-13T17:48:45ZIn this paper, we examine continuous-time autoregressive moving-average (CARMA) processes on Banach spaces driven by Lévy subordinators. We show their existence and cone-invariance, investigate their first and second order moment structure, and derive explicit conditions for their stationarity. Specifically, we define a measure-valued CARMA process as the analytically weak solution of a linear state-space model in the Banach space of finite signed measures. By selecting suitable input, transition, and output operators in the linear state-space model, we show that the resulting solution possesses CARMA dynamics and remains in the cone of positive measures defined on some spatial domain. We also illustrate how positive measure-valued CARMA processes can be used to model the dynamics of functionals of spatio-temporal random fields and connect our framework to existing CARMA-type models from the literature, highlighting its flexibility and broader applicability.2025-05-13T17:48:45ZFred Espen BenthSven KarbachAsma Khedherhttp://arxiv.org/abs/2505.07537v1The Exploratory Multi-Asset Mean-Variance Portfolio Selection using Reinforcement Learning2025-05-12T13:18:49ZIn this paper, we study the continuous-time multi-asset mean-variance (MV) portfolio selection using a reinforcement learning (RL) algorithm, specifically the soft actor-critic (SAC) algorithm, in the time-varying financial market. A family of Gaussian portfolio selections is derived, and a policy iteration process is crafted to learn the optimal exploratory portfolio selection. We prove the convergence of the policy iteration process theoretically, based on which the SAC algorithm is developed. To improve the algorithm's stability and the learning accuracy in the multi-asset scenario, we divide the model parameters that influence the optimal portfolio selection into three parts, and learn each part progressively. Numerical studies in the simulated and real financial markets confirm the superior performance of the proposed SAC algorithm under various criteria.2025-05-12T13:18:49ZYu LiYuhan WuShuhua Zhanghttp://arxiv.org/abs/2409.07159v3Market information of the fractional stochastic regularity model2025-05-12T10:23:31ZThe Fractional Stochastic Regularity Model (FSRM) is an extension of Black-Scholes model describing the multifractal nature of prices. It is based on a multifractional process with a random Hurst exponent $H_t$, driven by a fractional Ornstein-Uhlenbeck (fOU) process. When the regularity parameter $H_t$ is equal to $1/2$, the efficient market hypothesis holds, but when $H_t\neq 1/2$ past price returns contain some information on a future trend or mean-reversion of the log-price process. In this paper, we investigate some properties of the fOU process and, thanks to information theory and Shannon's entropy, we determine theoretically the serial information of the regularity process $H_t$ of the FSRM, giving some insight into one's ability to forecast future price increments and to build statistical arbitrages with this model.2024-09-11T10:14:02Z23 pages, 12 figuresDaniele AngeliniMatthieu Garcinhttp://arxiv.org/abs/2505.07231v1Mean Field Portfolio Games with Epstein-Zin Preferences2025-05-12T05:06:59ZWe study mean field portfolio games under Epstein-Zin preferences, which naturally encompass the classical time-additive power utility as a special case. In a general non-Markovian framework, we establish a uniqueness result by proving a one-to-one correspondence between Nash equilibria and the solutions to a class of BSDEs. A key ingredient in our approach is a necessary stochastic maximum principle tailored to Epstein-Zin utility and a nonlinear transformation. In the deterministic setting, we further derive an explicit closed-form solution for the equilibrium investment and consumption policies.2025-05-12T05:06:59Z25 pages; comments are welcomeGuanxing FuUlrich Horsthttp://arxiv.org/abs/2505.05121v1Error Analysis of Deep PDE Solvers for Option Pricing2025-05-08T10:45:59ZOption pricing often requires solving partial differential equations (PDEs). Although deep learning-based PDE solvers have recently emerged as quick solutions to this problem, their empirical and quantitative accuracy remain not well understood, hindering their real-world applicability. In this research, our aim is to offer actionable insights into the utility of deep PDE solvers for practical option pricing implementation. Through comparative experiments in both the Black--Scholes and the Heston model, we assess the empirical performance of two neural network algorithms to solve PDEs: the Deep Galerkin Method and the Time Deep Gradient Flow method (TDGF). We determine their empirical convergence rates and training time as functions of (i) the number of sampling stages, (ii) the number of samples, (iii) the number of layers, and (iv) the number of nodes per layer. For the TDGF, we also consider the order of the discretization scheme and the number of time steps.2025-05-08T10:45:59Z15 pages, 19 figuresJasper Rouhttp://arxiv.org/abs/2305.00200v2Calibration of Local Volatility Models with Stochastic Interest Rates using Optimal Transport2025-05-07T13:03:06ZWe develop a non-parametric, semimartingale optimal transport, calibration methodology for local volatility models with stochastic interest rate. The method finds a fully calibrated model which is the closest, in a way that can be defined by a general cost function, to a given reference model. We establish a general duality result which allows to solve the problem by optimising over solutions to a second order fully non-linear Hamilton-Jacobi-Bellman equation. Our methodology is analogous to Guo, Loeper, and Wang, 2022 and Guo, Loeper, Obloj, et al., 2022a but features a novel element of solving for discounted densities, or sub-probability measures. As an example, we apply the method to a sequential calibration problem, where a Vasicek model is already given for the interest rates and we seek to calibrate a stock price's local volatility model with volatility coefficient depending on time, the underlying and the short rate process, and the two processes driven by possibly correlated Brownian motions. The equity model is calibrated to any number of European options prices.2023-04-29T08:54:20ZThis paper is now accepted in Finance & Stochastics and this submission is our final AAM versionBenjamin JosephGregoire LoeperJan Oblojhttp://arxiv.org/abs/2404.05230v2Non-concave stochastic optimal control in finite discrete time under model uncertainty2025-05-05T16:34:16ZIn this article we present a general framework for non-concave robust stochastic control problems under model uncertainty in a discrete time finite horizon setting. Our framework allows to consider a variety of different path-dependent ambiguity sets of probability measures comprising, as a natural example, the ambiguity set defined via Wasserstein-balls around path-dependent reference measures with path-dependent radii, as well as parametric classes of probability distributions. We establish a dynamic programming principle which allows to derive both optimal control and worst-case measure by solving recursively a sequence of one-step optimization problems. Moreover, we derive upper bounds for the difference of the values of the robust and non-robust stochastic control problem in the Wasserstein uncertainty and parameter uncertainty case. As a concrete application, we study the robust hedging problem of financial derivatives under an asymmetric (and non-convex) loss function accounting for different preferences of sell- and buy side when it comes to the hedging of financial derivatives. As our entirely data-driven ambiguity set of probability measures, we consider Wasserstein-balls around the empirical measure derived from real financial data. We demonstrate that during adverse scenarios such as a financial crisis, our robust approach outperforms typical model-based hedging strategies such as the classical Delta-hedging strategy as well as the hedging strategy obtained in the non-robust setting with respect to the empirical measure and therefore overcomes the problem of model misspecification in such critical periods.2024-04-08T06:53:05ZAriel NeufeldJulian Sester