https://arxiv.org/api/0XmJScqIZQMUUlnY967hfit7vHU 2026-07-20T22:43:52Z 2409 30 15 http://arxiv.org/abs/2602.16862v4 Action-Space Entropy Regularization in Bayesian Markowitz 2026-06-26T03:42:23Z

We solve the entropy-regularized mean--variance portfolio problem under Bayesian drift uncertainty. We combine continuous-time Bayesian filtering with stochastic policy optimization; the main finding is negative: the two mechanisms are orthogonal. Posterior dynamics are policy-independent, so entropy regularization cannot accelerate learning about the unknown drift. The mean control is identical to the deterministic Bayesian Markowitz feedback, and entropy enters only through policy variance. On the technical side, the optimal policy is Gaussian, the value function is quadratic in wealth, and the leading belief-dependent coefficient closes in exponential form. The framework recovers both parent models as limiting cases.

2026-02-18T20:42:32Z 23 pages, 1 figure Andy Au http://arxiv.org/abs/2606.27462v1 The Decision Geometry of Covariance Estimation for the Global Minimum-Variance Portfolio under Heavy Tails 2026-06-25T18:35:30Z

The global minimum-variance portfolio (GMVP) is the canonical decision built from an estimated covariance matrix, yet covariance estimators are universally evaluated by matrix-norm loss, which is not the object the decision depends on. We characterise exactly how covariance-estimation error maps into GMVP suboptimality. We prove an exact regret identity and a non-asymptotic bound showing decision regret depends on the estimation error only through its action on the portfolio weights, scaled by portfolio concentration and the conditioning of the true covariance. From this we derive the decision geometry: GMVP regret is invariant to a (p-1)-dimensional projection of the p^2-dimensional error matrix, with invariance to the covariance-scale direction as an exact special case. We then apply the framework to heavy-tailed returns (tail index kappa in (2,4)), establishing the regret convergence rate implied by the centred operator-norm rate, and confirm the theory on a skew-t/t-copula simulation design with pre-registered analysis. The decision-focused advantage is a sharper constant and a concentration discount rather than a faster rate; we report an honest high-conditioning boundary of the rate prediction. The results complement recent decision-focused learning approaches by supplying the exact estimation geometry and consistency theory they lack.

2026-06-25T18:35:30Z 19 pages, 1 figure Xavier Fonseca http://arxiv.org/abs/2606.26835v1 A sharp order-three obstruction to the aggregation of conditional price-of-risk attribution 2026-06-25T10:20:42Z

We study the squared price-of-risk premium of a portfolio -- an integrated conditional squared Sharpe-ratio functional, not an expected excess return -- and its attribution to causal drivers. Relative to a declared admissible benchmark it decomposes into intervention-stable premium, a signed causal distortion (the confounding wedge), and a nonnegative information loss; the loss is an $L^2$ projection residual, the wedge is not. The decomposition is well posed exactly when the driver filtration is immersed in the price filtration. It need not aggregate across portfolios pooling drivers: we identify an order-three obstruction that is invisible to every singleton and pairwise admissibility screen -- each one- and two-driver sub-book is immersed while the pooled triple reveals a future innovation -- the analogue of Bernstein's pairwise-but-not-mutually-independent triple, and minimal relative to such pairwise diagnostics. We separate its two ingredients, combinatorial masking and anticipative coupling. The failure is one of immersion, not of no-arbitrage. Experiments on synthetic single- and multi-driver panels show the decomposition and its causal correction are estimable, and that a permutation-calibrated screen detects planted order-three leakage with controlled false positives.

2026-06-25T10:20:42Z 18 pages, 4 Figures, All experiments are synthetic and use no proprietary data. The code reproducing every figure is openly available at https://doi.org/10.5281/zenodo.20843643 (archived) and https://github.com/AlejandroRodriguezDominguez/order-three-attribution (development); each script is seeded, so every figure is exactly reproducible Alejandro Rodriguez Dominguez http://arxiv.org/abs/2606.26815v1 Data-Driven Duration Management -- Term Structure Forecasting Using Machine Learning 2026-06-25T09:58:28Z

This paper compares different methods for forecasting the term structure of U.S. and European zero-coupon government bonds using both traditional econometric and Machine Learning (ML) approaches. We compare classical models (e.g., Dynamic Nelson-Siegel (DNS) and Principal Component Analysis (PCA)) with different Neural Network (NN) architectures, including those inspired by the classical models, on the U.S. Treasury market and bonds issued by the European Central Bank (ECB). To enhance predictive performance, macroeconomic variables are incorporated. The findings for both markets are separately analyzed and compared. To this end, we propose a robust model evaluation framework combining statistical accuracy metrics - such as RMSE, MAE, and directional accuracy - with the economic relevance of a quantitative bond trading strategy. Results show that NNs consistently outperform traditional models in both forecasting accuracy and portfolio performance. For the U.S., the most effective approach is a direct-forecasting NN that incorporates DNS factors to reduce the dimensionality of zero-rate data and an Autoencoder (AE) to extract macroeconomic features, while for Europe, the optimal model is a factor-based NN using PCA-derived zero-rate factors without the integration of macroeconomic variables. Overall, the paper demonstrates how combining traditional modeling approaches with modern ML techniques and evaluation can improve yield curve forecasts and support applications in fixed-income portfolio construction.

2026-06-25T09:58:28Z Tobias Lausser Joao Eduardo Vuolo Rudi Zagst http://arxiv.org/abs/2304.07672v2 Optimal Investment and Consumption Strategies with General Cost Structure under CRRA Utility 2026-06-25T07:34:32Z

Transaction costs play a critical role in portfolio allocation and consumption decisions. We study a finite-horizon consumption--investment problem with CRRA utility under a general class of transaction cost functions. Based on dynamic programming and a singular perturbation expansion for a small cost-to-wealth ratio, we derive leading-order asymptotic formulas for the no-trade region, the four trading boundaries, the value function correction, and the optimal consumption rate. We further show how fixed, proportional, fixed-plus-proportional, and nonlinear transaction costs arise as special cases of the general framework. The results show that the leading-order no-trade region is governed by the fixed and proportional components, while the framework still accommodates nonlinear cost structures. Complementing the asymptotic analysis, we prove a verification theorem for the exact impulse-control formulation under a strictly positive fixed cost component, and characterize its limiting transitions to singular and continuous control regimes as the fixed cost vanishes.

2023-04-16T02:17:05Z Yingting Miao Qiang Zhang http://arxiv.org/abs/2606.26625v1 Portfolio Optimization for Commodity ETFs under Heavy-Tailed Returns 2026-06-25T05:40:06Z

This paper examines portfolio optimization for commodity exchange-traded funds (ETFs) under heavy-tailed return behavior. Using daily Bloomberg data for 30 U.S.-listed commodity ETFs from 12 December 2018 to 16 December 2024, we study funds spanning agriculture, energy, metals, and broad commodity index exposure. We compare a passive buy-and-hold portfolio with rolling-window optimized portfolios formed under mean--variance and conditional value-at-risk (CVaR) criteria, considering both long-only and restricted long--short strategies. The results showed substantial heterogeneity across commodity sectors, with energy and broad commodity index funds displaying pronounced volatility, skewness, and excess kurtosis. Historical optimization indicated that minimum-risk and CVaR-based portfolios provided more stable cumulative performance than tangent portfolios and generally improved Sharpe, Calmar, and STARR$_{0.95}$ ratios. Extreme-value diagnostics showed that optimized portfolios remained exposed to heavy downside tails, so improved risk-adjusted performance did not eliminate extreme-loss risk. A dynamic extension based on ARMA--GARCH marginal models, Student--$t$ copula dependence, and one-step-ahead predictive scenarios improved performance mainly when combined with minimum-risk or CVaR-based objectives. Dynamic mean--variance tangent portfolios performed less reliably, reflecting sensitivity to expected-return estimation error. Transaction-cost robustness checks further showed that the practical value of dynamic optimization depended on turnover control, with low-turnover dynamic CVaR tangent portfolios remaining more resilient to implementation costs. Overall, the analysis showed that commodity ETF allocation benefited most from conservative and downside-risk-aware optimization, while optimized portfolios continued to require explicit tail-risk and implementation diagnostics.

2026-06-25T05:40:06Z Nicholas Appiah Ali Jaffri Dilmi C. W. Hettiachchi-Halpe-Kankanamalage Svetlozar T. Rachev http://arxiv.org/abs/2606.25811v1 Hierarchical Graph Learning for Calendar Spread Strategies in Commodity Futures Markets 2026-06-24T13:25:15Z

Commodity futures can be represented hierarchically, with underlying assets at the upper level and individual futures contracts at the lower level. Entities at each level can be connected by edges reflecting inherent correlations, with cross-level edges capturing contract-to-underlying asset connections. Building on our observations of these structures, we propose a hierarchical graph learning approach for calendar spread (CS) strategies in commodity futures markets, addressing two significant gaps in the machine-learning literature: (i) the absence of learning-based methods for CS strategies in futures markets, and (ii) the lack of consideration of maturity-dependent interrelationships across commodity futures. We first establish the efficacy of CS strategies by analytically showing that CS strategies can possess higher risk-adjusted returns, measured by the information ratio, and lower risk, measured by variance and delta, than long-only strategies. We then introduce a method to convert learning-based predictions into CS positions. Next, we develop a hierarchical graph learning method that predicts futures price movements by utilizing the maturity-dependent interrelationships, thereby yielding a CS trading algorithm. Empirical results on commodity futures markets traded on the Chicago Mercantile Exchange Group demonstrate that our method outperforms benchmark models in both prediction and trading performance. We find that maturity-dependent interrelationships across commodity futures are instrumental in prediction and that CS trading based on hierarchical graph learning is effective for statistical arbitrage.

2026-06-24T13:25:15Z Yoonsik Hong Diego Klabjan http://arxiv.org/abs/2606.25696v1 A Two-Stage Decision Support System for Sustainability-Aware Long Short Portfolio Optimization 2026-06-24T11:07:49Z

This paper proposes a two-stage decision support system for long-short portfolio optimization under environmental, social, and governance (ESG) considerations. In the first stage, assets are evaluated using a multi-criteria procedure based on TODIMSort, with criterion weights derived using the MEREC (Removal Effects of Criteria) method. This allows assets to be assigned to classes ordered according to preferences that respond to market conditions and investor priorities, thus generating sets of long and short opportunities that dynamically adapt to the prevailing regime. In the second stage, we formulate a non-convex portfolio optimization problem that maximizes the Omega ratio while respecting budget, bound and leverage constraints. To solve it, we introduce an adaptive particle swarm solver equipped with a controller that selects, at each iteration, the most suitable recombination operator from a diverse pool of operators and combines it with a projection-based repair mechanism for constraint management. The empirical study, conducted on 421 stocks in the STOXX Europe 600 index, examines both the exploration capabilities and solution quality of the proposed solver compared to state-of-the-art benchmarks, as well as the ex post profitability of the resulting portfolio strategies. The results show that ESG-enhanced long-short portfolios offer competitive and often superior performance compared to their non-ESG counterparts and the market-value-weighted benchmark.

2026-06-24T11:07:49Z Giacomo di Tollo Massimiliano Kaucic Filippo Piccotto http://arxiv.org/abs/2108.02283v3 Machine Learning Classification and Portfolio Construction: Does the Loss Function Matter? 2026-06-22T22:59:46Z

Classification outperforms regression across matched machine learning models in portfolio construction. A stacking ensemble of gradient boosted tree, random forest, and neural network yields a value-weighted annualized Sharpe ratio of 1.83 for classification and 1.11 for regression. This outperformance persists in multiclass settings, across subsamples, and after transaction costs. Spanning tests show that classification retains economically large alphas after we control for regression, whereas regression alphas shrink substantially once we control for classification. These results indicate that classification extracts more return information than matched regression. Our diagnostics trace classification's advantage to sharper and more precise separation of return deciles.

2021-08-04T20:48:27Z Yang Bai Kuntara Pukthuanthong http://arxiv.org/abs/2606.23367v1 Asymmetry PRISM: A CPU/GPU Portfolio Optimization Engine for Deadline-Bounded Institutional Rebalancing 2026-06-22T14:03:54Z

Institutional rebalancing is a batched optimization workload with a hard operating deadline: hundreds of accounts need new weights under budget, turnover, exposure, exclusion, and tax-aware controls before trading can proceed. This paper evaluates Asymmetry PRISM, a CPU/GPU portfolio optimization engine, through a public evaluation boundary; problem data in, and returned weights, status codes, timings, memory class, external feasibility diagnostics, eligible objective comparisons, and audit records out. Within that boundary, the evaluation protocol fixes hardware and software versions, declares timing lanes, separates cold single calls from repeated workloads, and admits objective-gap claims only where an eligible reference solver completed. On completed multi-solver rows from N=100 to N=2,000, Asymmetry PRISM-CPU is 4.5x to 24.1x faster than the fastest completed reference row in the same lane. In the production queue study, Asymmetry PRISM-GPU completes 500/500 accounts over a 10,000-instrument universe in 109.5 s within a declared 25-minute operating window, with zero missed deadlines and an audit record for every solve; the recorded OSQP queue baseline completes 4/500. On an operationally constrained real-data suite (tax-motivated transition penalties, restriction caps, turnover controls, batches), Asymmetry PRISM clears constrained solves 3.4x to 126.7x faster than the best completing incumbent at certified-equal objectives, and the GPU route widens to 8.8x over the CPU route at N=384,800. Rows without a completed reference are reported as feasibility, timing, memory, and failure-status evidence.

2026-06-22T14:03:54Z 22 pages, 8 figures Debdoot Ghosh http://arxiv.org/abs/2505.07820v3 Revisiting the Excess Volatility Puzzle Through the Lens of the Chiarella Model 2026-06-22T09:32:46Z

We amend and extend the Chiarella model of financial markets to deal with arbitrary long-term value drifts in a consistent way. This allows us to improve upon existing calibration schemes, opening the possibility of calibrating individual monthly time series instead of classes of time series. The technique is employed on spot prices of four asset classes from ca. 1800 onward (stock indices, bonds, commodities, currencies). The so-called fundamental value is a direct output of the calibration, which allows us to (a) quantify the amount of excess volatility in these markets, which we find to be large (e.g. a factor $\approx$ 4 for stock indices) and consistent with previous estimates; and (b) determine the distribution of mispricings (i.e. the difference between market price and value), which we find in many cases to be bimodal. Both findings are strongly at odds with the Efficient Market Hypothesis. We also study in detail the 'sloppiness' of the calibration, that is, the directions in parameter space that are weakly constrained by data. The main conclusions of our study are remarkably consistent across different asset classes, and reinforce the hypothesis that the medium-term fate of financial markets is determined by a tug-of-war between trend followers and fundamentalists.

2025-05-12T17:59:46Z 20 pages plus 11 pages of appendices, 11+12 figures, 2+6 tables PLoS One 21(1): e0340409 (2026) Jutta G. Kurth Adam A. Majewski Jean-Philippe Bouchaud 10.1371/journal.pone.0340409 http://arxiv.org/abs/2205.08614v4 Well Posedness of Utility Maximization Problems Under Partial Information in a Market with Gaussian Drift 2026-06-21T10:17:27Z

This paper investigates well posedness of utility maximization problems for financial markets where stock returns depend on a hidden Gaussian mean-reverting drift process. Since that process is potentially unbounded, well posedness cannot be guaranteed for utility functions which are not bounded from above. For power utility with relative risk aversion smaller than that of log-utility this leads to restrictions on the choice of model parameters such as the investment horizon and parameters controlling the variance of the asset price and drift processes. We derive sufficient conditions to the model parameters leading to bounded maximum expected utility of terminal wealth for models with full and partial information.

2022-05-17T20:14:04Z 26 pages, 1 figure Abdelali Gabih Hakam Kondakji Ralf Wunderlich http://arxiv.org/abs/2606.20903v1 Reinforcement Learning for Risk-Sensitive Investment Management: a Free Energy--Entropy Duality Approach 2026-06-18T19:58:15Z

This paper develops a reinforcement-learning approach to continuous-time risk-sensitive benchmarked asset allocation in a partly model-based setting. The benchmarked problem does not directly fit the standard Markovian stochastic-control template: the state is uncontrolled, whereas the terminal reward contains a controlled Itô integral. We use free energy-entropy duality to reformulate the problem as a linear-quadratic-Gaussian stochastic differential game under an equivalent probability measure, yielding explicit finite- and infinite-horizon saddle-point solutions. This structure guides a continuous-time $q$-learning actor-critic method: the quadratic value function motivates the critic, while the affine saddle-point controls motivate deterministic actors for the portfolio allocation and adversarial control. The learned allocation admits an economic interpretation through fractional Kelly decompositions. A proof-of-concept implementation calibrated to U.S. equity data shows that the actors learn the optimal policy with high accuracy and reveals a favorable asymmetry: the portfolio actor receives a cleaner learning signal than the auxiliary adversarial actor.

2026-06-18T19:58:15Z Sebastien Lleo Wolfgang Runggaldier http://arxiv.org/abs/2111.14631v2 Model Risk in Credit Portfolio Models 2026-06-16T08:16:51Z

Model risk in credit portfolio models is a serious issue for banks but has so far not been tackled comprehensively. We will demonstrate how to deal with uncertainty in all model parameters in an all-embracing, yet easy-to-implement way.

2021-11-23T13:12:47Z 12 pages, 2 figures. This version: minor corrections, updates, and comments Christian Meyer http://arxiv.org/abs/2606.17032v1 Sharpe Ratio and Return-VaR Ratio Maximization for Option Portfolios with Skew-Elliptical $t$ Underlying Returns 2026-06-15T17:52:47Z

We provide a formulation for optimal option portfolios under Sharpe Ratio maximization when the underlying returns follow a skew-elliptical t-distribution. This departs from the traditional normal returns setting in the context of Sharpe ratio maximization by allowing the modelling of heavy-tailed and skewed dynamics. The novelty of this paper and our main result is to provide explicit formulas for the portfolio weights when maximizing the Sharpe ratio and return-to-Value-at-Risk (VaR) ratio in the skew-elliptical setting. Numerical experiments reveal that the optimal portfolios for the two ratios are different.

2026-06-15T17:52:47Z 14 pages Kyle Sung Traian A. Pirvu