https://arxiv.org/api/U4AuXrJbes7xe0Cxut+yRO8Xun8 2026-03-30T08:44:37Z 60152 135 15 http://arxiv.org/abs/2603.23303v1 Exponential Turnpike Theorems for Nonlinear Deterministic Meanfield Optimal Control Problems 2026-03-24T15:07:08Z

In this article, we establish exponential turnpike theorems for a class of nonlinear deterministic meanfield optimal control problems. We carry out our analysis simultaneously in the so-called Lagrangian and Eulerian frameworks. In the Lagrangian setting, the problem is lifted to a Hilbert space of random variables, and we prove an exponential turnpike theorem by combining first-order optimality conditions, a second-order expansion of the lifted Hamiltonian, and an operator Riccati diagonalization argument. In the Eulerian setting, we derive intrinsic KKT conditions for the static constrained problem, and show how the Eulerian second-order hypotheses split into a horizontal part, transferred by unitary conjugation to the lifted space, and a vertical part which reduces to uniform pointwise stabilizability and detectability conditions on multiplication operators. This yields an exponential turnpike theorem in the Wasserstein space for optimal Pontryagin triples. Along the way, we %provide explicit the link between Wasserstein Hessians and their Lagrangian lifts, and provide several remarks clarifying the role of occupation measures, local Eulerian minimizers, and control constraints in our results.

2026-03-24T15:07:08Z 31 pages Benoît Bonnet-Weill Giovanni Colombo Denis Shishmintsev Emmanuel Trélat http://arxiv.org/abs/2603.23299v1 Pruning for efficient deterministic global optimization over trained ReLU neural networks 2026-03-24T15:01:56Z

Neural networks are increasingly used as surrogates in optimization problems to replace computationally expensive models. However, embedding ReLU neural networks in mathematical programs introduces significant computational challenges, particularly for deep and wide networks, due to both the formulation of the ReLU disjunction and the resulting large-scale optimization problem. This work investigates how pruning techniques can accelerate the solution of optimization problems with embedded neural networks, focusing on the mechanisms underlying the computational gains. We provide theoretical insights into how both unstructured (weight) and structured (node) pruning affect the ReLU big-M formulation, showing that pruning monotonically tightens preactivation bounds. We conduct comprehensive empirical studies across multiple network architectures using an illustrative test function and a realistic chemical process flowsheet optimization case study. Our results show that pruning achieves speedups of up to three to four orders of magnitude, with computational gains attributed to three key factors: (i) reduction in problem size, (ii) decrease in the number of integer variables, and (iii) tightening of big-M bounds. Weight pruning is particularly effective for deep, narrow networks, while node pruning performs better for shallow, wide or medium-sized networks. In the chemical engineering case study, pruning enabled convergence within seconds for problems that were otherwise intractable. We recommend adopting pruning as standard practice when developing neural network surrogates for optimization, especially for engineering applications requiring repeated optimization solves.

2026-03-24T15:01:56Z Giacomo Lastrucci Tanuj Karia Victor Schulte Dominik Bongartz Artur M. Schweidtmann http://arxiv.org/abs/2603.23249v1 A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling 2026-03-24T14:16:08Z

Efficient scheduling of directed acyclic graphs (DAGs) in heterogeneous environments is challenging due to resource capacities and dependencies. In practice, the need for adaptability across environments with varying resource pools and task types, alongside rapid schedule generation, complicates these challenges. We propose WeCAN, an end-to-end reinforcement learning framework for heterogeneous DAG scheduling that addresses task--pool compatibility coefficients and generation-induced optimality gaps. It adopts a two-stage single-pass design: a single forward pass produces task--pool scores and global parameters, followed by a generation map that constructs schedules without repeated network calls. Its weighted cross-attention encoder models task--pool interactions gated by compatibility coefficients, and is size-agnostic to environment fluctuations. Moreover, widely used list-scheduling maps can incur generation-induced optimality gaps from restricted reachability. We introduce an order-space analysis that characterizes the reachable set of generation maps via feasible schedule orders, explains the mechanism behind generation-induced gaps, and yields sufficient conditions for gap elimination. Guided by these conditions, we design a skip-extended realization with an analytically parameterized decreasing skip rule, which enlarges the reachable order set while preserving single-pass efficiency. Experiments on computation graphs and real-world TPC-H DAGs demonstrate improved makespan over strong baselines, with inference time comparable to classical heuristics and faster than multi-round neural schedulers.

2026-03-24T14:16:08Z 30pages, 8 figures Ruisong Zhou Haijun Zou Li Zhou Chumin Sun Zaiwen Wen http://arxiv.org/abs/2603.23173v1 A Schrödinger Eigenfunction Method for Long-Horizon Stochastic Optimal Control 2026-03-24T13:15:48Z

High-dimensional stochastic optimal control (SOC) becomes harder with longer planning horizons: existing methods scale linearly in the horizon $T$, with performance often deteriorating exponentially. We overcome these limitations for a subclass of linearly-solvable SOC problems-those whose uncontrolled drift is the gradient of a potential. In this setting, the Hamilton-Jacobi-Bellman equation reduces to a linear PDE governed by an operator $\mathcal{L}$. We prove that, under the gradient drift assumption, $\mathcal{L}$ is unitarily equivalent to a Schrödinger operator $\mathcal{S} = -Δ+ \mathcal{V}$ with purely discrete spectrum, allowing the long-horizon control to be efficiently described via the eigensystem of $\mathcal{L}$. This connection provides two key results: first, for a symmetric linear-quadratic regulator (LQR), $\mathcal{S}$ matches the Hamiltonian of a quantum harmonic oscillator, whose closed-form eigensystem yields an analytic solution to the symmetric LQR with \emph{arbitrary} terminal cost. Second, in a more general setting, we learn the eigensystem of $\mathcal{L}$ using neural networks. We identify implicit reweighting issues with existing eigenfunction learning losses that degrade performance in control tasks, and propose a novel loss function to mitigate this. We evaluate our method on several long-horizon benchmarks, achieving an order-of-magnitude improvement in control accuracy compared to state-of-the-art methods, while reducing memory usage and runtime complexity from $\mathcal{O}(Td)$ to $\mathcal{O}(d)$.

2026-03-24T13:15:48Z Accepted to ICLR 2026, code available in https://github.com/lclaeys/eigenfunction-solver Louis Claeys Artur Goldman Zebang Shen Niao He http://arxiv.org/abs/2603.23156v1 Mean Field Games for Renewable Energy Development 2026-03-24T12:58:12Z

We propose a mean field game (MFG) framework to model the evolution of renewable energy production in competitive electricity markets. Producers interact through the spot price while optimising their profits under production, installation, and capacity adjustment costs, as well as the generation uncertainty. We first formulate the market as an $N$-player stochastic differential game and analyse its mean field game limit as $N\to\infty$. We characterise the representative producer's optimal control via forward-backward stochastic differential equations (FBSDEs) derived from the stochastic maximum principle and determine the corresponding equilibrium spot price. We establish existence and uniqueness of solutions to the FBSDEs and prove that the MFG admits a unique equilibrium. We then extend the model to a Stackelberg mean field game to incorporate the role of a social planner. The planner's optimisation problem leads to an extended Hamilton-Jacobi-Bellman (HJB) system, for which we prove existence and uniqueness of viscosity solutions. Finally, we implement a deep learning-based numerical scheme to approximate the equilibrium and investigate the impact of policy interventions on capacity dynamics. Our results highlight how optimal subsidy design depends on prevailing market conditions and can mitigate both capacity shortages and overproduction.

2026-03-24T12:58:12Z 20 figures Luciano Campi Zhuoshu Wu http://arxiv.org/abs/2603.23135v1 The relief distribution problem with trucks and drones under incomplete demand information 2026-03-24T12:33:48Z

Disaster relief operations often take place under uncertainty regarding the extent of damage across locations. In this paper, we study the delivery of relief aid in the aftermath of disasters when delivery vehicles are assisted by surveillance drones and the demand for relief supplies is initially unknown. We introduce a stylized problem that arises in many emergency supply delivery settings -- the relief distribution problem (RDP). In RDP, emergency vehicles, referred to as trucks, must distribute relief supplies on a network, starting from the depot to potential delivery locations, whose demand is initially unknown. The trucks are assisted by surveillance drones, which cannot deliver relief supplies, but scout delivery locations to see whether relief supplies are needed or not. The objective is to visit all location by any vehicle, deliver supplies to all damaged ones, and minimizing the completion time of the relief operation. We study two natural policies for the online problem RDP which we evaluate in two ways: the competitive ratio quantifies the performance in comparison to an optimal solution obtained under full information on damages, the drone-impact is the ratio of the algorithm's performance to the best outcome achievable without drones. Through theoretical analysis and computational experiments, we characterize the operational trade-offs between these policies and derive insights for the effective deployment of drones in disaster response.

2026-03-24T12:33:48Z Aaron Neugebauer Alena Otto Marie Schmidt http://arxiv.org/abs/2603.23131v1 Optimal Control of Switched Systems Governed by Logical Switching Dynamics 2026-03-24T12:27:47Z

This paper investigates the optimal co-design of logical and continuous controls for switched linear systems governed by controlled logical switching dynamics. Unlike traditional switched systems with arbitrary or state-dependent switching, the switching signals here are generated by an internal logical dynamical system and explicitly integrated into the control synthesis. By leveraging the semi-tensor product (STP) of matrices, we embed the coupled logical and continuous dynamics into a unified algebraic state-space representation, transforming the co-design problem into a tractable linear-quadratic framework. We derive Riccati-type backward recursions for both deterministic and stochastic logical dynamics, which yield optimal state-feedback laws for continuous control alongside value-function-based, state-dependent decision rules for logical switching. To mitigate the combinatorial explosion inherent in logical decision-making, a hierarchical algorithm is developed to decouple offline precomputation from efficient online execution. Numerical simulations demonstrate the efficacy of the proposed framework.

2026-03-24T12:27:47Z 26 pages, 3 figures Xiao Zhang Min Meng Changxi Li Ka-Fai Cedric Yiu http://arxiv.org/abs/2502.15421v2 Bounding the Error of Value Functions in Sobolev Norm Yields Bounds on Suboptimality of Controller Performance 2026-03-24T12:07:16Z

Optimal feedback controllers for nonlinear systems can be derived by solving the Hamilton-Jacobi-Bellman (HJB) equation. However, because the HJB is a nonlinear partial differential equation, numerical methods typically provide only approximate solutions. While numerical error bounds on approximate HJB solutions are often available, these bounds do not necessarily translate into guarantees on the suboptimality of the resulting controllers. In this paper, we establish that the suboptimality of the resulting controller is bounded by the $L^\infty$ norm of the HJB residual, which is, in turn, bounded by numerical error in the value function as measured in the Sobolev $W^{1,\infty}$ norm. This implies that convergence of value functions in $W^{1,\infty}$ result in controllers that yield a cost that is arbitrarily close to the true minimum. In contrast, we demonstrate that such guarantees do not hold when the value function error is measured in weaker norms, such as the Sobolev $W^{1,p}$ norm for finite $p$. These results apply to systems governed by Lipschitz continuous dynamics over a finite time horizon with compact input space.

2025-02-21T12:39:44Z Morgan Jones Matthew Peet http://arxiv.org/abs/2603.23099v1 DSO Led-Bilevel Optimization Framework for TSO-DSO Coordination across Active Distribution Networks 2026-03-24T11:48:01Z

This work presents a bilevel coordination model that captures the hierarchical interaction between the transmission and distribution layers under a Distribution System Operator(DSO)-led configuration. In this scheme, multiple DSOs independently optimize the operation of their active distribution networks (ADNs), including photovoltaic (PV) generation, battery energy storage systems (BESS), and peer-to-peer (P2P) energy exchanges both within and across ADNs through the Transmission Network (TN), before the Transmission System Operator (TSO) performs the global coordination. The proposed formulation combines the Second-Order Cone relaxation of the DistFlow model to represent the distribution networks (DNs) with the classical DC optimal power flow (OPF) model for the transmission layer. The DSO-first decision sequence enables the reformulation of the bi-level problem into an equivalent single-level optimization model using the Karush-Kuhn-Tucker (KKT) conditions, resulting in a Mixed-Integer Second-Order Cone Programming (MISOCP) formulation that captures both the discrete and convex characteristics of the problem, while preserving the binary variables associated with DER and P2P operation, which would otherwise need to be relaxed in traditional TSO-led approaches. The model is tested on a hybrid system composed of the IEEE 30-bus transmission network and five IEEE 33-bus DNs. Results show that the DSO-led coordination leads to a more efficient use of BESS, improves local self-consumption, and reduces imports from the TN compared to the conventional top-down scheme. Furthermore, computational results from the case study reveal that the model exhibits near-linear or quadratic growth in problem size as the number of ADNs increases, suggesting its applicability to large-scale multi-ADN configurations.

2026-03-24T11:48:01Z Fernando García-Muñoz Martín Venegas Escalona http://arxiv.org/abs/2603.23046v1 Convergence analysis of accelerated algorithms via a mixed-order dynamical system for separable nonsmooth convex optimization 2026-03-24T10:32:46Z

For a linear equality constrained convex optimization problem involving two objective functions with a ``nonsmooth" + ``nonsmooth" composite structure, we study two algorithms derived from a mixed-order dynamical system which incorporates time scales and a Tikhonov regularization term. We observe that different types of multipliers lead to distinct algorithms. For the implicit multiplier and semi-implicit multiplier, we develop a new primal-dual joint algorithm and a new splitting algorithm, respectively. Our proposed joint algorithm can reduce to an algorithm for solving the corresponding non-separable linearly constrained convex optimization problem. Then, we establish the nonergodic convergence properties of all our proposed algorithms. Moreover, we derive that the sequences generated by these algorithms strongly converge to the minimal norm solution. Finally, numerical experiments are conducted to validate the practical performance of the proposed algorithms.

2026-03-24T10:32:46Z 27 pages, 5 figures, submitted to Journal of Computational and Applied Mathematics (JCAM) Geng-Hua Li Hai-Yi Zhao Xiangkai Sun http://arxiv.org/abs/2603.23042v1 Minimizing Material Waste in Additive Manufacturing through Online Reel Assignment 2026-03-24T10:29:04Z

We study a variant of the online bin packing problem that arises in filament-based 3D printing systems operating in make-to-order settings, where only a limited number of filament reels of finite capacity can be handled at once. Components are assigned to reels upon arrival and insufficient reels are discarded to be replaced with new ones, resulting in material waste. To minimize the long-run average discarded filament through an online assignment policy, we formulate this problem as an infinite-horizon average-cost Markov Decision Process and analyze the structure of policies under stochastic, sequential demand. We first show that under a random allocation policy, the system decomposes into a collection of identical single-reel processes, allowing us to derive a closed-form expression for the average waste and enabling a tractable baseline analysis. Building on this decomposition, we construct a theoretically grounded index policy that assigns each reel a score reflecting the marginal cost of assignment and prove that it constitutes a one-step policy improvement over random allocation. We embed the index-based structure within a Deep Reinforcement Learning framework using approximate policy iteration. The resulting method achieves near-optimal performance across a range of simulated and real-world scenarios. Our results demonstrate that Reinforcement Learning policy significantly reduces material waste while maintaining real-time feasibility and interpretability.

2026-03-24T10:29:04Z Ilayda Celenk Willem van Jaarsveld Ivo J. B. F. Adan Alp Akcay http://arxiv.org/abs/2603.22996v1 Structure-Aware Optimization of Decision Diagrams for Health Guidance via Integer Programming 2026-03-24T09:39:35Z

In this paper, we consider a structure-aware optimization problem for decision diagrams used for health guidance. In particular, we focus on decision diagrams that decide to whom public sectors suggest consulting a medical worker. Furthermore, these diagrams decide which notification method should be used for each target person. In this paper, we formulate this problem as an integer program. Then we evaluate its practical usefulness through numerical examples.

2026-03-24T09:39:35Z Nanako Shimaoka Naoyuki Kamiyama Shinji Hotta Sayuri Kohmura Yuta Kurume Hiroko Suzuki Akihiro Inomata Eigo Segawa http://arxiv.org/abs/2603.24613v1 Persistence-based topological optimization: a survey 2026-03-24T09:14:46Z

Computational topology provides a tool, persistent homology, to extract quantitative descriptors from structured objects (images, graphs, point clouds, etc). These descriptors can then be involved in optimization problems, typically as a way to incorporate topological priors or to regularize machine learning models. This is usually achieved by minimizing adequate, topologically-informed losses based on these descriptors, which, in turn, naturally raises theoretical and practical questions about the possibility of optimizing such loss functions using gradient-based algorithms. This has been an active research field in the topological data analysis community over the last decade, and various techniques have been developed to enable optimization of persistence-based loss functions with gradient descent schemes. This survey presents the current state of this field, covering its theoretical foundations, the algorithmic aspects, and showcasing practical uses in several applications. It includes a detailed introduction to persistence theory and, as such, aims at being accessible to mathematicians and data scientists newcomers to the field. It is accompanied by an open-source library which implements the different approaches covered in this survey, providing a convenient playground for researchers to get familiar with the field.

2026-03-24T09:14:46Z Mathieu Carriere DATASHAPE Yuichi Ike LIGM Théo Lacombe LIGM Naoki Nishikawa UTokyo | IST http://arxiv.org/abs/2306.14853v4 Near-Optimal Nonconvex-Strongly-Convex Bilevel Optimization with Fully First-Order Oracles 2026-03-24T09:14:23Z

In this work, we consider bilevel optimization when the lower-level problem is strongly convex. Recent works show that with a Hessian-vector product (HVP) oracle, one can provably find an $ε$-stationary point within ${\mathcal{O}}(ε^{-2})$ oracle calls. However, the HVP oracle may be inaccessible or expensive in practice. Kwon et al. (ICML 2023) addressed this issue by proposing a first-order method that can achieve the same goal at a slower rate of $\tilde{\mathcal{O}}(ε^{-3})$. In this paper, we incorporate a two-time-scale update to improve their method to achieve the near-optimal $\tilde {\mathcal{O}}(ε^{-2})$ first-order oracle complexity. Our analysis is highly extensible. In the stochastic setting, our algorithm can achieve the stochastic first-order oracle complexity of $\tilde {\mathcal{O}}(ε^{-4})$ and $\tilde {\mathcal{O}}(ε^{-6})$ when the stochastic noises are only in the upper-level objective and in both level objectives, respectively. When the objectives have higher-order smoothness conditions, our deterministic method can escape saddle points by injecting noise, and can be accelerated to achieve a faster rate of $\tilde {\mathcal{O}}(ε^{-1.75})$ using Nesterov's momentum.

2023-06-26T17:07:54Z JMLR 2025 Lesi Chen Yaohua Ma Jingzhao Zhang http://arxiv.org/abs/2603.22924v1 Positive Observers Revisited 2026-03-24T08:14:55Z

The paper shows that positive linear systems can be stabilized using positive Luenberger-type observers, contradicting previous conclusions. This is achieved by structuring the observer as monotonically converging upper and lower bounds on the state. Analysis of the closed-loop properties under linear observer feedback gives conditions that cover a larger class than previous observer designs. The results are applied to nonpositive systems by enforcing positivity of the dynamics using feedback from the upper bound observer. The setting is expanded to include stochastic noise, giving conditions for convergence in expectation using feedback from positive observers.

2026-03-24T08:14:55Z Accepted for publication at the 2026 European Control Conference David Ohlin Anders Rantzer Emma Tegling