https://arxiv.org/api/NouUE/5tgMT+mBhpQ6JX9PvLQhs2026-03-30T10:21:59Z4840919515http://arxiv.org/abs/2603.12167v2Operator Splitting, Policy Iteration, and Machine Learning for Stochastic Optimal Control2026-03-19T22:26:54ZWe propose a splitting approach to solve the second-order Hamilton--Jacobi equation, reducing it to a heat step and a purely first-order step. The latter is implemented using a gradient value policy iteration algorithm, enabling efficient characteristic-based machine learning methods. We establish convergence rates for the splitting method. In particular, with $h$ the splitting step, the $L^\infty$ error is bounded between $\mathcal{O}(h)$ and $\mathcal{O}(h^{1/5})$ for Lipschitz data, improving to $\mathcal{O}(h^{1/3})$ for semiconcave data. In the periodic setting, we also obtain an $L^1$ error of order $\mathcal{O}(h^{1/2})$. For the first-order step, we provide a weighted $L^2$ error analysis that shows exponential convergence. Each iteration solves linear characteristic equations and learns the value function by minimizing a weighted value gradient loss. The approach yields stable and accurate numerical results.2026-03-12T17:02:46Z37 pages, with improved results for Theorem 1.1Alain BensoussanThien P. B. NguyenMinh-Binh TranSon N. T. Tuhttp://arxiv.org/abs/2603.19463v1Deep Hilbert--Galerkin Methods for Infinite-Dimensional PDEs and Optimal Control2026-03-19T20:53:20ZWe develop deep learning-based approximation methods for fully nonlinear second-order PDEs on separable Hilbert spaces, such as HJB equations for infinite-dimensional control, by parameterizing solutions via Hilbert--Galerkin Neural Operators (HGNOs). We prove the first Universal Approximation Theorems (UATs) which are sufficiently powerful to address these problems, based on novel topologies for Hessian terms and corresponding novel continuity assumptions on the fully nonlinear operator. These topologies are non-sequential and non-metrizable, making the problem delicate. In particular, we prove UATs for functions on Hilbert spaces, together with their Fréchet derivatives up to second order, and for unbounded operators applied to the first derivative, ensuring that HGNOs are able to approximate all the PDE terms. For control problems, we further prove UATs for optimal feedback controls in terms of our approximating value function HGNO.
We develop numerical training methods, which we call Deep Hilbert--Galerkin and Hilbert Actor-Critic (reinforcement learning) Methods, for these problems by minimizing the $L^2_μ(H)$-norm of the residual of the PDE on the whole Hilbert space, not just a projected PDE to finite dimensions. This is the first paper to propose such an approach. The models considered arise in many applied sciences, such as functional differential equations in physics and Kolmogorov and HJB PDEs related to controlled PDEs, SPDEs, path-dependent systems, partially observed stochastic systems, and mean-field SDEs. We numerically solve examples of Kolmogorov and HJB PDEs related to the optimal control of deterministic and stochastic heat and Burgers' equations, demonstrating the promise of our deep learning-based approach.2026-03-19T20:53:20ZSamuel N. CohenFilippo de FeoJackson HebnerJustin Sirignanohttp://arxiv.org/abs/2603.19395v1Numerical Analysis of a Coupled 3D-1D Transport Problem2026-03-19T18:36:58ZA finite element solution coupled with an interior penalty discontinuous Galerkin solution are defined for the approximation of the coupled 3D-1D solute transport problem. Under sufficient regularity for the weak solutions, optimal error bounds are derived for the 3D concentration and 1D concentration, that are optimal with respect to the time step size and the mesh sizes. Numerical results verify the theoretical results.2026-03-19T18:36:58ZAlyssa Taylor-LaPoleUzochi GideonBeatrice RiviereDuygu Vargunhttp://arxiv.org/abs/2603.19130v1Quantum block encoding for semiseparable matrices2026-03-19T16:49:13ZQuantum block encoding (QBE) is a crucial step in the development of most quantum algorithms, as it provides an embedding of a given matrix into a suitable larger unitary matrix. Historically, the development of efficient techniques for QBE has mostly focused on sparse matrices; less effort has been devoted to data-sparse (e.g., rank-structured) matrices.
In this work we examine a particular case of rank structure, namely, one-pair semiseparable matrices. We present a new block encoding approach that relies on a suitable factorization of the given matrix as the product of triangular and diagonal factors. To encode the matrix, the algorithm needs $2\log(N)+7$ ancillary qubits. This process takes polylogarithmic time and has an error of $\mathcal{O}(N^2)$, where $N$ is the matrix size.2026-03-19T16:49:13ZGiacomo AntonioliPaola BoitoGianna M. Del CorsoMargherita Porcellihttp://arxiv.org/abs/2603.19113v1A stable and fast method for solving multibody scattering problems via the method of fundamental solutions2026-03-19T16:37:03ZThe paper describes a numerical method for solving acoustic multibody scattering problems in two and three dimensions. The idea is to compute a highly accurate approximation to the scattering operator for each body through a local computation, and then use these scattering matrices to form a global linear system. The resulting coefficient matrix is relatively well-conditioned, even for problems involving a very large number of scatterers. The linear system is amenable to iterative solvers, and can readily be accelerated via fast algorithms for the matrix-vector multiplication such as the fast multipole method. The key point of the work is that the local scattering matrices can be constructed using potentially ill-conditioned techniques such as the method of fundamental solutions (MFS), while still maintaining scalability and numerical stability of the global solver. The resulting algorithm is simple, as the MFS is far simpler to implement than alternative techniques based on discretizing boundary integral equations using Nyström or Galerkin.2026-03-19T16:37:03Z31 pages, 9 figuresYunhui CaiJoar BaggePer-Gunnar Martinssonhttp://arxiv.org/abs/2603.19108v1Numerical Considerations for the Construction of Karhunen-Loève Expansions2026-03-19T16:34:46ZThis report examines numerical aspects of constructing Karhunen-Loève expansions (KLEs) for second-order stochastic processes. The KLE relies on the spectral decomposition of the covariance operator via the Fredholm integral equation of the second kind, which is then discretized on a computational grid, leading to an eigendecomposition task. We derive the algebraic equivalence between this Fredholm-based eigensolution and the singular value decomposition of the weight-scaled sample matrix, yielding consistent solutions for both model-based and data-driven KLE construction. Analytical eigensolutions for exponential and squared-exponential covariance kernels serve as reference benchmarks to assess numerical consistency and accuracy in 1D settings. The convergence of SVD-based eigenvalue estimates and of the empirical distributions of the KL coefficients to their theoretical $\mathcal{N}(0,1)$ target are characterized as a function of sample count. Higher-dimensional configurations include a two-dimensional irregular domain discretized by unstructured triangular meshes with two refinement levels, and a three-dimensional toroidal domain whose non-simply-connected topology motivates a comparison between Euclidean and shortest interior path distances between the grid points. The numerical results highlight the interplay between the discretization strategy, quadrature rule, and sample count, and their impact on the KLE results.2026-03-19T16:34:46ZCosmin SaftaHabib N. Najmhttp://arxiv.org/abs/2601.08709v2Multi-Preconditioned LBFGS for Training Finite-Basis PINNs2026-03-19T16:25:50ZA multi-preconditioned LBFGS (MP-LBFGS) algorithm is introduced for training finite-basis physics-informed neural networks (FBPINNs). The algorithm is motivated by the nonlinear additive Schwarz method and exploits the domain-decomposition-inspired additive architecture of FBPINNs, in which local neural networks are defined on subdomains, thereby localizing the network representation. Parallel, subdomain-local quasi-Newton corrections are then constructed on the corresponding local parts of the architecture. A key feature is a novel nonlinear multi-preconditioning mechanism, in which subdomain corrections are optimally combined through the solution of a low-dimensional subspace minimization problem. Numerical experiments indicate that MP-LBFGS can improve convergence speed, as well as model accuracy over standard LBFGS while incurring lower communication overhead.2026-01-13T16:38:15Z13 pagesMarc Salvadó-BenascoAymane KssimAlexander HeinleinRolf KrauseSerge GrattonAlena Kopaničákováhttp://arxiv.org/abs/2402.02917v3Construction of Optimal Algorithms for Function Approximation in Gaussian Sobolev Spaces2026-03-19T16:22:41ZThis paper studies function approximation in Gaussian Sobolev spaces over the real line and measures the error in a Gaussian-weighted $L^p$-norm. We construct two linear approximation algorithms using $n$ function evaluations that achieve the optimal or almost optimal rate of worst-case convergence in a Gaussian Sobolev space of order $α$. The first algorithm is based on scaled trigonometric interpolation and achieves the optimal rate $n^{-α}$ up to a logarithmic factor. This algorithm can be constructed in almost-linear time with the fast Fourier transform. The second algorithm is more complicated, being based on spline smoothing, but attains the optimal rate $n^{-α}$.2024-02-05T11:37:07Z19 pages, 2 figures, to appear on BIT Numerical MathematicsYuya SuzukiToni Karvonenhttp://arxiv.org/abs/2603.19096v1GLENN: Neural network-enhanced computation of Ginzburg-Landau energy minimizers2026-03-19T16:21:56ZIn this work, we propose a neural network-enhanced finite element strategy to compute the minimizer of the Ginzburg--Landau energy based on an unsupervised deep Ritz-type strategy. We treat the parameter $κ$ as a variable input parameter to obtain possible minimizers for a large range of $κ$-values. This allows for two possible strategies: 1) The neural network may be extensively trained to work as a stand-alone solver. 2) Neural network results are used as starting values for a subsequent classical iterative minimization procedure. The latter strategy particularly circumvents the missing reliability of the neural network-based approach. Numerical examples are presented that show the potential of the proposed strategy.2026-03-19T16:21:56ZMichael CrocollChristian DödingBenjamin DörichRoland Maierhttp://arxiv.org/abs/2603.19080v1Reduced order computation of 2D elastodynamic Green's functions in layered soil using a low-rank tensor approximation2026-03-19T16:07:40ZThe evaluation of elastodynamic Green's functions across numerous source-receiver locations, frequencies, and material properties, particularly in the context of parametric studies or boundary element computations, is computationally demanding and memory intensive. This paper presents a reduced order modeling strategy based on the Greedy Tucker Approximation (GTA), which incrementally constructs a low-rank representation of the Green's tensor through rank-one enrichments obtained via a Proper Generalized Decomposition (PGD)-type alternating least squares procedure. A Petrov-Galerkin formulation is employed to improve convergence and approximation accuracy. The resulting multi-dimensional tensor, expressed in terms of one-dimensional basis functions and a compact core, achieves substantial reductions in memory requirements. The methodology is demonstrated for two cases: a soil layer on rigid bedrock and a layered halfspace. Different separable dimensions are considered to capture various combinations of source and receiver configurations, frequencies, and material parameters. Results are validated against those obtained with the direct stiffness method and computation times and memory requirements are compared.2026-03-19T16:07:40ZPreprint submitted to Computers & StructuresZainab FarooqAmar PashovPieter ReumersStijn FrançoisGeert Degrandehttp://arxiv.org/abs/2511.22422v2Weyl distributions, spectral properties, and circulant approximation results for quaternion block multilevel Toeplitz matrix sequences2026-03-19T16:02:07ZThe present work contains a comprehensive treatment of Weyl eigenvalue and singular value distributions for single-axis quaternion block multilevel Toeplitz matrix sequences generated by $s\times t$ quaternion matrix-valued, $d$-variate, Lebesgue integrable generating functions. Furthermore, in view of concrete applications, we are interested in preconditioning and matrix approximation results. To this end, a crucial step is the extension of the notion of an approximating class of sequences (a.c.s.) to the case of matrix sequences with quaternion entries, since it allows us to decompose the difference between a matrix and its preconditioner into low-norm plus (relatively) low-rank terms. As a specific example, we consider classes of quaternion block multilevel circulant matrix sequences as an a.c.s. for quaternion block multilevel Toeplitz matrix sequences. These approximation results lay the foundations for fast preconditioning methods when dealing with large quaternion linear systems stemming from modern applications. We conclude our study with numerical experiments and directions for future research.2025-11-27T13:01:00ZAyoub LailouneValerio LoiStefano Serra-Capizzanohttp://arxiv.org/abs/2603.19075v1A conservative, discontinuous Galerkin, tracer transport scheme using compatible finite elements2026-03-19T16:00:47ZThis paper outlines a conservative transport scheme for scalar tracers within a compatible finite element model for geophysical fluid equations. Instead of using the advective transport equation for a mixing ratio, a conservative transport equation is solved for the tracer density of the mixing ratio multiplied by the dry density. This ensures mass conservation in the continuous equations, which can be preserved in the discrete equations with a discontinuous Galerkin transport scheme. Our method is designed to work for two placements of the mixing ratio in a Charney-Phillips vertical staggering: either co-located with the dry density or vertically staggered from it. The new scheme is designed to conserve the tracer density and ensure consistency by maintaining a constant mixing ratio. Additionally, a mass-conserving limiter is developed to ensure non-negativity in the co-located configuration. Tests with terminator toy chemistry and a moist rising bubble show the use of the new transport scheme with physics terms and its ability to accurately model mass conservation of moisture species in a dynamical core setup.2026-03-19T16:00:47ZTimothy C. AndrewsThomas M. Bendallhttp://arxiv.org/abs/2603.19071v1Quantifying the effect of noise perturbation for the stochastic Burgers equation with additive trace-class noise2026-03-19T15:58:36ZWe establish upper bounds for the weak and strong error resulting from a perturbation of the noise driving the stochastic Burgers equation, where we assume the noise to be additive and of trace class and the initial value to be sufficiently regular. More specifically, replacing the covariance operator of the driving noise $Q_1 \in \mathcal{L}_1(L^2)$ in the Burgers equation by a covariance operator $Q_2 \in \mathcal{L}_1(L^2)$ results in a weak error of $\mathcal{O}\big(\| (-A)^{-1^{-} } (Q_1-Q_2) \|_{\mathcal{L}_1(L^2)}\big)$ and a strong error of $\mathcal{O}\big(\big\| (-A)^{-1/2^{-}}\big|Q_1^{1/2} -Q_2^{1/2}\big| \big\|_{\mathcal{L}_2(L^2)}\big)$. Here $\|\cdot \|_{\mathcal{L}_1}$ is the trace class norm, $\|\cdot \|_{\mathcal{L}_2}$ is the Hilbert-Schmidt norm, and $A$ is the one-dimensional Dirichlet Laplacian that represents the leading term in the Burgers equation. In particular, our results provide upper bounds for the weak and strong error arising when approximating the trace class noise by finite-dimensional noise; the rates we obtain reflect the general philosophy that the weak convergence rate should be twice the strong rate.2026-03-19T15:58:36ZSonja CoxMatas Urbonashttp://arxiv.org/abs/2603.19043v1Complexity bounds on neural networks for the solution of structured linear systems of equations2026-03-19T15:39:32ZWe derive upper bounds on the complexity of ReLU neural networks approximating the solution of a linear system given the matrix and the right-hand side. We focus on matrices which are symmetric positive definite and sparse, as they appear in the context of finite difference and finite element methods. For such matrices, we extend available results for the matrix inversion to the task of solving a linear system, where we leverage favorable properties of classical methods such as the modified Richardson and the conjugate gradient method. Our bounds on the number of layers and neurons are not only explicit with respect to the size of the matrices, but also with respect to their condition numbers.2026-03-19T15:39:32ZBenjamin DörichRoland MaierLukas Ullmerhttp://arxiv.org/abs/2603.18980v1A bilinear inverse problem with forward operator inaccuracy applied to neonatal atlas-based diffuse optical tomography2026-03-19T14:43:30ZLinear inverse problems are highly common in practical real-world applications from industry to medical imaging. The forward operator is often built on some approximations of the studied system. Handling inaccuracies in the forward operator in the context of inverse problems is a relatively unstudied problem. In this work, we assume that we have a set of candidate forward operator matrices and suggest principal component analysis (PCA) for modeling their variation from the mean. We combine the original linear problem with the included forward operator inaccuracy into a bilinear tensor inverse problem and present two optimization algorithms and Gibbs sampling for approximately solving the problem. As a real-world test case, we apply the algorithms to account for the inaccuracy that is present in the sensitivity profiles or Jacobian matrices in diffuse optical tomography when an atlas-based model of the head anatomy is used instead of the subject's own anatomical model in neonates over a wide range of gestational ages (29--44 weeks). We report visual and numerical improvements in the spatial localization and contrast-to-noise-ratio in reconstructions of simulated hemodynamic activity.2026-03-19T14:43:30Z36 pages, 8 figuresAada HakulaPauliina HirviNuutti Hyvönen