https://arxiv.org/api/zKQzYfPee0MT6lgvvvPkHmN6hRQ2026-06-22T08:22:54Z4987034515http://arxiv.org/abs/2606.08883v1Accelerating GMRES with Matrix-Free Multiscale Robin Preconditioners2026-06-07T23:54:23ZWe propose a matrix-free right-preconditioning strategy for the Generalized Minimal Residual (GMRES) method based on the Multiscale Robin Coupled Method with oversampling (MRCM-OS) for the numerical solution of elliptic problems arising in subsurface flow. The resulting preconditioner is constructed through local subdomain solves with oversampling and smoothing, and can be applied without explicit assembly of the global operator.
After a careful presentation of the new procedure, it is used in extensive numerical experiments. Our results demonstrate that the proposed approach substantially reduces iteration counts across a range of challenging, high-contrast subsurface flow problems. In many cases, convergence is obtained in one or two GMRES iterations when oversampling and smoothing are employed.
The results indicate that combining GMRES with multiscale Robin-based operators is a promising direction for the construction of rapidly convergent preconditioning strategies.2026-06-07T23:54:23Z36 pages, 8 figures, 7 tablesDilong ZhouRafael T. GuiraldelloFelipe PereiraFabrício S. Sousahttp://arxiv.org/abs/2606.08871v1Fourier Neural Operators with rank-1 lattice points and hyperbolic cross2026-06-07T22:55:39ZThe \emph{Fourier neural operator} (FNO) is a neural network architecture that learns mappings between function spaces. Its efficient implementation is based on the multi-dimensional Fourier transform. By deriving general regularity bounds for the FNO with respect to both the spatial and parametric variables, we prove that the generalization error of the FNO can be improved by replacing spatial tensor product grids with purpose-built rank-1 lattice points, and by using a second lattice carefully constructed as training points in the parametric space. We achieve more accurate and efficient approximations from fewer network parameters, fewer spatial points, and fewer training samples. In addition, the architecture is simplified, because the high-dimensional Fourier transform on rank-1 lattices requires only a \emph{one-dimensional fast Fourier transform}, and we can use a \emph{hyperbolic cross} frequency index set with lattice points. We demonstrate the benefits of our \emph{lattice-based hyperbolic-cross FNOs} for an elliptic PDE on the torus.2026-06-07T22:55:39ZJakob DilenAlexander KellerFrances Y. KuoDirk Nuyenshttp://arxiv.org/abs/2605.29213v2Multifidelity Proper Orthogonal Decomposition2026-06-07T22:18:21ZThis paper introduces a multifidelity formulation that reduces the computational cost of the proper orthogonal decomposition (POD) of a high-fidelity model by leveraging data from cheaper, lower-fidelity models. POD is a prevalent technique for extracting a low-dimensional basis from training data to achieve subsequent dimension reduction or reduced-order modeling. In scientific and engineering applications, the training data are typically numerical snapshot solutions of a high-fidelity model, and computation of a sufficiently rich snapshot set can be prohibitively expensive, especially when sampling over a high-dimensional parameter space. Insufficient snapshot training data risks overfitting and poor generalizability of the POD basis to outside the training regime. Our multifidelity POD (MFPOD) formulation reallocates computational budget to cheaper, low-fidelity models that can be sampled more extensively. MFPOD then weights high- and low-fidelity snapshot data via a control-variate formulation to guarantee an unbiased estimate of the expected high-fidelity least-squares projection error. The MFPOD subspace is chosen to minimize the estimate of this projection error, and converges in probability to the same subspace as single-fidelity POD in the limit of an arbitrarily large budget. For restrictive computational budgets, the MFPOD cost function has (under some assumptions) lower variance than the POD cost function, which makes the MFPOD subspace more robust against variations in the training data and thus less prone to overfitting. For a numerical example modeling the velocity of the Pine Island glacier, MFPOD achieves the same accuracy as single-fidelity POD with an order of magnitude reduction in the offline computational cost of snapshot generation.2026-05-28T00:52:27ZNicole AretzKaren Willcoxhttp://arxiv.org/abs/2509.08070v2Subdivision Schemes in Metric Spaces2026-06-07T21:11:18ZWe develop a unified framework for nonlinear subdivision schemes on complete metric spaces (CMS). We begin with CMS preliminaries and formalize refinement in CMS, retaining key structural properties, such as locality. We prove a convergence theorem under contractivity and demonstrate its applicability. To address schemes where contractivity is unknown, we introduce two notions of proximity. Our proximity methods relate a nonlinear scheme to another nonlinear scheme with known contractivity, rather than to a linear scheme, as in much of the literature. Specifically, the first type proximity compares the two schemes after a single refinement step and, as in the classical theory, yields convergence from sufficiently dense initial data. The proximity of the second type monitors alignment across all refinement levels and provides strong convergence without density assumptions. We formulate and prove the corresponding theorems, and illustrate them with various examples, such as schemes over metric spaces of compact sets in $\R^n$ and schemes over the Wasserstein space, as well as a geometric Hermite metric space. These results extend subdivision theory beyond Euclidean and manifold-valued data for data in metric spaces.2025-09-09T18:17:28ZTo appear in Foundations of Computational MathematicsNira DynNir Sharonhttp://arxiv.org/abs/2606.08817v1Low-Variance Randomised Numerical Linear Algebra for Finite Element Simulation2026-06-07T20:11:47ZWe present a low-variance randomised numerical linear algebra approach for multi-query finite element systems arising from parametric elliptic partial differential equations with applications to digital twins and online model calibration. The method relies on Galerkin subspace projection for reducing the dimensionality, and then combines parameter-oblivious leverage-score Bernoulli sampling with a control variates scheme to yield a reduced-variance `forward' sketch and an invertible `inverse' sketch that are then fused to a single efficient regularised estimator. Effectively, this reduces the computational cost in computing the projected system of equations while preserving the structure, stability, and accuracy of the underlying FEM formulation. We derive probabilistic bounds for the sketching error, invertibility, and estimator variance, and then validate the method on large-scale example problems. The results show that when the parameter fields do not vary too sharply, the synergy of control variates together with the sketch fusion can largely offset the loss incurred by the sub-optimal parameter-oblivious sampling. In this regime, our method achieves substantial savings in time, memory, and communication while maintaining accuracy levels that are acceptable for scientific simulation.2026-06-07T20:11:47ZN. PolydoridesY. Wu. H. NooriH. VandierendonckR. Woodshttp://arxiv.org/abs/2606.08783v1OptMuon: Closed-Loop Orthogonalized Momentum Methods for Stochastic Optimization with Zero-Noise Optimality2026-06-07T18:59:24ZOrthogonalized momentum updates, as used in Muon-style optimizers, have recently shown strong empirical stability in large-scale deep learning. However, existing orthogonalized methods are typically paired with constant or open-loop magnitude rules, and therefore do not explicitly calibrate their update magnitudes from the observed optimization trajectory. Motivated by the closed-loop perspective behind Lipschitz-free and noise-adaptive methods, we propose OptMuon, a family of adaptive momentum orthogonalization methods for stochastic nonconvex optimization. OptMuon combines Muon-style polar-factor directions with a trajectory-dependent AdaGrad-Norm-type coefficient schedule, so that the update magnitude is determined by the observed gradient and momentum history rather than by a prescribed Lipschitz-dependent rule. The schedule does not use the smoothness constant, the variance level, or the bounded-gradient constant in parameter selection, and its running-maximum correction prevents isolated gradient spikes from causing excessive coefficient collapse. Under lower-boundedness, unbiased stochastic gradients with bounded variance, smoothness, and an almost-sure bounded stochastic-gradient condition, we prove two complementary guarantees. OptMuon-A achieves the noise-adaptive rate \(\tilde{\mathcal O}(T^{-1/2}+σ^{1/2}T^{-1/4})\) under average smoothness, while OptMuon-I achieves \(\tilde{\mathcal O}(T^{-1/2}+σ^{1/3}T^{-1/3})\) under individual smoothness. In the zero-noise regime, both bounds automatically reduce to a nearly optimal deterministic first-order rate \(\tilde{\mathcal O}(T^{-1/2})\) without manual hyperparameter retuning. These results show that closed-loop scalar adaptation can be combined with Muon-style momentum orthogonalization while retaining noise adaptivity and zero-noise optimality up to logarithmic factors.2026-06-07T18:59:24ZGanzhao Yuanhttp://arxiv.org/abs/2606.08731v1Numerical Analysis on Backward Stochastic Differential Equations by Finite Transposition Method2026-06-07T17:00:54ZIn this paper, we propose a finite transposition method to solve backward stochastic differential equations (BSDEs, for short). Based on the transposition solution theory for BSDEs, our method offers a promising way of efficiently computing solutions, which can be regarded as an analogous method for BSDEs as the classical finite element method for partial differential equations. Our method has the advantage of easily computable conditional expectations.2026-06-07T17:00:54Z47 ppPenghui WangYanqing WangXu Zhanghttp://arxiv.org/abs/2606.08727v1Compositional Approximation Can Strictly Outperform Superpositional Approximation2026-06-07T16:50:04ZMany classically studied function classes are known to be approximated optimally by superpositional methods, i.e. with approximants constructed as the linear combination of elements in some dictionary. Here optimality means that the uniform approximation error viewed as a function of the number of parameters used has polynomial decay of the highest order achievable by any parametrized method whose parameters can be encoded as a bit string of length proportional, up to logarithmic factors, to the number of parameters. While compositional methods like neural networks are structurally different, their approximation rates can be made comparable by imposing constraints that ensure such a proportional bit string encoding. In this work we study function classes exhibiting structural properties that limit superpositional approximation rates to be strictly lower than compositional approximation rates. In particular, we construct explicit examples for which there is an arbitrarily large gap.2026-06-07T16:50:04ZDennis ElbrächterPhilipp Petersenhttp://arxiv.org/abs/2502.09550v2A Nitsche method for incompressible fluids with general dynamic boundary conditions2026-06-07T16:49:08ZBoth Newtonian and non-Newtonian fluids may exhibit complex slip behaviour at the boundary. We examine a broad class of slip boundary conditions that generalises the commonly used Navier slip, perfect slip, stick-slip and Tresca friction boundary conditions. In particular, set-valued, nonmonotone, noncoercive and dynamic relations may occur. For a unifying framework of such relations, we present a fully discrete numerical scheme for the time-dependent Navier-Stokes equations subject to impermeability and general slip-type boundary conditions on polyhedral domains. Based on compactness arguments, we prove convergence of subsequences, finally ensuring the existence of a weak solution. The numerical scheme uses a general inf-sup stable pair of finite element spaces for the velocity and pressure, a regularisation approach for the implicit slip boundary condition and, most importantly, a general Nitsche method to impose the impermeability and a backward Euler time stepping. One of the key tools in the convergence proof is an inhomogeneous Korn inequality that includes a normal trace term.2025-02-13T18:03:10Z59 pages, 9 figures, 1 tablePablo Alexei Gazca-OrozcoFranz GmeinederErika Maringová KokavcováTabea Tscherpelhttp://arxiv.org/abs/2606.08699v1Simultaneous recovery of multiple parameters in nonlocal diffusion equations from internal measurements2026-06-07T15:53:40ZThis paper is devoted to simultaneously recovering multiple parameters from internal measurements for nonlocal diffusion equations. The uniqueness of the inverse problem is established by employing the asymptotic behavior of solutions, analytic continuation, the Laplace transform, and properties of analytic functions. For numerical reconstruction, we apply the Levenberg-Marquardt method to obtain a stable approximate solution of the inverse problem. Numerical examples are provided to demonstrate the efficiency of the proposed algorithm and to validate our theoretical findings.2026-06-07T15:53:40Z18 pages, 2 figures, 2 tablesKai YuZhiyuan LiYikan Liuhttp://arxiv.org/abs/2606.08685v1ND-TNN: Tensor-Neural-Network Approximation for High-Dimensional Nonlocal Diffusion Models2026-06-07T15:37:55ZWe study a numerical method, built on the tensor neural network (TNN) architecture introduced in \cite{wang2022tensor}, for solving nonlocal diffusion models in high-dimensional spaces. The tensor-product structure of the TNN ansatz, combined with the separability of the Gaussian kernel, reduces the high-dimensional integrals in the nonlocal energy to products of low-dimensional integrals, which are evaluated by Gauss--Legendre quadrature; nonseparable source and boundary data are handled by a TNN-based preconditioning step. For the Dirichlet boundary condition, we establish the asymptotically compatible $L^2$ error estimate \[ \|u_{\mathrm{loc}}-u_{δ,p}\|_{L^2(Ω)} \le C\!\left(\frac{\varepsilon_f}{\sqrtδ} +\frac{\varepsilon_g}δ +\frac{\varepsilon_u}{\sqrtδ} +η_{\mathrm{opt}}\right) +C\sqrtδ, \] where $\varepsilon_f$, $\varepsilon_g$ and $\varepsilon_u$ are the data and trial-class approximation errors and $η_{\mathrm{opt}}$ is the optimization residual. For the Neumann boundary condition, the $L^2$ estimate is improved to $O(\varepsilon_f+\varepsilon_g/\sqrtδ+\varepsilon_u +η_{\mathrm{opt}}+δ)$, and an $H^1$ gradient estimate is further obtained through a smoothing post-processing step. Numerical experiments on tensor-product domains up to $d=20$ support the theoretical results, and additional tests on two- and three-dimensional $L$-shaped domains demonstrate the practical robustness of the method beyond the smooth-domain setting covered by the analysis.2026-06-07T15:37:55ZZiyue CaiZuoqiang Shihttp://arxiv.org/abs/2606.08654v1Operator learning for the 2D incompressible Navier-Stokes equations: a conformal prediction approach in the data-scarce regime2026-06-07T14:49:37ZIn this paper, we propose a perturbation-based conformal prediction framework for uncertainty quantification in operator learning, with a focus on the 2D Navier--Stokes equations. While neural operators provide fast surrogates for expensive PDE solvers, they do not by themselves provide calibrated uncertainty for spatiotemporal field predictions. Our approach wraps a trained Fourier Neural Operator (FNO) with split conformal prediction and constructs the local uncertainty scale by comparing the predictions of two operators trained on nearly identical datasets: one on the original labels and one on labels perturbed by small Gaussian noise. We consider this procedure in the data-scarce regime, where the total label budget is fixed and methods that require a separate uncertainty network must divide training data between multiple models. On the 2D Navier--Stokes benchmark, the perturbation-based method produces substantially narrower conformal bands than existing methods under matched total data budgets while maintaining the target simultaneous coverage. These results suggest that perturbation sensitivity is a practical and sample-efficient uncertainty proxy for conformalized neural operators.2026-06-07T14:49:37ZWeinan WangBowen GangHao Denghttp://arxiv.org/abs/2606.08585v1Novel physical property preserved methods for stochastic Schrödinger--KdV equation2026-06-07T11:48:07ZIn this work, we study the stochastic Schrödinger--KdV equation driven by additive noise from both analytical and numerical viewpoints. We first establish the evolution laws for the averaged plasmon number, momentum, and energy, together with the conservation of the averaged particle number. Motivated by these intrinsic structures, we develop two temporal discretizations. One is constructed based on the splitting strategy and Crank--Nicolson scheme, and is shown to preserve the discrete evolution laws of the averaged plasmon number and momentum, as well as the discrete conservation law of the averaged particle number. The other is proposed within the constant scalar auxiliary variable framework, in which the nonlinear energy functional is reformulated so that a modified averaged energy law can be preserved at the discrete level. Combining these temporal discretizations with a local discontinuous Galerkin approximation in space yields structure-preserving full discretizations inheriting the corresponding discrete physical laws. Numerical experiments are presented to validate the theoretical results and to demonstrate the accuracy, robustness, and effectiveness of the proposed methods.2026-06-07T11:48:07Z24 pages, 12 figuresZiheng ChenJialin HongLiying Sunhttp://arxiv.org/abs/2606.08526v1A Mixed Extended Virtual Element Method for Elliptic Interface Problems on Polygonal Meshes2026-06-07T09:06:02ZWe propose a lowest-order \(H(\operatorname{div})\)-conforming mixed extended virtual element method for elliptic interface problems on interface-unfitted polygonal meshes. The flux and pressure are approximated by subdomain-wise extended \(H(\operatorname{div})\)-VEM spaces and by piecewise constants, respectively. On cut elements, the computable polynomial projection is defined on the whole background element and then restricted to the two subdomains. Compared with BDM-type polynomial spaces, the mixed VEM space contains a non-polynomial component, which gives rise to additional consistency terms on cut elements. To control these terms, we use an enhanced kernel stabilization on cut elements and an interface normal-flux average in the mixed coupling. A corrected interface-flux penalty and a local divergence ghost penalty are added to obtain cut-position-independent stability without using a volume div-div augmentation. We prove continuity, a discrete inf-sup condition, and an optimal first-order error estimate in a mesh-dependent norm. The constants are independent of the mesh size and of the position of the interface relative to the background mesh, but may depend on the coefficient contrast.2026-06-07T09:06:02Z29 pagesXianyan ZhengJinru ChenFeng Wanghttp://arxiv.org/abs/2606.08516v1Stable Triangle Projections for Variable-Degree Tetrahedral Spaces and Uniform IPDG Preconditioning2026-06-07T08:55:15ZThe main ingredient of this paper is an edge-local variable-degree projection on a triangle that is uniformly stable in both L2 and H1. We use this two-dimensional operator in two tetrahedral constructions. First, on a reference tetrahedron, we build an H1-stable projection from a high order polynomial space onto a variable-degree space whose degrees are prescribed independently on edges, faces, and in the volume. Since the tetrahedral projection is local and trace-compatible, it also gives an h- and p-uniform stable decomposition, in the weighted energy norm, for conforming hp spaces, and hence a uniform additive Schwarz preconditioner for the conforming Laplace operator. Second, on a uniformly regular mapped tetrahedral mesh with elementwise variable polynomial degrees, the same triangular projection gives the finite-layer edge truncation needed in a p-uniform stable DG-to-CG decomposition for the symmetric IPDG norm. The DG-to-CG decomposition, combined with the conforming splitting, gives the IPDG preconditioner. The constants depend only on reference shapes, the local degree-spread bound within each tetrahedron, the neighbor-degree bound across mesh faces, uniform map-regularity, patch cardinalities, and the coefficient path constants; they are independent of h, of the local polynomial degrees, and of the coefficient contrast.2026-06-07T08:55:15Z43 pages, 2 tablesSituan LiWeiying Zheng