https://arxiv.org/api/hHGdrySBYpLYsfTesztCKolJ/U0 2026-06-21T08:06:18Z 49870 30 15 http://arxiv.org/abs/2606.19614v1 On a class of modified Cayley--Magnus methods 2026-06-17T21:38:27Z We introduce a new class of numerical integrators for the time integration of non-autonomous linear ordinary differential equations whose coefficient matrix is sparse and evolves within a quadratic matrix Lie group. In contrast to standard Lie group integrators, the proposed methods avoid the evaluation of matrix exponentials acting on vectors and instead rely on solving a sequence of linear systems with sparse coefficient matrices. Moreover, they are well suited for problems arising from unbounded operators, as they inherently produce bounded solutions. We construct optimised schemes of orders four and six and assess their performance on a representative numerical example, demonstrating clear advantages over existing Lie-group integrators. 2026-06-17T21:38:27Z Sergio Blanes Fernando Casas Arieh Iserles http://arxiv.org/abs/2606.19611v1 Bregman-projected mirror methods for regularized stationary mean-field games 2026-06-17T21:33:47Z We develop and analyze a Bregman-projected mirror iteration for low-order regularizations of stationary mean-field game (MFG) systems in their natural Banach space setting. For separable Hamiltonians of the form \(H(x,p,m)=H_0(x,p)-g(m)\), with quadratic or super-quadratic Hamiltonian growth and linear or super-linear density couplings, we formulate a low-order \(\barγ\)-Laplacian regularization of the stationary MFG system as a variational inequality on \(L^{\barβ}(\mathbb T^d)\times W^{1,\barγ}(\mathbb T^d)\). To approximate solutions of this regularized variational inequality, we introduce a Bregman geometry matched to the mixed Lebesgue--Sobolev exponents of the problem and analyze a constrained two-step mirror method with frozen operator evaluation. For the exact constrained iteration and each fixed regularization parameter \(\epsi>0\), we derive a one-step Bregman inequality and use it to prove that the constrained iteration converges strongly to the unique solution of the regularized variational inequality under natural summability conditions on the step sizes. Numerical experiments on one- and two-dimensional models, validated against exact test solutions, illustrate residual decay under mesh refinement and suggest improved practical performance of the two-step implementation in the tested discretizations. 2026-06-17T21:33:47Z Hussain Al Abdulaziz Yuri Ashrafyan Yeva Gevorgyan Diogo Gomes http://arxiv.org/abs/2506.11719v3 Automatic differentiation for performing the Cauchy-Kovalevskaya procedure in Lax-Wendroff type discretizations 2026-06-17T19:20:18Z Lax-Wendroff methods combined with discontinuous Galerkin/flux reconstruction spatial discretization provide a high-order, single-stage, quadrature-free method for solving hyperbolic conservation laws. In this work, we introduce automatic differentiation (AD) for performing the Cauchy-Kowalewski procedure used in the element-local time average flux computation step (the predictor step) of Lax-Wendroff methods. The application of AD is similar for methods of any order and does not need positivity corrections during the predictor step. This contrasts with the approximate Lax-Wendroff procedure, which requires different finite difference formulas for different orders of the method and positivity corrections in the predictor step for fluxes that can only be computed on admissible states. The method is Jacobian-free and problem-independent, allowing direct application to any physical flux function. Numerical experiments demonstrate the order and positivity preservation of the method. Additionally, performance comparisons indicate that the wall-clock time of automatic differentiation is always on par with the approximate Lax-Wendroff method. 2025-06-13T12:34:30Z Journal of Computational Physics, 15 October 2026, article 115101, Volume 563 Arpit Babbar Valentin Churavy Michael Schlottke-Lakemper Hendrik Ranocha 10.1016/j.jcp.2026.115101 http://arxiv.org/abs/2606.19508v1 Higher Accuracy Modular Data Assimilation for the Navier-Stokes Equations 2026-06-17T18:46:49Z This paper develops an accurate and effective combination of second order backward differentiation time discretization (BDF2) with modular, 2-step nudging-based data assimilation \begin{align} \text{Forecast step: } \quad &\frac{3\widetilde{v}^{n+2}-4v^{n+1}+v^n}{2Δt}+\widetilde{v}^{n+2} \cdot \nabla \widetilde{v}^{n+2} - νΔ\widetilde{v}^{n+2} + \nabla q^{n+2}=f(x) \notag \\ &\nabla \cdot \widetilde{v}^{n+2} = 0 \notag \\ \text{Analysis step: } \quad &\frac{3v^{n+2}-3\widetilde{v}^{n+2}}{2Δt}-χI_H(u(t^{n+2})-v^{n+2})=0. \notag \end{align} If $I_H=I_H^2$, the analysis step can be made explicit, taking the form \begin{align} v^{n+2}=\widetilde{v}^{n+2}+\frac{2Δtχ}{3+2Δtχ}I_H(u^{n+2}-\widetilde{v}^{n+2}). \notag \end{align} This implies the analysis step has the stability property of an implicit step and lower complexity than an explicit analysis step. Stability and error estimates for the BDF2 scheme are presented along with their proofs. Numerical experiments are conducted to assess the performance of BDF2 modular assimilation algorithm. The results of the experiments support the conclusion that modular data assimilation has comparable accuracy to standard, fully coupled data assimilation while greatly reducing computational complexity and cost. 2026-06-17T18:46:49Z 27 pages, 7 figures, 3 tables Troy Yang http://arxiv.org/abs/2606.19471v1 Moreau-Yosida-based Kohn-Sham Inversion for Periodic Systems 2026-06-17T18:04:33Z Density-potential inversion for periodic systems within Moreau-Yosida-regularised density-functional theory is investigated, both theoretically and numerically. We develop the framework in a periodic homogeneous Sobolev space and use it to recover the exchange-correlation potential of Kohn-Sham theory through a limiting procedure. A key analytical ingredient is the proof of lower semicontinuity of the non-interacting kinetic-energy functional in the chosen topology. The proximal mapping, together with its algorithmic evaluation, plays a central role in the resulting inversion scheme. Numerical experiments illustrate the performance and properties of the method for both the Kohn-Sham and Gross-Pitaevskii equations. 2026-06-17T18:04:33Z Vebjørn H. Bakkestuen Michael F. Herbst Vegard Falmår Markus Penz Andre Laestadius http://arxiv.org/abs/2602.03322v2 Weighted finite difference methods for a nonlinear Klein-Gordon equation with high oscillations in space and time 2026-06-17T16:30:02Z We consider a nonlinear Klein-Gordon equation in the nonrelativistic limit regime with initial data in the form of a modulated highly oscillatory exponential. In this regime of a small scaling parameter $\varepsilon\ll 1$, the solution exhibits rapid oscillations in both time and space. The solution is approximated, up to $\mathcal{O}(\varepsilon)$, by a superposition of two polarized solutions, which are wave packets that move with opposite group velocities proportional to $\varepsilon^{-1}$. The equations for polarized solutions are formulated in co-moving coordinates and are then discretized by an explicit and an implicit exponentially weighted finite difference method. While the explicit weighted leapfrog method needs to satisfy a CFL-type stability condition, the implicit weighted Crank-Nicolson method is unconditionally stable. Both methods achieve second-order accuracy with time steps and mesh sizes that are not restricted in magnitude by $\varepsilon$. For the approximation of polarized solutions, the methods are uniformly convergent in the range from arbitrarily small to moderately bounded $\varepsilon$. Numerical experiments illustrate the theoretical results. 2026-02-03T09:49:09Z Yanyan Shi Christian Lubich http://arxiv.org/abs/2606.19224v1 A Conjugate Gradient Formulation of the EnKF Algorithm 2026-06-17T16:00:39Z Ensemble Kalman Filter (EnKF) based data assimilation algorithms synthesize predictive numerical forecast models with accumulated data as time evolves and account for model uncertainty and noisy measurements. The computational cost of these algorithms can be expensive, in particular for highly dimensional dynamical systems. Often, EnKF based algorithms have traded accuracy for reduced computational cost. In this paper, we present a novel parallelizable Conjugate Gradient-based Ensemble Kalman Filter (CGD-EnKF) algorithm that maintains comparable computational cost to efficient algorithms while realizing better state estimation accuracy in select cases. Here, we established the new approach by reformulating a matrix inverse calculation with a classical Conjugate Gradient (CGD) method. In addition, we discuss the upper error bound under CGD, error convergence to the classical EnKF result, and the computational complexity of the algorithm. We also showcase the CGD-EnKF-Reduced algorithm that is shown to be further computationally efficient for highly dimensional dynamical systems under small ensemble formulation. Numerical examples demonstrate the performance of our proposed algorithms and analytical properties, highlighting their comparability and advantages with respect to some benchmark EnKF algorithms. 2026-06-17T16:00:39Z Sanghyun Lee Zhengqi Liu Jonathan Valyou Ludmil Zikatanov http://arxiv.org/abs/2606.19213v1 Evaluating Rust for Sparse Matrix Kernels in Scientific Computing 2026-06-17T15:49:34Z Sparse matrix kernels form the computational backbone of scientific computing, traditionally relying on C/C++ and Fortran implementations that prioritize performance over memory safety. This work evaluates Rust as a systems-level alternative for sparse linear algebra by implementing and benchmarking three core workloads: sparse matrix-vector multiplication (SpMV), Lanczos-based Krylov methods, and matrix-exponential evaluation. We compare native Rust code against established baselines (Intel oneMKL, Eigen, PETSc, and PSBLAS) across a suite of representative matrices. Our results show that Rust's sparse kernels achieve performance comparable to Eigen and PSBLAS, tracking the state-of-the-art for CSC formats, while trailing PETSc's advanced blocked CSR optimizations. By analyzing compile-time monomorphization, SIMD vectorization, and FFI boundaries, we assess the practical impact of Rust's safety model and ecosystem readiness. The study provides concrete, evidence-based guidance for modernizing high-performance numerical software stacks. 2026-06-17T15:49:34Z Luca Lombardo Fabio Durastante http://arxiv.org/abs/2604.08002v2 Invariant Guided PINN for Fluid Flow Computation 2026-06-17T14:38:47Z Physics-informed neural networks (PINNs) often become difficult to optimize for incompressible flow problems with large spatial domains, multiscale stresses, or long-time invariant dynamics. We propose an invariant-guided PINN (IG-PINN) framework that uses partitioned training as a conservative preconditioning stage rather than as the final piecewise representation. A globally defined architecture is trained successively on spatial subdomains or temporal slabs; selected field traces, structural information, and conservative diagnostics are then transferred to a final global correction, yielding a single neural field on the full spatial or space-time domain. The framework is tested on two incompressible flow problems: steady Oldroyd--B flow past a confined cylinder and a rotational Newtonian flow with helicity diagnostics. In the Oldroyd--B case, IG-PINN transfers velocity, polymeric stress, and mass-flux information while avoiding pressure traces at artificial interfaces. In the helicity case, endpoint velocity is transferred through a hard temporal constraint and kinetic energy is controlled during slab training and residual global correction. The experiments demonstrate improved optimization robustness, reduced conservation errors for the cylinder wake, and controlled energy and helicity diagnostics for the transient rotational flow. 2026-04-09T09:07:39Z Zheng Lu Jiwei Jia Bora Aniruddha Xingyu An Young Ju Lee http://arxiv.org/abs/2604.00861v2 Error Estimates for Nitsche's Method on Approximate Domains 2026-06-17T14:34:28Z We derive a priori error estimates for Nitsche's method applied to elliptic problems on approximate domains. Such approximations arise, for example, in unfitted finite element methods, data-driven simulations, and evolving domain problems, where the computational domain does not coincide exactly with the physical one. We quantify geometric errors in terms of boundary location and normal perturbations and carry out the analysis in an abstract CutFEM framework under standard stability assumptions. In the energy norm, we obtain an estimate exhibiting an $h^{-1/2}$ amplification of the boundary location error. We then prove a refined $H^1$-seminorm estimate that removes this amplification, yielding a sharper bound with additive contributions from boundary location and normal errors. Finally, we establish an optimal order $L^2$-error estimate based on a refined duality argument, where the geometry contribution appears as a separate additive term, decoupled from the mesh size $h$. The results reveal a fundamental distinction between the norms: the energy norm amplifies boundary location errors while remaining insensitive to normal perturbations, the $H^1$-seminorm separates location and normal errors, and the $L^2$-norm is insensitive to normal perturbations. This provides a clear characterization of how geometric approximation affects convergence in Nitsche-based finite element methods, with particular relevance for unfitted discretizations. 2026-04-01T13:10:47Z Mats G. Larson Karl Larsson Shantiram Mahata http://arxiv.org/abs/2603.27714v2 Releasing the pressure: High-order surface flow discretizations via discrete Helmholtz-Hodge decompositions 2026-06-17T14:10:52Z We present a discrete Helmholtz--Hodge decomposition for H(div)-conforming Brezzi--Douglas--Marini (BDM) finite elements on triangulated surfaces of arbitrary topology. The divergence-free BDM subspace is split L2-orthogonally into rotated gradients of a continuous streamfunction space and a finite-dimensional space of discrete harmonic fields whose dimension equals the first Betti number of the surface. Consequently, any incompressible flow discretized on this subspace can be reformulated with a scalar streamfunction and finitely many harmonic coefficients as the only unknowns. This eliminates the pressure and the saddle-point structure while ensuring exact tangentiality, pointwise divergence-freeness, and pressure-robustness. We present a randomized algorithm for constructing the harmonic basis and discuss implementation aspects including hybridization, efficient treatment of the harmonic unknowns, and pressure reconstruction. Numerical experiments for unsteady surface Navier--Stokes equations on a trefoil knot and a multiply-connected sculpture surface demonstrate the method and illustrate the physical role of the harmonic velocity component. 2026-03-29T14:34:27Z 28 pages, 7 figures, 2 table Tim Brüers Christoph Lehrenfeld Tim van Beeck Max Wardetzky http://arxiv.org/abs/2606.19059v1 A performance portable fast Ewald summation for Stokes flow 2026-06-17T13:28:38Z We present GPU algorithms for Ewald summation methods for accelerating N-body Stokes flow problems in periodic domains. Like most N-body codes, Ewald sums use a near-field/far-field decomposition. The near field involves particle-to-particle (P2P) interactions. The far field primarily involves particle-to-grid (P2G) and grid-to-particle (G2P) interactions, as well as Fast Fourier Transforms. For each interaction, we investigate several algorithmic variants. Our implementation uses PyKokkos, a Python interface for the Kokkos C++ parallel programming framework, which supports portability to AMD/NVIDIA GPU and ARM/x86 CPU architectures. Double and single-precision numerical results, alongside analytical performance models, confirm the efficiency of our algorithms on AMD and NVIDIA GPU and on ARM and AMD CPU architectures. The P2P interaction achieves around 73% compute efficiency on NVIDIA H200, 84% on NVIDIA A100, 60% on AMD MI300, 52% on Grace CPU, and 68% on AMD Epyc CPU. A straightforward implementation of the P2G kernel can become a computational bottleneck. We introduce a novel P2G algorithm that achieves up to 16$\times$ speedup compared to a baseline GPU implementation. The overall Ewald sum code processes approximately 8 million particles per second on a H200 GPU, and about a half-million particles per second on a Grace CPU, for nine digits of accuracy. We also perform a multi-GPU weak scaling test on up to 256 million particles (64 GPUs) that shows bounded communication cost for all stages except the all-to-all particle sorting, which can be reduced to neighbor communication in the relevant time-stepping regime. 2026-06-17T13:28:38Z 28 pages, 11 figures Gabriel Kosmacher Ziyu Du Joar Bagge George Biros http://arxiv.org/abs/2512.19647v4 Milstein-type Schemes for Hyperbolic SPDEs 2026-06-17T13:15:33Z This article studies the temporal approximation of hyperbolic semilinear stochastic evolution equations with multiplicative Gaussian noise by Milstein-type schemes. We take the term hyperbolic to mean that the leading operator generates a contractive, not necessarily analytic $C_0$-semigroup. Optimal convergence rates are derived for the pathwise uniform strong error \[ E_h^\infty := \Big(\mathbb{E}\Big[\max_{1\le j \le M}\|U_{t_j}-u_j\|_X^p\Big]\Big)^{1/p} \] on a Hilbert space $X$ for $p\in [2,\infty)$. Here, $U$ is the mild solution and $u_j$ its Milstein approximation at time $t_j=jh$ with step size $h>0$ and final time $T=Mh>0$. For sufficiently regular nonlinearity and noise, we establish strong convergence of order one, with the error satisfying $E_h^\infty\lesssim h\sqrt{\log(T/h)}$ for rational Milstein schemes and $E_h^\infty \lesssim h$ for exponential Milstein schemes. This extends previous results from parabolic to hyperbolic SPDEs and from exponential to rational Milstein schemes. Moreover, root-mean-square error estimates are strengthened to pathwise uniform estimates. Numerical experiments validate the convergence rates for the stochastic Schrödinger equation. Further applications to Maxwell's and transport equations are included. 2025-12-22T18:19:45Z 44 pages, 1 figure, 3 tables. Added Subsection 5.2 on an extension of the error estimate to the full time interval and did minor corrections. Comments are welcome! Felix Kastner Katharina Klioba http://arxiv.org/abs/2606.19045v1 Structure-Preserving Schemes for a Fractional SVIR Epidemic Model with a Hybrid Mittag-Leffler-Caputo-Fabrizio Operator 2026-06-17T13:11:50Z This paper proposes and analyzes a fractional-order SVIR epidemic model based on a hybrid Mittag-Leffler-Caputo-Fabrizio (MLCF) fractional operator with a nonsingular kernel. This model captures short- and long-term memory effects in epidemic transmission dynamics. The positivity and boundedness of the solutions are proven through an integrated formulation of the MLCF operator and a fractional Gronwall inequality. The basic reproduction number $\mathcal{R}_0$, equilibrium points, and their local and global stability properties are rigorously investigated through Jacobian analysis, logarithmic Lyapunov functionals, and a fractional LaSalle invariance principle. To approximate the model, a $θ$-weighted nonstandard finite difference (NSFD) method is developed. This method preserves the continuous system's key qualitative properties, including positivity and boundedness, and is unconditionally stable in the fully implicit case. Consistency and first-order convergence are also proven. Numerical experiments, together with sensitivity and bifurcation analyses, illustrate the impact of fractional memory parameters on epidemic evolution and demonstrate the effectiveness of the proposed approach. 2026-06-17T13:11:50Z 31 pages, 10 figures Seham M. Al-Mekhlafi Ahmed Boudaoui Matthias Ehrhardt http://arxiv.org/abs/2510.14805v4 An Augmented Lagrangian Method-Based Framework in the Adjoint Space for Sparse Reconstruction of Acoustic Sources 2026-06-17T12:37:14Z We propose a semismooth Newton-based augmented Lagrangian framework for reconstructing sparse sources in inverse acoustic scattering problems. Rather than working in the unknown source space, our semismooth Newton updates operate in the measurement (adjoint) space, which is especially efficient when the number of measurements is much smaller than the discretized source dimension. The source is then recovered via Fenchel-Rockafellar duality. Our approach substantially accelerates computation and reduces costs. Numerical experiments in two and three dimensions demonstrate the high efficiency of the proposed method. 2025-10-16T15:39:52Z Nirui Tan Hongpeng Sun