https://arxiv.org/api/hHGdrySBYpLYsfTesztCKolJ/U02026-06-21T08:06:18Z498703015http://arxiv.org/abs/2606.19614v1On a class of modified Cayley--Magnus methods2026-06-17T21:38:27ZWe introduce a new class of numerical integrators for the time integration of non-autonomous linear ordinary differential equations whose coefficient matrix is sparse and evolves within a quadratic matrix Lie group. In contrast to standard Lie group integrators, the proposed methods avoid the evaluation of matrix exponentials acting on vectors and instead rely on solving a sequence of linear systems with sparse coefficient matrices. Moreover, they are well suited for problems arising from unbounded operators, as they inherently produce bounded solutions. We construct optimised schemes of orders four and six and assess their performance on a representative numerical example, demonstrating clear advantages over existing Lie-group integrators.2026-06-17T21:38:27ZSergio BlanesFernando CasasArieh Iserleshttp://arxiv.org/abs/2606.19611v1Bregman-projected mirror methods for regularized stationary mean-field games2026-06-17T21:33:47ZWe develop and analyze a Bregman-projected mirror iteration for low-order regularizations of stationary mean-field game (MFG) systems in their natural Banach space setting. For separable Hamiltonians of the form \(H(x,p,m)=H_0(x,p)-g(m)\), with quadratic or super-quadratic Hamiltonian growth and linear or super-linear density couplings, we formulate a low-order \(\barγ\)-Laplacian regularization of the stationary MFG system as a variational inequality on \(L^{\barβ}(\mathbb T^d)\times W^{1,\barγ}(\mathbb T^d)\). To approximate solutions of this regularized variational inequality, we introduce a Bregman geometry matched to the mixed Lebesgue--Sobolev exponents of the problem and analyze a constrained two-step mirror method with frozen operator evaluation. For the exact constrained iteration and each fixed regularization parameter \(\epsi>0\), we derive a one-step Bregman inequality and use it to prove that the constrained iteration converges strongly to the unique solution of the regularized variational inequality under natural summability conditions on the step sizes. Numerical experiments on one- and two-dimensional models, validated against exact test solutions, illustrate residual decay under mesh refinement and suggest improved practical performance of the two-step implementation in the tested discretizations.2026-06-17T21:33:47ZHussain Al AbdulazizYuri AshrafyanYeva GevorgyanDiogo Gomeshttp://arxiv.org/abs/2506.11719v3Automatic differentiation for performing the Cauchy-Kovalevskaya procedure in Lax-Wendroff type discretizations2026-06-17T19:20:18ZLax-Wendroff methods combined with discontinuous Galerkin/flux reconstruction spatial discretization provide a high-order, single-stage, quadrature-free method for solving hyperbolic conservation laws. In this work, we introduce automatic differentiation (AD) for performing the Cauchy-Kowalewski procedure used in the element-local time average flux computation step (the predictor step) of Lax-Wendroff methods. The application of AD is similar for methods of any order and does not need positivity corrections during the predictor step. This contrasts with the approximate Lax-Wendroff procedure, which requires different finite difference formulas for different orders of the method and positivity corrections in the predictor step for fluxes that can only be computed on admissible states. The method is Jacobian-free and problem-independent, allowing direct application to any physical flux function. Numerical experiments demonstrate the order and positivity preservation of the method. Additionally, performance comparisons indicate that the wall-clock time of automatic differentiation is always on par with the approximate Lax-Wendroff method.2025-06-13T12:34:30ZJournal of Computational Physics, 15 October 2026, article 115101, Volume 563Arpit BabbarValentin ChuravyMichael Schlottke-LakemperHendrik Ranocha10.1016/j.jcp.2026.115101http://arxiv.org/abs/2606.19508v1Higher Accuracy Modular Data Assimilation for the Navier-Stokes Equations2026-06-17T18:46:49ZThis paper develops an accurate and effective combination of second order backward differentiation time discretization (BDF2) with modular, 2-step nudging-based data assimilation \begin{align} \text{Forecast step: } \quad &\frac{3\widetilde{v}^{n+2}-4v^{n+1}+v^n}{2Δt}+\widetilde{v}^{n+2} \cdot \nabla \widetilde{v}^{n+2} - νΔ\widetilde{v}^{n+2} + \nabla q^{n+2}=f(x) \notag \\ &\nabla \cdot \widetilde{v}^{n+2} = 0 \notag \\ \text{Analysis step: } \quad &\frac{3v^{n+2}-3\widetilde{v}^{n+2}}{2Δt}-χI_H(u(t^{n+2})-v^{n+2})=0. \notag \end{align} If $I_H=I_H^2$, the analysis step can be made explicit, taking the form \begin{align} v^{n+2}=\widetilde{v}^{n+2}+\frac{2Δtχ}{3+2Δtχ}I_H(u^{n+2}-\widetilde{v}^{n+2}). \notag \end{align} This implies the analysis step has the stability property of an implicit step and lower complexity than an explicit analysis step. Stability and error estimates for the BDF2 scheme are presented along with their proofs. Numerical experiments are conducted to assess the performance of BDF2 modular assimilation algorithm. The results of the experiments support the conclusion that modular data assimilation has comparable accuracy to standard, fully coupled data assimilation while greatly reducing computational complexity and cost.2026-06-17T18:46:49Z27 pages, 7 figures, 3 tablesTroy Yanghttp://arxiv.org/abs/2606.19471v1Moreau-Yosida-based Kohn-Sham Inversion for Periodic Systems2026-06-17T18:04:33ZDensity-potential inversion for periodic systems within Moreau-Yosida-regularised density-functional theory is investigated, both theoretically and numerically. We develop the framework in a periodic homogeneous Sobolev space and use it to recover the exchange-correlation potential of Kohn-Sham theory through a limiting procedure. A key analytical ingredient is the proof of lower semicontinuity of the non-interacting kinetic-energy functional in the chosen topology. The proximal mapping, together with its algorithmic evaluation, plays a central role in the resulting inversion scheme. Numerical experiments illustrate the performance and properties of the method for both the Kohn-Sham and Gross-Pitaevskii equations.2026-06-17T18:04:33ZVebjørn H. BakkestuenMichael F. HerbstVegard FalmårMarkus PenzAndre Laestadiushttp://arxiv.org/abs/2602.03322v2Weighted finite difference methods for a nonlinear Klein-Gordon equation with high oscillations in space and time2026-06-17T16:30:02ZWe consider a nonlinear Klein-Gordon equation in the nonrelativistic limit regime with initial data in the form of a modulated highly oscillatory exponential. In this regime of a small scaling parameter $\varepsilon\ll 1$, the solution exhibits rapid oscillations in both time and space. The solution is approximated, up to $\mathcal{O}(\varepsilon)$, by a superposition of two polarized solutions, which are wave packets that move with opposite group velocities proportional to $\varepsilon^{-1}$. The equations for polarized solutions are formulated in co-moving coordinates and are then discretized by an explicit and an implicit exponentially weighted finite difference method. While the explicit weighted leapfrog method needs to satisfy a CFL-type stability condition, the implicit weighted Crank-Nicolson method is unconditionally stable. Both methods achieve second-order accuracy with time steps and mesh sizes that are not restricted in magnitude by $\varepsilon$. For the approximation of polarized solutions, the methods are uniformly convergent in the range from arbitrarily small to moderately bounded $\varepsilon$. Numerical experiments illustrate the theoretical results.2026-02-03T09:49:09ZYanyan ShiChristian Lubichhttp://arxiv.org/abs/2606.19224v1A Conjugate Gradient Formulation of the EnKF Algorithm2026-06-17T16:00:39ZEnsemble Kalman Filter (EnKF) based data assimilation algorithms synthesize predictive numerical forecast models with accumulated data as time evolves and account for model uncertainty and noisy measurements. The computational cost of these algorithms can be expensive, in particular for highly dimensional dynamical systems. Often, EnKF based algorithms have traded accuracy for reduced computational cost. In this paper, we present a novel parallelizable Conjugate Gradient-based Ensemble Kalman Filter (CGD-EnKF) algorithm that maintains comparable computational cost to efficient algorithms while realizing better state estimation accuracy in select cases. Here, we established the new approach by reformulating a matrix inverse calculation with a classical Conjugate Gradient (CGD) method. In addition, we discuss the upper error bound under CGD, error convergence to the classical EnKF result, and the computational complexity of the algorithm. We also showcase the CGD-EnKF-Reduced algorithm that is shown to be further computationally efficient for highly dimensional dynamical systems under small ensemble formulation. Numerical examples demonstrate the performance of our proposed algorithms and analytical properties, highlighting their comparability and advantages with respect to some benchmark EnKF algorithms.2026-06-17T16:00:39ZSanghyun LeeZhengqi LiuJonathan ValyouLudmil Zikatanovhttp://arxiv.org/abs/2606.19213v1Evaluating Rust for Sparse Matrix Kernels in Scientific Computing2026-06-17T15:49:34ZSparse matrix kernels form the computational backbone of scientific computing, traditionally relying on C/C++ and Fortran implementations that prioritize performance over memory safety. This work evaluates Rust as a systems-level alternative for sparse linear algebra by implementing and benchmarking three core workloads: sparse matrix-vector multiplication (SpMV), Lanczos-based Krylov methods, and matrix-exponential evaluation. We compare native Rust code against established baselines (Intel oneMKL, Eigen, PETSc, and PSBLAS) across a suite of representative matrices. Our results show that Rust's sparse kernels achieve performance comparable to Eigen and PSBLAS, tracking the state-of-the-art for CSC formats, while trailing PETSc's advanced blocked CSR optimizations. By analyzing compile-time monomorphization, SIMD vectorization, and FFI boundaries, we assess the practical impact of Rust's safety model and ecosystem readiness. The study provides concrete, evidence-based guidance for modernizing high-performance numerical software stacks.2026-06-17T15:49:34ZLuca LombardoFabio Durastantehttp://arxiv.org/abs/2604.08002v2Invariant Guided PINN for Fluid Flow Computation2026-06-17T14:38:47ZPhysics-informed neural networks (PINNs) often become difficult to optimize for incompressible flow problems with large spatial domains, multiscale stresses, or long-time invariant dynamics. We propose an invariant-guided PINN (IG-PINN) framework that uses partitioned training as a conservative preconditioning stage rather than as the final piecewise representation. A globally defined architecture is trained successively on spatial subdomains or temporal slabs; selected field traces, structural information, and conservative diagnostics are then transferred to a final global correction, yielding a single neural field on the full spatial or space-time domain. The framework is tested on two incompressible flow problems: steady Oldroyd--B flow past a confined cylinder and a rotational Newtonian flow with helicity diagnostics. In the Oldroyd--B case, IG-PINN transfers velocity, polymeric stress, and mass-flux information while avoiding pressure traces at artificial interfaces. In the helicity case, endpoint velocity is transferred through a hard temporal constraint and kinetic energy is controlled during slab training and residual global correction. The experiments demonstrate improved optimization robustness, reduced conservation errors for the cylinder wake, and controlled energy and helicity diagnostics for the transient rotational flow.2026-04-09T09:07:39ZZheng LuJiwei JiaBora AniruddhaXingyu AnYoung Ju Leehttp://arxiv.org/abs/2604.00861v2Error Estimates for Nitsche's Method on Approximate Domains2026-06-17T14:34:28ZWe derive a priori error estimates for Nitsche's method applied to elliptic problems on approximate domains. Such approximations arise, for example, in unfitted finite element methods, data-driven simulations, and evolving domain problems, where the computational domain does not coincide exactly with the physical one.
We quantify geometric errors in terms of boundary location and normal perturbations and carry out the analysis in an abstract CutFEM framework under standard stability assumptions. In the energy norm, we obtain an estimate exhibiting an $h^{-1/2}$ amplification of the boundary location error. We then prove a refined $H^1$-seminorm estimate that removes this amplification, yielding a sharper bound with additive contributions from boundary location and normal errors. Finally, we establish an optimal order $L^2$-error estimate based on a refined duality argument, where the geometry contribution appears as a separate additive term, decoupled from the mesh size $h$.
The results reveal a fundamental distinction between the norms: the energy norm amplifies boundary location errors while remaining insensitive to normal perturbations, the $H^1$-seminorm separates location and normal errors, and the $L^2$-norm is insensitive to normal perturbations. This provides a clear characterization of how geometric approximation affects convergence in Nitsche-based finite element methods, with particular relevance for unfitted discretizations.2026-04-01T13:10:47ZMats G. LarsonKarl LarssonShantiram Mahatahttp://arxiv.org/abs/2603.27714v2Releasing the pressure: High-order surface flow discretizations via discrete Helmholtz-Hodge decompositions2026-06-17T14:10:52ZWe present a discrete Helmholtz--Hodge decomposition for H(div)-conforming Brezzi--Douglas--Marini (BDM) finite elements on triangulated surfaces of arbitrary topology. The divergence-free BDM subspace is split L2-orthogonally into rotated gradients of a continuous streamfunction space and a finite-dimensional space of discrete harmonic fields whose dimension equals the first Betti number of the surface. Consequently, any incompressible flow discretized on this subspace can be reformulated with a scalar streamfunction and finitely many harmonic coefficients as the only unknowns. This eliminates the pressure and the saddle-point structure while ensuring exact tangentiality, pointwise divergence-freeness, and pressure-robustness. We present a randomized algorithm for constructing the harmonic basis and discuss implementation aspects including hybridization, efficient treatment of the harmonic unknowns, and pressure reconstruction. Numerical experiments for unsteady surface Navier--Stokes equations on a trefoil knot and a multiply-connected sculpture surface demonstrate the method and illustrate the physical role of the harmonic velocity component.2026-03-29T14:34:27Z28 pages, 7 figures, 2 tableTim BrüersChristoph LehrenfeldTim van BeeckMax Wardetzkyhttp://arxiv.org/abs/2606.19059v1A performance portable fast Ewald summation for Stokes flow2026-06-17T13:28:38ZWe present GPU algorithms for Ewald summation methods for accelerating N-body Stokes flow problems in periodic domains. Like most N-body codes, Ewald sums use a near-field/far-field decomposition. The near field involves particle-to-particle (P2P) interactions. The far field primarily involves particle-to-grid (P2G) and grid-to-particle (G2P) interactions, as well as Fast Fourier Transforms. For each interaction, we investigate several algorithmic variants. Our implementation uses PyKokkos, a Python interface for the Kokkos C++ parallel programming framework, which supports portability to AMD/NVIDIA GPU and ARM/x86 CPU architectures. Double and single-precision numerical results, alongside analytical performance models, confirm the efficiency of our algorithms on AMD and NVIDIA GPU and on ARM and AMD CPU architectures. The P2P interaction achieves around 73% compute efficiency on NVIDIA H200, 84% on NVIDIA A100, 60% on AMD MI300, 52% on Grace CPU, and 68% on AMD Epyc CPU. A straightforward implementation of the P2G kernel can become a computational bottleneck. We introduce a novel P2G algorithm that achieves up to 16$\times$ speedup compared to a baseline GPU implementation. The overall Ewald sum code processes approximately 8 million particles per second on a H200 GPU, and about a half-million particles per second on a Grace CPU, for nine digits of accuracy. We also perform a multi-GPU weak scaling test on up to 256 million particles (64 GPUs) that shows bounded communication cost for all stages except the all-to-all particle sorting, which can be reduced to neighbor communication in the relevant time-stepping regime.2026-06-17T13:28:38Z28 pages, 11 figuresGabriel KosmacherZiyu DuJoar BaggeGeorge Biroshttp://arxiv.org/abs/2512.19647v4Milstein-type Schemes for Hyperbolic SPDEs2026-06-17T13:15:33ZThis article studies the temporal approximation of hyperbolic semilinear stochastic evolution equations with multiplicative Gaussian noise by Milstein-type schemes. We take the term hyperbolic to mean that the leading operator generates a contractive, not necessarily analytic $C_0$-semigroup. Optimal convergence rates are derived for the pathwise uniform strong error \[
E_h^\infty := \Big(\mathbb{E}\Big[\max_{1\le j \le M}\|U_{t_j}-u_j\|_X^p\Big]\Big)^{1/p} \] on a Hilbert space $X$ for $p\in [2,\infty)$. Here, $U$ is the mild solution and $u_j$ its Milstein approximation at time $t_j=jh$ with step size $h>0$ and final time $T=Mh>0$. For sufficiently regular nonlinearity and noise, we establish strong convergence of order one, with the error satisfying $E_h^\infty\lesssim h\sqrt{\log(T/h)}$ for rational Milstein schemes and $E_h^\infty \lesssim h$ for exponential Milstein schemes. This extends previous results from parabolic to hyperbolic SPDEs and from exponential to rational Milstein schemes. Moreover, root-mean-square error estimates are strengthened to pathwise uniform estimates. Numerical experiments validate the convergence rates for the stochastic Schrödinger equation. Further applications to Maxwell's and transport equations are included.2025-12-22T18:19:45Z44 pages, 1 figure, 3 tables. Added Subsection 5.2 on an extension of the error estimate to the full time interval and did minor corrections. Comments are welcome!Felix KastnerKatharina Kliobahttp://arxiv.org/abs/2606.19045v1Structure-Preserving Schemes for a Fractional SVIR Epidemic Model with a Hybrid Mittag-Leffler-Caputo-Fabrizio Operator2026-06-17T13:11:50ZThis paper proposes and analyzes a fractional-order SVIR epidemic model based on a hybrid Mittag-Leffler-Caputo-Fabrizio (MLCF) fractional operator with a nonsingular kernel. This model captures short- and long-term memory effects in epidemic transmission dynamics. The positivity and boundedness of the solutions are proven through an integrated formulation of the MLCF operator and a fractional Gronwall inequality. The basic reproduction number $\mathcal{R}_0$, equilibrium points, and their local and global stability properties are rigorously investigated through Jacobian analysis, logarithmic Lyapunov functionals, and a fractional LaSalle invariance principle.
To approximate the model, a $θ$-weighted nonstandard finite difference (NSFD) method is developed. This method preserves the continuous system's key qualitative properties, including positivity and boundedness, and is unconditionally stable in the fully implicit case. Consistency and first-order convergence are also proven. Numerical experiments, together with sensitivity and bifurcation analyses, illustrate the impact of fractional memory parameters on epidemic evolution and demonstrate the effectiveness of the proposed approach.2026-06-17T13:11:50Z31 pages, 10 figuresSeham M. Al-MekhlafiAhmed BoudaouiMatthias Ehrhardthttp://arxiv.org/abs/2510.14805v4An Augmented Lagrangian Method-Based Framework in the Adjoint Space for Sparse Reconstruction of Acoustic Sources2026-06-17T12:37:14ZWe propose a semismooth Newton-based augmented Lagrangian framework for reconstructing sparse sources in inverse acoustic scattering problems. Rather than working in the unknown source space, our semismooth Newton updates operate in the measurement (adjoint) space, which is especially efficient when the number of measurements is much smaller than the discretized source dimension. The source is then recovered via Fenchel-Rockafellar duality. Our approach substantially accelerates computation and reduces costs. Numerical experiments in two and three dimensions demonstrate the high efficiency of the proposed method.2025-10-16T15:39:52ZNirui TanHongpeng Sun