https://arxiv.org/api/sv+WLScpEK7IhXGa1wGThF/YjQg 2026-03-22T11:41:23Z 48280 45 15 http://arxiv.org/abs/2603.17523v1 Translation Invariance of Neural Operators for the FitzHugh-Nagumo Model 2026-03-18T09:28:48Z

Neural Operators (NOs) are a powerful deep learning framework designed to learn the solution operator that arise from partial differential equations. This study investigates NOs ability to capture the stiff spatio-temporal dynamics of the FitzHugh-Nagumo model, which describes excitable cells. A key contribution of this work is evaluating the translation invariance using a novel training strategy. NOs are trained using an applied current with varying spatial locations and intensities at a fixed time, and the test set introduces a more challenging out-of-distribution scenario in which the applied current is translated in both time and space. This approach significantly reduces the computational cost of dataset generation. Moreover we benchmark seven NOs architectures: Convolutional Neural Operators (CNOs), Deep Operator Networks (DONs), DONs with CNN encoder (DONs-CNN), Proper Orthogonal Decomposition DONs (POD-DONs), Fourier Neural Operators (FNOs), Tucker Tensorized FNOs (TFNOs), Localized Neural Operators (LocalNOs). We evaluated these models based on training and test accuracy, efficiency, and inference speed. Our results reveal that CNOs performs well on translated test dynamics. However, they require higher training costs, though their performance on the training set is similar to that of the other considered architectures. In contrast, FNOs achieve the lowest training error, but have the highest inference time. Regarding the translated dynamics, FNOs and their variants provide less accurate predictions. Finally, DONs and their variants demonstrate high efficiency in both training and inference, however they do not generalize well to the test set. These findings highlight the current capabilities and limitations of NOs in capturing complex ionic model dynamics and provide a comprehensive benchmark including their application to scenarios involving translated dynamics.

2026-03-18T09:28:48Z Luca Pellegrini http://arxiv.org/abs/2502.07341v2 Operator splitting algorithm for structured population models on metric spaces 2026-03-18T08:53:39Z

In this paper, we propose a numerical scheme for structured population models defined on a separable and complete metric space. In particular, we consider a generalized version of a transport equation with additional growth and non-local interaction terms in the space of nonnegative Radon measures equipped with the flat metric. The solutions, given by families of Radon measures, are approximated by linear combinations of Dirac measures. For this purpose, we introduce a finite-range approximation of the measure-valued model functions, provided that they are linear. By applying an operator splitting technique, we are able to separate the effects of the transport from those of growth and the non-local interaction. We derive the order of convergence of the numerical scheme, which is linear in the spatial discretization parameters and polynomial of order $α$ in the time step size, assuming that the model functions are $α$ Hölder regular in time. In a second step, we show that our proposed algorithm can approximate the posterior measure of Bayesian inverse models, which will allow us to link model parameters to measured data in the future.

2025-02-11T08:08:51Z Updated Acknowledgements, shortened calculations without changing results Carolin Lindow Institute for Mathematics, Heidelberg University, Germany Christian Düll Institute for Mathematics, Heidelberg University, Germany Piotr Gwiazda Institute of Mathematics of Polish Academy of Sciences, Warsaw, Poland Interdisciplinary Centre for Mathematical and Computational Modelling Błażej Miasojedow Institute of Applied Mathematics and Mechanics, University of Warsaw, Poland Anna Marciniak-Czochra Institute for Mathematics, Heidelberg University, Germany Interdisciplinary Center for Scientific Computing http://arxiv.org/abs/2603.17477v1 A New Fractional Step Structure Preserving Method for The Landau-Lifshitz-Gilbert Equation 2026-03-18T08:32:20Z

In this paper, we propose a structure preserving method using a Crank-Nicolson's type method with an implicit Gauss-Seidel fractional iteration. Such a method is of first-order accuracy in time and second-order accuracy in space, stable and length preserving. Such a proposed method brings great benefits for the theoretical analysis. The numerical accuracy, norm preserving and stability are verified for 1D and 3D tests.

2026-03-18T08:32:20Z Changjian Xie http://arxiv.org/abs/2603.17467v1 Wavenumber-explicit $hp$-FEM analysis of Maxwell's equations with impedance boundary conditions in piecewise smooth media 2026-03-18T08:20:18Z

We consider the time-harmonic Maxwell equations with impedance boundary conditions on a bounded Lipschitz domain $Ω$ with analytic boundary $Γ$. We suppose that $Ω$ consists of multiple subdomains, and that the permeability and permittivity tensors are analytic on every subdomain, but may jump across subdomain interfaces. Under these conditions we show that for any wavenumber $k\in\mathbb{C}$ with $|k|\geq 1$ for which Maxwell's equations are polynomially well-posed, a Galerkin discretization based on Nédélec elements of order $p$ on a mesh with mesh width $h$ is quasi-optimal, provided that there holds the wavenumber-explicit scale resolution condition a) that $|k|h/p$ is sufficiently small and b) that $p/\log |k|$ is bounded from below.

2026-03-18T08:20:18Z Jens Markus Melenk David Wörgötter http://arxiv.org/abs/2603.17466v1 A Full-Density Approach to Simulating Random Iteration Equations with Applications 2026-03-18T08:19:41Z

The goal of this study is to introduce a unified computational framework for simulating random iteration equations (RIE), understood as iteration equations containing random variables. The novelty of this work is that full probability densities of the state vectors are propagated stepwise through the iterations avoiding the need of repetitive pathwise Monte Carlo simulations of the iteration equation. The presentation of the methodology is conceptually efficient based on recent work on static random equations and intentionally accessible. The technical requirements on the RIE are minimal based on the previous work, allowing for potential nonlinearities, discontinuities and stochasticities in the transfer function, as well as nonstandard densities and diffusion processes. As results, illustrative applications of random and stochastic differential equation simulations, a novel full-density gradient descent method (FDGD) for global optimization under uncertainty and examples of chaotic mappings are presented in order to demonstrate the breadth of the utility of this framework. In total, the character of the presentation is explorative and encourages new applications and theoretical studies.

2026-03-18T08:19:41Z Wolfgang Hoegele http://arxiv.org/abs/2603.17448v1 Modified Halley's method for computation of zeros of solution of second order ODEs 2026-03-18T07:42:47Z

This paper develops an efficient iterative method for computing all zeros of solutions of second order ordinary differential equations. A third order Halleys method is first derived by approximating the solution of an associated Riccati differential equation. To improve computational efficiency, a modified Halleys method is proposed by fixing one of the functions in Halleys scheme as a constant. The modified Halleys method also retains third order convergence. Based on the behavior of the coefficients of the second order ODE, nonlocal convergence results are established for both Halleys and modified Halleys methods. Suitable initial guesses for computing all zeros of solutions of second order ODEs in a given interval are also presented for both methods. Furthermore, algorithms based on the modified Halleys method are developed for to compute all nodes and weights for Gauss Legendre and Gauss Hermite quadratures. A comparative numerical study with recent methods demonstrates the efficiency of the proposed algorithms.

2026-03-18T07:42:47Z Dhivya Prabhu K Sanjeev Singh Antony Vijesh http://arxiv.org/abs/2510.10322v4 A Spatio-temporal CP decomposition analysis of New England region in the US 2026-03-18T07:21:14Z

Spatio temporal data consist of measurement for one or more raster fields such as weather, traffic volume, crime rate, or disease incidents. Advances in modern technology have increased the number of available information for this type of data hence the rise of multidimensional data. In this paper we take advantage of the multidimensional structure of the data but also its temporal and spatial structure. In fact, we will be using the NCAR Climate Data Gateway website which provides data discovery and access services for global and regional climate model data. The daily values of total precipitation (prec), maximum (tmax), and minimum (tmin) temperature are combined to create a multidimensional data called tensor (a multidimensional array). In this paper, we propose a spatio temporal principal component analysis to initialize CP decomposition component. We take full advantage of the spatial and temporal structure of the data in the initialization step for cp component analysis. The performance of our method is tested via comparison with most popular initialization method. We also run a clustering analysis to further show the performance of our analysis.

2025-10-11T19:34:16Z 14 pages, 3 figures Fatoumata Sanogo http://arxiv.org/abs/2508.16898v2 Enhanced shape recovery in advection--diffusion problems via a novel ADMM-based CCBM optimization 2026-03-18T03:00:33Z

This work proposes a novel shape optimization framework for geometric inverse problems governed by the advection--diffusion equation, based on the coupled complex boundary method (CCBM). Building on recent developments [Afr22, Rab23, Rab25, RAN25, RN24], we aim to recover the shape of an unknown inclusion via shape optimization driven by a cost functional constructed from the imaginary part of the complex-valued state variable over the entire domain. We rigorously derive the associated shape derivative in variational form and provide explicit expressions for the gradient and second-order information. Optimization is carried out using a Sobolev gradient method within a finite element framework. To address difficulties in reconstructing obstacles with concave boundaries, particularly under measurement noise and the combined effects of advection and diffusion, we introduce a state-of-the-art numerical scheme inspired by the Alternating Direction Method of Multipliers (ADMM). In addition to implementing this non-conventional approach, we demonstrate how the adjoint method can be efficiently applied and utilize partial gradients todevelop a more efficient CCBM-ADMM scheme. The accuracy and robustness of the proposed computational approach are validated through various numerical experiments.

2025-08-23T05:02:23Z 24 pages Elmehdi Cherrat Lekbir Afraites Julius Fergy Tiongson Rabago http://arxiv.org/abs/2603.16239v2 Neural Pushforward Samplers for the Fokker-Planck Equation on Embedded Riemannian Manifolds 2026-03-18T01:00:59Z

In this paper, we extend the Weak Adversarial Neural Pushforward Method to the Fokker--Planck equation on compact embedded Riemannian manifolds. The method represents the solution as a probability distribution via a neural pushforward map that is constrained to the manifold by a retraction layer, enforcing manifold membership and probability conservation by construction. Training is guided by a weak adversarial objective using ambient plane-wave test functions, whose intrinsic differential operators are derived in closed form from the geometry of the embedding, yielding a fully mesh-free and chart-free algorithm. Both steady-state and time-dependent formulations are developed, and numerical results on a double-well problem on the two-sphere demonstrate the capability of the method in capturing multimodal invariant distributions on curved spaces.

2026-03-17T08:24:52Z 13 pages, 2 figures, 1 table, 1 algorithm Andrew Qing He Wei Cai http://arxiv.org/abs/2405.00891v7 An interacting particle consensus method for constrained global optimization 2026-03-18T00:40:49Z

This paper presents a particle-based optimization method designed for addressing minimization problems with equality constraints, particularly in cases where the loss function exhibits non-differentiability or non-convexity. The proposed method combines components from consensus-based optimization algorithm with a newly introduced forcing term directed at the constraint set. A rigorous mean-field limit of the particle system is derived, and the convergence of the mean-field limit to the constrained minimizer is established. Additionally, we introduce a stable discretized algorithm and conduct various numerical experiments to demonstrate the performance of the proposed method.

2024-05-01T22:32:03Z José A. Carrillo Shi Jin Haoyu Zhang Yuhua Zhu http://arxiv.org/abs/2603.17143v1 On the role of relaxation and acceleration in the non-overlapping Schwarz alternating method for coupling 2026-03-17T21:14:42Z

The purpose of this paper is to study the influence of relaxation and acceleration techniques on the convergence behavior of the non-overlapping Schwarz algorithm with alternating Dirichlet-Neumann transmission conditions in the context of domain decomposition- (DD-) based coupling. After demonstrating that the multiplicative Schwarz scheme can be formulated as a fixed-point iteration, we explore, both theoretically and numerically, two promising techniques for speeding up the method: (i) Aitken acceleration and (ii) Anderson acceleration. In the process, we derive a robust and efficient adaptive variant of Anderson acceleration, termed "Anderson with memory adaptation". We compare the proposed acceleration strategies to the well-known classical relaxed Dirichlet-Neumann Schwarz alternating method. Our results suggest that, while Aitken-accelerated Schwarz is the best approach in terms efficiency and robustness when considering two sub-domain DDs, Anderson-accelerated Schwarz is the method of choice in larger multi-domain setting.

2026-03-17T21:14:42Z Giulia Sambataro Irina Tezaur http://arxiv.org/abs/2603.17014v1 A space-time dual-pairing summation-by-parts framework for forward and adjoint wave equations 2026-03-17T18:01:18Z

In this paper, we propose the first of its kind space-time dual-pairing summation by parts (DP-SBP) numerical framework for forward and adjoint wave propagation problems. This novel approach enables us to achieve spatial and temporal high order accuracy while naturally introducing dissipation in time. Within this framework, initial and boundary conditions are weakly imposed using the simultaneous approximation term (SAT) technique. Fully discrete energy estimates are derived, ensuring the stability of the resulting numerical scheme. Furthermore, the proposed space-time numerical framework allows us to construct adjoint consistent fully discrete numerical approximations, which can be applied to solve inverse wave propagation problems. We provide numerical experiments in one and two spatial dimensions to verify the theoretical analysis and demonstrate convergence of numerical errors.

2026-03-17T18:01:18Z Kenny Wiratama Kenneth Duru Yunho Kim http://arxiv.org/abs/2603.16850v1 Unifying Optimization and Dynamics to Parallelize Sequential Computation: A Guide to Parallel Newton Methods for Breaking Sequential Bottlenecks 2026-03-17T17:55:01Z

Massively parallel hardware (GPUs) and long sequence data have made parallel algorithms essential for machine learning at scale. Yet dynamical systems, like recurrent neural networks and Markov chain Monte Carlo, were thought to suffer from sequential bottlenecks. Recent work showed that dynamical systems can in fact be parallelized across the sequence length by reframing their evaluation as a system of nonlinear equations, which can be solved with Newton's method using a parallel associative scan. However, these parallel Newton methods struggled with limitations, primarily inefficiency, instability, and lack of convergence guarantees. This thesis addresses these limitations with methodological and theoretical contributions, drawing particularly from optimization. Methodologically, we develop scalable and stable parallel Newton methods, based on quasi-Newton and trust-region approaches. The quasi-Newton methods are faster and more memory efficient, while the trust-region approaches are significantly more stable. Theoretically, we unify many fixed-point methods into our parallel Newton framework, including Picard and Jacobi iterations. We establish a linear convergence rate for these techniques that depends on the method's approximation accuracy and stability. Moreover, we give a precise condition, rooted in dynamical stability, that characterizes when parallelization provably accelerates a dynamical system and when it cannot. Specifically, the sign of the Largest Lyapunov Exponent of a dynamical system determines whether or not parallel Newton methods converge quickly. In sum, this thesis unlocks scalable and stable methods for parallelizing sequential computation, and provides a firm theoretical basis for when such techniques will and will not work. This thesis also serves as a guide to parallel Newton methods for researchers who want to write the next chapter in this ongoing story.

2026-03-17T17:55:01Z PhD Dissertation; Stanford University Xavier Gonzalez 10.25740/vf943fc9855 http://arxiv.org/abs/2510.14759v2 On the convergence of stochastic variance reduced gradient for linear inverse problems 2026-03-17T16:21:01Z

Stochastic variance reduced gradient (SVRG) is an accelerated version of stochastic gradient descent based on variance reduction, and is promising for solving large-scale inverse problems. In this work, we analyze SVRG and a regularized version that incorporates a priori knowledge of the problem, for solving linear inverse problems in Hilbert spaces. We prove that, with suitable constant step size schedules and regularity conditions, the regularized SVRG can achieve optimal convergence rates in terms of the noise level without any early stopping rules, provided that the truncation level is chosen suitably, and standard SVRG is also optimal for problems with nonsmooth solutions under a priori stopping rules. The analysis is based on an explicit error recursion and suitable a priori estimates on the inner loop updates with respect to the anchor point. Numerical experiments are provided to complement the theoretical analysis.

2025-10-16T14:59:11Z 29 pages, 2 figures Bangti Jin Zehui Zhou http://arxiv.org/abs/2603.16980v1 Interpretable AI-Assisted Early Reliability Prediction for a Two-Parameter Parallel Root-Finding Scheme 2026-03-17T15:17:35Z

We propose an interpretable AI-assisted reliability diagnostic framework for parameterized root-finding schemes based on kNN-LLE proxy stability profiling and multi-horizon early prediction. The approach augments a numerical solver with a lightweight predictive layer that estimates solver reliability from short prefixes of iteration dynamics, enabling early identification of stable and unstable parameter regimes. For each configuration in the parameter space, raw and smoothed proxy profiles of a largest Lyapunov exponent (LLE) estimator are constructed, from which contractivity-based reliability scores summarizing finite-time convergence are derived. Machine learning models predict the reliability score from early segments of the proxy profile, allowing the framework to determine when solver dynamics become diagnostically informative. Experiments on a two-parameter parallel root-finding scheme show reliable prediction after only a few iterations: the best models achieve R^2=0.48 at horizon T=1, improve to R^2=0.67 by T=3, and exceed R^2=0.89 before the characteristic minimum-location scale of the stability profile. Prediction accuracy increases to R^2=0.96 at larger horizons, with mean absolute errors around 0.03, while inference costs remain negligible (microseconds per sample). The framework provides interpretable stability indicators and supports early decisions during solver execution, such as continuing, restarting, or adjusting parameters.

2026-03-17T15:17:35Z 23 pages, 9 figures Bruno Carpentieri Andrei Velichko Mudassir Shams Paola Lecca