Nonlinear Incompressible Shear Wave Models in Hyperelasticity and Viscoelasticity Frameworks, with Applications to Love Waves

2026-03-18T21:35:47Z

General equations describing shear displacements in incompressible hyperelastic materials, holding for an arbitrary form of strain energy density function, are presented and applied to the description of nonlinear Love-type waves propagating on an interface between materials with different mechanical properties. The model is valid for a broad class of hyper-viscoelastic materials. For a cubic Yeoh model, shear wave equations contain cubic and quintic differential polynomial terms, including viscoelasticity contributions in terms of dispersion terms that include mixed derivatives $u_{xxt}$ of the material displacement. Full (2+1)-dimensional numerical simulations of waves propagating in the bulk of a two-layered solid are undertaken and analyzed with respect to the source position and mechanical properties of the layers. Interfacial nonlinear Love waves and free upper surface shear waves are tracked; it is demonstrated that in the fully nonlinear case, the variable wave speed of interface and surface waves generally satisfies the linear Love wave existence condition $c_1 < \abs{v} < c_2$, while tending to the larger material wave speed $c_1$ or $c_2$ for large times.

Christoffel Adaptive Sampling for Sparse Random Feature Expansions

2026-03-18T20:19:49Z

Random Feature Models (RFMs) have become a powerful tool for approximating multivariate functions and solving partial differential equations efficiently. Sparse Random Feature Expansions (SRFE) improve traditional RFMs by incorporating sparsity, making it particularly effective in data-scarce settings. In this work, we integrate active learning with sparse random feature approximations to improve sampling efficiency. Specifically, we incorporate the Christoffel function to guide an adaptive sampling process, dynamically selecting informative sample points based on their contribution to the function space. This approach optimizes the distribution of sample points by leveraging the Christoffel function associated with an iteratively-chosen basis obtained by the sparse recovery solver. We conduct numerical experiments comparing adaptive and nonadaptive sampling strategies with the SRFE framework and examine their accuracy for various function approximation tasks. Overall, our results demonstrate the advantages of adaptive sampling in maintaining high accuracy while reducing sample complexity for SRFE, highlighting its potential for scientific computing tasks where data is expensive to acquire.

Splitting-strategies for arbitrary-order fully mixed finite element discretizations of the Biot equations

2026-03-18T19:49:58Z

We study the fully mixed formulation of the Biot equations, which is characterized by a symmetric coupling between flow and deformation. This structure enables the use of stable mixed finite elements for each subproblem without a strong compatibility condition across the two subphysics. To exploit this flexibility while preserving the conservation structure of both subproblems, we consider fully mixed finite element methods in which the symmetry of the elastic stress tensor is enforced weakly. The resulting mixed formulation exhibits a saddle-point structure whose stability is determined by suitable inf--sup conditions. Inf--sup stability is established for several families of discrete spaces of arbitrary order, leading to optimal a priori error estimates. Iterative splitting strategies following the classical fixed-stress split with additional tuning are specifically investigated for the fully mixed formulation, with proof of convergence and rates depending on the coupling strength. Contrary to previous analyses on coupled problems with a symmetric structure, we theoretically prove the efficacy of negative stabilization, consistent with Schur-complement ideas. Numerical results based on analytical solutions and the classical Mandel problem support the theory.

On the equivalence of semi-discrete Active Flux and Discontinuous Galerkin methods and a comparison of their performance

2026-03-18T18:01:42Z

The Active Flux (AF) method employs a globally continuous approximation, like continuous Finite Element methods. This is achieved through the placement of point values at cell interfaces which are shared between adjacent cells. With, on average, K+1 degrees of freedom per cell, Active Flux achieves a polynomial approximation of degree K+1, while the Discontinuous Galerkin (DG) method uses only polynomials of degree K, i.e. one degree less with the same number of degrees of freedom. Despite all the differences, in this paper we show, however, that for linear problems in one and several dimensions as well as -- in some sense -- for nonlinear ones, semi-discrete AF and DG are the same method. We identify a mapping between their respective degrees of freedom, upon which the updates of these degrees of freedom turn out to agree. On the one hand, AF therefore seems more economical then DG for a given value of the error, and we confirm this in numerical experiments. On the other hand, this is a way to understand superconvergence of DG in a natural way, and we show how Radau polynomials and their zeros appear in the mapping between DG and AF: In the Radau points, AF "shines through" as the background high-order scheme behind DG.

Quantum linear system algorithm with optimal queries to initial state preparation

2026-03-18T18:00:06Z

Quantum algorithms for linear systems produce the solution state $A^{-1}|b\rangle$ by querying two oracles: $O_A$ that block encodes the coefficient matrix and $O_b$ that prepares the initial state. We present a quantum linear system algorithm making $\mathbfΘ\left(1/\sqrt{p}\right)$ queries to $O_b$, which is optimal in the success probability, and $\mathbf{O}\left(κ\log\left(1/p\right)\left(\log\log\left(1/p\right)+\log\left({1}/ε\right)\right)\right)$ queries to $O_A$, nearly optimal in all parameters including the condition number and accuracy. Notably, our complexity scaling of initial state preparation holds even when $p$ is not known $\textit{a priori}$. This contrasts with recent results achieving $\mathbf{O}\left(κ\log\left({1}/ε\right)\right)$ complexity to both oracles, which, while optimal in $O_A$, is highly suboptimal in $O_b$ as $κ$ can be arbitrarily larger than $1/\sqrt{p}$. In various applications such as solving differential equations, preparing ground states of operators with real spectra, and estimating and transforming eigenvalues of non-normal matrices, we can further improve the dependence on $p$ using a block preconditioning scheme to nearly match or outperform best previous results based on other methods, which also furnishes an extremely simple quantum linear system algorithm with an optimal query complexity to $O_A$. Underlying our results is a new Variable Time Amplitude Amplification algorithm with Tunable thresholds (Tunable VTAA), which fully characterizes generic nested amplitude amplifications, improves the $\ell_1$-norm input cost scaling of Ambainis to an $\ell_{\frac{2}{3}}$-quasinorm scaling, and admits a deterministic amplification schedule for the quantum linear system problem.

Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training

2026-03-18T17:37:31Z

Orthogonalized-momentum optimizers such as Muon improve transformer training by approximately whitening/orthogonalizing matrix-valued momentum updates via a short polar-decomposition iteration. However, polar-factor approximations typically require multiple large matrix multiplications, and the resulting overhead can be substantial and hardware-dependent. We introduce MUD (MomentUm Decorrelation), a complementary whitening approach that replaces Muon's polar update with a triangular (Cholesky-like) whitening surrogate inspired by classical Gram--Schmidt and Gauss-Seidel ideas. We show that row-orthonormal matrices are fixed points of the MUD map, relate the inner step to symmetric Gauss-Seidel preconditioning of the Gram matrix, and prove quadratic local convergence near the fixed point. In terms of time-to-perplexity, MUD yields consistent 10-50\% wall-clock improvements over tuned AdamW and Muon in time-to-perplexity, typically converging slightly slower per step than Muon but with substantially lower optimizer overhead -- relative to Muon, MUD improves peak tokens/s by roughly $1.3-2.6\times$ across most settings and up to nearly $3\times$ on GPT-2 large on an A100. We also demonstrate training a ESM-2 150M protein language model, where MUD matches Muon-level validation perplexity in significantly less wall-clock time.

State-dependent temperature control in Langevin diffusions using numerical exploratory Hamiltonian-Jacobi-Bellman equations

2026-03-18T17:09:04Z

Choosing how much noise to add in Langevin dynamics is essential for making these algorithms effective in challenging optimization problems. One promising approach is to determine this noise by solving Hamilton-Jacobi-Bellman (HJB) equations and their exploratory variants. Though these ideas have been demonstrated to work well in one dimension, extension to high-dimensional minimization has been limited by two unresolved numerical challenges: setting reliable control bounds and stably computing the second-order information (Hessians) required by the equations. These issues and the broader impact of HJB parameters have not been systematically examined. This work provides the first such investigation. We introduce principled control bounds and develop a physics-informed neural network framework that embeds the structure of exploratory HJB equations directly into training, stabilizing computation, and enabling accurate estimation of state-dependent noise in high-dimensional problems. Numerical experiments demonstrate that the resulting method remains robust and effective well beyond low-dimensional test cases.

Mathematical and numerical modeling of coupled oxygen dynamics and neuronal electrophysiology

2026-03-18T17:00:48Z

Modeling and simulating how oxygen supply shapes neuronal excitability is crucial for advancing the understanding of brain function in pathological scenarios, such as ischemia. This condition is caused by a reduced blood supply, leading to the deprivation of oxygen and other metabolites; this energy deficit impairs ionic pumps and causes cellular swelling. In this work, this phenomenon is modeled through a volumetric variation law that links cell swelling to local oxygen concentration and the percentage of blood flow reduction. The swelling law links volume changes to local oxygen and the degree of blood-flow depression, providing a simple mechanistic pathway from hypoxia to tortuosity-driven transport impairment. The interplay between oxygen supply and excitability in brain tissue is described by coupling the monodomain model with specific neuronal ionic and metabolic models that characterize ion and metabolite concentration dynamics. The numerical approximation of this coupled multiscale problem is particularly challenging, owing to the presence of sharp and fast-propagating wavefronts and complex geometrical domains. To address these challenges, suitable space- and time-adaptive schemes are employed to capture the action potential dynamics accurately. This multiscale model is discretized in space with the high-order p-adaptive discontinuous Galerkin method on polygonal and polyhedral grids (PolyDG) and integrated in time with a Crank-Nicolson scheme. We numerically investigate different pathological scenarios on a two-dimensional idealized square domain and on a realistic geometry, both discretized with a polygonal grid, analyzing how subclinical and severe ischemia can affect brain electrophysiology and metabolic concentrations.

Decoupled Divergence-Free Neural Networks Basis Method for Incompressible Fluid Problems

2026-03-18T16:43:20Z

We propose a decoupled divergence-free neural networks basis (Decoupled-DFNN) method for solving incompressible flow problems, including the Stokes and Navier-Stokes equations. To ensure the divergence free property exactly, the velocity field is represented as the curl of a stream function in two dimensions and as the curl of a vector potential in three dimensions. Beyond classical stream-function or velocity-vorticity formulations, we further utilize the properties of the curl operator to derive two specific decoupled subproblems for the velocity (through the stream function or vector potential) and the pressure, respectively. The proposed formulations enable a sequential solution strategy, in which the velocity and pressure are solved independently. To resolve the inherent nonlinearity of the Navier-Stokes equations, we employ a Gauss-Newton linearization strategy, transforming the nonlinear velocity subproblem into a sequence of linear subproblems. These decoupled subproblems for velocity and pressure are subsequently solved using the TransNet framework. Compared with existing methods, the proposed approach reduces computational cost while strictly preserving the incompressibility constraint.

Global Asymptotic Rates Under Randomization: Gauss-Seidel and Kaczmarz

2026-03-18T16:32:33Z

Current performance bounds for randomized iterative methods are often considered tight under per-iteration analyses, yet they are notoriously loose in practice. We derive asymptotic performance bounds that narrow this theory-practice gap, leveraging a new technique for bounding the spectral radii of operators arising in randomized iterations and a connection we establish to Perron-Frobenius theory for noncommutative algebras. The asymptotic analysis also uncovers and quantifies the previously unexplained role of relaxation in improving performance, thereby resolving an open problem posed by Strohmer and Vershynin in 2007.

A Flux-Correction Form of the Third-Order Edge-Based Scheme for a General Numerical Flux Function

2026-03-18T12:38:54Z

In this short note, we present a flux-correction form of the third-order edge-based scheme for the Euler equations that enables the direct use of a general flux function. The core idea is to replace, without loss of accuracy, the arithmetic average of the flux extrapolations by a general numerical flux evaluated at the edge midpoint, together with a correction term. We show that the proposed flux-correction form preserves third-order accuracy, provided that the general numerical flux is evaluated with the left and right states that are computed exactly for a quadratic function, which can be achieved effectively by the U-MUSCL scheme with κ = 1/2. Numerical results are presented to verify third-order accuracy with the HLLC and LDFSS flux functions on irregular tetrahedral grids.

Automated Grammar-based Algebraic Multigrid Design With Evolutionary Algorithms

2026-03-18T12:02:26Z

Although multigrid is asymptotically optimal for solving many important partial differential equations, its efficiency relies heavily on the careful selection of the individual algorithmic components. In contrast to recent approaches that can optimize certain multigrid components using deep learning techniques, we adopt a complementary strategy, employing evolutionary algorithms to construct efficient multigrid cycles from proven algorithmic building blocks. Here, we will present its application to generate efficient algebraic multigrid methods with so-called \emph{flexible cycling}, that is, level-specific smoothing sequences and non-recursive cycling patterns. The search space with such non-standard cycles is intractable to navigate manually, and is generated using genetic programming (GP) guided by context-free grammars. Numerical experiments with the linear algebra library, \emph{hypre}, demonstrate the potential of these non-standard GP cycles to improve multigrid performance both as a solver and a preconditioner.

On the validity limits of the parametrisation method for invariant manifolds: an assessment of practical criteria for vibrating systems

2026-03-18T11:24:48Z

The parametrisation method for invariant manifolds is a powerful technique for deriving reduced-order models in the context of nonlinear vibrating systems, allowing accurate computations of nonlinear normal modes. Thanks to arbitrary order asymptotic expansions, converged results are within reach and directly applicable to finite element structures. However, since it relies on a local theory and asymptotic expansions, the results are only valid up to a given amplitude, which defines the convergence radius of the approximation. The aim of this contribution is to investigate the validity limits of the approach and review the existing error estimates, with the concrete objective of proposing a practical approach to estimate the validity range during the computation, thus producing safe bounds within which the reduced-order model can be used. Three different criteria are assessed. The first one uses the error in the invariance equation as the distance to the fixed point increases. The second one is adapted from an upper bound criterion derived for normal form transforms and based on the potential singularities of the homological operator. The third one uses Cauchy and d'Alembert convergence rules for series. The criteria are tested on a number of different examples that are representative of the situations encountered when dealing with nonlinear vibrations. The Duffing equation serves as a first benchmark that allows considering conservative oscillations, forced systems at primary resonance, and superharmonic resonance. The investigations are then extended to a vibrating system with two degrees of freedom. Finally, the different criteria are assessed on a finite element beam structure, and guidelines are formulated to generalise their practical use and produce accurate and easy-to-use error bounds in the context of model order reduction for nonlinear vibrating structures.

Novel technique based on Léja Points Approximation for Log-determinant Estimation of Large matrices

2026-03-18T10:30:03Z

The computation of the Log-determinant of large, sparse, symmetric positive definite (SPD) matrices is essential in many scientific computational fields such as numerical linear algebra and machine learning. In low dimensions, Cholesky is preferred, but in high dimensions, its computation may be prohibitive due to memory limitation. To circumvent this, Krylov subspace techniques have proven to be efficient but may be computationally expensive due to the required orthogonalization processes. In this paper, we introduce a novel technique to estimate the Log-determinant of a matrix using Léja points, where the implementation is only based on matrix multiplications and a rough estimation of eigenvalue bounds of the matrix. By coupling Léja points interpolation with a randomized algorithm called Hutch++, we achieve substantial reductions in computational complexity while preserving significant accuracy compared to the stochastic Lanczos quadrature. We establish the approximation errors of the matrix function together with multiplicative error bounds for the approximations obtained by this method. The effectiveness and scalability of the proposed method on both large sparse synthetic matrices (maximum likelihood in Gaussian Markov Random fields) and large-scale real-world matrices are confirmed through numerical experiments.

Optimal preconditioning techniques for finite volume approximation of three-dimensional conservative space-fractional diffusion equations

2026-03-18T09:55:53Z

A Crank-Nicolson finite volume approximation for three-dimensional conservative space-fractional diffusion equation results in large and dense three-level Toeplitz discrete linear systems. Preconditioned Krylov subspace methods with sine transform-based preconditioners are developed to solve these systems, including the preconditioned conjugate gradient (PCG) method for the symmetric case and the preconditioned generalized minimal residual (PGMRES) method for the non-symmetric case. Moreover, we provide detailed analysis of the convergence of these Krylov subspace methods. Specifically, for the symmetric case, we prove the spectra of the preconditioned matrices are uniformly bounded in the open interval (1/2, 3/2), which results in a linear convergence rate of the PCG method. For the non-symmetric case, we demonstrate that the PGMRES method also achieves a linear convergence rate independent of discretization stepsizes from the residual point of view. These results imply that the iteration counts of the PCG and PGMRES methods are uniformly bounded and independent of the matrix sizes. Numerical experiments in both symmetric and non-symmetric cases in two- and three-dimensions are conducted to confirm the optimal performance of the proposed preconditioned Krylov subspace methods.