https://arxiv.org/api/NouUE/5tgMT+mBhpQ6JX9PvLQhs 2026-06-21T20:59:36Z 49870 195 15 http://arxiv.org/abs/2511.02153v4 A Joint Variational Framework for Multimodal X-ray Ptychography and Fluorescence Reconstruction 2026-06-12T02:14:16Z

Recovering high-resolution structural and compositional information from coherent X-ray measurements involves solving coupled, nonlinear, and ill-posed inverse problems. Ptychography reconstructs a complex transmission function from overlapping diffraction patterns, while X-ray fluorescence provides quantitative, element-specific contrast at lower spatial resolution. We formulate a joint variational framework that integrates these two modalities into a single nonlinear least-squares problem with shared spatial variables. This formulation enforces cross-modal consistency between structural and compositional estimates, improving conditioning and promoting stable convergence. The resulting optimization couples complementary contrast mechanisms (i.e., phase and absorption from ptychography, elemental composition from fluorescence) within a unified inverse model. Numerical experiments on simulated data demonstrate that the joint reconstruction achieves faster convergence, sharper and more quantitative reconstructions, and lower relative error compared with separate inversions. The proposed approach illustrates how multimodal variational formulations can enhance stability, resolution, and interpretability in computational X-ray imaging.

2025-11-04T00:44:46Z Keywords: inverse problems, x-ray imaging science, ill-posedness, joint reconstruction. This work is sponsored by NSF DMS-2338904 Chengru Eric Zou Elle Buser Zichao Wendy Di Yuanzhe Xi http://arxiv.org/abs/2606.13998v1 Analytic First Derivatives of SIDER Interpolation 2026-06-12T00:36:30Z

Spherical interpolation is required in numerical and geometric applications in which the unknowns are constrained to remain on the unit sphere. Spherical Interpolation of orDER $n$ (SIDER-$n$) was introduced as the high-order reconstruction component of spherical essentially non-oscillatory interpolation, where the reconstruction is built entirely from spherical linear interpolation (SLERP) operations and therefore preserves the spherical constraint exactly. This paper develops analytic first-derivative formulas for SIDER curves of arbitrary order. The central observation is that the recursive definition of SIDER can be differentiated by direct chain-rule propagation through its binary tree of SLERP operations. After deriving the total derivative of SLERP with moving endpoints, we obtain compact recursions for the derivative of SIDER-$n$, including simplified formulas at interpolation nodes and practical formulas at middle points between consecutive sampling locations. The latter are relevant when a reconstruction is evaluated halfway between data samples, as occurs in several high-order reconstruction-based numerical algorithms. The base case SIDER2 is treated explicitly, and SIDER3 and SIDER4 are used to illustrate the recursive mechanism. We also prove that the derivative is tangent to the sphere at every reconstructed point, including both sampling points and middle points. The resulting formulas extend the original SIDER/SENO framework by supplying differential information for sphere-valued reconstructions, with potential use in high-order finite-volume, ENO/WENO, SENO-type, and related methods for conservation laws and evolution problems.

2026-06-12T00:36:30Z Shingyu Leung http://arxiv.org/abs/2606.13946v1 Chosen-Plaintext Attacks of Double Random Phase Encryption with Nonlinear Optical Media 2026-06-11T22:11:31Z

This paper studies an inverse problem in nonlinear optical encryption. We examine chosen-plaintext attacks (CPA) on a nonlinear optical encryption strategy that integrates double random phase encryption (DRPE) into a nonlinear optical propagation model to enhance the security of the combined system. We first demonstrate that the system's phase information can be decoded from carefully designed differential CPA data. We then demonstrate that the strength of the optical device's nonlinearity can also be recovered from CPA data, indicating that including this parameter as an additional security key does not enhance protection against CPA attacks, although numerical simulations show that strong nonlinearity still poses significant challenges for CPA attacks. Finally, we provide a stability analysis to demonstrate that small errors in decoded security keys result in only small errors in the decrypted text, even though the encryption process is nonlinear.

2026-06-11T22:11:31Z 30 pages, 11 figures Frontiers in Applied Mathematics 1 (2026), 22-47 Yan Cheng Yiwei Chen Kui Ren Nathan Soedjak 10.3934/fam.2026002 http://arxiv.org/abs/2506.03311v2 Demystifying Tubal Tensor Algebra 2026-06-11T20:31:44Z

Developed in a series of seminal papers in the early 2010s, the tubal tensor framework provides a clean and effective algebraic setting for tensor computations, supporting matrix-mimetic features such as a tensor Singular Value Decomposition and Eckart-Young-like optimality results. It has proven to be a powerful tool for analyzing inherently multilinear data arising in hyperspectral imaging, medical imaging, neural dynamics, scientific simulations, and more. At the heart of tubal tensor algebra lies a special tensor-tensor product: originally the t-product, later generalized into a full family of products via the $\star_M$-product. Though initially defined through the multiplication of a block-circulant unfolding of one tensor by a matricization of another, it was soon observed that the t-product can be interpreted as standard matrix multiplication where the scalars are tubes-i.e., real vectors twisted ``inward.'' Yet, a fundamental question remains: why is this the ``right'' way to define a tensor-tensor product in the tubal setting? In this paper, we show that the t-product and its $\star_M$ generalization arise naturally when viewing third-order tensors as matrices of tubes, together with a small set of desired algebraic properties. Furthermore, we prove that the $\star_M$-product is, in fact, the only way to define a tubal product satisfying these properties. Thus, while partly expository in nature - aimed at presenting the foundations of tubal tensor algebra in a cohesive and accessible way - this paper also addresses theoretical gaps in the tubal tensor framework, proves new results, and provides justification for the tubal tensor framework central constructions, thereby shedding new light on it.

2025-06-03T18:51:29Z Haim Avron Uria Mor http://arxiv.org/abs/2606.13807v1 A WKB-related time-stepping scheme for differential equations describing oscillatory systems 2026-06-11T18:21:00Z

In this study, we present a novel time-stepping scheme for multiscale differential equations describing oscillatory systems with well-separated scales, where the scale separation is controlled by a small parameter $ε$. The time-stepping method is related to a multi-modal WKB approximation and relies on a transformation of variables derived in this work. The analysis reveals that, in the transformed formulation, the leading-order oscillations are either eliminated or appear only at higher asymptotic order. The method is applied to ordinary differential equations, including the well-known van der Pol oscillator. We investigate the accuracy of the proposed method numerically for different parameter regimes, in particular for decreasing values of $ε$, and study how the parameters of the numerical scheme must be adapted as $ε$ is reduced. In the presented numerical tests, the computational cost remains bounded as $ε$ is decreased.

2026-06-11T18:21:00Z Juliane Rosemeier Rupert Klein http://arxiv.org/abs/2412.08059v7 Parameter optimization for restarted mixed precision iterative sparse solver 2026-06-11T17:50:16Z

The problem of optimal precision switching for the conjugate gradient (CG) method applied to sparse linear systems is considered. A sparse matrix is defined as an $n\!\times\!n$ matrix with $m\!=\!O(n)$ nonzero entries. The algorithm first computes an approximate solution in single precision with tolerance $\varepsilon_1$, then switches to double precision to refine the solution to the required stopping tolerance $\varepsilon_2$. Based on estimates of system matrix parameters -- computed in time which does not exceed $1\%$ of the time needed to solve the system in double precision -- we determine the optimal value of $\varepsilon_1$ that minimizes total computation time. This value is obtained by classifying the matrix using the $k$-nearest neighbors method on a small precomputed sample. Classification relies on a feature vector comprising: the matrix size $n$, the number of nonzeros $m$, the pseudo-diameter of the matrix sparsity graph, and the average rate of residual norm decay during the early CG iterations in single precision. We show that, in addition to the matrix condition number, the diameter of the sparsity graph influences the growth of rounding errors during iterative computations. The proposed algorithm reduces the computational complexity of the CG -- expressed in equivalent double-precision iterations -- by more than $17\%$ on average across the considered matrix types in a sequential setting. The resulting speedup is at most $1.5\%$ worse than that achieved with the optimal (oracle) choice of $\varepsilon_1$. While the impact of matrix structure on Krylov subspace method convergence is well understood, the use of the sparsity graph diameter as a predictive feature for rounding error growth in mixed-precision CG appears to be novel. To the best of our knowledge, no prior work employs graph diameter to guide precision switching in iterative linear solvers.

2024-12-11T03:02:58Z 51 pages, 5 figures Alexander V. Prolubnikov http://arxiv.org/abs/2512.07004v4 Accurate Models of NVIDIA Tensor Cores 2026-06-11T17:47:01Z

Matrix multiplication is a fundamental operation in both training of neural networks and inference. To accelerate matrix multiplication, Graphical Processing Units (GPUs) provide it implemented in hardware. Due to the increased throughput over the software-based matrix multiplication, the multipliers are increasingly used outside of AI, to accelerate various applications in scientific computing. However, matrix multipliers targeted at AI are at present not compliant with IEEE 754 floating-point arithmetic behaviour, with different vendors offering different numerical features. This leads to non-reproducible results across different generations of GPU architectures, at the matrix multiply-accumulate instruction level. To study numerical characteristics of matrix multipliers - such as rounding behaviour, accumulator width, normalization points, extra carry bits, and others - test vectors are typically constructed. Yet, these vectors may or may not distinguish between different hardware models, and due to limited hardware availability, their reliability across many different platforms remains largely untested. We present software models for emulating the inner product behavior of low- and mixed-precision matrix multipliers in the V100, A100, H100 and B200 data center GPUs in most supported input formats of interest to mixed-precision algorithm developers: 8-, 16-, and 19-bit floating point. These matrix multiplier models are first approximated by determining the numerical features via test vectors designed to trigger outputs sensitive to bit level differences in the implementation, followed by semi-exhaustive comparison (randomised input vectors of $10^7$ values) between the models and the actual GPU matrix multipliers - this process is repeated until the model is bit accurate.

2025-12-07T21:13:18Z Faizan A. Khattak Mantas Mikaitis http://arxiv.org/abs/2606.13549v1 A general-purpose global regularization method for 3D volume integral operators 2026-06-11T16:29:43Z

Singular volume integral operators associated with constant-coefficient partial differential operators extend the applicability of potential theory to inhomogeneous problems, for example arising from nonlinearities or variable coefficients. Typically the PDE kernels in these operators give rise to singularities at all $\mathcal{O}(1/h^3)$ volume discretization/evaluation points in a mesh of characteristic size $h$, while the slowly-decaying nature of such kernels give rise to long-range interactions that require coupling to fast summation algorithms. The presented method uses Green's identities to regularize a wide variety of both scalar-valued and vector-valued volume integral operators by use of a certain regularizing volume density interpolant. The analysis shows how the regularizing effect of the interpolant is global in the sense that the interpolation quality increases in an exactly compensatory fashion as the distance to the Green's function singularity decreases. High-order convergence estimates with tabulated simplex quadratures are established, including with exact representation of curved domains.

2026-06-11T16:29:43Z 28 pages, 4 figures Thomas G. Anderson Marc Bonnet Luiz M. Faria Carlos Pérez-Arancibia http://arxiv.org/abs/2603.08415v2 Discontinuous Galerkin approximation of a nonlinear multiphysics problem arising in ultrasound-enhanced drug delivery 2026-06-11T15:46:04Z

Motivated by simulations of ultrasound-enhanced drug delivery, this work presents the numerical analysis of a mathematical model that captures the influence of ultrasound waves on the diffusivity of the drug. The system under study consists of the Westervelt wave equation, accounting for the nonlinear propagation of ultrasound, coupled to a convection-diffusion equation modeling the drug concentration. In particular, drug delivery is affected by ultrasound through a pressure-dependent diffusion coefficient. The Westervelt equation is supplemented by linear absorbing boundary conditions as a means of reducing spurious reflections off the boundaries of computational domains. For spatial discretization of this multiphysics system, we employ a discontinuous Galerkin approach on simplicial meshes. Under suitable assumptions on the exact pressure and the mesh size, we first establish well-posedness, non-degeneracy, and optimal convergence rates in the energy norm for the semi-discrete pressure subproblem. The smallness of the semi-discrete pressure is then used to establish the well-posedness and convergence of the wave--convection-diffusion system under suitable regularity of the exact concentration. Finally, theoretical findings are illustrated through numerical experiments.

2026-03-09T14:13:07Z Femke de Wit Vanja Nikolić http://arxiv.org/abs/2606.13482v1 A Stabilized Multilevel B-Spline-Based Fast Integral Method for the Solution of the Electric Field Integral Equation 2026-06-11T15:32:30Z

We present a multilevel B-spline-based fast integral method for the solution of the electric field integral equation (EFIE), combining fast Fourier transformation (FFT)-compatible kernel interpolation with robust high-order interpolation. Existing FFT-accelerated global Lagrange-based approaches rely on equidistant interpolation points and can, therefore, suffer from Runge-type instabilities at high interpolation orders, limiting robust high-accuracy compression. In contrast, B-splines on equidistant knot vectors overcome these instabilities and enable robust high-order interpolation for accurate matrix compression. Replacing Lagrange interpolation by B-spline interpolation is, however, non-trivial: B-spline coefficients do not coincide with function values at the interpolation points, and the associated sampling matrices can become ill-conditioned. To address these challenges, we introduce a knot-removal stabilization strategy, combined with exact interlevel transfers based on knot insertion, yielding accurate, well-conditioned multilevel interpolation. Moreover, we propose a factorization strategy that preserves the null space of the scalar potential operator up to machine precision and is compatible with low-frequency preconditioning techniques. Numerical results for both canonical and realistic geometries demonstrate robust high-order interpolation without the breakdown observed for Lagrange-based approaches and confirm $\mathcal{O}(N)$ complexity.

2026-06-11T15:32:30Z Danijel Jukić Bernd Hofmann Thomas F. Eibert Simon B. Adrian http://arxiv.org/abs/2510.02111v2 Coarse scrambling for Sobol' and Niederreiter sequences 2026-06-11T15:19:25Z

We introduce coarse scrambling, a novel randomization for digital sequences that permutes blocks of digits in a mixed-radix representation. This construction is designed to preserve the powerful $(0,\mathbb{e},d)$-sequence property of the underlying points. For sufficiently smooth integrands, we prove that this method achieves the canonical $O(n^{-3+ε})$ variance decay rate, matching that of standard Owen's scrambling. Crucially, we show that its maximal gain coefficient grows only logarithmically with dimension, $O(\log d)$, thus providing theoretical robustness against the curse of dimensionality affecting scrambled Sobol' sequences. Numerical experiments validate these findings and illustrate a practical trade-off: while Owen's scrambling is superior for integrands sensitive to low-dimensional projections, coarse scrambling is competitive for functions with low effective truncation dimension.

2025-10-02T15:20:49Z Kosuke Suzuki http://arxiv.org/abs/2606.13457v1 Reduced basis algorithm for solving nonlinear differential equations on quantum computers 2026-06-11T15:13:38Z

As quantum computing moves toward scientific computing applications, nonlinear differential equations remain a central challenge since quantum evolution is intrinsically linear. In this work, we introduce a reduced basis algorithm (RBA) for polynomial nonlinear ordinary differential equations (ODEs) and spatially discretized partial differential equations (PDEs). After time discretization, the method composes the resulting polynomial update map over $m$ timesteps, identifies the reduced monomial basis appearing in this composed map, and constructs a linear RBA operator whose action recovers the exact $m$-timestep nonlinear dynamics. Thus, at the level of the chosen discrete update rule, the method introduces no additional approximation error beyond the time discretization error. The qubit number requirement is governed by the size of the reduced monomial basis. For an $n$-dimensional polynomial ODE system of degree $p>1$, the lifted register requires at most $q_m^{\mathrm{ODE}} = O(nm\log p)$ qubits in the full basis scenario. For PDEs discretized on $N^D$ grid points, a locality-based construction requires at most $q_m^{\mathrm{PDE}} = O(D\log N + n m^{D+1}\log p)$ qubits. Hence, the dependence on the grid size remains logarithmic, while the nonlinear overhead is controlled by local reduced basis size. The main computational burden is moved from the quantum computer to a classical preprocessing step, where the reduced monomial basis and RBA operator are constructed for the chosen timestep window. Through numerical tests on the Lorenz system and the one-dimensional Burgers equation, we verify that the RBA reproduces the corresponding discrete time nonlinear dynamics exactly, while exposing the trade-off between timestep composition, reduced basis growth, and locality.

2026-06-11T15:13:38Z Monica Lăcătuş Matthias Möller Sauro Succi http://arxiv.org/abs/2505.16345v2 Convergence analysis of GMRES applied to Helmholtz problems near resonances 2026-06-11T15:06:04Z

The finite element solution of Helmholtz problems near resonant or quasi-resonant frequencies poses significant challenges, as iterative solvers typically suffer from severely degraded convergence. We analyze the convergence behavior of GMRES applied to linear systems arising from such configurations. Theoretical convergence estimates are derived based on harmonic Ritz values, highlighting their proximity to small eigenvalues as a key determining factor. We further examine deflation strategies and their interplay with preconditioning techniques, using the Complex Shifted Laplacian preconditioner as a case study. Numerical experiments on resonant and quasi-resonant test cases validate the theoretical framework and demonstrate the effectiveness of deflation strategies. This study provides new insights and practical guidance for analyzing and improving iterative solvers for time-harmonic problems near resonances.

2025-05-22T07:59:18Z Victorita Dolean Pierre Marchand Axel Modave Timothée Raynaud http://arxiv.org/abs/2606.13434v1 Momentum Space Algorithm for Electronic Structure of Double-Incommensurate Trilayer Graphene 2026-06-11T14:59:34Z

Numerical algorithms for computing electronic structure of incommensurate 2D materials using ab initio models is critical for predicting material properties and guiding experiment. For bilayers, momentum space and continuum models have been introduced to approximate observables of ab initio tight-binding models using a momenta description despite the lack of periodicity in the tight-binding model required for Bloch theory. A similar structure has been introduced for double-incommensurate trilayers using a continuum model, where the three lattices are all mutually incommensurate. However, this description leads to a four-dimensional lattice space, and numerical convergence of the density of states was observed to have poor convergence. In this work, we introduce a momentum space framework for double incommensurate trilayer graphene, and introduce an efficient truncation scheme of the four-dimensional lattice to drastically improve convergence of the density of states and momentum local density of states (a parallel object to classical band structure). We implement this algorithm on an ab initio model of twisted trilayer graphene and validate convergence estimates. We further verify numerically that the momentum space algorithm, inherently higher order than the continuum model as it is an exact transformation of the tight-binding model, captures altered band behavior near the flat bands at magic angles.

2026-06-11T14:59:34Z 54 pages, 8 figures Ken Beard Daniel Massatt http://arxiv.org/abs/2606.13429v1 A Scalable Deflated Conjugate Gradient Solver for the Time-Dependent Pseudo-Stress Stokes Problem 2026-06-11T14:57:25Z

We propose a novel iterative solution framework for the unsteady Stokes equations in the pseudo-stress formulation. When solving this class of problems by using implicit time-integration schemes, standard solvers suffer from deteriorating convergence properties for small time steps, independently of the chosen space discretisation method. This is due to the singular modes of the dev-dev operator. For this reason, we introduce a computational framework obtained by combining a deflated Conjugate Gradient method with a W-cycle multigrid scheme that employs a Restricted Additive Schwarz smoother. The key point is to choose the deflation subspace so that the inner system to be solved within a deflated Conjugate Gradient scheme corresponds to a Laplace problem defined on the singular modes of the original dev-dev operator. This results to be independent of the spatial discretisation method and allows one to use efficient multigrid iterative solvers. Numerical experiments show that the proposed strategy significantly accelerates the Conjugate Gradient convergence and provides stable performance with respect to the time step, confirming its robustness for solving linear systems in the pseudo-stress framework.

2026-06-11T14:57:25Z Alessandra Cancrini Gabriele Ciaramella Paola F. Antonietti