https://arxiv.org/api/DRMNie2O2aWTJz4s54WPPGnFlfk 2026-06-22T19:06:49Z 49870 480 15 http://arxiv.org/abs/2606.04366v1 MeshTok: Efficient Multi-Scale Tokenization for Scalable PDE Transformers 2026-06-03T02:29:04Z

Conventional patchified Transformers operate on uniform spatial partitions, distributing computational effort evenly across the domain irrespective of local features. This inflexible tokenization scheme is inherently limited in its ability to efficiently represent and process solutions to complex PDEs. To address this, we propose MeshTok, an adaptive mesh refinement (AMR)-inspired tokenization and sequence modeling framework. This method selectively refines spatial regions exhibiting sharp gradients, transient features, or multiscale structures, generating a heterogeneous set of multiscale tokens defined on a fixed simulation grid. These tokens are processed within a unified Transformer sequence, enabling the model to simultaneously capture coarse-grained global context and fine-grained local details without requiring specialized architectural components. Although adaptive refinement moderately increases token count, it promotes a more targeted allocation of computational resources to physically informative regions, which we view as a practical inductive bias rather than a formal optimality guarantee. Experimental evaluations across multiple PDE families and benchmark datasets demonstrate that MeshTok consistently improves the efficiency-accuracy trade-off compared to uniform-grid baselines. This suggests adaptive multiscale tokenization as a scalable and generalizable design principle for neural PDE modeling. Code is available at https://github.com/SCAILab-USTC/MeshTok.

2026-06-03T02:29:04Z ICML2026 Yanshun Zhao Xiaoyu Peng Jiamin Jiang Congcong Zhu Jingrun Chen http://arxiv.org/abs/2606.04348v1 A Perturbed q-Tsallis Self-Concordant Barrier for Spectrally Robust Semidefinite Programming 2026-06-03T02:04:59Z

We introduce and analyse a perturbed $q$-Tsallis barrier for semidefinite programming (SDP), defined as a spectral perturbation of the classical log-det barrier on the cone of positive definite matrices. The barrier introduces eigenvalue-adaptive stiffening through a Tsallis-type matrix-power term controlled by parameters $q>1$ and $η\geq0$. Our main theoretical contribution is a sharp characterisation of the differential self-concordance regime of the barrier. We prove that the barrier is differentially self-concordant on the interior of the positive semidefinite cone for all $η\geq0$ if and only if $q\in(1,2]$, establishing the exact threshold at $q=2$. For $q>2$, uniform self-concordance fails globally, although local sufficient conditions remain valid on compact spectral domains. On compact feasible sets, the effective local barrier parameter remains $O(n)$, preserving the same asymptotic iteration complexity class as the classical log-det barrier. We further establish a spectral robustness result showing that the sensitivity of the central path to perturbations is selectively damped in small-eigenvalue directions according to the scaling $κ(X^*)^{-(q-1)}$, where $κ(X^*)$ denotes the spectral condition number. This yields improved robustness relative to the classical log-det barrier for ill-conditioned SDP solutions. Finally, we develop a Mehrotra-type primal--dual predictor--corrector interior-point method equipped with a Lanczos-based Krylov kernel for evaluating matrix powers efficiently. Numerical experiments validate the theoretical predictions and demonstrate improved robustness together with significant computational acceleration.

2026-06-03T02:04:59Z Sergio Assuncao Monteiro Fabricio Alves Barbosa da Silva http://arxiv.org/abs/2606.04347v1 Exponential Time Differencing Schemes for a Phase-Field Model of Multicomponent Membranes 2026-06-03T02:04:46Z

In this paper, we develop and analyze exponential time differencing (ETD) schemes for a phase-field model of multicomponent membranes proposed in our previous work \cite{luo2025ohta}, in which membrane deformation is governed by a force-balance phase-field equation and protein segregation is described by a membrane-associated Ohta-Kawasaki (OK) dynamics. For a fixed phase-field membrane, we introduce a geometry-adapted operator splitting method based on the localization function, which reformulates the surface OK dynamics into a form suitable for ETD integration. The resulting first- and second-order ETD schemes, combined with finite-difference spatial discretization, are rigorously proved to satisfy a discrete maximum-bound principle and unconditional energy stability. For the coupled system, we construct stabilized ETD schemes in an FFT-based spectral framework, treating stiff linear terms exactly and nonlinear mechanochemical couplings explicitly. A narrow-band implementation further reduces the computational cost by restricting surface calculations to the diffuse membrane region. Numerical experiments confirm the predicted temporal accuracy, maximum-bound preservation, and energy decay for the fixed-membrane OK problem, and demonstrate stable and efficient three-dimensional simulations of protein-driven pattern formation and membrane deformation.

2026-06-03T02:04:46Z 25 pages, 6 figures Wangbo Luo Zhonghua Qiao Yanxiang Zhao http://arxiv.org/abs/2512.23965v2 Multimodal sampling via Schrödinger-Föllmer samplers with temperatures 2026-06-03T01:42:52Z

Generating samples from complex and high-dimensional distributions is ubiquitous in various scientific fields of statistical physics, Bayesian inference, scientific computing and machine learning. Very recently, Huang et al. (IEEE Trans. Inform. Theory, 2025) proposed new Schrödinger-Föllmer samplers (SFS), based on the Euler discretization of the Schrödinger-Föllmer diffusion evolving on the unit interval $[0, 1]$. There, a convergence rate of order $\mathcal{O}(\sqrt{h})$ in the $L^2$-Wasserstein distance was obtained for the Euler discretization with a uniform time step-size $h>0$. By introducing a temperature parameter, different samplers are proposed in this paper, based on the Euler discretization of the Schrödinger-Föllmer process with temperatures. As revealed by numerical experiments, high temperatures are vital, particularly in sampling from multimodal distributions. Further, a novel approach of error analysis is developed for the time discretization and an enhanced convergence rate of order $\mathcal{O}(h)$ is obtained in the $L^2$-Wasserstein distance, under certain smoothness conditions on the drift. This significantly improves the existing order-half convergence in the aforementioned paper. Unlike Langevin samplers, SFS is gradient-free, works in a unit interval $[0, 1]$ and does not require any ergodicity. Numerical experiments confirm the convergence rate and show that, the SFS substantially outperforms vanilla Langevin samplers, particularly in sampling from multimodal distributions.

2025-12-30T03:37:40Z J. Complexity 96 (2026), Paper No. 102052 Xiaojie Wang Xiaoyan Zhang 10.1016/j.jco.2026.102052 http://arxiv.org/abs/2507.21757v2 Non-periodic Fourier propagation algorithms for partial differential equations 2026-06-02T23:45:33Z

Spectral methods for partial differential equations (PDEs) with non-periodic boundary conditions arising in computational physics often use polynomial expansions on non-uniform grids. Here, we implement a Fourier method that employs fast trigonometric expansions on a uniform grid with non-periodic boundaries using fast discrete sine transforms (DST) or/and discrete cosine transforms (DCT) to solve parabolic PDEs. We implement this method in two ways: either using a Fourier spectral derivative or a Fourier interaction picture. Both methods can treat vector fields with a combination of Dirichlet and/or Neumann boundary conditions in one or more space dimensions. As examples, we use them to solve a variety of computational physics PDEs with analytical solutions, including the Peregrine solitary wave solution. For the 1D heat equation problem, our method with an interaction picture is accurate up to machine precision. Soluble examples of stochastic partial differential equation (SPDE) with non-periodic boundaries in one and two space dimensions, with physics and interdisciplinary applications are also treated. We compare the results obtained from these algorithms with publicly available solvers that use polynomial spectral methods, and study their relative performance and error scaling. Polynomial methods with non-uniform spatial grids have lower spatial discretization errors when the solutions change slowly in space, typically with large spatial grids. For problems with rapid spatial variation, Fourier methods can outperform polynomial expansions, owing to their smaller maximum space interval, and are generally faster due to the computational efficiency of discrete Fourier transform methods. We verified this by making a complexity analysis in which we studied the total error at the optimum combination of time and space steps for a given resource use.

2025-07-29T12:38:31Z Revision: complexity analysis, higher dimensions, SPDEs Channa Hatharasinghe Run Yan Teh Jesse van Rhijn Peter D. Drummond Margaret D. Reid http://arxiv.org/abs/2606.04292v1 An Efficient Solver for the Richards Equation for Variably Saturated Flows in Porous Media 2026-06-02T23:44:51Z

We present a nonlinear multigrid solver for the Richards equation in variably saturated porous media with strongly nonlinear hydraulic conductivity and water-retention relationships. The governing equation is discretized using a second-order conservative finite-difference scheme in space and an implicit backward differentiation formula in time. The core component of the solver is a nonlinear Gauss--Seidel (NGS) smoother based on a triangular splitting of the diffusion operator combined with diagonal stabilization. This construction yields a sequence of locally decoupled scalar nonlinear problems that can be solved efficiently and robustly using only a few Newton iterations. Under suitable monotonicity assumptions, we establish the convergence of the NGS iteration in the $L^\infty$ norm and derive explicit conditions on the stabilization parameters. Numerical experiments for benchmark infiltration, drainage, and root-uptake problems demonstrate that the proposed NGS-based multigrid framework is both computationally efficient and robust.

2026-06-02T23:44:51Z 20 pages Xuelong Gu Qi Wang http://arxiv.org/abs/2602.15202v2 Tomography by Design: An Algebraic Approach to Low-Rank Quantum States 2026-06-02T22:38:14Z

We present an algebraic algorithm for quantum state tomography that leverages measurements of certain observables to estimate structured entries of the underlying density matrix. Under low-rank assumptions, the remaining entries can be obtained solely using standard numerical linear algebra operations. The proposed algebraic matrix completion framework applies to a broad class of generic, low-rank mixed quantum states and, compared with state-of-the-art methods, is computationally efficient while providing deterministic recovery guarantees.

2026-02-16T21:31:47Z 5 pages, Accepted to EUSIPCO 2026 Shakir Showkat Sofi Charlotte Vermeylen Lieven De Lathauwer http://arxiv.org/abs/2606.04267v1 Unbiased estimation of squared concentration in the Fisher-von Mises-Langevin distribution and the impossibility of unbiased concentration 2026-06-02T22:35:54Z

The estimation of concentration parameter in Fisher-von Mises-Langevin distribution is the directional statistics analogue of the estimation of the precision matrix for the Gaussian distribution. In this work we show that unbiased estimation of this parameter is impossible. With this realization in hand, we provide an alternative parameterization of the Fisher-von Mises-Langevin distribution in terms of the squared concentration, which we term the intensity. We fruther show that unbiased estimation of thereof is possible, and provide (almost) unbiased estimators thereof in terms of a partial sum U-statistic. We showcase our new estimator on synthetic data, New York taxi trip data, and on spherical word embeddings.

2026-06-02T22:35:54Z Zain Jabbar Yuqin Jiang Andrey A. Popov http://arxiv.org/abs/2606.04265v1 Nonlocal Mean Field Schrödinger Bridge with Learned Interactions 2026-06-02T22:30:46Z

The Schrödinger Bridge Problem constructs a stochastic process that connects an initial distribution to a terminal distribution with minimum energy. This work considers its mean-field extension, the Mean-Field Schrödinger Bridge, for interacting particle systems. With nonlocal interactions, evaluating the resulting particle-dependent distributional terms can scale quadratically with the population size, which makes large-scale problems intractable. We address this bottleneck by approximating the nonlocal interactions with neural network surrogates. The resulting four-stage alternating algorithm reduces the per-step cost from quadratic to linear in the population size at inference. We also derive Grönwall-type stability bounds that show how surrogate errors propagate to the generated trajectories. In numerical experiments on navigation and opinion-dynamics tasks, the proposed method reproduces trajectories obtained with analytical evaluation and reduces training time.

2026-06-02T22:30:46Z 31 pages, 15 figures Daisuke Inoue Mathieu Laurière Dante Kalise http://arxiv.org/abs/2507.13492v4 An efficient and energy stable framework for phase field simulations of grain growth in additive manufacturing 2026-06-02T22:29:18Z

Phase field simulations play a key role in the understanding of microstructure evolution in additive manufacturing. However, they have been found extremely computationally expensive. One of the reasons is the small time step requirement to resolve the complex microstructure evolution during the rapid solidification process. This paper investigates the possibility of using a class of stabilized time integration algorithms to accelerate such phase field simulations by increasing the time steps, based on a phase field model dedicated to simulating the solidification of 316L stainless steel during additive manufacturing, particularly in a regime where the solid-liquid interface is moving fast and there is absolute interfacial stability with negligible composition variations. The specific computational framework, incorporating the finite element method and the stabilized time integration algorithms, was developed. A theoretical analysis on energy stability was conducted, based on a revisited energy law derived for the phase field model. The numerical results confirmed that the proposed framework can effectively enforce the numerical stability and a decreasing energy requirement for the phase field simulations with at least two orders-of-magnitude larger time steps over conventional explicit methods. 2D and 3D phase field simulations have been conducted with relevant physical and kinetic parameters for 316L stainless steel. This computational framework can be easily adapted for different phase field models and open numerous opportunities for efficient phase field simulations.

2025-07-17T19:02:07Z Chaoqian Yuan Chinnapat Panwisawas Ye Lu 10.1016/j.jcp.2026.115108 http://arxiv.org/abs/2606.04259v1 An indefinite LOBPCG type of algorithm for detecting a definite Hermitian matrix pair 2026-06-02T22:24:49Z

A Hermitian matrix pair $(A,B)$ is called definite if some real linear combination of the matrices $A$ and $B$ is a positive definite matrix. Detection of the definiteness is not straightforward. We propose a basic subspace algorithm for detecting a large definite matrix pair $(A,B)$ with indefinite $B$. The proposed subspace algorithm is based on iterative testing of small projected Hermitian matrix pairs formed by using subspaces of small dimensions. Furthermore, we propose a specialized algorithm with parameter $m$, and its preconditioned variant. In the specialized algorithm with $m=3$ we choose the subspaces like in the indefinite locally optimal block preconditioned conjugate gradient (LOBPCG) method. Numerical experiments demonstrate the efficiency of our specialized algorithm, applied on medium-sized pairs, as well as, on large and banded pairs. Our algorithm very quickly detects (in)definiteness; much faster than some other algorithms.

2026-06-02T22:24:49Z Linear Algebra and its Applications, v.746 (2026), pp.111-139 Marija Miloloža Pandur 10.1016/j.laa.2026.05.020 http://arxiv.org/abs/2407.00809v4 Kernel Neural Operators (KNOs) for Scalable, Memory-efficient, Geometrically-flexible Operator Learning 2026-06-02T21:20:03Z

This paper introduces the Kernel Neural Operator (KNO), a provably convergent operator-learning architecture that utilizes compositions of deep kernel-based integral operators for function-space approximation of operators (maps from functions to functions). The KNO decouples the choice of kernel from the numerical integration scheme (quadrature), thereby naturally allowing for operator learning with explicitly-chosen trainable kernels on irregular geometries. On irregular domains, this allows the KNO to utilize domain-specific quadrature rules. To help ameliorate the curse of dimensionality, we also leverage an efficient dimension-wise factorization algorithm on regular domains. More importantly, the ability to explicitly specify kernels also allows the use of highly expressive, non-stationary, neural anisotropic kernels whose parameters are computed by training neural networks. We present universal approximation theorems showing that both the continuous and fully discretized KNO are universal approximators on operator learning problems. Numerical results demonstrate that on existing benchmarks the training and test accuracy of KNOs is closely comparable to or higher than that of popular neural operators while typically using an order of magnitude fewer trainable parameters, with the more expressive kernels proving important to attaining high accuracy. KNOs thus facilitate low-memory, geometrically-flexible, deep operator learning, while retaining the implementation simplicity and transparency of traditional kernel methods from both scientific computing and machine learning.

2024-06-30T19:28:12Z 14 pages + 15 page appendix, 7 figures Transactions on Machine Learning Research, ISSN 2835-8856, 2026 Matthew Lowery John Turnage Zachary Morrow John D. Jakeman Akil Narayan Shandian Zhe Varun Shankar http://arxiv.org/abs/2606.04204v1 An Energy-Stable Implicit Convex-Splitting BDF2 Scheme for the Cahn-Hilliard-Navier-Stokes Equations 2026-06-02T20:49:14Z

We develop an energy-stable implicit convex-splitting BDF2 discretization (CS-BDF2) of the Cahn--Hilliard--Navier--Stokes equations. For the Cahn--Hilliard equation, BDF2 analyses can establish energy stability by testing the phase equation in the (H^{-1}) metric. For CHNS, this test is not compatible with the coupled energy estimate: the momentum equation is tested by (\bfu^{n+1}), while the transported phase equation is tested by (μ^{n+1}) so that transport cancels capillary work. The chemical-potential relation must then be paired with the BDF2 phase increment ((3φ^{n+1}-4φ^n+φ^{n-1})/2); its nonlinear part must produce a BDF2 bulk-energy difference, up to nonnegative higher-order history terms. To overcome this difficulty, we introduce a new BDF2-compatible convex-splitting approximation of the nonlinear bulk force that directly yields a discrete bulk-energy identity and enables a discrete energy analysis for the CHNS system. Specifically, we discretize the bulk force (f(φ)=φ^3-φ) by (χ(φ^{\dagger,n+1},φ^{\dagger,n})-φ^{*,n+1}), where (χ(a,b)=\tfrac14(a^2+b^2)(a+b)), (φ^{\dagger,n+1}=\tfrac{3φ^{n+1}-φ^n}{2}), (φ^{\dagger,n}=\tfrac{3φ^n-φ^{n-1}}{2}), and (φ^{*,n+1}=2φ^n-φ^{n-1}). This discretization is based on the shifted BDF2 identity ((3φ^{n+1}-4φ^n+φ^{n-1})/2=φ^{\dagger,n+1}-φ^{\dagger,n}). With a matching discretization of the reversible coupling terms in CHNS, the scheme is mass conservative, uniquely solvable, and unconditionally energy stable. We prove second-order convergence for the phase variable, chemical potential, velocity, and pressure.

2026-06-02T20:49:14Z 32 pages Xuelong Gu Qi Wang http://arxiv.org/abs/2606.04190v1 Sampling and reconstruction of convex functions 2026-06-02T20:19:38Z

We discuss optimal recovery for classes of multivariate convex functions from given point samples, as well as the sampling numbers of these classes, corresponding to optimal sample choices. Upper and lower bounds for either variant are established when the reconstruction error is measured in $L_p$ for $1\leq p\leq \infty$. These bounds match, sometimes up to logarithmic factors, and therefore characterize the respective optimal rate of decay. For classical smoothness classes such as Sobolev, Hölder or Besov spaces, it is well known that the optimal decay rate of sampling numbers can be achieved by sampling on uniform tensor product grids and using linear methods of reconstruction, such as piecewise polynomial interpolation. One of the main findings in this paper is that for classes of convex functions, these procedures generally produce suboptimal rates, except when $p=1$ and $p=\infty$, and are outperformed by nonlinear reconstruction methods that do not employ tensor product grids.

2026-06-02T20:19:38Z Andrea Bonito Albert Cohen Wolfgang Dahmen Ronald Devore Guergana Petrova Jonathan W. Siegel http://arxiv.org/abs/2510.20116v2 Interpolatory Approximations of PMU Data: Dimension Reduction and Pilot Selection 2026-06-02T18:58:56Z

This work investigates the reduction of phasor measurement unit (PMU) data through low-rank matrix approximations. To reconstruct a PMU data matrix from fewer measurements, we propose the framework of interpolatory matrix decompositions (IDs). In contrast to methods relying on principal component analysis or singular value decomposition, IDs recover the complete data matrix using only a few of its rows (PMU datastreams) and/or a few of its columns (snapshots in time). This row-/column-based compression enables real-time monitoring of power transmission systems using measurements from a smaller subset of pilot datastreams, thereby minimizing communication bandwidth. The ID perspective gives a rigorous error bound on the quality of the data compression. We propose selecting the pilot measurements used in an ID via the discrete empirical interpolation method (DEIM), a greedy algorithm that aims to control the error bound. This bound yields a computable estimate of the reconstruction error during online operations. A violation of this estimate suggests a change in the system's operating conditions and thus serves as a tool for fault detection. Following a disturbance, DEIM can be used to localize the event source across all buses with high accuracy. Numerical tests on synthetic PMU data demonstrate DEIM's excellent performance in data compression and validate the proposed DEIM-based fault-detection and localization method.

2025-10-23T01:36:44Z Sean Reiter Mark Embree Serkan Gugercin Vassilis Kekatos