https://arxiv.org/api/bSjFdoZ9ucdT7KS+rkQS6/yap282026-06-22T09:24:20Z4987036015http://arxiv.org/abs/2603.29237v2Stochastic Dimension Implicit Functional Projections for Global Integral Conservation in High-Dimensional PINNs2026-06-07T07:51:44ZEnforcing prescribed global integral constraints in mesh-free neural PDE solvers is challenging in high-dimensional domains. Existing projection methods for spatial integrals are often tied to fixed grids or uniform quadrature, which can conflict with randomly sampled physics-informed neural networks (PINNs) and scale poorly with dimension. High-order differential operators also increase reverse-mode automatic differentiation memory costs. We propose Stochastic Dimension Implicit Functional Projection (SDIFP), a quadrature-level framework for enforcing prescribed first and second spatial moments. SDIFP replaces tensor-product nodal projection by a global affine correction of the neural-network output, with two scalar coefficients determined from a weighted quadrature rule. Under positive target variance and nonzero empirical raw variance, this correction is the nearest-point projection, in the weighted quadrature norm, onto the empirical two-moment constraint set. Thus, the prescribed moments are exact for the selected quadrature rule, while continuum errors are quadrature errors of the corrected field. For decomposable high-dimensional linear operators, SDIFP combines affine moment correction with stochastic operator-subset sampling. With independent residual and derivative sampling and conditionally unbiased coefficient-gradient estimation, the resulting estimator is unbiased for the specified quadrature-based residual objective; the shared-subset fast mode is biased in general. SDIFP avoids tensor-product quadrature for moment enforcement, separates forward quadrature evaluation from the reverse-mode graph, and retains pointwise inference efficiency once the affine coefficients are fixed or precomputed.2026-03-31T04:07:51ZZhangyong LiangHuanhuan Gaohttp://arxiv.org/abs/2503.20921v2The MINI mixed virtual element for the Stokes equation2026-06-07T07:02:18ZWe present and discuss a generalization of the popular MINI mixed finite element for the 2D Stokes equation by means of conforming virtual elements on polygonal meshes. We prove optimal error estimates for both velocity and pressure. Theoretical results are confirmed by several numerical tests performed with different choices of polynomial accuracy and meshes.2025-03-26T18:55:06ZTo appear on Computational Methods in Applied Mathematics; 36 pages, 11 figures, 1 tableSilvia BertoluzzaFabio CredaliDaniele Pradahttp://arxiv.org/abs/2606.08448v1Multiscale Fourier Neural Operator for Inverse Wave Scattering in Highly Oscillatory Media2026-06-07T04:36:49ZIn this paper, we propose an operator learning method based on the multiscale Fourier neural operator (MscaleFNO) for inverse medium problems of Helmholtz equations. The MscaleFNO provides a neural surrogate model with reduced spectral bias for the Helmholtz equations, mapping highly oscillatory medium profiles to scattered wavefields. A plug-and-play inversion using elucidated diffusion model is introduced to regularize the inverse solver based on least squares of data misfits. Numerical results for partial aperture inversion of oscillatory two-dimensional media demonstrate the advantage and effectiveness of MscaleFNO for accurate reconstruction of highly oscillatory medium properties.2026-06-07T04:36:49Z33 pages, 15 figuresZilin YouZhenli XuWei Caihttp://arxiv.org/abs/1705.00729v4New Combinations of Polynomial Root-Finding Iterations2026-06-07T00:52:31ZSome near-optimal polynomial root-finders of 2024-25, based on subdivision iterations, approximate all complex roots of a polynomial or all roots lying in a fixed Region of Interest in the complex plane. We combine these iterations with Newton's and/or Schroeder's to yield significant empirical acceleration versus each approach standing alone. Like the cited recent algorithms, our root-finders can be applied not only to a polynomial represented in monomial basis by its coefficients but also to a black box polynomial represented by an oracle (black box subroutine) for its evaluation. Some by-products of our study such as an extension of the Gauss-Lucas theorem and a fast black box estimator for root radius can be of independent interest.2017-05-01T22:29:14Z16 pages, 7 figuresVictor Y. Panhttp://arxiv.org/abs/2604.26993v2State-Dependent Lyapunov Analysis of Rank-1 Matrix Factorization2026-06-06T22:29:59ZWe study gradient descent for rank-1 matrix factorization through a state-dependent Lyapunov perspective. The central object is a parameterized quadratic certificate $I(δ;\,\cdot)$ whose boundary-inward property induces a monotone state parameter $δ_t$, thereby certifying that the trajectory is confined to a shrinking family of level sets. For certified initializations below the critical step size, this mechanism proves convergence to global minimizers. Above the critical step size, the same monotone-state mechanism instead leads to a balanced terminal regime; for a range of post-critical step sizes, the reduced dynamics exhibit period-2 behavior consistent with edge-of-stability phenomena.
We further show that the scalar certificate is not an ad hoc algebraic construction: under structural axioms and a natural state-parameter normalization, it is uniquely determined by the monotonicity mechanism. Numerical experiments suggest that this state-dependent Lyapunov mechanism persists beyond the proved cases, including two-dimensional rank-1 approximation and quartic augmentations of scalar factorization.2026-04-28T22:43:16ZJaehong Moonhttp://arxiv.org/abs/2604.26280v2Structure-Aware Tensorial Model Reduction2026-06-06T21:30:53ZThis work investigates a two-stage method for constructing projection-based reduced-order models (ROMs) of parameterized partial differential equations (PDEs). Based on established tensorial ROM methodology, the proposed approach reduces dimensionality offline by encoding solution snapshots using a multi-linear Tucker factorization, so that a reduced basis which varies nonlinearly with PDE parameters can be rapidly constructed online and used in a Galerkin ROM. Two novel extensions of this strategy, tailored to the cases of structured PDEs and sparse parameter sampling, are presented: the construction of reduced bases orthonormalized with respect to a general discrete inner product, and the interpolation of encoded states via radial basis functions. Basic representation and ROM error estimates are presented demonstrating the validity of these modifications, and the approach is challenged on examples where monolithic-basis ROMs are known to struggle, including a realistic instance of Maxwell's equations in 3D. Results suggest that the proposed nonlinear basis ROM can effectively mitigate linear restrictions on Kolmogorov $n$-width while improving upon previous tensorial ROM technology, particularly in the highly nonlinear and data-limited regimes characteristic of practical use cases.2026-04-29T04:21:03ZArjun VijaywargiyaEric C. CyrAnthony Gruberhttp://arxiv.org/abs/2601.09900v4Nonlinear numerical schemes using specular differentiation for initial value problems of first-order ordinary differential equations2026-06-06T20:03:26ZThis paper proposes specular differentiation in one-dimensional Euclidean space and provides its fundamental analysis, including a quasi-Fermat theorem and a quasi-Mean Value Theorem. As an application, this paper develops several numerical schemes for solving initial value problems for first-order ordinary differential equations. Based on numerical simulations, we select one scheme and prove its second-order consistency and convergence. By modifying this scheme, we also obtain a numerical scheme with zero local truncation error for ODEs whose solution trajectories are ellipses.2026-01-14T22:14:24ZKiyuob Junghttp://arxiv.org/abs/2606.08293v1A Second-order Structure-preserving Parametric FEM for Surface Evolution2026-06-06T18:29:15ZIn this paper, we propose a second-order-in-time, structure-preserving, and mesh-robust parametric finite element method for surface diffusion and volume-preserving mean curvature flow. We first reformulate the original evolution equations into new systems in which the tangential motion is governed by a harmonic map heat flow. This heat flow maps a fixed reference surface onto the unknown evolving surface and drives points on the evolving surface to move in their tangent spaces so as to reduce the associated harmonic energy. As a result, in the discrete setting, the mesh quality can be maintained at a level comparable to that of the reference surface, unless singularities occur. The volume-preserving property is theoretically guaranteed by the careful design of the scheme, while energy dissipation is enforced through a Lagrange multiplier. We present several numerical experiments to demonstrate second-order convergence in time and the advantage of the proposed method in preserving mesh quality. The structure-preserving properties are further confirmed by the numerical results. Finally, the proposed framework can be readily extended to other geometric flows.2026-06-06T18:29:15Z23 pages, 12 pagesBeiping DuanZongze Yanghttp://arxiv.org/abs/2606.08203v1Stable and Scalable Probabilistic Numerical Solvers for Stiff and High-Dimensional ODEs2026-06-06T14:42:03ZFiltering-based probabilistic numerical solvers for ordinary differential equations (ODEs) have been established as a flexible and efficient simulation framework with built-in numerical uncertainty quantification. However, problems that are both stiff and high-dimensional remain a challenge, as current methods are either stable and have cubic cost in the ODE dimension, or scale linearly at the expense of stability. In this paper, we close this gap and develop probabilistic ODE solvers that are both stable and scalable. We propose two complementary strategies. First, we develop a matrix-free update step that uses Jacobian-vector products, iterative linear solvers, and stochastic covariance estimation to enable linear scaling, all while retaining stability. Second, we propose iterative re-linearization to further improve stability without sacrificing scalability, turning probabilistic ODE solvers into fully implicit methods. We evaluate the proposed approaches on a range of stiff and high-dimensional problems and demonstrate improved stability and scalability over established probabilistic solvers.2026-06-06T14:42:03ZNathanael Boschhttp://arxiv.org/abs/2509.18712v3Optimality of quasi-Monte Carlo methods and suboptimality of the sparse-grid Gauss--Hermite rule in Gaussian Sobolev spaces2026-06-06T14:34:44ZOptimality of several quasi-Monte Carlo methods and suboptimality of the sparse-grid quadrature based on the univariate Gauss--Hermite rule is proved in the Sobolev spaces of mixed dominating smoothness of order $α$, where the optimality is in the sense of worst-case convergence rate. For sparse-grid Gauss--Hermite quadrature, lower and upper bounds are established, with rates coinciding up to a logarithmic factor. The dominant rate is found to be only $N^{-α/2}$ with $N$ function evaluations, although the optimal rate is known to be $N^{-α}(\ln N)^{(d-1)/2}$. The lower bound is obtained by exploiting the structure of the Gauss--Hermite nodes and is independent of the quadrature weights; consequently, no modification of the weights can improve the rate $N^{-α/2}$. In contrast, several quasi-Monte Carlo methods with a change of variables are shown to achieve the optimal rate, some up to, and one including, the logarithmic factor.2025-09-23T06:52:34ZYoshihito KazashiYuya SuzukiTakashi Godahttp://arxiv.org/abs/2602.05869v2Wedge Sampling: Efficient Tensor Completion with Nearly-Linear Sample Complexity2026-06-06T14:29:26ZWe introduce Wedge Sampling, a new non-adaptive sampling scheme for low-rank tensor completion. We study recovery of an order-$k$ low-rank tensor of dimension $n \times \cdots \times n$ from a subset of its entries. Unlike the standard uniform entry model (i.e., i.i.d. samples from $[n]^k$), wedge sampling allocates observations to structured length-two patterns (wedges) in an associated bipartite sampling graph. By directly promoting these length-two connections, the sampling design strengthens the spectral signal that underlies efficient initialization, in regimes where uniform sampling is too sparse to generate enough informative correlations.
Our main result shows that this change in sampling paradigm enables polynomial-time algorithms to achieve both weak and exact recovery with nearly linear sample complexity in $n$. The approach is also plug-and-play: wedge-sampling-based spectral initialization can be combined with existing refinement procedures (e.g., spectral or gradient-based methods) using only an additional $\tilde{O}(n)$ uniformly sampled entries, substantially improving over the $\tilde{O}(n^{k/2})$ sample complexity typically required under uniform entry sampling for efficient methods. Overall, our results suggest that the statistical-to-computational gap highlighted in Barak and Moitra (2022) is, to a large extent, a consequence of the uniform entry sampling model for tensor completion, and that alternative non-adaptive measurement designs that guarantee a strong initialization can overcome this barrier.2026-02-05T16:47:13ZCOLT 2026 arXiv version. 65 pages, 3 figuresHengrui LuoAnna MaLudovic StephanYizhe Zhuhttp://arxiv.org/abs/2606.08175v1Quaternion Maximum-Volume Submatrix Selection with Applications to Multichannel Imaging and Visual Data2026-06-06T13:42:19ZLow-rank approximation based on selected rows and columns is a useful alternative to singular value decompositions when the goal is an interpretable and compact matrix representation. A standard way to choose these rows and columns is the maximum-volume principle: it selects submatrices with large volume, which usually leads to stable interpolation coefficients and accurate CUR-type approximations. In this paper, we study this idea for quaternion matrices. This setting is natural for color images, three-dimensional motion data, and multi-channel signals, but requires care because quaternion multiplication is noncommutative. We define quaternion maximum-volume submatrix selection using quaternion singular values and the Study determinant. We then derive quaternion rank-one update formulas and use them to build two selection procedures: a greedy square-core method for row and column replacement, and a rectangular method that enlarges a selected row set until the interpolation coefficients are controlled. We prove that successful row and column swaps increase the quaternion volume of the selected square core when the exact quaternion inverse is used. We also connect the stopping criterion with quasi-dominance, prove an exact quaternion CUR identity in the full-rank case, and derive an interpolation stability bound. For the rectangular case, we derive an append-row pseudoinverse update and show how it gives a natural right preconditioner for overdetermined quaternion least-squares problems. Finally, we illustrate the methods on three applications: quaternion CUR approximation of RGB images, RectMaxVol-based preconditioning for ill-conditioned quaternion least-squares systems, and row selection in quaternion motion-capture data. The experiments show that the proposed quaternion MaxVol and RectMaxVol methods provide stable and efficient selection routines.2026-06-06T13:42:19ZVsevolod KliushevJunjun PanValentin Leplathttp://arxiv.org/abs/2606.08161v1AttentionCap: Transformer Based Capacitance Matrix Learning Toward Full-Chip Extraction2026-06-06T13:20:09ZAs capacitance extraction accuracy of rule-based pattern matching becomes difficult to sustain at advanced nodes, a growing trend emerges to develop deep-learning-based 2D capacitance models. However, existing MLP- and CNN-based methods constrain their input to fixed metal-layer combinations in a specific process node, limiting their usability in practice. Recognizing the inherent similarity between capacitance matrix and the prevailing attention mechanism, we propose AttentionCap, a customized Transformer for capacitance matrix learning, with a Gram representation framework, a physics-aligned symmetric-attention output layer, and a novel normalized Laplacian loss. We also introduce a process-node embedding to enable multi-node learning. Trained on synthetic data, AttentionCap attains 0.67\%/3.99\% self/coupling-capacitance error on unseen real designs under a multi-layer and multi-node setting, surpassing the CNN-Cap baseline with 4.6$\times$/5.7$\times$ lower self/coupling error and 192$\times$ faster inference speed. A pretrained AttentionCap accurately transfers to an unseen node with only 5K samples and 4K finetuning steps. With sufficient accuracy on unseen real designs and strong transferability to new process nodes, AttentionCap offers highly practical value for modern EDA workflows. Code and data are available at https://github.com/THU-numbda/AttentionCap.2026-06-06T13:20:09ZAccepted at the 63rd ACM/IEEE Design Automation Conference (DAC '26)Jiechen HuangHector R. RodriguezDingcheng YangZuochang YeYibo LinWenjian Yuhttp://arxiv.org/abs/2311.00554v2A higher order numerical method for singularly perturbed elliptic problems with characteristic boundary layers2026-06-06T12:17:58ZA Petrov-Galerkin finite element method is constructed for a singularly perturbed elliptic problem in two space dimensions. The solution contains a regular boundary layer and two characteristic boundary layers. Exponential splines are used as test functions in one coordinate direction and are combined with bilinear trial functions defined on a Shishkin mesh. The resulting numerical method is shown to be a stable parameter-uniform numerical method that achieves a higher order of convergence compared to upwinding on the same mesh.2023-11-01T14:42:01Z27 pages, 3 figuresAlan F. HegartyEugene O'Riordanhttp://arxiv.org/abs/2606.08095v1Strain localization in softening plasticity without modifying standard constitutive models: a deformable Cosserat approach2026-06-06T10:46:24ZThis paper presents a formulation for strain localization in softening plasticity based on a deformable Cosserat model. The approach enables the direct use of standard elastoplastic constitutive models formulated for a classical Cauchy continuum, without modifying the stress update algorithm or consistent tangent operator. A key feature of the framework is the strict separation of dissipative and energetic mechanisms: all dissipation is confined to the macro-continuum, while the micro-continuum contributes only through linear elastic terms associated with the director field. As a result, the constitutive structure of the elastoplastic model is preserved, and existing models can be employed as black-box components. The internal length scale arises naturally from the micro-continuum and governs the development, interaction and selection of localization patterns, rather than acting as a diffusive parameter. The formulation is easy to implement within standard finite element frameworks, requiring only additional linear contributions to the residual and tangent operators. The performance of the approach is assessed through benchmark problems involving shallow foundations on soil, a demanding test due to complex and unstable localization mechanisms. Both Tresca and Matsuoka-Nakai plasticity models are considered, including cases with highly unstable post-peak responses. Numerical results show convergence of load-displacement responses, dissipated energy and shear-band patterns upon mesh refinement, even in the presence of nonlinear interacting localization processes. These findings demonstrate a robust and physically consistent approach for the analysis of strain localization in softening plasticity.2026-06-06T10:46:24ZAndrea PanteghiniM. B. Rubin