http://arxiv.org/api/uGXyJ9rkEpzUtPLGgYKCYdOtFGw2025-04-22T00:00:00-04:0041976015http://arxiv.org/abs/2504.15201v12025-04-21T16:17:39Z2025-04-21T16:17:39ZPhase-separated lipid vesicles: continuum modeling, simulation, and
validation The paper presents a complete research cycle comprising continuum-based
modeling, computational framework development, and validation setup to predict
phase separation and surface hydrodynamics in lipid bilayer membranes. We
starting with an overview of the key physical characteristics of lipid
bilayers, including their composition, mechanical properties, and
thermodynamics, and then discuss continuum models of multi-component bilayers.
The most complex model is a Navier--Stokes--Cahn--Hilliard (NSCH) type system,
describing the coupling of incompressible surface fluid dynamics with
phase-field dynamics on arbitrarily curved geometries. It is discretized using
trace finite element methods, which offer geometric flexibility and stability
in representing surface PDEs. Numerical studies are conducted to examine
physical features such as coarsening rates and interfacial dynamics. The
computational results obtained from the NSCH model are compared against
experimental data for membrane compositions with distinct phase behaviors,
demonstrating that including both phase-field models and surface hydrodynamics
is essential to accurately reproduce domain evolution observed in
epi-fluorescence microscopy. Lastly, we extend the model to incorporate
external forces that enable the simulation of vesicles containing cationic
lipids, used to enhance membrane fusion.
Maxim OlshanskiiAnnalisa Quainihttp://arxiv.org/abs/2504.15177v12025-04-21T15:39:42Z2025-04-21T15:39:42ZAn $rp$-adaptive method for accurate resolution of shock-dominated
viscous flow based on implicit shock tracking This work introduces an optimization-based $rp$-adaptive numerical method to
approximate solutions of viscous, shock-dominated flows using implicit shock
tracking and a high-order discontinuous Galerkin discretization on
traditionally coarse grids without nonlinear stabilization (e.g., artificial
viscosity or limiting). The proposed method adapts implicit shock tracking
methods, originally developed to align mesh faces with solution
discontinuities, to compress elements into viscous shocks and boundary layers,
functioning as a novel approach to aggressive $r$-adaptation. This form of
$r$-adaptation is achieved naturally as the minimizer of the enriched residual
with respect to the discrete flow variables and coordinates of the nodes of the
grid. Several innovations to the shock tracking optimization solver are
proposed to ensure sufficient mesh compression at viscous features to render
stabilization unnecessary, including residual weighting, step constraints and
modifications, and viscosity-based continuation. Finally, $p$-adaptivity is
used to locally increase the polynomial degree with three clear benefits: (1)
lessens the mesh compression requirements near shock waves and boundary layers,
(2) reduces the error in regions where $r$-adaptivity is not sufficient with
the given grid topology, and (3) reduces computational cost by performing a
majority of the $r$-adaptivity iterations on the coarsest discretization. A
series of numerical experiments show the proposed method effectively resolves
viscous, shock-dominated flows, including accurate prediction of heat flux
profiles produced by hypersonic flow over a cylinder, and compares favorably in
terms of accuracy per degree of freedom to $h$-adaptation with a high-order
discretization.
Huijing DongMasayuki YanoTianci HuangMatthew J. Zahr43 pages, 35 figures,http://arxiv.org/abs/2504.15173v12025-04-21T15:26:35Z2025-04-21T15:26:35ZPoroelastic flow across a permeable interface: a Hamilton's principle
approach and its finite element implementation We consider fluid flow across a permeable interface within a deformable
porous medium. We use mixture theory. The mixture's constituents are assumed to
be incompressible in their pure form. We use Hamilton's principle to obtain the
governing equations, and we propose a corresponding finite element
implementation. The filtration velocity and the pore pressure are allowed to be
discontinuous across the interface while some control of these discontinuities
is built into the interfacial constitutive behavior. To facilitate the
practical implementation of the formulation in a finite element scheme, we
introduce a Lagrange multiplier field over the interface for the explicit
enforcement of the jump condition of the balance of mass. Our formulation
appears to recover some basic results from the literature. The novelty of the
work is the formulation of an approach that can accommodate specific
constitutive assumptions pertaining to the behavior of the interface that do
not necessarily imply the continuity of the filtration velocity and/or of the
pore pressure across it.
Francesco CostanzoCenter for Neural Engineering, Engineering Science and Mechanics Department, Penn State UniversityMohammad JannesariCenter for Neural Engineering, Engineering Science and Mechanics Department, Penn State UniversityBeatrice GhittiCenter for Neural Engineering, Engineering Science and Mechanics Department, Penn State UniversityAuckland Bioengineering Institute, The University of Aucklandhttp://arxiv.org/abs/2504.15151v12025-04-21T14:55:24Z2025-04-21T14:55:24ZArtificial compressibility method for the incompressible Navier-Stokes
equations with variable density We introduce a novel artificial compressibility technique to approximate the
incompressible Navier-Stokes equations with variable fluid properties such as
density and dynamical viscosity. The proposed scheme used the couple pressure
and momentum, equal to the density times the velocity, as primary unknowns. It
also involves an adequate treatment of the diffusive operator such that
treating the nonlinear convective term explicitly leads to a scheme with time
independent stiffness matrices that is suitable for pseudo-spectral methods.
The stability and temporal convergence of the semi-implicit version of the
scheme is established under the hypothesis that the density is approximated
with a method that conserves the minimum-maximum principle. Numerical
illustrations confirm that both the semi-implicit and explicit scheme are
stable and converge with order one under classic CFL condition. Moreover, the
proposed scheme is shown to perform better than a momentum based pressure
projection method, previously introduced by one of the authors, on setups
involving gravitational waves and immiscible multi-fluids in a cylinder.
Cappanera LoicGiordano Salvatorehttp://arxiv.org/abs/2407.14413v22025-04-21T14:26:15Z2024-07-19T15:40:08ZUniqueness of the inverse source problem for fractional diffusion-wave
equations This study addresses the inverse source problem for the fractional
diffusion-wave equation, characterized by a source comprising spatial and
temporal components. The investigation is primarily concerned with practical
scenarios where data is collected subsequent to an incident. We establish the
uniqueness of either the spatial or the temporal component of the source,
provided that the temporal component exhibits an asymptotic expansion at
infinity. Taking anomalous diffusion as a typical example, we gather the
asymptotic behavior of one of the following quantities: the concentration on
partial interior region or at a point inside the region, or the flux on partial
boundary or at a point on the boundary. The proof is based on the asymptotic
expansion of the solution to the fractional diffusion-wave equation. Notably,
our approach does not rely on the conventional vanishing conditions for the
source components. We also observe that the extent of uniqueness is dependent
on the fractional order.
Lingyun QiuJiwoon SimRevision submitted to arxiv on 2025/4/21http://arxiv.org/abs/2504.15110v12025-04-21T14:02:59Z2025-04-21T14:02:59ZKolmogorov-Arnold Networks: Approximation and Learning Guarantees for
Functions and their Derivatives Inspired by the Kolmogorov-Arnold superposition theorem, Kolmogorov-Arnold
Networks (KANs) have recently emerged as an improved backbone for most deep
learning frameworks, promising more adaptivity than their multilayer perception
(MLP) predecessor by allowing for trainable spline-based activation functions.
In this paper, we probe the theoretical foundations of the KAN architecture by
showing that it can optimally approximate any Besov function in
$B^{s}_{p,q}(\mathcal{X})$ on a bounded open, or even fractal, domain
$\mathcal{X}$ in $\mathbb{R}^d$ at the optimal approximation rate with respect
to any weaker Besov norm $B^{\alpha}_{p,q}(\mathcal{X})$; where $\alpha < s$.
We complement our approximation guarantee with a dimension-free estimate on the
sample complexity of a residual KAN model when learning a function of Besov
regularity from $N$ i.i.d. noiseless samples. Our KAN architecture incorporates
contemporary deep learning wisdom by leveraging residual/skip connections
between layers.
Anastasis KratsiosTakashi Furuyahttp://arxiv.org/abs/2409.04297v22025-04-21T13:42:27Z2024-09-06T14:13:57ZMinimization of the Pseudospectral Abscissa of a Quadratic Matrix
Polynomial For a quadratic matrix polynomial dependent on parameters and a given
tolerance $\epsilon > 0$, the minimization of the $\epsilon$-pseudospectral
abscissa over the set of permissible parameter values is discussed, with
applications in damping optimization and brake squeal reductions in mind. An
approach is introduced that is based on nonsmooth and global optimization (or
smooth optimization techniques such as BFGS if there are many parameters)
equipped with a globally convergent criss-cross algorithm to compute the
$\epsilon$-pseudospectral abscissa objective when the matrix polynomial is of
small size. For the setting when the matrix polynomial is large, a subspace
framework is introduced, and it is argued formally that it solves the
minimization problem globally. The subspace framework restricts the
parameter-dependent matrix polynomial to small subspaces, and thus solves the
minimization problem for such restricted small matrix polynomials. It then
expands the subspaces using the minimizers for the restricted polynomials. The
proposed approach makes the global minimization of the
$\epsilon$-pseudospectral abscissa possible for a quadratic matrix polynomial
dependent on a few parameters and for sizes up to at least a few hundreds. This
is illustrated on several examples originating from damping optimization.
Volker MehrmannEmre Mengi29 pages, 5 figureshttp://arxiv.org/abs/2504.15100v12025-04-21T13:41:20Z2025-04-21T13:41:20ZApplication of Sensitivity Analysis Methods for Studying Neural Network
Models This study demonstrates the capabilities of several methods for analyzing the
sensitivity of neural networks to perturbations of the input data and
interpreting their underlying mechanisms. The investigated approaches include
the Sobol global sensitivity analysis, the local sensitivity method for input
pixel perturbations and the activation maximization technique. As examples, in
this study we consider a small feedforward neural network for analyzing an open
tabular dataset of clinical diabetes data, as well as two classical
convolutional architectures, VGG-16 and ResNet-18, which are widely used in
image processing and classification. Utilization of the global sensitivity
analysis allows us to identify the leading input parameters of the chosen tiny
neural network and reduce their number without significant loss of the
accuracy. As far as global sensitivity analysis is not applicable to larger
models we try the local sensitivity analysis and activation maximization method
in application to the convolutional neural networks. These methods show
interesting patterns for the convolutional models solving the image
classification problem. All in all, we compare the results of the activation
maximization method with popular Grad-CAM technique in the context of
ultrasound data analysis.
Jiaxuan MiaoSergey Matveev11 pages, 16 figures, 32 referenceshttp://arxiv.org/abs/2504.15073v12025-04-21T13:03:21Z2025-04-21T13:03:21ZHermitian Quaternion Toeplitz Matrices by Quaternion-valued Generating
Functions In this paper, we study Hermitian quaternion Toeplitz matrices generated by
quaternion-valued functions. We show that such generating function must be the
sum of a real-valued function and an odd function with imaginary component.
This setting is different from the case of Hermitian complex Toeplitz matrices
generated by real-valued functions only. By using of 2-by-2 block complex
representation of quaternion matrices, we give a quaternion version of
Grenander-Szeg\"{o} theorem stating the distribution of eigenvalues of
Hermitian quaternion Toeplitz matrices in terms of its generating function. As
an application, we investigate Strang's circulant preconditioners for Hermitian
quaternion Toeplitz linear systems arising from quaternion signal processing.
We show that Strang's circulant preconditioners can be diagionalized by
discrete quaternion Fourier transform matrices whereas general quaternion
circulant matrices cannot be diagonalized by them. Also we verify the
theoretical and numerical convergence results of Strang's circulant
preconditioned conjugate gradient method for solving Hermitian quaternion
Toeplitz systems.
Xue-lei LinMichael K. NgJunjun Panhttp://arxiv.org/abs/2504.14939v12025-04-21T08:02:07Z2025-04-21T08:02:07ZFull Discretization of Stochastic Semilinear Schrödinger equation
driven by multiplicative Wiener noise In this article, we have analyzed the full discretization of the Stochastic
semilinear Schr\"{o}dinger equation in a bounded convex polygonal domain driven
by multiplicative Wiener noise. We use the finite element method for spatial
discretization and the stochastic trigonometric method for time discretization
and derive a strong convergence rate with respect to both parameters (temporal
and spatial). Numerical experiments have also been performed to support
theoretical bounds.
Suprio BharMrinmay BiswasMangala Prasad27 pages, Comments are welcomehttp://arxiv.org/abs/2504.14930v12025-04-21T07:50:51Z2025-04-21T07:50:51ZKernel-learning parameter prediction and evaluation in algebraic
multigrid method for several PDEs This paper explores the application of kernel learning methods for parameter
prediction and evaluation in the Algebraic Multigrid Method (AMG), focusing on
several Partial Differential Equation (PDE) problems. AMG is an efficient
iterative solver for large-scale sparse linear systems, particularly those
derived from elliptic and parabolic PDE discretizations. However, its
performance heavily relies on numerous parameters, which are often set
empirically and are highly sensitive to AMG's effectiveness. Traditional
parameter optimization methods are either computationally expensive or lack
theoretical support. To address this, we propose a Gaussian Process Regression
(GPR)-based strategy to optimize AMG parameters and introduce evaluation
metrics to assess their effectiveness. Trained on small-scale datasets, GPR
predicts nearly optimal parameters, bypassing the time-consuming parameter
sweeping process. We also use kernel learning techniques to build a kernel
function library and determine the optimal kernel function through linear
combination, enhancing prediction accuracy. In numerical experiments, we tested
typical PDEs such as the constant-coefficient Poisson equation,
variable-coefficient Poisson equation, diffusion equation, and Helmholtz
equation. Results show that GPR-predicted parameters match grid search results
in iteration counts while significantly reducing computational time. A
comprehensive analysis using metrics like mean squared error, prediction
interval coverage, and Bayesian information criterion confirms GPR's efficiency
and reliability. These findings validate GPR's effectiveness in AMG parameter
optimization and provide theoretical support for AMG's practical application.
Juan ZhangJunyue LuoFangfang ZhangXiaoqiang Yuehttp://arxiv.org/abs/2403.02832v42025-04-21T06:27:49Z2024-03-05T10:06:09ZQuasi-Monte Carlo with Domain Transformation for Efficient Fourier
Pricing of Multi-Asset Options Efficiently pricing multi-asset options poses a significant challenge in
quantitative finance. Fourier methods leverage the regularity properties of the
integrand in the Fourier domain to accurately and rapidly value options that
typically lack regularity in the physical domain. However, most of the existing
Fourier approaches face hurdles in high-dimensional settings due to the tensor
product (TP) structure of the commonly employed numerical quadrature
techniques. To overcome this difficulty, this work advocates using the
randomized quasi-MC (RQMC) quadrature to improve the scalability of Fourier
methods with high dimensions. The RQMC technique benefits from the smoothness
of the integrand and alleviates the curse of dimensionality while providing
practical error estimates. Nonetheless, the applicability of RQMC on the
unbounded domain, $\mathbb{R}^d$, requires a domain transformation to
$[0,1]^d$, which may result in singularities of the transformed integrand at
the corners of the hypercube, and hence deteriorate the performance of RQMC. To
circumvent this difficulty, we design an efficient domain transformation
procedure based on boundary growth conditions on the transformed integrand. The
proposed transformation preserves sufficient regularity of the original
integrand for fast convergence of the RQMC method. To validate our analysis, we
demonstrate the efficiency of employing RQMC with an appropriate transformation
to evaluate options in the Fourier space for various pricing models, payoffs,
and dimensions. Finally, we highlight the computational advantage of applying
RQMC over MC or TP in the Fourier domain, and over MC in the physical domain
for options with up to 15 assets.
Christian BayerChiheb Ben HammoudaAntonis PapapantoleonMichael SametRaúl Temponehttp://arxiv.org/abs/2409.18757v22025-04-21T05:07:47Z2024-09-27T13:49:16Z$L_2$-approximation using randomized lattice algorithms We propose a randomized lattice algorithm for approximating multivariate
periodic functions over the $d$-dimensional unit cube from the weighted Korobov
space with mixed smoothness $\alpha > 1/2$ and product weights
$\gamma_1,\gamma_2,\ldots\in [0,1]$. Building upon the deterministic lattice
algorithm by Kuo, Sloan, and Wo\'{z}niakowski (2006), we incorporate a
randomized quadrature rule by Dick, Goda, and Suzuki (2022) to accelerate the
convergence rate. This randomization involves drawing the number of points for
function evaluations randomly, and selecting a good generating vector for
rank-1 lattice points using the randomized component-by-component algorithm. We
prove that our randomized algorithm achieves a worst-case root mean squared
$L_2$-approximation error of order $M^{-\alpha/2 - 1/8 + \varepsilon}$ for an
arbitrarily small $\varepsilon > 0$, where $M$ denotes the maximum number of
function evaluations, and that the error bound is independent of the dimension
$d$ if the weights satisfy $\sum_{j=1}^\infty \gamma_j^{1/\alpha} < \infty$.
Our upper bound converges faster than a lower bound on the worst-case
$L_2$-approximation error for deterministic rank-1 lattice-based approximation
proved by Byrenheid, K\"{a}mmerer, Ullrich, and Volkmer (2017). We also show a
lower error bound of order $M^{-\alpha/2-1/2}$ for our randomized algorithm,
leaving a slight gap between the upper and lower bounds open for future
research.
Mou CaiTakashi GodaYoshihito Kazashimajor revision, 24 pageshttp://arxiv.org/abs/2407.00359v62025-04-21T04:24:58Z2024-06-29T08:12:36ZMulticriteria Optimization and Decision Making: Principles, Algorithms
and Case Studies Real-world decision and optimization problems, often involve constraints and
conflicting criteria. For example, choosing a travel method must balance speed,
cost, environmental footprint, and convenience. Similarly, designing an
industrial process must consider safety, environmental impact, and cost
efficiency. Ideal solutions where all objectives are optimally met are rare;
instead, we seek good compromises and aim to avoid lose-lose scenarios.
Multicriteria optimization offers computational techniques to compute Pareto
optimal solutions, aiding decision analysis and decision making. This reader
offers an introduction to this topic and has been developed on the basis of the
revised edition of the reader for the MSc computer science course
"Multicriteria Optimization and Decision Analysis" at the Leiden Institute of
Advanced Computer Science, Leiden University, The Netherlands. This course was
taught annually by the first author from 2007 to 2023 as a single semester
course with lectures and practicals. Our aim was to make the material
accessible to MSc students who do not study mathematics as their core
discipline by introducing basic numerical analysis concepts when necessary and
providing numerical examples for interesting cases. The introduction is
organized in a unique didactic manner developed by the authors, starting from
more simple concepts such as linear programming and single-point methods, and
advancing from these to more difficult concepts such as optimality conditions
for nonlinear optimization and set-oriented solution algorithms. Besides, we
focus on the mathematical modeling and foundations rather than on specific
algorithms, though not excluding the discussion of some representative examples
of solution algorithms.
Michael EmmerichAndré Deutz102 pages, Lecture noteshttp://arxiv.org/abs/2504.14790v12025-04-21T01:33:56Z2025-04-21T01:33:56ZEnhanced Data-driven Topology Design Methodology with Multi-level Mesh
and Correlation-based Mutation for Stress-related Multi-objective
Optimization Topology optimization (TO) serves as a widely applied structural design
approach to tackle various engineering problems. Nevertheless,
sensitivity-based TO methods usually struggle with solving strongly nonlinear
optimization problems. By leveraging high capacity of deep generative model,
which is an influential machine learning technique, the sensitivity-free
data-driven topology design (DDTD) methodology is regarded as an effective
means of overcoming these issues. The DDTD methodology depends on initial
dataset with a certain regularity, making its results highly sensitive to
initial dataset quality. This limits its effectiveness and generalizability,
especially for optimization problems without priori information. In this
research, we proposed a multi-level mesh DDTD-based method with
correlation-based mutation module to escape from the limitation of the quality
of the initial dataset on the results and enhance computational efficiency. The
core is to employ a correlation-based mutation module to assign new geometric
features with physical meaning to the generated data, while utilizing a
multi-level mesh strategy to progressively enhance the refinement of the
structural representation, thus avoiding the maintenance of a high
degree-of-freedom (DOF) representation throughout the iterative process. The
proposed multi-level mesh DDTD-based method can be driven by a low quality
initial dataset without the need for time-consuming construction of a specific
dataset, thus significantly increasing generality and reducing application
difficulty, while further lowering computational cost of DDTD methodology.
Various comparison experiments with the traditional sensitivity-based TO
methods on stress-related strongly nonlinear problems demonstrate the
generality and effectiveness of the proposed method.
Jun YangShintaro Yamasaki23 pages, 22 figures