https://arxiv.org/api/WtyoJoXkjQYUQpfFSX09dnOCwfM 2026-06-22T15:07:13Z 2664 465 15 http://arxiv.org/abs/2409.01272v1 A prony method variant which surpasses the Adaptive LMS filter in the output signal's representation of input 2024-09-02T14:10:26Z

The Prony method for approximating signals comprising sinusoidal/exponential components is known through the pioneering work of Prony in his seminal dissertation in the year 1795. However, the Prony method saw the light of real world application only upon the advent of the computational era, which made feasible the extensive numerical intricacies and labor which the method demands inherently. The Adaptive LMS Filter which has been the most pervasive method for signal filtration and approximation since its inception in 1965 does not provide a consistently assured level of highly precise results as the extended experiment in this work proves. As a remedy this study improvises upon the Prony method by observing that a better (more precise) computational approximation can be obtained under the premise that adjustment can be made for computational error , in the autoregressive model setup in the initial step of the Prony computation itself. This adjustment is in proportion to the deviation of the coefficients in the same autoregressive model. The results obtained by this improvisation live up to the expectations of obtaining consistency and higher value in the precision of the output (recovered signal) approximations as shown in this current work and as compared with the results obtained using the Adaptive LMS Filter.

2024-09-02T14:10:26Z Parthasarathy Srinivasan 10.5121/sipij.2024.15401 http://arxiv.org/abs/2408.11074v3 cpp11armadillo: An R Package to Use the Armadillo C++ Library 2024-09-01T00:17:42Z

This article introduces 'cpp11armadillo', a new R package that integrates the powerful Armadillo C++ library for linear algebra into the R programming environment. Targeted primarily at social scientists and other non-programmers, this article explains the computational benefits of moving code to C++ in terms of speed and syntax. We provide a comprehensive overview of Armadillo's capabilities, highlighting its user-friendly syntax akin to MATLAB and its efficiency for computationally intensive tasks. The 'cpp11armadillo' package simplifies a part of the process of using C++ within R by offering additional ease of integration for those who require high-performance linear algebra operations in their R workflows. This work aims to bridge the gap between computational efficiency and accessibility, making advanced linear algebra operations more approachable for R users without extensive programming backgrounds.

2024-08-19T00:07:39Z 23 pages, 0 figures Mauricio Vargas Sepúlveda Jonathan Schneider Malamud 10.1016/j.softx.2025.102087. http://arxiv.org/abs/2409.00244v1 TorchDA: A Python package for performing data assimilation with deep learning forward and transformation functions 2024-08-30T20:30:34Z

Data assimilation techniques are often confronted with challenges handling complex high dimensional physical systems, because high precision simulation in complex high dimensional physical systems is computationally expensive and the exact observation functions that can be applied in these systems are difficult to obtain. It prompts growing interest in integrating deep learning models within data assimilation workflows, but current software packages for data assimilation cannot handle deep learning models inside. This study presents a novel Python package seamlessly combining data assimilation with deep neural networks to serve as models for state transition and observation functions. The package, named TorchDA, implements Kalman Filter, Ensemble Kalman Filter (EnKF), 3D Variational (3DVar), and 4D Variational (4DVar) algorithms, allowing flexible algorithm selection based on application requirements. Comprehensive experiments conducted on the Lorenz 63 and a two-dimensional shallow water system demonstrate significantly enhanced performance over standalone model predictions without assimilation. The shallow water analysis validates data assimilation capabilities mapping between different physical quantity spaces in either full space or reduced order space. Overall, this innovative software package enables flexible integration of deep learning representations within data assimilation, conferring a versatile tool to tackle complex high dimensional dynamical systems across scientific domains.

2024-08-30T20:30:34Z Sibo Cheng Jinyang Min Che Liu Rossella Arcucci http://arxiv.org/abs/2408.13434v1 The applicability of equal area partitions of the unit sphere 2024-08-24T02:12:31Z

This paper addresses the idea of the applicability of mathematics, using, as a case study, a construction and software package that partition the unit sphere into regions of equal area. The paper assesses the applicability of this construction and software by examining citing works, including papers, dissertations and software.

2024-08-24T02:12:31Z 29 pages, 24 figures, accepted by Journal of Approximation Software Paul Leopardi http://arxiv.org/abs/2408.13420v1 PySLSQP: A transparent Python package for the SLSQP optimization algorithm modernized with utilities for visualization and post-processing 2024-08-24T01:24:11Z

PySLSQP is a seamless interface for using the SLSQP algorithm from Python. It wraps the original SLSQP Fortran code sourced from the SciPy repository and provides a host of new features to improve the research utility of the original algorithm. Some of the additional features offered by PySLSQP include auto-generation of unavailable derivatives using finite differences, independent scaling of the problem variables and functions, access to internal optimization data, live-visualization, saving optimization data from each iteration, warm/hot restarting of optimization, and various other utilities for post-processing.

2024-08-24T01:24:11Z 9 pages with 2 figures. For associated code, see https://github.com/anugrahjo/PySLSQP Anugrah Jo Joshy University of California San Diego John T. Hwang University of California San Diego http://arxiv.org/abs/2312.12967v2 Implementation of the Emulator-based Component Analysis 2024-08-23T02:45:24Z

We present a PyTorch-powered implementation of the emulator-based component analysis used for ill-posed numerical non-linear inverse problems, where an approximate emulator for the forward problem is known. This emulator may be a numerical model, an interpolating function, or a fitting function such as a neural network. With the help of the emulator and a data set, the method seeks dimensionality reduction by projection in the variable space so that maximal variance of the target (response) values of the data is covered. The obtained basis set for projection in the variable space defines a subspace of the greatest response for the outcome of the forward problem. The method allows for the reconstruction of the coordinates in this subspace for an approximate solution to the inverse problem. We present an example of using the code provided as a Python class.

2023-12-20T12:14:25Z Anton Vladyka Eemeli A. Eronen Johannes Niskanen 10.1016/j.jocs.2024.102437 http://arxiv.org/abs/2408.11880v1 RAO-SS: A Prototype of Run-time Auto-tuning Facility for Sparse Direct Solvers 2024-08-21T03:05:27Z

In this paper, a run-time auto-tuning method for performance parameters according to input matrices is proposed. RAO-SS (Run-time Auto-tuning Optimizer for Sparse Solvers), which is a prototype of auto-tuning software using the proposed method, is also evaluated. The RAO-SS is implemented with the Autopilot, which is middle-ware to support run-time auto-tuning with fuzzy logic function. The target numerical library is the SuperLU, which is a sparse direct solver for linear equations. The result indicated that: (1) the speedup factors of 1.2 for average and 3.6 for maximum to default executions were obtained; (2) the software overhead of the Autopilot can be ignored in RAO-SS.

2024-08-21T03:05:27Z Takahiro Katagiri Yoshinori Ishii Hiroki Honda http://arxiv.org/abs/2408.11040v1 Solving the Convex Flow Problem 2024-08-20T17:48:38Z

In this paper, we introduce the solver ConvexFlows for the convex flow problem first defined in the authors' previous work. In this problem, we aim to optimize a concave utility function depending on the flows over a graph. However, unlike the classic network flows literature, we also allow for a concave relationship between the input and output flows of edges. This nonlinear gain describes many physical phenomena, including losses in power network transmission lines. We outline an efficient algorithm for solving this problem which parallelizes over the graph edges. We provide an open source implementation of this algorithm in the Julia programming language package ConvexFlows.jl. This package includes an interface to easily specify these flow problems. We conclude by walking through an example of solving for an optimal power flow using ConvexFlows.

2024-08-20T17:48:38Z Theo Diamandis Guillermo Angeris http://arxiv.org/abs/2408.08840v1 ideal.II: a Galerkin Space-Time Extension to the Finite Element Library deal.II 2024-08-16T16:46:42Z

The C++ library deal.II provides classes and functions to solve stationary problems with finite elements on one- to threedimensional domains. It also supports the typical way to solve time-dependent problems using time-stepping schemes, either with an implementation by hand or through the use of external libraries like SUNDIALS. A different approach is the usage of finite elements in time as well, which results in space-time finite element schemes. The library ideal.II (short for instationary deal.II) aims to extend deal.II to simplify implementations of the second approach.

2024-08-16T16:46:42Z Jan Philipp Thiele http://arxiv.org/abs/2409.00013v1 CEopt: A MATLAB Package for Non-convex Optimization with the Cross-Entropy Method 2024-08-15T23:53:50Z

This paper introduces CEopt (https://ceopt.org), a MATLAB tool leveraging the Cross-Entropy method for non-convex optimization. Due to the relative simplicity of the algorithm, it provides a kind of transparent ``gray-box'' optimization solver, with intuitive control parameters. Unique in its approach, CEopt effectively handles both equality and inequality constraints using an augmented Lagrangian method, offering robustness and scalability for moderately sized complex problems. Through select case studies, the package's applicability and effectiveness in various optimization scenarios are showcased, marking CEopt as a practical addition to optimization research and application toolsets.

2024-08-15T23:53:50Z Americo Cunha Marcos Vinicius Issa Julio Cesar Basilio José Geraldo Telles Ribeiro http://arxiv.org/abs/2101.11003v2 FDApy: a Python package for functional data 2024-08-12T08:43:35Z

We introduce FDApy, an open-source Python package for the analysis of functional data. The package provides tools for the representation of (multivariate) functional data defined on different dimensional domains and for functional data that is irregularly sampled. Additionally, dimension reduction techniques are implemented for multivariate and/or multidimensional functional data that are regularly or irregularly sampled. A toolbox for generating functional datasets is also provided. The documentation includes installation and usage instructions, examples on simulated and real datasets and a complete description of the API. FDApy is released under the MIT license. The code and documentation are available at https://github.com/StevenGolovkine/FDApy.

2021-01-26T10:07:33Z 18 pages, 11 figures Steven Golovkine 10.21105/joss.07526 http://arxiv.org/abs/2408.04846v1 UGrid: An Efficient-And-Rigorous Neural Multigrid Solver for Linear PDEs 2024-08-09T03:46:35Z

Numerical solvers of Partial Differential Equations (PDEs) are of fundamental significance to science and engineering. To date, the historical reliance on legacy techniques has circumscribed possible integration of big data knowledge and exhibits sub-optimal efficiency for certain PDE formulations, while data-driven neural methods typically lack mathematical guarantee of convergence and correctness. This paper articulates a mathematically rigorous neural solver for linear PDEs. The proposed UGrid solver, built upon the principled integration of U-Net and MultiGrid, manifests a mathematically rigorous proof of both convergence and correctness, and showcases high numerical accuracy, as well as strong generalization power to various input geometry/values and multiple PDE formulations. In addition, we devise a new residual loss metric, which enables unsupervised training and affords more stability and a larger solution space over the legacy losses.

2024-08-09T03:46:35Z Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024 Xi Han Fei Hou Hong Qin http://arxiv.org/abs/2405.06056v2 Hybrid parallel discrete adjoints in SU2 2024-08-07T22:21:57Z

The open-source multiphysics suite SU2 features discrete adjoints by means of operator overloading automatic differentiation (AD). While both primal and discrete adjoint solvers support MPI parallelism, hybrid parallelism using both MPI and OpenMP has only been introduced for the primal solvers so far. In this work, we enable hybrid parallel discrete adjoint solvers. Coupling SU2 with OpDiLib, an add-on for operator overloading AD tools that extends AD to OpenMP parallelism, marks a key step in this endeavour. We identify the affected parts of SU2's advanced AD workflow and discuss the required changes and their tradeoffs. Detailed performance studies compare MPI parallel and hybrid parallel discrete adjoints in terms of memory and runtime and unveil key performance characteristics. We showcase the effectiveness of performance optimizations and highlight perspectives for future improvements. At the same time, this study demonstrates the applicability of OpDiLib in a large code base and its scalability on large test cases, providing valuable insights for future applications both within and beyond SU2.

2024-05-09T19:00:48Z 28 pages, 9 figures, 2 listings; new layout, revised section structure, polishing and small updates Johannes Blühdorn Pedro Gomes Max Aehle Nicolas R. Gauger 10.1016/j.compfluid.2024.106528 http://arxiv.org/abs/2307.11752v2 OpenLB User Guide: Associated with Release 1.6 of the Code 2024-08-07T20:25:42Z

OpenLB is an object-oriented implementation of LBM. It is the first implementation of a generic platform for LBM programming, which is shared with the open source community (GPLv2). Since the first release in 2007, the code has been continuously improved and extended which is documented by thirteen releases as well as the corresponding release notes which are available on the OpenLB website (https://www.openlb.net). The OpenLB code is written in C++ and is used by application programmers as well as developers, with the ability to implement custom models OpenLB supports complex data structures that allow simulations in complex geometries and parallel execution using MPI, OpenMP and CUDA on high-performance computers. The source code uses the concepts of interfaces and templates, so that efficient, direct and intuitive implementations of the LBM become possible. The efficiency and scalability has been checked and proved by code reviews. This user manual and a source code documentation by DoxyGen are available on the OpenLB project website.

2023-05-17T22:47:34Z Adrian Kummerländer Samuel J. Avis Halim Kusumaatmaja Fedor Bukreev Michael Crocoll Davide Dapelo Simon Großmann Nicolas Hafen Shota Ito Julius Jeßberger Eliane Kummer Jan E. Marquardt Johanna Mödl Tim Pertzel František Prinz Florian Raichle Martin Sadric Maximilian Schecher Dennis Teutscher Stephan Simonis Mathias J. Krause http://arxiv.org/abs/2408.03452v1 Matrix-Free Finite Volume Kernels on a Dataflow Architecture 2024-08-06T21:18:51Z

Fast and accurate numerical simulations are crucial for designing large-scale geological carbon storage projects ensuring safe long-term CO2 containment as a climate change mitigation strategy. These simulations involve solving numerous large and complex linear systems arising from the implicit Finite Volume (FV) discretization of PDEs governing subsurface fluid flow. Compounded with highly detailed geomodels, solving linear systems is computationally and memory expensive, and accounts for the majority of the simulation time. Modern memory hierarchies are insufficient to meet the latency and bandwidth needs of large-scale numerical simulations. Therefore, exploring algorithms that can leverage alternative and balanced paradigms, such as dataflow and in-memory computing is crucial. This work introduces a matrix-free algorithm to solve FV-based linear systems using a dataflow architecture to significantly minimize memory latency and bandwidth bottlenecks. Our implementation achieves two orders of magnitude speedup compared to a GPGPU-based reference implementation, and up to 1.2 PFlops on a single dataflow device.

2024-08-06T21:18:51Z arXiv admin note: substantial text overlap with arXiv:2304.11274 Ryuichi Sai Francois P. Hamon John Mellor-Crummey Mauricio Araya-Polo