https://arxiv.org/api/WtyoJoXkjQYUQpfFSX09dnOCwfM2026-06-22T15:07:13Z266446515http://arxiv.org/abs/2409.01272v1A prony method variant which surpasses the Adaptive LMS filter in the output signal's representation of input2024-09-02T14:10:26ZThe Prony method for approximating signals comprising sinusoidal/exponential components is known through the pioneering work of Prony in his seminal dissertation in the year 1795. However, the Prony method saw the light of real world application only upon the advent of the computational era, which made feasible the extensive numerical intricacies and labor which the method demands inherently. The Adaptive LMS Filter which has been the most pervasive method for signal filtration and approximation since its inception in 1965 does not provide a consistently assured level of highly precise results as the extended experiment in this work proves. As a remedy this study improvises upon the Prony method by observing that a better (more precise) computational approximation can be obtained under the premise that adjustment can be made for computational error , in the autoregressive model setup in the initial step of the Prony computation itself. This adjustment is in proportion to the deviation of the coefficients in the same autoregressive model. The results obtained by this improvisation live up to the expectations of obtaining consistency and higher value in the precision of the output (recovered signal) approximations as shown in this current work and as compared with the results obtained using the Adaptive LMS Filter.2024-09-02T14:10:26ZParthasarathy Srinivasan10.5121/sipij.2024.15401http://arxiv.org/abs/2408.11074v3cpp11armadillo: An R Package to Use the Armadillo C++ Library2024-09-01T00:17:42ZThis article introduces 'cpp11armadillo', a new R package that integrates the powerful Armadillo C++ library for linear algebra into the R programming environment. Targeted primarily at social scientists and other non-programmers, this article explains the computational benefits of moving code to C++ in terms of speed and syntax. We provide a comprehensive overview of Armadillo's capabilities, highlighting its user-friendly syntax akin to MATLAB and its efficiency for computationally intensive tasks. The 'cpp11armadillo' package simplifies a part of the process of using C++ within R by offering additional ease of integration for those who require high-performance linear algebra operations in their R workflows. This work aims to bridge the gap between computational efficiency and accessibility, making advanced linear algebra operations more approachable for R users without extensive programming backgrounds.2024-08-19T00:07:39Z23 pages, 0 figuresMauricio Vargas SepúlvedaJonathan Schneider Malamud10.1016/j.softx.2025.102087.http://arxiv.org/abs/2409.00244v1TorchDA: A Python package for performing data assimilation with deep learning forward and transformation functions2024-08-30T20:30:34ZData assimilation techniques are often confronted with challenges handling complex high dimensional physical systems, because high precision simulation in complex high dimensional physical systems is computationally expensive and the exact observation functions that can be applied in these systems are difficult to obtain. It prompts growing interest in integrating deep learning models within data assimilation workflows, but current software packages for data assimilation cannot handle deep learning models inside. This study presents a novel Python package seamlessly combining data assimilation with deep neural networks to serve as models for state transition and observation functions. The package, named TorchDA, implements Kalman Filter, Ensemble Kalman Filter (EnKF), 3D Variational (3DVar), and 4D Variational (4DVar) algorithms, allowing flexible algorithm selection based on application requirements. Comprehensive experiments conducted on the Lorenz 63 and a two-dimensional shallow water system demonstrate significantly enhanced performance over standalone model predictions without assimilation. The shallow water analysis validates data assimilation capabilities mapping between different physical quantity spaces in either full space or reduced order space. Overall, this innovative software package enables flexible integration of deep learning representations within data assimilation, conferring a versatile tool to tackle complex high dimensional dynamical systems across scientific domains.2024-08-30T20:30:34ZSibo ChengJinyang MinChe LiuRossella Arcuccihttp://arxiv.org/abs/2408.13434v1The applicability of equal area partitions of the unit sphere2024-08-24T02:12:31ZThis paper addresses the idea of the applicability of mathematics, using, as a case study, a construction and software package that partition the unit sphere into regions of equal area. The paper assesses the applicability of this construction and software by examining citing works, including papers, dissertations and software.2024-08-24T02:12:31Z29 pages, 24 figures, accepted by Journal of Approximation SoftwarePaul Leopardihttp://arxiv.org/abs/2408.13420v1PySLSQP: A transparent Python package for the SLSQP optimization algorithm modernized with utilities for visualization and post-processing2024-08-24T01:24:11ZPySLSQP is a seamless interface for using the SLSQP algorithm from Python. It wraps the original SLSQP Fortran code sourced from the SciPy repository and provides a host of new features to improve the research utility of the original algorithm. Some of the additional features offered by PySLSQP include auto-generation of unavailable derivatives using finite differences, independent scaling of the problem variables and functions, access to internal optimization data, live-visualization, saving optimization data from each iteration, warm/hot restarting of optimization, and various other utilities for post-processing.2024-08-24T01:24:11Z9 pages with 2 figures. For associated code, see https://github.com/anugrahjo/PySLSQPAnugrah Jo JoshyUniversity of California San DiegoJohn T. HwangUniversity of California San Diegohttp://arxiv.org/abs/2312.12967v2Implementation of the Emulator-based Component Analysis2024-08-23T02:45:24ZWe present a PyTorch-powered implementation of the emulator-based component analysis used for ill-posed numerical non-linear inverse problems, where an approximate emulator for the forward problem is known. This emulator may be a numerical model, an interpolating function, or a fitting function such as a neural network. With the help of the emulator and a data set, the method seeks dimensionality reduction by projection in the variable space so that maximal variance of the target (response) values of the data is covered. The obtained basis set for projection in the variable space defines a subspace of the greatest response for the outcome of the forward problem. The method allows for the reconstruction of the coordinates in this subspace for an approximate solution to the inverse problem. We present an example of using the code provided as a Python class.2023-12-20T12:14:25ZAnton VladykaEemeli A. EronenJohannes Niskanen10.1016/j.jocs.2024.102437http://arxiv.org/abs/2408.11880v1RAO-SS: A Prototype of Run-time Auto-tuning Facility for Sparse Direct Solvers2024-08-21T03:05:27ZIn this paper, a run-time auto-tuning method for performance parameters according to input matrices is proposed. RAO-SS (Run-time Auto-tuning Optimizer for Sparse Solvers), which is a prototype of auto-tuning software using the proposed method, is also evaluated. The RAO-SS is implemented with the Autopilot, which is middle-ware to support run-time auto-tuning with fuzzy logic function. The target numerical library is the SuperLU, which is a sparse direct solver for linear equations. The result indicated that: (1) the speedup factors of 1.2 for average and 3.6 for maximum to default executions were obtained; (2) the software overhead of the Autopilot can be ignored in RAO-SS.2024-08-21T03:05:27ZTakahiro KatagiriYoshinori IshiiHiroki Hondahttp://arxiv.org/abs/2408.11040v1Solving the Convex Flow Problem2024-08-20T17:48:38ZIn this paper, we introduce the solver ConvexFlows for the convex flow problem first defined in the authors' previous work. In this problem, we aim to optimize a concave utility function depending on the flows over a graph. However, unlike the classic network flows literature, we also allow for a concave relationship between the input and output flows of edges. This nonlinear gain describes many physical phenomena, including losses in power network transmission lines. We outline an efficient algorithm for solving this problem which parallelizes over the graph edges. We provide an open source implementation of this algorithm in the Julia programming language package ConvexFlows.jl. This package includes an interface to easily specify these flow problems. We conclude by walking through an example of solving for an optimal power flow using ConvexFlows.2024-08-20T17:48:38ZTheo DiamandisGuillermo Angerishttp://arxiv.org/abs/2408.08840v1ideal.II: a Galerkin Space-Time Extension to the Finite Element Library deal.II2024-08-16T16:46:42ZThe C++ library deal.II provides classes and functions to solve stationary problems with finite elements on one- to threedimensional domains. It also supports the typical way to solve time-dependent problems using time-stepping schemes, either with an implementation by hand or through the use of external libraries like SUNDIALS. A different approach is the usage of finite elements in time as well, which results in space-time finite element schemes. The library ideal.II (short for instationary deal.II) aims to extend deal.II to simplify implementations of the second approach.2024-08-16T16:46:42ZJan Philipp Thielehttp://arxiv.org/abs/2409.00013v1CEopt: A MATLAB Package for Non-convex Optimization with the Cross-Entropy Method2024-08-15T23:53:50ZThis paper introduces CEopt (https://ceopt.org), a MATLAB tool leveraging the Cross-Entropy method for non-convex optimization. Due to the relative simplicity of the algorithm, it provides a kind of transparent ``gray-box'' optimization solver, with intuitive control parameters. Unique in its approach, CEopt effectively handles both equality and inequality constraints using an augmented Lagrangian method, offering robustness and scalability for moderately sized complex problems. Through select case studies, the package's applicability and effectiveness in various optimization scenarios are showcased, marking CEopt as a practical addition to optimization research and application toolsets.2024-08-15T23:53:50ZAmerico CunhaMarcos Vinicius IssaJulio Cesar BasilioJosé Geraldo Telles Ribeirohttp://arxiv.org/abs/2101.11003v2FDApy: a Python package for functional data2024-08-12T08:43:35ZWe introduce FDApy, an open-source Python package for the analysis of functional data. The package provides tools for the representation of (multivariate) functional data defined on different dimensional domains and for functional data that is irregularly sampled. Additionally, dimension reduction techniques are implemented for multivariate and/or multidimensional functional data that are regularly or irregularly sampled. A toolbox for generating functional datasets is also provided. The documentation includes installation and usage instructions, examples on simulated and real datasets and a complete description of the API. FDApy is released under the MIT license. The code and documentation are available at https://github.com/StevenGolovkine/FDApy.2021-01-26T10:07:33Z18 pages, 11 figuresSteven Golovkine10.21105/joss.07526http://arxiv.org/abs/2408.04846v1UGrid: An Efficient-And-Rigorous Neural Multigrid Solver for Linear PDEs2024-08-09T03:46:35ZNumerical solvers of Partial Differential Equations (PDEs) are of fundamental significance to science and engineering. To date, the historical reliance on legacy techniques has circumscribed possible integration of big data knowledge and exhibits sub-optimal efficiency for certain PDE formulations, while data-driven neural methods typically lack mathematical guarantee of convergence and correctness. This paper articulates a mathematically rigorous neural solver for linear PDEs. The proposed UGrid solver, built upon the principled integration of U-Net and MultiGrid, manifests a mathematically rigorous proof of both convergence and correctness, and showcases high numerical accuracy, as well as strong generalization power to various input geometry/values and multiple PDE formulations. In addition, we devise a new residual loss metric, which enables unsupervised training and affords more stability and a larger solution space over the legacy losses.2024-08-09T03:46:35ZProceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024Xi HanFei HouHong Qinhttp://arxiv.org/abs/2405.06056v2Hybrid parallel discrete adjoints in SU22024-08-07T22:21:57ZThe open-source multiphysics suite SU2 features discrete adjoints by means of operator overloading automatic differentiation (AD). While both primal and discrete adjoint solvers support MPI parallelism, hybrid parallelism using both MPI and OpenMP has only been introduced for the primal solvers so far. In this work, we enable hybrid parallel discrete adjoint solvers. Coupling SU2 with OpDiLib, an add-on for operator overloading AD tools that extends AD to OpenMP parallelism, marks a key step in this endeavour. We identify the affected parts of SU2's advanced AD workflow and discuss the required changes and their tradeoffs. Detailed performance studies compare MPI parallel and hybrid parallel discrete adjoints in terms of memory and runtime and unveil key performance characteristics. We showcase the effectiveness of performance optimizations and highlight perspectives for future improvements. At the same time, this study demonstrates the applicability of OpDiLib in a large code base and its scalability on large test cases, providing valuable insights for future applications both within and beyond SU2.2024-05-09T19:00:48Z28 pages, 9 figures, 2 listings; new layout, revised section structure, polishing and small updatesJohannes BlühdornPedro GomesMax AehleNicolas R. Gauger10.1016/j.compfluid.2024.106528http://arxiv.org/abs/2307.11752v2OpenLB User Guide: Associated with Release 1.6 of the Code2024-08-07T20:25:42ZOpenLB is an object-oriented implementation of LBM. It is the first implementation of a generic platform for LBM programming, which is shared with the open source community (GPLv2). Since the first release in 2007, the code has been continuously improved and extended which is documented by thirteen releases as well as the corresponding release notes which are available on the OpenLB website (https://www.openlb.net). The OpenLB code is written in C++ and is used by application programmers as well as developers, with the ability to implement custom models OpenLB supports complex data structures that allow simulations in complex geometries and parallel execution using MPI, OpenMP and CUDA on high-performance computers. The source code uses the concepts of interfaces and templates, so that efficient, direct and intuitive implementations of the LBM become possible. The efficiency and scalability has been checked and proved by code reviews. This user manual and a source code documentation by DoxyGen are available on the OpenLB project website.2023-05-17T22:47:34ZAdrian KummerländerSamuel J. AvisHalim KusumaatmajaFedor BukreevMichael CrocollDavide DapeloSimon GroßmannNicolas HafenShota ItoJulius JeßbergerEliane KummerJan E. MarquardtJohanna MödlTim PertzelFrantišek PrinzFlorian RaichleMartin SadricMaximilian SchecherDennis TeutscherStephan SimonisMathias J. Krausehttp://arxiv.org/abs/2408.03452v1Matrix-Free Finite Volume Kernels on a Dataflow Architecture2024-08-06T21:18:51ZFast and accurate numerical simulations are crucial for designing large-scale geological carbon storage projects ensuring safe long-term CO2 containment as a climate change mitigation strategy. These simulations involve solving numerous large and complex linear systems arising from the implicit Finite Volume (FV) discretization of PDEs governing subsurface fluid flow. Compounded with highly detailed geomodels, solving linear systems is computationally and memory expensive, and accounts for the majority of the simulation time. Modern memory hierarchies are insufficient to meet the latency and bandwidth needs of large-scale numerical simulations. Therefore, exploring algorithms that can leverage alternative and balanced paradigms, such as dataflow and in-memory computing is crucial. This work introduces a matrix-free algorithm to solve FV-based linear systems using a dataflow architecture to significantly minimize memory latency and bandwidth bottlenecks. Our implementation achieves two orders of magnitude speedup compared to a GPGPU-based reference implementation, and up to 1.2 PFlops on a single dataflow device.2024-08-06T21:18:51ZarXiv admin note: substantial text overlap with arXiv:2304.11274Ryuichi SaiFrancois P. HamonJohn Mellor-CrummeyMauricio Araya-Polo