https://arxiv.org/api/qMC5/2cw0z8M297ZGKqvi9cQj7c 2026-06-10T01:23:56Z 10652 30 15 http://arxiv.org/abs/2605.29213v2 Multifidelity Proper Orthogonal Decomposition 2026-06-07T22:18:21Z

This paper introduces a multifidelity formulation that reduces the computational cost of the proper orthogonal decomposition (POD) of a high-fidelity model by leveraging data from cheaper, lower-fidelity models. POD is a prevalent technique for extracting a low-dimensional basis from training data to achieve subsequent dimension reduction or reduced-order modeling. In scientific and engineering applications, the training data are typically numerical snapshot solutions of a high-fidelity model, and computation of a sufficiently rich snapshot set can be prohibitively expensive, especially when sampling over a high-dimensional parameter space. Insufficient snapshot training data risks overfitting and poor generalizability of the POD basis to outside the training regime. Our multifidelity POD (MFPOD) formulation reallocates computational budget to cheaper, low-fidelity models that can be sampled more extensively. MFPOD then weights high- and low-fidelity snapshot data via a control-variate formulation to guarantee an unbiased estimate of the expected high-fidelity least-squares projection error. The MFPOD subspace is chosen to minimize the estimate of this projection error, and converges in probability to the same subspace as single-fidelity POD in the limit of an arbitrarily large budget. For restrictive computational budgets, the MFPOD cost function has (under some assumptions) lower variance than the POD cost function, which makes the MFPOD subspace more robust against variations in the training data and thus less prone to overfitting. For a numerical example modeling the velocity of the Pine Island glacier, MFPOD achieves the same accuracy as single-fidelity POD with an order of magnitude reduction in the offline computational cost of snapshot generation.

2026-05-28T00:52:27Z Nicole Aretz Karen Willcox http://arxiv.org/abs/2606.08822v1 Unstructured Mesh Tools for Fusion Energy System Design 2026-06-07T20:26:30Z

The execution of accurate simulations of fusion energy systems requires the appropriate representation of critical component geometries as well as the coupling of complex fusion physics codes with one another and with engineering analysis tools. This paper examines the challenges of creating simulation workflows that fully leverage existing fusion research codes while integrating them with commercial computer-aided engineering (CAE) software. Key areas addressed include: (a) the construction and meshing of analysis geometries taking full advantage of available geometric modeling and meshing technologies; (b) the effective coupling of fusion physics and engineering analysis codes; and (c) the support for simulation workflows that couple particle and continuum modeling methods.

2026-06-07T20:26:30Z Mark S. Shephard Jacob S. Merson Onkar Sahni Cameron W. Smith Usman Riaz Fuad Hasan Aditya Y. Joshi Dhyanjyoti D. Nath Abhiyan Paudel http://arxiv.org/abs/2606.08796v1 A Non-Overlapping Schwarz Hybrid Finite Element-Neural Operator Framework for Solid Mechanics on Irregular Domains 2026-06-07T19:26:53Z

Finite element (FE) methods are the benchmark for solid mechanics simulations, yet their computational cost becomes prohibitive for problems with localised nonlinearities, fine-scale features, or long-time dynamic evolution. In our earlier FE-neural operator (FE-NO) hybrid framework [1], physics-informed deep operator networks were coupled with FE solvers through overlapping domain decomposition with Dirichlet-Dirichlet interface exchange, accelerating intensive subdomains while preserving FE fidelity elsewhere. Two limitations remained: the overlapping formulation required redundant interface computations that increased inner Schwarz iteration counts, and the convolutional feature extractor restricted the NO subdomain to structured grids, precluding irregular geometries. A non-overlapping Schwarz alternating method with Neumann-Dirichlet interface exchange replaces it, transmitting traction from the NO to FE rather than displacement. This eliminates the overlap layer and reduces inner Schwarz iterations while maintaining bounded error accumulation across all tested time horizons. For arbitrarily shaped subdomains, a Point-DeepONet operates on unstructured FE point clouds without interpolation, extending it to non-convex and irregular geometries. Strain and stress operators are derived analytically from the displacement operators via kinematic equations, rather than as independent networks, reducing trainable parameter sets while enforcing mechanical consistency by construction. The framework is validated on three benchmarks: static linear elasticity, quasi-static hyperelasticity, and elastodynamics with regular and irregular geometries. These results establish a non-overlapping FE-NO coupling paradigm that is geometry-flexible, parameter-efficient, and convergence-stable, providing a pathway for hybrid physics-based and operator-learning solvers in large-scale dynamic solid mechanics.

2026-06-07T19:26:53Z 40 Pages,19 figures Wei Wang Abhinav Gupta Haihui Ruan Somdatta Goswami http://arxiv.org/abs/2606.08711v1 Evaluating Operators for Acoustic Wave Simulation Correction 2026-06-07T16:07:42Z

Correcting numerical dispersion artifacts from Finite Difference solvers is a well-identified challenge in computational wave physics, but existing approaches evaluate only a restricted family of CNN-based architectures and have been applied exclusively to the elastic wave equation. We instantiate the Deep Finite Difference framework on two-dimensional anisotropic acoustic wave propagation, pairing a fourth-order Finite Difference proxy with a Pseudo-Spectral reference over 27,000 heterogeneous velocity fields. We benchmark twelve correction architectures, from linear regression to Fourier Neural Operators, under a unified 10-fold cross-validation protocol.

2026-06-07T16:07:42Z Pascal Tribel Gianluca Bontempi http://arxiv.org/abs/2604.14618v2 A Stable SBP-SAT FDTD Subgridding Method Without Region Split 2026-06-07T15:54:32Z

A provably stable summation-by-parts simultaneous approximation term (SBP-SAT) finite-difference time-domain (FDTD) subgridding method without region split is proposed. By designing projection SBP operators tailored for embedded topological features and deriving the corresponding SAT boundary conditions, this approach guarantees long-time stability through discrete energy analysis. Unlike conventional SBP-SAT FDTD subgridding techniques that rely on aligned or multi-block configurations, the proposed method enables a direct coupling between an internal refined region and a single surrounding coarse-grid domain without introducing auxiliary blocks or causing domain fragmentation. Numerical results validate the efficiency, accuracy, and topological flexibility of the proposed method. Compared with existing multi-block SBP-SAT methods, this method effectively reduces computational complexity by minimizing SAT boundary conditions and improves calculation accuracy near grid interfaces.

2026-04-16T04:57:27Z 13 pages, 14 figures Yuhui Wang Langran Deng Weibo Wu Hanhong Liu Xinyue Zhang Xingqi Zhang Jian Wang Wei-Jie Wang Zhizhang Chen Shunchuan Yang http://arxiv.org/abs/2606.08379v1 TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution 2026-06-07T00:20:29Z

This study addresses the optimal execution of large stock sell programs by introducing TT-DAC-PS (Twin-Target Deterministic Actor-Critic with Policy Smoothing), a deterministic actor-critic architecture that combines twin exponential-moving-average critic targets with pessimistic min backup, TD3-style target policy smoothing noise, delayed actor updates, and conservative Q regularisation to curb overestimation. Exploration uses Ornstein-Uhlenbeck (OU) noise with a hybrid schedule: deterministic episode-wise decay, variance-guided adjustment based on recent reward dispersion, and a Soft Actor-Critic (SAC)-style temperature that is learned and mapped to the noise scale. The environment integrates Almgren-Chriss (AC) trade impact with Limit Order Book (LOB) prices and volumes, normalised state features, per-step volume participation caps, and a utility-based reward. The trade execution algorithm is applied to LOB data for ten U.S. stocks. Performance is assessed against reinforcement-learning baseline algorithms, including Proximal Policy Optimisation (PPO), Soft Actor-Critic (SAC), and Advantage Actor-Critic (A2C), as well as alternative trade execution algorithms, including Time-Weighted Average Price (TWAP), Volume-Weighted Average Price (VWAP), and AC. The proposed model consistently reduces mean implementation shortfall percentage with competitive variance, outperforming classical baselines and standard reinforcement-learning benchmark models.

2026-06-07T00:20:29Z 21 pages, 1 figure, 3 tables Ilia Zaznov Atta Badii Julian Kunkel Alfonso Dufour http://arxiv.org/abs/2606.08345v1 AuditFraudBench: Benchmarking Audit Judgment in Detecting Fraudulent Misstatements 2026-06-06T21:27:01Z

Large language models (LLMs) have shown strong performance in financial analysis and surface-level factual error detection, yet their ability to identify fraudulent financial misinformation in audited corporate reporting remains underexplored. Existing financial and audit benchmarks mainly focus on factual verification, numerical reasoning, rule compliance, or audit workflows, but rarely evaluate misleading disclosure narratives or management explanations that obscure the true drivers of reported performance. We introduce AuditFraudBench, an enforcement-grounded benchmark constructed from authentic company filings and regulatory materials, including original and restated 10-K and 10-Q filings, structured financial statements, MD&A disclosures, and SEC Accounting and Auditing Enforcement Releases (AAERs). AuditFraudBench contains three tasks: Profit Source Attribution, Misleading Narrative Detection, and Fraud Pattern Classification, which evaluate whether models can identify the true source of reported performance, detect misleading disclosure framing, and classify misconduct mechanisms into known manipulation patterns. We evaluate GPT, DeepSeek, and Qwen series LLMs on the benchmark. Results show that both proprietary and open models still struggle to jointly reason over financial figures, disclosure framing, restatement evidence, and enforcement-grounded fraud mechanisms. AuditFraudBench provides a challenging testbed for audit-relevant, evidence-grounded evaluation of LLMs in financial reporting.

2026-06-06T21:27:01Z Work in progress Zhiwei Liu Yueru He Qing Ou Tianlei Zhu Xiaorui Guo Xueqing Peng Sophia Ananiadou http://arxiv.org/abs/2606.08287v1 Mesh Graph Neural Network Framework for Accelerating Finite Element Simulation for Arbitrary Geometries 2026-06-06T18:17:08Z

Finite element analysis (FEA) is essential for structural design but remains computationally expensive, particularly when evaluating multiple design iterations or load scenarios. Machine learning surrogate models offer a promising alternative, yet most approaches struggle with a critical limitation: generalizing across varying geometries. This work presents a mesh graph network (MGN) for predicting von Mises stress fields in 2D structural components with arbitrary hole geometries. Unlike traditional machine learning approaches that use absolute node coordinates as features, the proposed model builds on existing MGN frameworks that encode node types (e.g., fixed boundary, free surface, hole edge), relative edge features (distance between neighbors), and global features (applied load). This architecture is inherently translation- and rotation-invariant, enabling generalization to unseen geometries without retraining. The MGN was trained on 11 plate geometries under 20 load conditions and evaluated on 7 unseen geometries and 3 unseen loads. In the most favorable case, the model achieves $R^2 \geq 0.97$ on an unseen geometry and unseen load, compared to $R^2 \approx 0.01$--$0.86$ for conventional models (Random Forest, Gradient Boosting , K-Nearest Neighbors) trained on identical data. However, even in less favorable cases, the MGN model still outperforms conventional models. This work extends the mesh-based simulation framework of Pfaff et al. (arXiv:2010.03409) to structural mechanics, demonstrating that graph neural networks can serve as efficient surrogates for finite element analysis across varying geometries.

2026-06-06T18:17:08Z 10 pages, 6 figures, to be published. Code available at https://github.com/Josiah-Kunz/MGN-Public Josiah D. Kunz Kamal Choudhary http://arxiv.org/abs/2606.08285v1 Beyond Agent Architecture: Execution Assumptions and Reproducibility in LLM-Based Trading Systems 2026-06-06T18:14:29Z

Large language models (LLMs) and agentic systems are increasingly proposed for financial trading, yet their reported performance remains difficult to compare because studies vary in data provenance, temporal split discipline, execution timing, turnover treatment, and transaction-cost modeling. This article presents a targeted topical review and reproducibility audit of execution realism in LLM-based trading research. A coded evidence matrix covering 30 trade-relevant primary studies is used to assess point-in-time controls, split transparency, held-out evaluation, cost and turnover treatment, execution semantics, universe definition, and artifact release. Across the audited sample, architecture reporting is generally clearer than the evaluation assumptions needed to judge whether a trading result is economically interpretable or reproducible. A 10-equity worked example is included only as a methodological scaffold to illustrate how explicit friction and timing choices can materially compress active-strategy results. The main conclusion is that the next useful step for LLM trading research is not only better agent design, but also clearer reporting standards for execution realism, reproducibility, and evaluation comparability.

2026-06-06T18:14:29Z Junyi Yao Zihao Zheng http://arxiv.org/abs/2601.01279v3 Supracompetitive Pricing Under AI Monoculture 2026-06-06T05:43:36Z

When competing sellers delegate pricing to a shared AI model, such as a large language model, correlated recommendations combined with performance-driven updates aggregating seller feedback raise a key question: can standard AI deployment practices inadvertently produce supracompetitive pricing? We develop a stylized duopoly model in which two sellers receive pricing recommendations from a shared AI characterized by two parameters: a propensity parameter capturing the model's tendency to set high prices and an output-fidelity parameter measuring alignment between this tendency and actual outputs, with propensity updated via periodic retraining on observed outcomes. We find that configuring AI models for robustness and reproducibility can lead to supracompetitive pricing via a phase transition. Below a critical output-fidelity threshold, competitive pricing is the unique stable outcome. Above it, the model exhibits bistability: both competitive and supracompetitive pricing are locally stable, with the realized outcome determined by the model's initial propensity. Supracompetitive pricing raises average prices, but occasional low-price recommendations complicate detection. With perfect output fidelity, full price coordination emerges from any interior initial propensity. For finite training batches of size $b$, when the initial propensity lies in the supracompetitive basin, the probability of supracompetitive pricing approaches 1 as $b$ increases, with the region of indeterminate outcomes shrinking at rate $O(1/\sqrt{b})$. Any factor reducing alignment between the model's propensity and sellers' actual pricing, whether through diversifying AI providers, introducing recommendation noise, or reducing seller adherence, pushes the market toward competitive outcomes.

2026-01-03T20:38:21Z 46 pages Shengyu Cao Ming Hu http://arxiv.org/abs/2506.19094v5 Accurate identification of communication between multiple interacting neural populations 2026-06-06T00:11:40Z

Neural recording technologies now enable simultaneous recording of population activity across many brain regions, motivating the development of data-driven models of inter-regional communication. However, existing models can struggle to disentangle the influences that drive recorded population activity, leading to inaccurate portraits of communication. Here, we introduce Multi-Region Latent Factor Analysis via Dynamical Systems (MR-LFADS), a sequential variational autoencoder designed to disentangle inter-regional communication, inputs from unobserved regions, and local neural population dynamics. We show that MR-LFADS outperforms existing approaches at identifying communication across dozens of simulations of task-trained multi-region networks. When applied to large-scale electrophysiology, MR-LFADS predicts brain-wide effects of circuit perturbations that were held out during model fitting. These validations on synthetic and real neural data position MR-LFADS as a promising tool for discovering principles of brain-wide information processing.

2025-06-23T20:15:29Z Forty-second International Conference on Machine Learning (2025) Belle Liu Jacob Sacks Matthew D. Golub http://arxiv.org/abs/2606.07898v1 Temporal Coverage over Density: Parsimonious Training-Set Design for ML Climate Downscaling 2026-06-05T23:16:38Z

High-resolution regional climate simulations provide critical information for climate impacts assessments but remain computationally expensive, motivating the development of machine-learning downscalers and emulators. A key challenge is determining how limited high-resolution simulations should be distributed across a changing climate trajectory to capture both forced climate response and internal variability. Using the CESM2 Large Ensemble over the western United States, we compare three training-year selection strategies under fixed data budgets: a contiguous block of historical years, years drawn from both the beginning and end of the simulation period, and years distributed throughout the full climate trajectory. Including both historical and future years consistently outperforms training on historical years alone, demonstrating the importance of exposing downscaling models to climate states outside the historical record and highlighting limitations of stationarity assumptions common in statistical downscaling. Training on years distributed throughout the full climate trajectory performs best overall, indicating that broad sampling of internal variability provides additional information beyond exposure to the forced climate response alone. Models trained on temporally distributed subsets more successfully reproduce variability in unseen ensemble members while retaining strong performance across a wide range of climate diagnostics. Even when trained on only one-tenth of the available high-resolution years, temporally distributed models remain highly competitive with full-data training. These results suggest that, under fixed computational budgets, broad sampling of climate states is more valuable than temporal continuity when allocating scarce high-resolution simulations. The findings provide practical guidance for regional climate downscaling and large-ensemble projection workflows.

2026-06-05T23:16:38Z 22 pages, 8 figures Karandeep Singh Stefan Rahimi Chad W. Thackeray Stephen Cropper Alex Hall http://arxiv.org/abs/2606.07876v1 Optimal Wiener-Filter Solutions for Denoising of Graph Signals on Directed Graphs 2026-06-05T22:19:06Z

Graph signal processing has opened new avenues to the canonical denoising problem in interesting settings. Specifically, here we propose a Wiener-filter solution for graph signals on directed graphs. Under various stationarity assumptions combining uncorrelated and correlated noise conditions, we show optimal solutions, including a successful proof-of-concept for temperature graph.

2026-06-05T22:19:06Z This work was accepted to be presented at the Graph Signal Processing Workshop 2026 Chun Hei Michael Chan Alexandre Cionca Dimitri Van De Ville http://arxiv.org/abs/2606.07463v1 Amortized Neural Optimization for Pre-Layout Signal Integrity Design Space Exploration using Differentiable Surrogates 2026-06-05T17:13:44Z

Pre-layout design space exploration (DSE) for high-speed signal integrity (SI) analysis is often limited by the computational cost of simulations and iterative optimization algorithms within modern electronic design automation (EDA) workflows. While machine learning surrogate models accelerate the simulation step, optimizing designs still requires utilizing iterative black-box search methods. This iterative nature scales poorly, making multi-corner sweeps computationally expensive. As a solution, this paper proposes amortized neural optimization (ANO) for pre-layout SI design. ANO entirely eliminates iterative black-box inference by utilizing fully differentiable neural network surrogate models. ANO extracts analytical gradients from the surrogate to train a global optimization policy. Instead of solving the optimization problem repeatedly at inference, the optimization process is learned offline and therefore amortized. Once the ANO policy is trained, it maps different channel contexts directly to near-optimal design parameters in a single deterministic forward pass. The efficiency and accuracy of the ANO framework are demonstrated based on three complex SI design scenarios, including DDR5 decision feedback equalization (DFE), 9-dimensional SerDes Tx/Rx co-equalization, and DDR3 DQS differential pair routing to optimize eye diagram metrics under intra-pair skew constraints. By trading roughly 10% in optimality compared to instance-specific black-box algorithms, it realizes speedups of three to four orders of magnitude. For a large-scale 320,000-instance multi-corner SerDes sweep optimization, ANO collapses what would have taken days of computation using iterative search algorithms into a single batched forward pass that completes in milliseconds. This transforms computationally expensive SI optimization into real-time and interactive pre-layout DSE.

2026-06-05T17:13:44Z 16 pages, 20 figures, 8 tables Julian Withöft Werner John Emre Ecik Ralf Brüning Jürgen Götze http://arxiv.org/abs/2606.07442v1 Tracing Stablecoin Contagion during the USDC Depeg after the Silicon Valley Bank Collapse 2026-06-05T16:41:06Z

The March 2023 collapse of Silicon Valley Bank (SVB) disrupted the core premise of stablecoins, which are digital tokens designed to maintain a fixed value against the U.S. dollar and serve as on-chain substitutes for dollar liquidity. The event triggered a sharp depeg of USDC, creating a rare exogenous shock to the stablecoin ecosystem. While price deviations during this crisis are well documented, the underlying behavioral reorganization of on-chain activity remains less understood. Here, we analyze high-granularity transaction data to measure the shock's effects on network activities, volumes, and prices, reconstructing the contagion pathway from market-wide synchronization down to account-level reallocation. By extracting phase dynamics, we first show that transaction activity across major stablecoins became strongly synchronized during the crisis window, indicating a collective market-level response. We then uncover a bifurcated contagion pathway. While USDT, WBTC, and WETH reacted primarily as liquidity absorption channels with larger trade volumes, only USDC-related assets exhibited immediate price responses alongside surging transaction counts. This reflects the dominant role of USDC-related assets in this incident and their immediate behavioral connection to user panic, driving a mass reallocation from single-coin to multi-coin portfolios. Finally, governed by persistent intraday time-zone rhythms and balance-size heterogeneity, these findings provide a comprehensive empirical framework for understanding systemic risk and flight-to-quality mechanisms in fractional-reserve digital asset networks.

2026-06-05T16:41:06Z Krongtum Sankaewtong Stefan Kitzler Bernhard Haslhofer Yuichi Ikeda