https://arxiv.org/api/2kCDKoJV0kIUN5CwfLMvcxlO8FE 2026-06-10T18:23:09Z 10652 315 15 http://arxiv.org/abs/2605.09284v1 Semi-Supervised Neural Super-Resolution for Mesh-Based Simulations 2026-05-10T03:17:17Z

Mesh-based simulations provide high-fidelity solutions to partial differential equations (PDEs), but achieving such accuracy typically requires fine meshes, leading to substantial computational overhead. Super-resolution techniques aim to mitigate this cost by reconstructing high-resolution (HR), high-fidelity solutions from low-cost, low-resolution (LR) counterparts. However, training neural networks for super-resolution often demands large amounts of expensive HR supervision data. To address this challenge, we propose SuperMeshNet, an HR data-efficient super-resolution framework for mesh-based simulations aided by message passing neural networks (MPNNs). At its core, SuperMeshNet introduces complementary learning, a semi-supervised approach that effectively leverages both 1) a small amount of paired LR-HR data and 2) abundant unpaired LR data via two jointly trained, complementary MPNN-based models. Additionally, our model is enriched by inductive biases, which are empirically shown to further improve super-resolution performance. Extensive experiments demonstrate that SuperMeshNet requires 90% less HR data to achieve even lower root mean square error (RMSE) than that of the fully supervised benchmark without the inductive biases. The source code and datasets are available at https://github.com/jykim-git/SuperMeshNet.git.

2026-05-10T03:17:17Z International Conference on Machine Learning (ICML 2026) (to appear) (Please cite our conference version.) Jiyeon Kim Youngjoon Hong Won-Yong Shin http://arxiv.org/abs/2605.09265v1 Agentic AI for Particle-Based Simulation: Automating SPH Workflows for Debris Flow Modeling 2026-05-10T02:23:04Z

Physics-based simulation underpins engineering analysis but remains difficult to deploy in practice due to complex setup, parameterization, and interpretation. While Large Language Model-based agentic systems have shown promise in automating engineering computing workflows, they have primarily targeted structured, mesh-based problems. We present the first agentic AI workflow for meshless simulation in computational mechanics, demonstrated on debris flow modeling using Smoothed Particle Hydrodynamics (SPH) with the software DualSPHysics. By integrating tool orchestration, multimodal inputs (text and sketches), and human-in-the-loop interaction, the framework enables end-to-end simulation workflows for a class of problems that are inherently less structured and more challenging to automate. Results show that multimodal inputs not only enhance user experience but also reduces failure modes over text-only descriptions. Human-in-the-loop is critical for resolving ambiguities and handling SPH-specific configurations. We further introduce a cognitive-task-based evaluation of post-processing, showing strong performance in visualization and data extraction, with remaining gaps in higher-level SPH-specific physical reasoning that are amenable to improvement through domain-aware modeling. These results establish the viability of agentic AI for particle-based simulation and underscore its potential to transform the accessibility and efficiency of computational mechanics workflows.

2026-05-10T02:23:04Z 24 pages, 10 figures, 5 tables Danrong Zhang Ruijia Wang Chenying Liu Yumeng Zhao http://arxiv.org/abs/2605.09185v1 AutoRedTrader: Autonomous Red Teaming of Trading Agents through Synthetic Misinformation Injection 2026-05-09T21:51:08Z

LLM-based financial agents increasingly rely on both numerical market data and textual signals for sequential trading and stock prediction. However, financial misinformation often appears as subtle textual perturbations rather than explicit falsehoods, making it difficult to detect while still capable of significantly altering agent reasoning and decisions. To study this risk, we propose AutoRedTrader, an autonomous red-teaming framework that generates finance-specific misinformation through behavioral bias manipulation, minor textual perturbations, and rewriting strategies, with agent feedback used to strengthen attacks over time. We evaluate AutoRedTrader in a POMDP-based financial agent simulation environment, and further examine a time-series-informed grounding setting for robustness analysis. The framework enables systematic evaluation of how subtle misinformation affects financial agents and whether historical market evidence can stabilize decisions under misleading textual signals. We evaluate the framework on Bitcoin transaction data. The results show that AutoRedTrader achieves the strongest attack performance with 69.00% misinformation exposure rate and 26.67% attack success rate, outperforming general-purpose misinformation and red-teaming baselines. Ablation studies further show that all modules contribute to generating retrievable and decision-effective financial misinformation.

2026-05-09T21:51:08Z Work in progress Zhiwei Liu Yangyang Yu Yupeng Cao Yuechen Jiang Haohang Li Zhuoran Lu Yuyan Wang Yixiang Zheng Xiaorui Guo Calvin Yixiang Cheng Sophia Ananiadou http://arxiv.org/abs/2605.09117v1 Rao-Blackwellized Markov chain Monte Carlo Light Transport 2026-05-09T19:04:28Z

In light transport simulation, Markov chain Monte Carlo methods are particularly effective at exploring regions with complex lighting characteristics. However, estimator variance is a central concern across Monte Carlo methods in general. In light transport, high variance directly manifests as increased noise or, equivalently, longer rendering times at fixed image quality. Variance reduction techniques based on Rao-Blackwellization have proven particularly effective. In practice, however, the RB approach traditionally used in light transport, waste-recycling, can yield little to no measurable variance reduction, a fact we empirically confirm in this work. Motivated by this lack of effective variance reduction, we introduce a novel RB technique for the general-purpose Metropolis-Hastings algorithm that is computationally efficient and achieves substantial variance reduction. We show that this method consistently outperforms waste-recycling in terms of both variance reduction and convergence speed. Building on this result, we adapt the proposed RB approach to the recently introduced general-purpose Jump Restore algorithm, where it similarly achieves substantial variance reduction and accelerated convergence. Through extensive experiments in light transport simulation, we demonstrate that our \gls{rb} technique significantly outperforms the traditional approaches for both MH-based light transport algorithms and Jump Restore Light Transport, under both equal-time and equal-sample-count comparisons.

2026-05-09T19:04:28Z Sascha Holl Gurprit Singh Hans-Peter Seidel http://arxiv.org/abs/2605.09099v1 GraphNetz: Statistical Benchmarking of Graph Neural Networks with Paired Tests and Rank Aggregation 2026-05-09T18:15:00Z

Graph Neural Networks (GNNs) benchmarks often report single point estimates, even when performance differences are small relative to variation across random seeds, train/test splits, and datasets. Confidence intervals, paired comparisons, multiple-comparison correction, and rank-based aggregation are standard statistical tools, but they are rarely the default output of graph-learning benchmark suites. We introduce GraphNetz, a benchmarking framework whose default output is a structured statistical report rather than a raw accuracy table. GraphNetz currently includes 63 dataset loaders, four task types, and five canonical GNN architectures, while also supporting custom datasets and models. The framework standardizes multi-seed evaluation and automatically returns per-cell confidence intervals, Holm-corrected paired tests, and Friedman-Nemenyi critical-difference diagrams across tasks. In a cross-category benchmark over ten heterogeneous tasks, apparent rank differences among four canonical node-level encoders fall within a single Nemenyi clique, indicating that none is significantly better than the others at $α= 0.05$. GraphNetz therefore provides researchers with a reproducible computational and statistical pipeline to benchmark new graph-learning methods against standard architectures, over different tasks and a wide set of applications, while reporting principled statistical evidence for benchmarking which accounts for seed uncertainty. This framework is set to serve the graph-learning community with a reproducible and honest model comparison ready to be added to papers.

2026-05-09T18:15:00Z Kleyton da Costa Bernardo Modenesi http://arxiv.org/abs/2605.09097v1 An Overlapping Schwarz Space-Time Refinement Framework for Material Point Method 2026-05-09T18:05:34Z

We propose an overlapping Schwarz space-time refinement framework for the material point method (OS-MPM) to improve computational efficiency in problems with strongly localized deformation, contact, and large geometric nonlinearity. The method decomposes the domain into overlapping coarse and fine subdomains with heterogeneous spatial and temporal resolutions, while retaining standard MPM discretizations within each subdomain. Coarse-fine coupling is achieved through an MPM-specific Schwarz iteration combining mass-weighted spatial transmission and temporal interpolation for sub-cycling. In contrast to refinement strategies based on modified basis functions, transition kernels, or strongly enforced interface constraints, the proposed approach preserves the modular structure of standard MPM and shifts the coupling complexity to nonmatching-grid interface operators within the Schwarz alternating procedure. Numerical examples, including a gravity-driven cantilever beam, Hertzian contact, and an elastic inclusion problem, show that the method reproduces analytical or fine-resolution reference solutions with good accuracy and convergence behavior. In the inclusion benchmark, the proposed framework achieves comparable or slightly lower error than single-domain fine simulations at the finest tested resolutions, while reducing computational cost by up to 9.15 times. A three-dimensional folding example further demonstrates the generality of the framework. These results indicate that the proposed method provides an accurate, modular, and efficient route for local space-time refinement in MPM.

2026-05-09T18:05:34Z Zhaofeng Luo Minchen Li Yupeng Jiang http://arxiv.org/abs/2605.21507v1 Visibility nowcasting in South Korea: a machine learning approach to class imbalance and distribution shift 2026-05-09T16:58:22Z

Atmospheric visibility is a critical variable for transportation safety and air quality management, however, accurate prediction remains challenging due to the complex interactions between meteorological conditions and air pollutants, as well as the rarity of low-visibility events. This study introduces a machine learning framework to nowcast visibility in six major South Korean cities. To handle the imbalance in the 2018-2020 training data, we applied the Synthetic Minority Over-sampling Technique with Nominal and Continuous (SMOTENC) and Conditional Tabular Generative Adversarial Network (CTGAN). An ensemble approach combining machine learning and deep learning models was then used and evaluated on a 2021 test dataset. The results revealed a marked decline in predictive performance in the test set compared to the cross-validation phase. This degradation was attributed to a distributional shift between training and testing periods, which was quantitatively confirmed by measuring the Wasserstein distance of the most influential feature identified by SHAP analysis. In general, this study presents a methodology that aims to simultaneously address the dual challenges of data imbalance and temporal distributional shifts, and emphasizes the necessity of accounting for evolving external environmental factors when implementing nowcasting models on time-series data.

2026-05-09T16:58:22Z Published in Theoretical and Applied Climatology Theoretical and Applied Climatology, vol. 157, art. no. 283, 2026 Bong Gyun Shin Chan Sik Lee Hyesun Suh 10.1007/s00704-026-06219-6 http://arxiv.org/abs/2605.08988v1 Benchmarking Compositional Generalisation for Machine Learning Interatomic Potentials 2026-05-09T15:12:59Z

Machine Learning Interatomic Potentials play a fundamental role in computational chemistry and materials science, enabling applications from molecular dynamics simulations to drug design and materials discovery. While recent approaches can estimate inter-atomic forces with high precision, it remains unclear to what extent they can generalise to previously unseen molecules. Do they learn the compositional structure of chemistry, capturing how molecular fragments and their combinations determine properties, or do they primarily learn to interpolate patterns that are specific to the training examples? To address this question, we propose a benchmark consisting of four tasks that require some form of compositional generalisation. In each task, models are tested on molecules that were unseen during training, but the training data is chosen such that generalisation to the test examples should be feasible for models that learn the underlying physical principles. Our empirical analysis shows that the considered tasks are highly challenging for state-of-the-art models, with errors on out-of-distribution examples often an order of magnitude higher than on in-distribution examples, even when using foundation models that have been pre-trained on millions of molecules.

2026-05-09T15:12:59Z Amir Masoud Nourollah Irtaza Khalid Stefano Leoni Steven Schockaert http://arxiv.org/abs/2604.23548v2 Unsupervised Learning for AC Optimal Power Flow with Fast Physics-Aware Layer 2026-05-09T14:39:36Z

Learning to solve the Alternating Current Optimal Power Flow (AC-OPF) problem by neural networks (NNs) is a promising approach in real-time applications. Existing methods to ensure the physical feasibility of NN outputs embed a power flow (PF) solver within networks. However, the gradient through the PF solver, namely, implicit differentiation, needs manual Jacobian derivation and the solution of linear systems, which is computationally prohibitive and hinders integration with modern automatic differentiation (AD) frameworks. To address these challenges, we propose FPL-OPF, a novel unsupervised learning framework that incorporates a Fast Physics-aware Layer for AC-OPF problems. FPL-OPF embeds a fast PF iterative solver within the NN and takes solely the last few or even the final iterations into the AD graph. This design ensures high computational efficiency for both the forward and backward passes, circumventing complex custom backward implementations. Theoretically, we rigorously prove that the gradient from this design serves as a high-fidelity surrogate of the true implicit gradient under mild conditions. Extensive experiments demonstrate that FPL-OPF achieves significant speedups over state-of-the-art unsupervised learning approaches, while maintaining near-zero constraint violations and competitive optimality. Our code is available at https://github.com/wowotou1998/fpl-opf

2026-04-26T06:01:02Z Accepted at ACM e-Energy 2026 Jiebao Zhang Haoyu Yan Zhichao Sheng Hongwen Yu Shuang Ye Haoyu Wang Ye Shi 10.1145/3744255.3811718 http://arxiv.org/abs/2605.08976v1 Score-Based Generative Modeling through Anisotropic Stochastic Partial Differential Equations 2026-05-09T14:36:05Z

Score-based generative modeling (SBGM) has achieved state-of-the-art performance in image generation, with the quality of generated images being highly dependent on the design of the forward (diffusion) process. Among these, models based on stochastic differential equations (SDEs) have proven particularly effective. While traditional methods aim to progressively destroy all image information to enable reconstruction from pure noise, we propose a class of anisotropic stochastic partial differential equations (SPDEs) that preserve the geometric structure of the data over longer time scales throughout the transformation. These SPDEs consist of a drift term that enforces deterministic destruction via structured smoothing, and a diffusion coefficient that enables random destruction through noise injection. Both components are governed by anisotropy coefficients, enabling controlled, direction-dependent information degradation. This framework provides the theoretical foundation for a novel anisotropic score-based generative model. By retaining geometric structure for longer time scales, the backward generative process can exploit residual geometric cues, leading to improved reconstruction fidelity. We empirically validate this improvement in a proof-of-concept implementation on unconditional image generation, showing that anisotropic diffusion can achieve superior image quality metrics. We demonstrate consistent improvements in both pixel and latent space experiments over the SDE-driven baseline as well as over the state-of-the-art Flow Matching approach. Finally, we demonstrate the effectiveness of the introduced anisotropy in a conditional stroke-to-image generation task.

2026-05-09T14:36:05Z Sascha Holl Jente Vandersanden Gurprit Singh Hans-Peter Seidel http://arxiv.org/abs/2605.08924v1 PPI2Text: Captioning Protein-Protein Interactions with Coordinate-Aligned Pair-Map Decoding 2026-05-09T12:49:30Z

Protein-protein interaction (PPI) modeling has been widely studied as a binary or multi-label classification task. While emerging multimodal large language models (LLMs) can now describe single proteins, they remain unable to generate free-form descriptions of interactions between protein pairs. Moving beyond controlled vocabulary annotations, we propose to model PPI using free-text description, enabling richer expressiveness, improved interpretability, and better integration with literature knowledge base. We present PPI2Text, a multimodal LLM for free-form PPI captioning from amino acid sequences, that encodes each protein using ESM3 encoder, constructs a pair map from the two representations to capture interactions across all residue pairs, and autoregressively generates descriptions using a Qwen3 language decoder. We further introduce PaCo-RoPE, a coordinate-aligned positional encoding that aligns each axis of the pair grid with the residue positions of the corresponding protein. In addition, we release PPI2Text-Dataset, a 351k-pair corpus of free-form PPI descriptions aggregated from ten curated biological databases and further synthesized with Gemini under evidence-tiered prompting. PPI2Text consistently outperforms strong baselines across multiple ablation settings and evaluation protocols. It not only achieves higher scores on linguistic metrics against synthesized references, but also excels on factuality metrics, where an LLM-based judge evaluates outputs against raw biological evidence.

2026-05-09T12:49:30Z Xiao Fei Sarah Almeida Carneiro Yang Zhang Lawrence P. Petalidis Achilleas Tsortos Costas Bouyioukos Michalis Vazirgiannis http://arxiv.org/abs/2507.07107v2 Machine Learning Enhanced Multi-Factor Quantitative Trading: A Cross-Sectional Portfolio Optimization Approach with Bias Correction 2026-05-09T10:17:34Z

Rolling-window factor pipelines for Chinese A-share markets contain a subtle but costly flaw: daily price-move limits (+/-10% main-board, +/-20% STAR/ChiNext) render a fraction of closing prices non-executable, yet standard implementations ingest these values before any row-filtering runs. The contaminated aggregates propagate silently through moving averages, correlations, and ranks--a failure mode we term "upstream contamination". On real A-share data it inflates apparent information coefficient by 18% while reducing realised Sharpe by 0.44 points, because the model learns to predict returns it cannot trade. We resolve this with a mask-first design: a Boolean tradability mask is constructed at data load time and threaded through every operator, so that no window ever reads a non-tradable price. Built on this foundation, the system adds (i) a GPU-vectorised 213-factor engine via PyTorch unfold primitives (51x over pandas); (ii) an Adjusted-MSE loss penalising wrong-sign predictions 11x more heavily than magnitude errors; (iii) block-bootstrap GBM augmentation; and (iv) Markowitz-Ledoit-Wolf portfolio optimisation with cvxpy warm-start caching. On a calibrated 3,000-stock synthetic panel the system achieves annualised Sharpe 2.05; on proprietary real A-share data (2022-2024) it achieves Sharpe 1.63. Ablation shows the mask contract is the single largest contributor (+0.44), exceeding any model or loss choice. The full implementation is released under MIT licence at https://github.com/initial-d/ml-quant-trading.

2025-06-02T03:04:19Z 18 pages, 12 tables, 6 figures. Code at https://github.com/initial-d/ml-quant-trading. v2: rewritten abstract, added deflated Sharpe analysis (DSR=0.978), expanded limitations section, corrected factor count from 500-1000 to 213 Yimin Du http://arxiv.org/abs/2502.06238v2 XNet-Enhanced Deep BSDE Method and Numerical Analysis 2026-05-09T05:23:25Z

Semilinear parabolic partial differential equations (PDEs) are fundamental to modeling complex dynamical systems across scientific domains. The Deep Backward Stochastic Differential Equation (BSDE) method is a promising approach for high-dimensional PDEs; however, existing convergence results apply only to globally Lipschitz generators, excluding important cases such as Allen--Cahn and Hamilton--Jacobi--Bellman (HJB) equations. This paper presents both a theoretical and a computational advance for Deep BSDE methods. Theoretically, we establish the convergence theory for non--Lipschitz generators--covering Allen--Cahn equations with cubic nonlinearity and HJB equations with quadratic gradient growth--based on a bounded double--well lemma and a truncated-BSDE analysis within the Bouchard--Touzi--Zhang theory. Computationally, we instantiate the framework with XNet, a shallow architecture with $\mathcal O(L)$ parameters that preserves strong approximation while substantially reducing optimization and computational cost. Numerical experiments on 100--dimensional PDEs corroborate the predicted convergence behavior and demonstrate significant efficiency gains over standard feedforward implementations.

2025-02-10T08:18:04Z Xiaotao Zheng Xingye Yue Zhihong Xia Xin Li 10.1007/s10915-026-03237-4 http://arxiv.org/abs/2508.08441v3 SpectraLLM: Uncovering the Ability of LLMs for Molecular Structure Elucidation from Multi-Spectral Data 2026-05-09T03:36:43Z

Automated molecular structure elucidation remains challenging, as existing approaches often depend on pre-compiled databases or restrict themselves to single spectroscopic modalities. Here we introduce SpectraLLM, a large language model that performs end-to-end structure prediction by reasoning over one or multiple spectra. Unlike conventional spectrum-to-structure pipelines, SpectraLLM represents both continuous (IR, Raman, UV-Vis, NMR) and discrete (MS) modalities in a shared language space, enabling it to capture substructural patterns that are complementary across different spectral types. We pretrain and fine-tune the model on small-molecule domains and evaluate it on four public benchmark datasets. SpectraLLM achieves state-of-the-art performance, substantially surpassing single-modality baselines. Moreover, it demonstrates strong robustness in unimodal settings and further improves prediction accuracy when jointly reasoning over diverse spectra, establishing a scalable paradigm for language-based spectroscopic analysis. Code is available at https://github.com/OPilgrim/SpectraLLM.

2025-08-04T13:33:38Z 42 pages, 6 figures, 30 tables; Accepted to ICLR 2026 Proceedings of the 14th International Conference on Learning Representations (ICLR), 2026 Yunyue Su Jiahui Chen Zao Jiang Zhenyi Zhong Liang Wang Qiang Liu Zhaoxiang Zhang http://arxiv.org/abs/2508.04875v4 PriceFM: Foundation Model for Probabilistic Electricity Price Forecasting 2026-05-08T21:25:19Z

Electricity price forecasting in Europe presents unique challenges due to increasing renewable generation variability, market integration, and the continent's physically interconnected power system. While recent advances in foundation models have led to substantial improvements in general time series forecasting, most existing approaches do not incorporate prior graph knowledge from the transmission topology, which can limit their ability to exploit meaningful cross-region dependencies in interconnected power systems, motivating a domain-specific foundation model. In this paper, we address this gap by first introducing a comprehensive and up-to-date dataset across 24 European countries (38 regions), spanning from 2022-01-01 to 2026-01-01. Building on this groundwork, we propose PriceFM, a probabilistic foundation model pretrained on this large dataset. Specifically, PriceFM maps each region's price and exogenous features, including load, solar, and wind generation forecasts, into a comparable latent embedding via a shared Mixture-of-Experts (MoE) projection layer, then injects prior graph knowledge by constructing a sparse graph mask derived from transmission topology. Across a large-scale European benchmark, PriceFM achieves strong performance and demonstrates superior generalization compared with multiple competitive baselines. The results highlight the value of topology-guided forecasting with increasing renewable generation and strong cross-border interconnections. The methodology is available at: https://runyao-yu.github.io/PriceFM/.

2025-08-06T20:55:56Z 10 pages, 5 figures, 6 tables Runyao Yu Chenhui Gu Jochen Stiasny Qingsong Wen Wasim Sarwar Dilov Lianlian Qi Jochen L. Cremer