https://arxiv.org/api/74AjW2+I0bAJa3KxmPwe+KqhEq8 2026-06-10T09:53:15Z 10652 180 15 http://arxiv.org/abs/2512.10439v2 HypeR Adaptivity: Joint $hr$-Adaptive Meshing via Hypergraph Multi-Agent Deep Reinforcement Learning 2026-05-23T20:00:44Z

Adaptive mesh refinement is central to the efficient solution of partial differential equations (PDEs) via the finite element method (FEM). Classical $r$-adaptivity optimizes vertex positions but requires solving expensive auxiliary PDEs such as the Monge-Ampère equation, while classical $h$-adaptivity modifies topology through element subdivision but suffers from expensive error indicator computation and is constrained by isotropic refinement patterns that impose accuracy ceilings. Combined $hr$-adaptive techniques naturally outperform single-modality approaches, yet inherit both computational bottlenecks and the restricted cost-accuracy trade-off. Emerging machine learning methods for adaptive mesh refinement seek to overcome these limitations, but existing approaches address $h$-adaptivity or $r$-adaptivity in isolation. We present HypeR, a deep reinforcement learning framework that jointly optimizes mesh relocation and refinement. HypeR casts the joint adaptation problem using tools from hypergraph neural networks and multi-agent reinforcement learning. Refinement is formulated as a heterogeneous multi-agent Markov decision process (MDP) where element agents decide discrete refinement actions, while relocation follows an anisotropic diffusion-based policy on vertex agents with provable prevention of mesh tangling. The reward function combines local and global error reduction to promote general accuracy. Across benchmark PDEs, HypeR reduces approximation error by up to 6--10$\times$ versus state-of-art $h$-adaptive baselines at comparable element counts, breaking through the uniform refinement accuracy ceiling that constrains subdivision-only methods. The framework produces meshes with improved shape metrics and alignment to solution anisotropy, demonstrating that jointly learned $hr$-adaptivity strategies can substantially enhance the capabilities of automated mesh generation.

2025-12-11T09:02:33Z Niccolò Grillo James Rowbottom Pietro Liò Carola Bibiane Schönlieb Stefania Fresca http://arxiv.org/abs/2605.24682v1 Scalable High-Dimensional Bayesian Field Reconstruction with Finite Elements: Application to 3D Porous Media Flow 2026-05-23T17:38:51Z

We present a unified, finite-element-native variational inference framework for very high-dimensional Bayesian spatial field reconstruction in physics-based problems governed by partial differential equations (PDEs) that are nonlinear in the inferred parameters. The framework delivers a full-covariance Gaussian variational posterior, with a probabilistic treatment of all prior and likelihood hyperparameters, on a three-dimensional curved finite-element discretization at a stochastic field dimension exceeding 400000. To our knowledge, this is the first full-covariance variational reconstruction at this scale, complementing the low-rank Hessian-Laplace approaches that dominate extreme-scale Bayesian inversion. The spatial prior is derived from the stochastic PDE (SPDE) connection and formulated natively in terms of finite-element (FE) operators. The sparse Gaussian variational distribution is parameterized via its precision Cholesky factor, with the sparsity pattern inherited from the domain's Laplacian. Unlike covariance-based sparse parameterizations, which encode only short-range correlations, the sparse precision implicitly represents dense posterior covariances through its sparse inverse, yielding smooth, physically plausible samples at O(n) memory cost and enabling direct evidence-lower-bound (ELBO) gradients via the path-derivative (sticking-the-landing) estimator. Natural gradient strategies stabilize convergence, while a variational Bayes expectation-maximization (VB-EM) loop marginalizes all hyperparameters analytically and induces an automatic coarse-to-fine continuation. The framework is demonstrated on Bayesian permeability field reconstruction for a porous-media flow problem, recovering all major spatial features with high fidelity. Algorithmic ablation and comparison with alternative inference methods quantify the improvements over state-of-the-art baselines.

2026-05-23T17:38:51Z 36 pages, 11 figures Jonas Nitzler Maximilian Bergbauer Phaedon-Stelios Koutsourelakis Wolfgang A. Wall http://arxiv.org/abs/2605.24564v1 Summoning the Oracle to Slay It: Mitigating Look-Ahead Bias in Financial Backtesting with Large Language Models 2026-05-23T12:57:18Z

Backtesting large language models (LLMs) on historical financial data is unreliable because pre-training cuts off after the events happened. An LLM trained in 2024 already "knows" which way 2018-2020 stocks moved. We name this failure parametric look-ahead bias and propose FinCAD, an inference-time adaptation of Context-Aware Decoding that suppresses an LLM's memory of historical outcomes without retraining. FinCAD pairs an adversarial bias-discovery pipeline that learns a model-specific memory-activating prior prompt with an entity- and date-adaptive rule that scales the CAD strength to per-(entity, date) memorisation, so the penalty fires on memorised in-sample dates and decays to zero out-of-sample. Across five 7-14B LLMs and five mega-cap equities, FinCAD cuts in-sample backtest returns by up to -67.1% on memorised dates while leaving 2025 out-of-sample returns within $8K and Sharpe within 0.10 of baseline, and preserves general-purpose reasoning within 1.7 pts. On an eleven-model leaderboard, it raises the in-sample / out-of-sample Spearman correlation from +0.779 to +0.846, recovering rankings that genuinely predict out-of-sample performance.

2026-05-23T12:57:18Z Weixian Waylon Li Mengyu Wang Tiejun Ma http://arxiv.org/abs/2605.24464v1 Toward Secure Operation and Management (O&M) of Satellite Constellations: Efficiency, Resilience, and Reliability in a Network Perspective 2026-05-23T08:30:12Z

Satellite constellations equipped with Inter-Satellite Links and onboard packet switching enable real-time Operation and Management across globally distributed satellites, but also broaden the attack surface and introduce unprecedented cybersecurity threats. Existing efforts mainly focus on cryptography for single-satellite point-to-point links, without considering constellation-level security. To address this gap, this article extends security research in two directions: from individual satellites to constellation-wide architectures, and from isolated cryptography to system-level security incorporating efficiency, resilience, and reliability. These extensions raise three key questions: how to design efficient security mechanisms for dynamic constellation topologies with adaptive onboard routing; how a constellation O&M system can recover resiliently under worst-case failures of onboard security functions; and how to improve the reliability of onboard security functions under stringent resource constraints. To address these challenges, we first construct a constellation-wide hybrid security framework that protects semantically sensitive content fields using End-to-End encryption, while safeguarding routing-related fields through Moving Target Defense. Next, we introduce a ciphered-mode and safe-mode management mechanism with an M-delayed fallback that balances recovery timeliness and exploitability. Finally, we propose security-aware routers that manage plaintext/ciphered modes and coordinate access to a shared pool of onboard cipher modules, enabling redundancy sharing across multiple endpoints and extending secure operation duration in ciphered mode. These solutions comply with existing standards defined by organizations including DVB and the CCSDS, while translating conceptual security principles into practical system-level mechanisms.

2026-05-23T08:30:12Z Linan Huang Peilong Liu Xi Chen Zhiyuan Lin Jian Yan Linling Kuang http://arxiv.org/abs/2605.24462v1 No Certificate, No Execution: Certified Traces as a Foundation for Trustworthy AI Agents 2026-05-23T08:24:42Z

We argue that trustworthy AI agents, especially in high-stakes and policy-governed domains, should make execution conditional on certified traces rather than rely only on stronger generative models, output-level guardrails, or post-hoc audits. A generative agent may propose recommendations, tool calls, reports, or actions, but generation is not permission: an action may be computable yet impermissible, and individually permissible actions may compose into an impermissible trace. We formalize trustworthy agency through a \textbf{Proposal--Certification--Execution (PCE)} architecture: a probabilistic generating machine $M_G$ proposes candidate execution traces, a \textbf{Permissibility Machine} $M_Π$ certifies proposed traces under a policy system $Π$, and execution proceeds only for certified traces. The executable trace language is $L_{\mathrm{exec}} = L_G \cap L_{\mathrm{cert}}(M_Π)$. Before execution, a trace is a structured pre-execution record submitted for certification: it specifies intended steps, evidence, proposed tool calls, approvals, replayable computations, credentials, and execution conditions. This perspective complements chain-of-thought monitorability: visible reasoning may help detect misbehavior, but monitorability is not certifiability, and reasoning is only one component of a broader execution trace. The formal principle is simple: an agent-generated trace should execute only when it carries a checkable certificate witnessing permissibility under $Π$: \textbf{no certificate, no execution}. We develop certified traces and Permissibility Machines as foundations for trustworthy AI agents, connect trace certification to proof-carrying execution, proof memory, privacy, and zero-knowledge certificates, and propose evaluating agents by what generated traces can be safely certified for execution, not by output accuracy alone.

2026-05-23T08:24:42Z Xiao-Yang Liu Yanglet Xiaodong Wang Agostino Capponi http://arxiv.org/abs/2512.17064v2 Flux-Preserving Adaptive Finite State Projection for Multiscale Stochastic Reaction Networks 2026-05-23T05:17:57Z

The Finite State Projection (FSP) method approximates the Chemical Master Equation (CME) by restricting the dynamics to a finite subset of the (typically infinite) state space, enabling direct numerical solution with computable error bounds. Adaptive variants update this subset in time, but multiscale systems with widely separated reaction rates remain challenging, as low-probability bottleneck states can carry essential probability flux and the dynamics alternate between fast transients and slowly evolving stiff regimes. We propose a flux-based adaptive FSP method that uses probability flux to drive both state-space pruning and time-step selection. The pruning rule protects low-probability states with large outgoing flux, preserving connectivity in bottleneck systems, while the time-step rule adapts to the instantaneous total flux to handle rate constants spanning several orders of magnitude. Numerical experiments on stiff, oscillatory, and bottleneck reaction networks show that the method maintains accuracy while using substantially smaller state spaces.

2025-12-18T21:04:12Z Aditya Dendukuri Shivkumar Chandrasekaran Linda Petzold http://arxiv.org/abs/2604.04453v2 Generative modeling of granular flow on inclined planes using conditional flow matching 2026-05-23T04:55:22Z

Granular flows govern many natural and industrial processes, yet their interior kinematics and mechanics remain largely unobservable, as experiments access only boundaries or free surfaces. Conventional numerical simulations are computationally expensive for fast inverse reconstruction, and deterministic models tend to collapse to over-smoothed mean predictions in ill-posed settings. This study, to the best of the authors' knowledge, presents the first conditional flow matching (CFM) framework for granular-flow reconstruction from sparse boundary observations. Trained on high-fidelity particle-resolved discrete element simulations, the generative model is guided at inference by a differentiable forward operator and a novel sparsity-aware gradient guidance mechanism. This mechanism avoids the gradient dilution inherent to standard mean-squared-error approaches, preserves the absolute physical scale of observation errors, enforces measurement consistency without hyperparameter tuning, and prevents unphysical velocity predictions in non-material regions. A physics decoder maps the reconstructed velocity fields to stress states and energy fluctuation quantities, including mean stress, deviatoric stress, and granular temperature. The framework accurately recovers interior flow fields from full observation to only 16\% of the informative window, and it remains effective under strongly diluted spatial resolution with only 11% of data. It also outperforms a deterministic CNN baseline in the most ill-posed reconstruction regime and provides spatially resolved uncertainty estimates through ensemble generation. These results demonstrate that conditional generative modeling offers a practical route for non-invasive inference of hidden bulk mechanics in granular media, and it suggests potential applicability for inverse problems in particulate and multiphase systems.

2026-04-06T05:59:54Z Xuyang Li Rui Li Teng Man Yimin Lu http://arxiv.org/abs/2511.04556v2 Optimizing Sensor Placement for Flow Reconstruction in Urban Drainage Networks: A Digital Twin-Based Sparse Sensing Approach 2026-05-22T23:28:17Z

Urban flooding triggered by intense rainfall is becoming increasingly frequent and widespread. While flood prediction and monitoring in high spatio-temporal resolution are desired, practical constraints in time, budget, and technology hinder its full implementation. How to monitor urban drainage networks and predict flow conditions under constrained resources is a major challenge. To address this, we introduced a data-driven sparse sensing (DSS) approach, demonstrated via a digital-twin of the Woodland catchment in Duluth, Minnesota. Specifically, we coupled EPA-SWMM with singular value decomposition and QR factorization-based sensor selection to optimize monitoring locations for system-level flow reconstruction. An ensemble of SWMM simulations, driven by diverse scenarios, provided the necessary hydraulic data to extract the reduced basis and identify informative sensor locations. Cross-event validation showed that three strategically placed sensors among 77 candidate nodes achieved a mean system-level Nash-Sutcliffe efficiency (NSE) of 0.949 across observed storm events. The QR-selected sensor sets were benchmarked against reference sensor configurations obtained from exhaustive searches and Monte Carlo random-placements. This comparison further showed that flow reconstruction based on QR-selected sensors closely tracked the exhaustive optimum while substantially outperforming random placements. We further evaluated the framework's robustness by introducing multiplicative Gaussian noise and simulating individual sensor failures. While the model is relatively resilient to noise, the impact of sensor dropouts depends heavily on the number of sensors allocated and their specific locations.

2025-11-06T17:08:19Z 32 pages (including supplementary information), 11 figures. Submitted to Water Research. Partially presented at HydroML 2025 Symposium, Minnesota Water Resources Conference 2025, and AGU Fall Meeting 2025 Zihang Ding Amit Kumar Imran Md. Azizul Islam Mila Avellar Montezuma Ruihang Zhang Kun Zhang http://arxiv.org/abs/2605.23712v1 Operator Learning for Reconstructing Flow Fields from Sparse Measurements: a Language Model Approach 2026-05-22T14:56:05Z

Reconstructing flow fields from sparse measurements is a fundamental problem in fluid mechanics with broad implications for modeling, control, and design. In this work, we propose a novel operator learning framework that leverages the architecture of language models to perform flow reconstruction in a mesh-free manner. We reformulate flow field reconstruction as a sequence-to-sequence learning task, where sparse measurements are treated as context and unobserved locations as queries. Our model learns to reconstruct the full flow field from sparse inputs, effectively capturing spatial correlations and long-range dependencies. We evaluate the proposed approach on four benchmark datasets: (1) two-dimensional vortex street simulations, (2) daily average temperature data across the contiguous United States, (3) three-dimensional blood flow simulations based on dissipative particle dynamics, and (4) three-dimensional turbulent jet flow measurements obtained via particle tracking velocimetry. Across all cases, our method demonstrates competitive reconstruction accuracy, even with highly incomplete data (less than 10\% observed), and achieves efficient performance. The results highlight the potential of language models as robust and scalable tools for scientific data reconstruction, and suggest a promising direction toward the development of foundation models for scientific and engineering applications.

2026-05-22T14:56:05Z Qian Zhang George Em Karniadakis http://arxiv.org/abs/2605.23527v1 LiveFigure: Generating Editable Scientific Illustration with VLM Agents 2026-05-22T11:42:55Z

Scientific illustrations are essential for depicting conceptual designs, methodologies, and experimental workflows in research, playing a pivotal role in communicating complex academic insights. However, creating high-quality scientific illustrations remains a labor-intensive task for human scientists. While recent generative image models have advanced prompt-based editing, the synthesis of fully editable figures remains a fundamental challenge. Valid editability involves structured transformations of graphical elements, scales, attributes, and text, rather than simple pixel-level changes. Existing models generate raster outputs that do not support manual correction or layout adjustment, limiting their utility in scientific publishing, where editable vector figures are typically required for submission. To address this challenge, we introduce LiveFigure, an agentic framework driven by VLM agents that imitates the multi-step drawing workflow of human researchers. It first plans figure blueprints by drawing inspiration from high-quality references in previous works, then generates executable scripts that produce figures via the PowerPoint interface based on skills and experience, and finally refines the outputs with targeted visual diagnostics, producing fully vectorized, editable figures that meet publication standards. Extensive experiments demonstrate that LiveFigure generates inherently editable figures, achieving 80% publication-readiness in only 17 manual edits, far surpassing the 24% rate of the strongest baseline, NanoBanana. Human preference studies further validate this advantage, with LiveFigure securing a 60% win rate against NanoBanana. Our code is available at https://github.com/tsinghua-fib-lab/LiveFigure.git.

2026-05-22T11:42:55Z Chenyang Shao Jiahe Liu Fengli Xu Yong Li http://arxiv.org/abs/2605.23470v1 Learning Individual Dynamics from Sparse Cross-Sectional Snapshots 2026-05-22T10:29:47Z

Predicting how a dynamical unit evolves over time - how an individual ages, an epidemic spreads, or a physical system degrades - typically requires dense longitudinal tracking. When only extremely sparse or entirely cross-sectional data is available, inferring individualized, continuous-time trajectories is fundamentally ill-posed. Existing methods force a strict compromise: sequence models (e.g. latent ODEs) require dense longitudinal data, while cross-sectional methods (e.g. optimal transport, flow matching-based) map aggregate populations, losing individual dynamics. In this paper, we demonstrate that this dichotomy can be broken. We introduce CADENCE, a principled probabilistic framework that recovers continuous individual trajectories from isolated snapshots by anchoring latent dynamics to static, individual-level contexts. We provide novel identifiability guarantees for single-timepoint trajectory inference. By combining a score-based spatial encoder (bijective Probability Flow ODE) to eliminate diffeomorphic ambiguities with a Soft Mixture-of-Experts (SMoE) router, we show that individual dynamical parameters and routing function are jointly identifiable. Across a suite of benchmarks spanning physical systems to real-world biological data, CADENCE, trained strictly on extremely sparse snapshots with context structure, matches or exceeds the performance of state-of-the-art sequential models trained on dense, full-trajectory data.

2026-05-22T10:29:47Z Christian Lagemann Kai Lagemann Steven L. Brunton Sach Mukherjee http://arxiv.org/abs/2601.03260v2 SciNet: Evaluating AI Agents in Relation-Aware Scientific Literature Retrieval 2026-05-22T09:22:40Z

AI agents have seen widespread adoption in information retrieval for scientific research, giving rise to tools such as Deep Research. However, existing retrieval agents mainly rely on keyword- or embedding-based methods. While effective at capturing content-level similarities, they struggle to understand complex relational networks among scientific papers, such as identifying corroborating or conflicting studies and tracing technological lineages. This fundamental limitation often results in fragmented knowledge structures, misinterpreted research sentiment, and ineffective modeling of collective scientific progress. To address this limitation, we introduce SciNet, the first Scientific Network relation-aware dataset for information retrieval agents. Built on a meta-database of 269 million papers across 7 disciplines and containing 8,940 carefully designed tasks, SciNet systematically captures three levels of relational understanding: ego-centric retrieval of papers with novel knowledge structures, pairwise identification of scholarly relationships, and path-wise reconstruction of scientific evolution. Extensive evaluation of three categories of retrieval agents shows that their accuracy on relation-aware tasks often falls below 20%, highlighting a fundamental shortcoming of current retrieval paradigms. Importantly, in a downstream literature review application, agents empowered with SciNet achieve a 25.3% improvement in review quality, highlighting the critical value of relation-aware retrieval for deepening scientific insights. We publicly release SciNet at https://github.com/tsinghua-fib-lab/SciNet to support future research.

2025-12-16T02:53:02Z Chenyang Shao Fengli Xu Yong Li http://arxiv.org/abs/2605.23407v1 GeoCycler: Reward-Aligned 3D Diffusion for Constraint-Conditioned Cyclic Peptide Design 2026-05-22T09:17:11Z

Cyclic peptides are attractive therapeutic modalities because their closed-ring topology can improve stability and target specificity. However, de novo cyclic peptide design remains challenging for diffusion generators, as macrocyclization requires satisfying sparse, non-smooth, and compositional geometric constraints. Existing constraint-conditioned methods largely rely on inference-time guidance, which can steer samples toward desired closures but does not directly change the learned generative distribution. We propose GeoCycler, a reward-weighted diffusion alignment framework for training conditional latent diffusion models toward macrocyclization feasibility. GeoCycler introduces a type-gated stair reward that activates distance-based shaping only when prerequisite residue or linker types are satisfied, providing dense geometric feedback while avoiding misleading signals from chemically incompatible anchors. Together with positive-only reward weighting and replay-based stabilization, GeoCycler aligns a single generator across multiple cyclization topologies. On the LNR benchmark, GeoCycler improves pass@5 closure success over strong guidance-based baselines across stapled, head-to-tail, disulfide, and bicyclic settings. In particular, it improves head-to-tail success by 20.8 percentage points over CP-Composer while maintaining comparable amino-acid and backbone-dihedral statistics. These results suggest that training-time alignment to sparse geometric constraints is a promising alternative to relying solely on post hoc sampling-time correction for cyclic peptide generation.

2026-05-22T09:17:11Z Jingjie Zhang Hanqun Cao Haosen Shi He Mutian Yu Wang Zijun Gao Fang Wu Xiaojun Yao Chang-Yu Hsieh Sinno Jialin Pan Pranam Chatterjee Chunbin Gu Pheng-Ann Heng http://arxiv.org/abs/2605.23353v1 Bayesian Extreme Value Theory with Hawkes-AR-Gumbel Dependence for Extreme CVaR Estimation in Operational Risk 2026-05-22T08:18:23Z

Operational risk capital estimation under Basel II/III requires quantifying aggregate losses at extreme confidence levels of 99.9% and beyond, yet the standard Loss Distribution Approach (LDA) assumes independence between loss frequency and severity, an assumption frequently violated during stress episodes. Furthermore, MLE of tail parameters ignores parameter uncertainty, leading to overconfident risk estimates at extreme quantiles. We propose a Bayesian framework that combines Extreme Value Theory with a dynamic dependence architecture, the Hawkes-AR-Gumbel model, for operational risk Conditional Value-at-Risk (CVaR) estimation at confidence levels up to 99.995%. The model integrates three mechanisms that capture empirically documented features of operational losses: an autoregressive latent stress process that captures persistence of crisis regimes, a Hawkes selfexcitation component for frequency that generates event clustering and overdispersion, and a Gumbel copula for upper-tail dependence that links frequency and severity innovations through an asymmetric copula concentrating dependence in the extreme tail. Inference is performed via Hamiltonian Monte Carlo using PyMC, yielding full posterior distributions for all parameters, and CVaR at arbitrary confidence levels is estimated through posterior predictive Monte Carlo simulation. We compare three models on simulated operational risk data: the independent model (standard LDA), a shared latent factor model with symmetric dependence, and the proposed Hawkes-AR-Gumbel model. The independent model underestimates CVaR at 99.995% by approximately 40%, while the shared factor model fails to capture temporal persistence, event clustering, and upper-tail asymmetry. The HawkesAR-Gumbel model recovers the true dependence structure and correctly estimates CVaR at extreme levels.

2026-05-22T08:18:23Z Juan Ballesteros Gómez Eduardo C. Garrido-Merchán Pedro Pablo Pérez-Velasco http://arxiv.org/abs/2605.23298v1 DeFi Yield Aggregators: Analysing Investment Strategies and Structural Dependencies 2026-05-22T07:15:23Z

Yield aggregators are financial services in Decentralised Finance (DeFi) providing automated investment management and return optimisation for users. In this study, we investigate the operational mechanisms and monetary flows of two major yield aggregators, Yearn Finance and Cian, over the period from May 4, 2024 to May 3, 2025. Our supporting conceptual framework decomposes yield aggregator operations into user investment and strategy management cycles. Using a network approach for 2,459 Yearn and 921 Cian transactions, we trace protocol interactions and capital flows across the ecosystem. Users invested 15.7M USD into Yearn's USDC vault, which generated yield through liquidity provision and dynamic allocation across DeFi protocols. Cian, deployed later, attracted 54.0M USD into its staked-ETH (stETH) vault and implemented sophisticated leverage through flashloan-enabled recursive staking. Yearn's USDC vault achieves an annual yield of 5.41%, while Cian's stETH vault produces 4.22% with higher risk exposure. We use the operational insights from our analysis to extend the existing DeFi Stack Reference Model (DSR) with new financial primitives to highlight structural risk dependencies. Overall, our findings show that strategic complexity in yield aggregation does not necessarily translate into higher returns but materially expands risk exposure.

2026-05-22T07:15:23Z Stefan Kitzler Kasra Zarinehbaf Asadi Svetlana Kremer Bernhard Haslhofer