https://arxiv.org/api/paGIu/nqehYqhGYYCgWTp0/38I0 2026-06-10T14:35:03Z 10652 255 15 http://arxiv.org/abs/2605.22848v1 From Simulation to Discovery: AI Enabled Probabilistic Emulation of Mechanistic Crop Systems 2026-05-15T23:56:01Z

Global food security depends on predicting crop responses to climate variability, yet process based crop models remain too computationally expensive for large scale exploration of genotype and environment interactions. Here we develop a probabilistic neural emulator of APSIM that reproduces key maize growth processes across 13 outputs with high fidelity (with R^2 of 0.93) while reducing simulation time by several orders of magnitude. Trained on two million simulations spanning diverse genetic, soil, and management conditions, and augmented with a convolutional synthetic weather generator that produces physically consistent climate sequences, the framework enables scalable exploration of crop responses under realistic and diverse environmental inputs while providing calibrated predictive uncertainty without costly Bayesian inference. Applying this framework across 100,000 trait configurations, six soil environments in Iowa and Illinois, and climate projections through the year 2100 under two emissions scenarios, we identify 181 maize trait combinations that consistently maintain high yield across all tested conditionsan analysis infeasible with the mechanistic model alone. We further show that radiation use efficiency and temperature driven root dynamics are dominant drivers of yield resilience. Notably, projected yield distributions vary substantially across locations, with some lower productivity sites exhibiting yield increases under future climate scenarios, indicating that climate change may reshape regional yield potential in nonintuitive ways. These results demonstrate how uncertainty aware emulation transforms mechanistic crop simulation from a computational bottleneck into an on demand discovery engine, one capable of interrogating the full genotype, environment and management space at a scale no process-based model can match.

2026-05-15T23:56:01Z Mojdeh Saadati Juan Panelo Gustavo Visentini Soumik Sarkar Carlos Messina Baskar Ganapathysubramanian http://arxiv.org/abs/2605.16549v1 Post-Quantum Discovery as a Governance Capability: Evidence-Based Cryptographic Visibility and Exposure Prioritisation in a Critical Service Provider 2026-05-15T18:46:15Z

Post Quantum Cryptography (PQC) readiness is increasingly constrained not by algorithm availability, but by cryptographic visibility, dependency complexity, and fragmented governance. This paper presents an anonymised case study of a large European critical service provider that initiated PQC readiness through a discovery first strategy, utilizing tool supported cryptographic inventorying to establish an evidence based baseline prior to migration planning. The discovery phase revealed systemic challenges, including distributed cryptographic ownership, uneven evidence quality across legacy and modern environments, and high dependency on third party cryptographic roadmaps. To operationalise these findings, the organisation introduced a structured exposure register that enabled prioritisation based on asset criticality, confidentiality longevity, and migration feasibility. We argue that PQC discovery should be understood as a governance capability that stabilises organisational knowledge and converts cryptographic uncertainty into measurable accountability, supporting risk based decision making and ecosystem coordination. The results contribute actionable lessons for institutions pursuing crypto-agility and resilience under post quantum harvest now, decrypt later threat models.

2026-05-15T18:46:15Z Eurocrypt 2026, Sapienza University, Poster presentation Jelena Zelenovic Leila Taghizadeh Edoardo Pena-Gonzalez Jaime Gomez Garcia Bart Preneel http://arxiv.org/abs/2603.25099v2 Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization 2026-05-15T16:52:38Z

We present a framework in which a large language model (LLM) acts as an online adaptive controller for SIMP topology optimization, replacing conventional fixed-schedule continuation with real-time, state-conditioned parameter decisions. At every $k$-th iteration, the LLM receives a structured observation$-$current compliance, grayness index, stagnation counter, checkerboard measure, volume fraction, and budget consumption$-$and outputs numerical values for the penalization exponent $p$, projection sharpness $β$, filter radius $r_{\min}$, and move limit $δ$ via a Direct Numeric Control interface. A hard grayness gate prevents premature binarization, and a meta-optimization loop uses a second LLM pass to tune the agent's call frequency and gate threshold across runs. We benchmark the agent against four baselines$-$fixed (no-continuation), standard three-field continuation, an expert heuristic, and a schedule-only ablation$-$on three 2-D problems (cantilever, MBB beam, L-bracket) at $120\!\times\!60$ resolution and two 3-D problems (cantilever, MBB beam) at $40\!\times\!20\!\times\!10$ resolution, all run for 300 iterations. A standardized 40-iteration sharpening tail is applied from the best valid snapshot so that compliance differences reflect only the exploration phase. The LLM agent achieves the lowest final compliance on every benchmark: $-5.7\%$ to $-18.1\%$ relative to the fixed baseline, with all solutions fully binary. The schedule-only ablation underperforms the fixed baseline on two of three problems, confirming that the LLM's real-time intervention$-$not the schedule geometry$-$drives the gain. Code and reproduction scripts will be released upon publication.

2026-03-26T07:14:31Z 32 pages, 11 figures Shaoliang Yang Jun Wang Yunsheng Wang http://arxiv.org/abs/2605.12509v2 Representing Higher-Order Networks: A Survey of Graph-Based Frameworks 2026-05-15T13:29:23Z

Many real-world phenomena are naturally modeled by graphs and networks. However, classical graph models are often limited to pairwise interactions and may not adequately capture the richer structures that arise in practice. Higher-order graph formalisms extend this framework by incorporating multiway, hierarchical, temporal, multilayer, recursive, and tensor-based interactions, thereby providing more expressive representations of complex systems. This book presents a comprehensive overview of mathematical notions that can be used to model higher-order networks. It surveys foundational concepts, extensional frameworks, and newly introduced formalisms, with an emphasis on their structural principles, relationships, and modeling roles. The aim is to provide a unified perspective that helps readers compare diverse higher-order network models and identify appropriate tools for theoretical study and practical applications. This book is Edition 2.0. It mainly includes the addition of several concepts, as well as corrections and improvements of typographical errors and explanations.

2026-03-24T03:28:40Z 170 pages. Peer-Reviewed Book. Publisher: Neutrosophic Science International Association (NSIA) Publishing House. ISBN: 978-1-59973-881-9 Takaaki Fujita Florentin Smarandache 10.6084/m9.figshare.31827613 http://arxiv.org/abs/2605.15766v1 BioXArena: Benchmarking LLM Agents on Multi-Modal Biomedical Machine Learning Tasks 2026-05-15T09:24:55Z

Large language model (LLM) agents are increasingly capable of automating components of machine learning development, yet existing biomedical benchmarks mainly focus on question answering, reasoning, and tool usage, or evaluate only narrow aspects of biomedical ML coding. We present BioXArena, a biomedical machine learning benchmark designed to evaluate whether agents can generate task-specific model training pipelines for heterogeneous and multi-modal biomedical datasets. BioXArena contains 76 end-to-end tasks across 9 domains, including sequence modeling, single-cell analysis, structural biology, network biology, chemical biology, perturbation dynamics, phenotype-disease modeling, biomedical imaging, and text-integrated learning. Each task is curated from primary biomedical sources into a unified evaluation framework with hidden labels, held-out graders, and biology-aware metrics normalized to a 0 to 1 scale. Agents are required to write executable code, train predictive models, and generate submissions for private test samples. Most tasks involve multiple input modalities, including tabular data, images, natural language, molecular sequences, omics matrices, and protein structures. We evaluate 11 agent configurations in a standardized 2-hour single-GPU environment. MLEvolve with Gemini-3.1-Pro achieves the highest average score of 0.666, followed by GPT-5.4 with 0.636, while no single agent consistently dominates across all domains. We additionally perform extensive ablation studies, robustness evaluations, scaling analyses, cost analyses, and failure-mode investigations to better understand how model backbones, agent scaffolds, inference budgets, and biomedical domains influence BioML coding performance. We will publicly release all benchmark tasks, graders, execution runners, leaderboard results, and agent trajectories.

2026-05-15T09:24:55Z 69 pages, 13 figures, 34 tables Loka Li Duzhen Zhang Xingbo Du Leonard Song Zixiao Wang Assanali Aukenov Noel Thomas Shakhnazar Sailaukan Yonghan Yang Feilong Chen Jiahua Dong Kun Zhang Bin Zhang Le Song http://arxiv.org/abs/2605.15754v1 Spatiotemporal decoupled physics-informed Stone-Weierstrass neural operator for long-time prediction of time-dependent parametric PDEs 2026-05-15T09:15:01Z

Driven by rapid advances in artificial intelligence and modern GPU computing capabilities, deep learning methods based on the optimization paradigm have provided new pathways to solve spatiotemporal physical problems, whose mathematical core lies in solving partial differential equations (PDEs). As an emerging class of function-space learning methods, neural operators (NOs) have exhibited great potential in efficient PDE solving. However, existing mainstream neural operator frameworks suffer from critical bottlenecks when modeling time-dependent PDEs over long time horizons, including accuracy degradation, insufficient stability, high training costs, and excessive memory consumption, which severely limit their practical deployment. To address these challenges in long-time prediction with neural operators, we propose a novel spatiotemporally decoupled physics-informed neural operator architecture, termed the physics-informed Stone-Weierstrass neural operator (PI-SWNO). The design is theoretically grounded in the decoupling paradigm combining time-invariant spatial basis functions with time-varying evolution coefficients, as well as the Stone-Weierstrass approximation theorem. By encoding spatial and temporal information via two separate subnetworks, the framework structurally mitigates the accumulation of errors over extended time intervals. Furthermore, we introduce a time-marching batch-wise sampling strategy to resolve the memory bottleneck of full-range modeling over extended time spans, ensuring continuity and convergence of full-time-domain solutions.

2026-05-15T09:15:01Z 66 pages, 25 figures, 5 tables Shan Ding Yongfu Tian Lang Qin Hongxiang Ma Guofeng Su Rui Yang http://arxiv.org/abs/2605.15564v1 CrystalBoltz: End-to-End Protein Structure Determination via Experiment-Guided Diffusion for X-Ray Crystallography 2026-05-15T03:11:34Z

Generative models trained on public databases of protein structures, most of which have been determined by X-ray crystallography, now provide powerful priors for structure prediction. However, they are not readily conditioned on the measurements from a new crystallographic experiment, limiting their use for X-ray structure determination. In crystallography, the measured structure-factor amplitudes do not by themselves determine an electron density map or atomic structure because the associated phases are unobserved and must be inferred. Structure determination therefore remains an inverse problem in which candidate models must be both structurally plausible and consistent with measured diffraction data, often requiring substantial manual refinement by human experts. Emerging methods aim to incorporate experimental information more directly into predictive and refinement workflows. We present CrystalBoltz, a generative framework that casts crystallographic refinement as Bayesian inference over atomic structures and operates directly on structure-factor amplitudes. CrystalBoltz moves from unguided generation with a pre-trained prior over protein structures to experiment-guided posterior sampling, followed by atomic coordinate and B-factor refinement. Across multiple protein crystallography datasets, CrystalBoltz attains lower coordinate RMSD and lower R-factors than the strongest baselines considered, while reducing runtime by a factor of 33 relative to existing experimentally guided refinement.

2026-05-15T03:11:34Z Project page: https://soniaminseokim.github.io/crystalboltz-website/ Minseo Kim Huanghao Mai Jay Shenoy Alec Follmer Gordon Wetzstein Frederic Poitevin http://arxiv.org/abs/2605.15549v1 CTF4Nuclear: Common Task Framework for Nuclear Fission and Fusion Models 2026-05-15T02:42:26Z

The demand for clean energy is ever increasing, with new nuclear technologies presenting a complementary solution to renewable energies. However, designing and operating these systems is exceptionally difficult, given the complexity of the physical phenomena that interact to form the system dynamics. While high-fidelity simulations help to understand the non-linear, multi-physics interactions within a reactor, they are computationally expensive and rarely suitable for real-time applications. Furthermore, model-based approaches are inherently sensitive to simplifying assumptions required to derive their governing equations and parameters, leading to inevitable discrepancies with real-world measurements. In contrast, Machine Learning (ML) methods have the potential to generate reliable surrogate models which may be able to quickly predict the system's behaviour. However, the number of data-driven methods that can potentially be used for this task is large and diverse. In a safety-critical setting such as nuclear engineering, a fair comparison of different ML methods, and a clear understanding of their advantages and limitations, is of paramount importance. To address this, we introduce a Common Task Framework (CTF) for ML in nuclear engineering, building upon previous efforts in dynamical systems and seismology. This CTF considers a curated set of datasets from different nuclear and nuclear-adjacent systems. The CTF evaluates the performance of a method on 12 established metrics, alongside a new paradigm focused on system monitoring from sparse measurements only. We illustrate the framework by benchmarking standard ML baselines against these datasets, revealing current method limitations. Our vision is to replace ad hoc comparisons with standardized evaluations on hidden test sets, raising the bar for rigour and reproducibility in scientific ML for the nuclear industry.

2026-05-15T02:42:26Z Stefano Riva Carolina Introini Antonio Cammi Dean Price Alexey Yermakov Yue Zhao Philippe M. Wyder Judah Goldfeder Jan Williams Amy Sara Rude Matteo Tomasetto Joe Germany Joseph Bakarji Georg Maierhofer Miles Cranmer J. Nathan Kutz http://arxiv.org/abs/2605.15412v1 From Feedback Loops to Policy Updates: Reinforcement Fine-Tuning for LLM-Based Alpha Factor Discovery 2026-05-14T20:54:40Z

Modern quantitative trading increasingly relies on systematic models to extract predictive signals from large-scale financial data, where alpha factor discovery plays a central role in transforming market observations into tradable signals. Recent LLM-based methods have shown promise in automating factor generation, but most of them still rely on prompt-level generation--evaluation--feedback loops for iterative optimization. As the loop becomes longer, repeatedly appended historical candidates and feedback can cause context explosion, increase inference cost, dilute useful information, and introduce feedback drift. Moreover, these methods often depend on very large LLMs whose stable generation preferences may lead to structurally similar expressions, redundant candidates, and search stagnation. To address these limitations, we propose \textsc{QuantEvolver}, a self-evolving alpha factor discovery framework based on reinforcement fine-tuning. Instead of accumulating feedback in the prompt, \textsc{QuantEvolver} converts executable quantitative evaluation into policy updates, enabling a Miner LLM to internalize historical optimization experience through parameter learning. Specifically, \textsc{QuantEvolver} constructs high-quality seed factors, builds diverse seed--time-window training tasks, generates executable Factor DSL expressions, evaluates them through Regime Backtest, and optimizes the Miner LLM with Diversity-Complementarity Reward. During training, high-quality factors are continuously accumulated in a Mined Factor Database, which serves as the final discovered factor library. Extensive experiments on three realistic market benchmarks demonstrate the effectiveness of \textsc{QuantEvolver}, which consistently improves the primary evaluation metric of each task over existing LLM-based alpha factor discovery baselines, produces higher-quality and more complementary factor pools.

2026-05-14T20:54:40Z Lingzhe Zhang Tong Jia Yunpeng Zhai Zixuan Xie Chiming Duan Minghua He Philip S. Yu Ying Li http://arxiv.org/abs/2605.13767v2 Chrono::Ray: A Distributed Framework for High-Throughput Simulation-Based Analysis of Multibody Systems 2026-05-14T18:56:31Z

Large-scale simulation studies can provide invaluable insights across computational engineering efforts, but they are often computationally demanding, requiring the use of distributed computing, which is itself not a simple task. Chrono::Ray addresses this challenge by integrating the high-fidelity multibody dynamics simulation engine Chrono with the open-source distributed computing platform Ray. The result is a modular workflow framework providing user-friendly abstractions for large-scale engineering simulation studies, supporting scalable orchestration of large ensembles of simulation trials without requiring users to directly manage distributed infrastructure. The current capabilities of the framework are demonstrated through two representative examples: parameter recovery for a multibody lunar lander model, and design of experiments for parameters of a continuum terramechanics model. Chrono::Ray is a part of the larger Project Chrono ecosystem and is released as an open-source software package, with source code available at https://github.com/uwsbel/chrono-ray.git.

2026-05-13T16:47:09Z Khailanii Slaton Dan Negrut http://arxiv.org/abs/2410.01990v3 Deep Learning Alternatives of the Kolmogorov Superposition Theorem 2026-05-14T18:26:20Z

This paper explores alternative formulations of the Kolmogorov Superposition Theorem (KST) as a foundation for neural network design. The original KST formulation, while mathematically elegant, presents practical challenges due to its limited insight into the structure of inner and outer functions and the large number of unknown variables it introduces. Kolmogorov-Arnold Networks (KANs) leverage KST for function approximation, but they have faced scrutiny due to mixed results compared to traditional multilayer perceptrons (MLPs) and practical limitations imposed by the original KST formulation. To address these issues, we introduce ActNet, a scalable deep learning model that builds on the KST and overcomes many of the drawbacks of Kolmogorov's original formulation. We evaluate ActNet in the context of Physics-Informed Neural Networks (PINNs), a framework well-suited for leveraging KST's strengths in low-dimensional function approximation, particularly for simulating partial differential equations (PDEs). In this challenging setting, where models must learn latent functions without direct measurements, ActNet consistently outperforms KANs across multiple benchmarks and is competitive against the current best MLP-based approaches. These results present ActNet as a promising new direction for KST-based deep learning applications, particularly in scientific computing and PDE simulation tasks.

2024-10-02T19:53:14Z Guilhoto, Leonardo Ferreira, and Paris Perdikaris. "Deep Learning Alternatives Of The Kolmogorov Superposition Theorem." The Thirteenth International Conference on Learning Representations (ICLR 2025) Leonardo Ferreira Guilhoto Paris Perdikaris http://arxiv.org/abs/2504.09544v3 Integrating chemical structures as treatments improves representations of microscopy images for morphological profiling 2026-05-14T18:16:27Z

Recent advances in self-supervised deep learning have improved our ability to quantify cellular morphological changes in high-throughput microscopy screens, a process known as morphological profiling. However, most current methods only learn from images, despite many screens being inherently multimodal, as they involve both a chemical or genetic perturbation as well as an image-based readout. We hypothesized that incorporating chemical compound structures during self-supervised pre-training could improve learned representations of images from high-throughput microscopy screens. We introduce a representation learning framework, MICON (Molecular-Image Contrastive Learning), that models chemical compounds as treatments that induce transformations of cell phenotypes. MICON significantly outperforms classical hand-crafted features such as CellProfiler and existing deep-learning-based representation learning methods in challenging evaluation settings where models must identify reproducible effects of drugs across independent replicates and data-generating centers. We demonstrate that incorporating chemical compound information into the learning process provides small, but consistent improvements in performance and that modeling compounds specifically as treatments outperforms approaches that directly align images and compounds in a single representation space. Our findings point to a new direction for representation learning in morphological profiling, suggesting that methods should explicitly account for the multimodal nature of microscopy screening data.

2025-04-13T12:27:21Z 24 pages Yemin Yu Emre Hayir Neil Tenenholtz Lester Mackey Ying Wei David Alvarez-Melis Ava P. Amini Alex X. Lu http://arxiv.org/abs/2605.22845v1 Cross-attention-based bipartite graph neural network for coupled nodal and elemental field prediction in large-deformation sheet material forming 2026-05-14T17:12:44Z

Finite element simulations of large-deformation sheet material forming involve node-element coupling between nodal kinematics and element-level deformation measures. Machine-learning surrogates can accelerate such simulations, but most graph-based models use node-centred representations. This representation is indirect for element-level quantities, which are often recovered from nodal predictions by interpolation or post-processing. It may also obscure the node-element coupling structure that underlies the finite element update. This work proposes a cross-attention-based bipartite graph neural network (CAtt-BiGNN) for coupled prediction of nodal displacement increments and elemental thinning. The graph represents mesh nodes and elements as distinct but connected entities, linked by directed node-element edges, so that nodal and elemental fields are predicted on their native discretisation domains. An edge-aware cross-attention processor conditions adaptive node-element coupling weights on geometric edge features, enabling bidirectional message passing between nodal kinematic states and elemental deformation states. A hierarchical extension, CAtt-BiUGNN, combines the CAtt-BiGNN with graph downsampling-upsampling to improve information propagation on larger meshes. Adaptive Gaussian noise is further evaluated as an optional rollout-stabilisation strategy. The models are tested on two representative forming cases with different graph sizes. CAtt-BiGNN improves the balance between displacement and thinning prediction relative to node-centred baselines and bipartite ablation variants, while CAtt-BiUGNN gives the strongest overall performance in the larger-graph setting. The results indicate that the proposed model provides an effective surrogate framework for large-deformation sheet material forming.

2026-05-14T17:12:44Z Yingxue Zhao Haoran Li Haosu Zhou Tobias Pfaff Nan Li http://arxiv.org/abs/2605.14917v1 A Mutual Information Lower Bound for Multimodal Regression Active Learning 2026-05-14T14:50:47Z

Active learning for continuous regression has lacked an acquisition function that targets epistemic uncertainty when the predictive distribution is multimodal: variance misses modal disagreement, and information-theoretic targets like BALD are designed for discrete outputs. We introduce a Two-Index framework that makes this separation explicit: one stochastic index selects among competing model hypotheses (epistemic source), while a second governs within-hypothesis randomness (aleatoric source). An entropy decomposition within the framework identifies the mutual information between the output and the epistemic index as a principled acquisition objective, and we prove this quantity vanishes as the model is trained on growing datasets, confirming that it captures exactly the uncertainty data can resolve. Because this mutual information is intractable for continuous outputs, we derive the Mutual Information Lower Bound (MI-LB) acquisition function, a closed-form approximation for Mixture Density Network ensembles. On benchmarks featuring multimodal systems, MI-LB matches or beats every baseline evaluated and is the only method to do so consistently -- geometric and Fisher-based baselines compete only when the input space already encodes the multimodality, and collapse otherwise.

2026-05-14T14:50:47Z Leonardo Ferreira Guilhoto Akshat Kaushal Paris Perdikaris http://arxiv.org/abs/2511.21449v2 Numerical Optimization of Planar Nozzle Shapes for Fused Deposition Modeling 2026-05-14T12:36:30Z

Purpose: In fused deposition modeling (FDM), the nozzle plays a critical role in enabling high printing speeds while maintaining precision. Despite its importance, most applications still rely on standard nozzle designs. This work investigates the influence of nozzle geometry on pressure loss inside the nozzle, a key factor in high-speed printing performance. Design/methodology/approach: We focus on optimizing the nozzle shape to minimize the pressure loss and establish a framework that allows both simple angle-based optimization and more advanced spline-based parametrization. To model the polymer melt flow, we use a Giesekus model to account for viscoelastic effects. Findings: For angle-based optimization, the pressure-loss objective exhibits two local minima: one associated with smooth flow and another with pronounced recirculation regions inside the nozzle. While the latter yields a lower pressure drop, such flow patterns are generally undesirable due to increased residence times and the associated risk of material degradation and nozzle clogging. The splinebased parametrization results in only marginal additional reductions in pressure loss compared to angle optimization, while decreasing the manufacturability of the nozzle considerably. Originality/value: This paper presents a comparative study of FDM nozzle shape optimization using a Giesekus model. We introduce a flexible optimization framework that accommodates both simple and advanced geometric parametrizations. The main contribution is the systematic comparison between angle- and spline-based parametrizations across materials and extrusion velocities, showing that most of the achievable pressure-loss reduction is already captured by the simpler and more manufacture-ready angle optimization.

2025-11-26T14:40:23Z Steffen Tillmann Felipe A. González Stefanie Elgeti 10.1108/HFF-02-2026-0141