https://arxiv.org/api/FCWxW/3yHbzOE509meFw2pPcBlI2026-05-15T23:45:45Z10445015http://arxiv.org/abs/2605.14917v1A Mutual Information Lower Bound for Multimodal Regression Active Learning2026-05-14T14:50:47ZActive learning for continuous regression has lacked an acquisition function that targets epistemic uncertainty when the predictive distribution is multimodal: variance misses modal disagreement, and information-theoretic targets like BALD are designed for discrete outputs. We introduce a Two-Index framework that makes this separation explicit: one stochastic index selects among competing model hypotheses (epistemic source), while a second governs within-hypothesis randomness (aleatoric source). An entropy decomposition within the framework identifies the mutual information between the output and the epistemic index as a principled acquisition objective, and we prove this quantity vanishes as the model is trained on growing datasets, confirming that it captures exactly the uncertainty data can resolve. Because this mutual information is intractable for continuous outputs, we derive the Mutual Information Lower Bound (MI-LB) acquisition function, a closed-form approximation for Mixture Density Network ensembles. On benchmarks featuring multimodal systems, MI-LB matches or beats every baseline evaluated and is the only method to do so consistently -- geometric and Fisher-based baselines compete only when the input space already encodes the multimodality, and collapse otherwise.2026-05-14T14:50:47ZLeonardo Ferreira GuilhotoAkshat KaushalParis Perdikarishttp://arxiv.org/abs/2511.21449v2Numerical Optimization of Planar Nozzle Shapes for Fused Deposition Modeling2026-05-14T12:36:30ZPurpose: In fused deposition modeling (FDM), the nozzle plays a critical role in enabling high printing speeds while maintaining precision. Despite its importance, most applications still rely on standard nozzle designs. This work investigates the influence of nozzle geometry on pressure loss inside the nozzle, a key factor in high-speed printing performance. Design/methodology/approach: We focus on optimizing the nozzle shape to minimize the pressure loss and establish a framework that allows both simple angle-based optimization and more advanced spline-based parametrization. To model the polymer melt flow, we use a Giesekus model to account for viscoelastic effects. Findings: For angle-based optimization, the pressure-loss objective exhibits two local minima: one associated with smooth flow and another with pronounced recirculation regions inside the nozzle. While the latter yields a lower pressure drop, such flow patterns are generally undesirable due to increased residence times and the associated risk of material degradation and nozzle clogging. The splinebased parametrization results in only marginal additional reductions in pressure loss compared to angle optimization, while decreasing the manufacturability of the nozzle considerably. Originality/value: This paper presents a comparative study of FDM nozzle shape optimization using a Giesekus model. We introduce a flexible optimization framework that accommodates both simple and advanced geometric parametrizations. The main contribution is the systematic comparison between angle- and spline-based parametrizations across materials and extrusion velocities, showing that most of the achievable pressure-loss reduction is already captured by the simpler and more manufacture-ready angle optimization.2025-11-26T14:40:23ZSteffen TillmannFelipe A. GonzálezStefanie Elgeti10.1108/HFF-02-2026-0141http://arxiv.org/abs/2605.14620v1Landscape-Aware Bandit Hyper-Heuristics for Online Operator Selection in UAV Inspection Routing2026-05-14T09:37:45ZUAV multi-site inspection often reduces to choosing a high-quality visiting order after target sites have been extracted from a map. This paper develops LA-BHH, a landscape-aware bandit hyper-heuristic that learns an operator-selection policy online for this routing layer. LA-BHH treats 2-opt, swap, relocate, and Or-opt moves as low-level arms, builds context from static landscape descriptors and online search-state features, and updates a LinUCB controller from improvement rewards during the same run. Experimental results on 45 generated Euclidean TSP instances show that LA-BHH achieves the best mean final gap and convergence AUC, with 0.0223 and 0.0389 respectively. It reduces final gap by 17.6\% over UCB-HH, 22.6\% over Random-HH, and 68.2\% over nearest-neighbor construction. Ablation results further show that contextual credit assignment, 2-opt repair, and stagnation-aware state use are the main contributors.2026-05-14T09:37:45ZJunhao WeiYanxiao LiYifu ZhaoQibin HeHaochen LiDexing YaoBaili LuZhenhong PengYapeng WangSio-Kei ImXu Yanghttp://arxiv.org/abs/2605.14597v1VMU-Diff: A Coarse-to-fine Multi-source Data Fusion Framework for Precipitation Nowcasting2026-05-14T09:05:30ZPrecipitation nowcasting is a vital spatio-temporal prediction task for meteorological applications but faces challenges due to the chaotic property of precipitation systems. Existing methods predominantly rely on single-source radar data to build either deterministic or probabilistic models for extrapolation. However, the single deterministic model suffers from blurring due to MSE convergence. The single probabilistic model, typically represented by diffusion models, can generate fine details but suffers from spurious artifacts that compromise accuracy and computational inefficiency. To address these challenges, this paper proposes a novel coarse-to-fine Vision Mamba Unet and residual Diffusion (VMU-Diff) based precipitation nowcasting framework. It realizes precipitation nowcasting through a two-stage process, i.e., a deterministic model-based coarse stage to predict global motion trends and a probabilistic model-based fine stage to generate fine prediction details. In the coarse prediction stage, rather than single-source radar data, both radar and multi-band satellite data are taken as input. A spatial-temporal attention block and several Vision mamba state-space blocks realize multi-source data fusion, and predict the future echo global dynamics. The fine-grained stage is realized by a spatio-temporal refine generator based on residual conditional diffusion models. It first obtains spatio-temporal residual features based on coarse prediction and ground truth, and further reconstructs the residual via conditional Mamba state-space module. Experiments on Jiangsu SWAN datasets demonstrate the improvements of our method over state-of-the-art methods, particularly in short-term forecasts.2026-05-14T09:05:30Z5 pages, 2 figuresChunlei ShiHao LiYufeng ZhuBoyu LiuYongchao FengZengliang ZangHongbin WangYanlan YangDan Niuhttp://arxiv.org/abs/2605.14137v1Flow Field Reconstruction with Sensor Placement Policy Learning2026-05-13T21:41:29ZFlow-field reconstruction from sparse sensor measurements remains a central challenge in modern fluid dynamics, as the need for high-fidelity data often conflicts with practical limits on sensor deployment. Existing deep learning-based methods have demonstrated promising results, but they typically depend on simplifying assumptions such as two-dimensional domains, predefined governing equations, synthetic datasets derived from idealized flow physics, and unconstrained sensor placement. In this work, we address these limitations by studying flow reconstruction under realistic conditions and introducing a directional transport-aware Graph Neural Network (GNN) that explicitly encodes both flow directionality and information transport. We further show that conventional sensor placement strategies frequently yield suboptimal configurations. To overcome this, we propose a novel Two-Step Constrained PPO procedure for Proximal Policy Optimization (PPO), which jointly optimizes sensor layouts by incorporating flow variability and accounts for reconstruction model's performance disparity with respect to sensor placement. We conduct comprehensive experiments under realistic assumptions to benchmark the performance of our reconstruction model and sensor placement policy. Together, they achieve significant improvements over existing methods.2026-05-13T21:41:29ZNeurIPS 2025Ruoyan LiGuancheng WanZijie HuangZixiao LiuHaixin WangXiao LuoWei WangYizhou Sunhttp://arxiv.org/abs/2605.13814v1Emergency Vehicle Preemption Strategies using Machine Learning to Optimize Traffic Operations2026-05-13T17:41:27ZEmergency response vehicles (ERVs), such as fire trucks, operate to save lives and mitigate property damage. Emergency vehicle preemption (EVP) is typically implemented to provide the right-of-way to ERVs by giving green signals as they approach signalized intersections along their routes. EVP operations are usually optimized to minimize ERV delay. This study seeks to reduce delay experienced by other vehicles in the network while keeping ERV travel time near its optimum. A machine learning-based EVP strategy, termed MLEVP, is developed to determine EVP trigger times at multiple downstream intersections using real-time sensor data, including vehicle detections, signal indications, and ERV location. MLEVP proactively clears downstream traffic queues to reduce ERV response time while limiting delay on conflicting traffic movements. In the case study, MLEVP is developed using a calibrated microscopic simulation of a signalized corridor testbed in PTV Vissim. The EVP problem is formulated as a regression problem and solved using machine learning models trained on data generated from the simulation. Results demonstrate that the proposed algorithm can produce near-optimal ERV travel times while minimizing impacts on conflicting traffic.2026-05-13T17:41:27ZSomdut RoyMichael HunterAbhilasha SarojAngshuman Guinhttp://arxiv.org/abs/2605.13767v1Chrono::Ray: A Distributed Framework for High-Throughput Simulation-Based Analysis of Multibody Systems2026-05-13T16:47:09ZLarge-scale simulation studies can provide invaluable insights across computational engineering efforts, but they are often computationally demanding, requiring the use of distributed computing, which is itself not a simple task. Chrono::Ray addresses this challenge by integrating the high-fidelity multibody dynamics simulation engine Chrono with the open-source distributed computing platform Ray. The result is a modular workflow framework providing user-friendly abstractions for large-scale engineering simulation studies, supporting scalable orchestration of large ensembles of simulation trials without requiring users to directly manage distributed infrastructure. The current capabilities of the framework are demonstrated through two representative examples: parameter recovery for a multibody lunar lander model, and design of experiments for parameters of a continuum terramechanics model. Chrono::Ray is a part of the larger Project Chrono ecosystem and is released as an open-source software package, with source code available at https://github.com/uwsbel/chrono-ray.git.2026-05-13T16:47:09ZKhailanii SlatonDan Negruthttp://arxiv.org/abs/2605.13766v1Elastica++: A high-performance, multiphysics framework for large interacting assemblies of Cosserat rods2026-05-13T16:47:05ZSoft, slender structures are ubiquitous in natural and engineered systems, with broad application potential from biomimetic materials to soft robotics. However, there is a notable lack of computational tools that simultaneously preserve high-fidelity continuum rod mechanics, scale to large interacting ensembles, and remain flexible across diverse biophysical settings. Here we introduce Elastica++, an open-source, high-performance implementation of the Cosserat-rod model for large-scale simulations of slender-body dynamics. Elastica++ combines performance-oriented kernels with shared-memory parallelism to sustain teraflop-scale throughput despite complex discretization domains and physical interactions. The framework further interoperates with external numerical solvers, supporting efficient multiphysics workflows. We demonstrate robustness and breadth through case studies spanning passive nest-like metamaterials, collective active-matter dynamics, cilia carpets, soft magnetic microrobots, and schooling swimmers. Elastica++ thus provides a missing foundation for high-throughput studies of emergent behavior in interacting assemblies of elastic slender structures.2026-05-13T16:47:05ZTejaswin ParthasarathySeung Hyun KimSongyuan CuiMattia Gazzolahttp://arxiv.org/abs/2605.13761v1Toward AI-Driven Digital Twins for Metropolitan Floods: A Conditional Latent Dynamics Network Surrogate of the Shallow Water Equations2026-05-13T16:41:14ZAI-driven flood digital twins demand fast hydrodynamic surrogates for ensemble forecasting and observation assimilation. Yet even GPU-accelerated two-dimensional shallow water equation (SWE) solvers still require $\sim 55$ minutes per $96$-hour run on a $\sim 4.2$-million-active-cell metropolitan basin (the Des~Plaines River basin at $30\,\mathrm{m}$ resolution), making such workloads prohibitive at native resolution. We present the Conditional Latent Dynamics Network (CLDNet): a low-dimensional latent neural ODE driven by rainfall, paired with a coordinate-based decoder conditioned on static terrain (elevation, slope, Manning roughness) that reconstructs depth and discharge at arbitrary query points. Pointwise decoding decouples memory from grid size and handles irregular watersheds natively, enabling metropolitan-scale training on a single compute node and direct queries at exact gauge coordinates without raster snapping. We evaluate CLDNet on a synthetic $250{,}000$-cell Texas benchmark and on a new Des~Plaines case study of $114$ real-rainfall Stage~IV storms whose reference simulator we validate against United States Geological Survey (USGS) gauges at the April~2013 flood-of-record (Nash--Sutcliffe efficiency $0.57$--$0.94$ on mean-recentered water-surface elevation). CLDNet roughly halves the relative root-mean-squared error of an unconditional baseline, outperforms regular-grid VAE--ConvLSTM and FNO baselines on the Texas benchmark (both presuppose a Cartesian grid and do not apply to the irregular Des~Plaines watershed), reaches a critical success index of $\approx 86\%$ at the $0.5\,\mathrm{m}$ inundation threshold, and produces a full $96$-hour basin-wide forecast in $\sim 29$ seconds -- a $\sim 115\times$ speedup.2026-05-13T16:41:14ZPhillip SiYuan QiuOmar SallamJeremy FeinsteinZiang HeEugene YanPeng Chenhttp://arxiv.org/abs/2605.13633v1Effects of Thermal Boundary Conditions on Natural Convection and Entropy Generation in Non-Newtonian Power-Law Fluids2026-05-13T15:00:15ZThis study investigates the role of thermal boundary conditions on natural convection and entropy generation in non-Newtonian power-law fluids confined within a square cavity and a concentric cylindrical annulus. Steady, two-dimensional governing equations based on the incompressible power-law model and the Boussinesq approximation are solved using the Gridap.jl finite element framework. The numerical methodology is validated against benchmark solutions for both Newtonian and non-Newtonian convection, showing good agreement in terms of isotherm fields, streamlines, local Nusselt number distributions, and entropy generation. The effects of fluid rheology and heating mode are examined for shear-thinning, Newtonian, and shear-thickening fluids under uniform and non-uniform thermal boundary conditions. The results show that shear-thinning behavior enhances buoyancy-driven circulation, steepens thermal gradients, and increases heat transfer, whereas shear-thickening behavior suppresses convection and promotes conduction-dominated transport. Thermal boundary conditions are found to play an important role in controlling the intensity and spatial distribution of flow, heat transfer, and irreversibility. In both geometries, uniform heating produces stronger and more distributed convective structures, while non-uniform sinusoidal heating localizes thermal forcing and consistently reduces total entropy generation. An entropy analysis further reveals that viscous dissipation dominates irreversibility in shear-thinning fluids, whereas heat-transfer irreversibility becomes dominant as the power-law index increases. The study demonstrates that appropriate thermal boundary design, together with fluid rheology, provides an effective route for controlling heat transfer and minimizing thermodynamic losses in non-Newtonian convection systems. The source code and metadata are publicly available.2026-05-13T15:00:15Z21 figures, 4 tablesLambert TheisenSatyvir Singhhttp://arxiv.org/abs/2605.13607v1Ergodicity Library: A Python Toolkit for Stochastic-Process Simulation, Time-Average Diagnostics, and Agent-Based Experiments2026-05-13T14:41:28Zergodicity is an open-source Python library for computational work on stochastic dynamics, with particular emphasis on non-ergodicity, time-average behavior, heavy-tailed processes, and decision making under uncertainty. The package brings together three layers that are often split across ad hoc scripts: process definitions and simulators, analysis and fitting tools, and agent-based experimentation. This article documents the implemented software rather than presenting new stochastic theory. We describe the package architecture, the supported process families, the analysis workflow, and the practical boundaries of the current implementation. We also provide fully reproducible examples covering heavy-tailed ensemble spread, multiplicative Levy growth diagnostics, adaptive memory mean reversion, preasymptotic fluctuation analysis, and partial stochastic differential equation simulation. The package is positioned as an integration layer on top of the scientific Python stack, reducing the amount of glue code required to move from process specification to diagnostics and comparative experiments.2026-05-13T14:41:28ZIhor Kendiukhovhttp://arxiv.org/abs/2605.13407v1Vector-Quantized Discrete Latent Factors Meet Financial Priors: Dynamic Cross-Sectional Stock Ranking Prediction for Portfolio Construction2026-05-13T12:02:53ZPredicting cross-sectional stock returns is challenging due to low signal-to-noise ratios and evolving market regimes. Classical factor models offer interpretability but limited flexibility, while deep learning models achieve strong performance yet often underutilize financial priors. We address this gap with PRISM-VQ (PRior-Informed Stock Model with Vector Quantization), a dynamic factor framework that integrates expert prior factors, vector-quantized discrete latent factors learned from cross-sectional structure, and a structure-conditioned Mixture-of-Experts to generate time-varying factor loadings. Vector quantization acts as an information bottleneck that suppresses noise while capturing robust market structure, with discrete codes serving both as latent factors and as routing signals for temporal expert specialization. Experiments on CSI 300 and S&P 500 show consistent improvements in cross-sectional return prediction and portfolio performance over strong baselines while preserving interpretability. Our code is available at https://github.com/finxlab/PRISM-VQ.2026-05-13T12:02:53ZIJCAI 2026 Accepted Paper including Technical AppendixNamhyoung KimJae Wook Songhttp://arxiv.org/abs/2605.13378v1Robust Matrix-Free Newton-Krylov Solvers via Automatic Differentiation2026-05-13T11:34:32ZJacobian-Free Newton-Krylov (JFNK) methods avoid forming the full Jacobian, but still require Jacobian-vector products, i.e., Gateaux derivatives of the nonlinear residual along Krylov directions. In standard Finite Differences (FD) formulations, these products are obtained by perturbing the Newton state and differencing residuals, making the linearization sensitive to round-off error and floating-point precision. This work evaluates the global impact of forward-mode Automatic Differentiation (AD) as a replacement for FD Jacobian-vector product in finite-precision JFNK solvers. The comparison keeps the discretization, Newton iteration, line search, Krylov methods, tolerances, and CPU/GPU backend fixed, only varying linearization strategy. Benchmarks include Burgers dynamics, Su-Olson radiation diffusion, reaction-diffusion, and nonlinear time-harmonic Maxwell equations, each evaluated in different nonlinear regimes. By preventing degradation of the Krylov operator, AD accelerates computation by 2-3 orders of magnitude across both CPU and GPU architectures. More importantly, it drastically improves global solver robustness, achieving a minimum completion rate of 95%, compared to just 42% for FD. Ultimately, accurate Gateaux derivatives unify performance and accuracy in JFNK methods, making AD the optimal choice for stiff nonlinear and reduced-precision environments.2026-05-13T11:34:32ZMarco PasqualeStefano Markidishttp://arxiv.org/abs/2605.13024v1ReCoG: Relational and Compact Context Graph Learning for Few-shot Molecular Property Prediction2026-05-13T05:28:21ZFew-shot molecular property prediction (FSMPP) is essential in drug discovery and materials design, where high-quality labeled data are often scarce and expensive to obtain. Despite the promising performance of existing methods, especially context-aware methods, they still face two-fold severe challenges with \textit{insufficient structural context modeling} \& \textit{redundant auxiliary context learning}, leading to inadequate context graph exploration and ineffective information utilization for effective molecule representation learning. To address these, in this paper, we propose a novel framework by learning on \textbf{\underline{Re}}lational and \textbf{\underline{C}}ompact c\textbf{\underline{o}}ntext \textbf{\underline{G}}raph, named \textbf{\method}, to comprehensively exploit the context graph for expressive molecular property prediction. Specifically, the proposed \method contains two core modules: a \textbf{(1) cross-property relational learning module} to better model the structural and relational context information, and a \textbf{(2) context graph information bottleneck module} to adaptively suppress irrelevant auxiliary signals for compact context information utilization, followed by a detailed theoretical demonstration regarding the importance of joint relational and compact knowledge extraction in context graphs.2026-05-13T05:28:21ZZeyu WangXin ZhengYao LuShanqing YuQi XuanShirui Panhttp://arxiv.org/abs/2605.03751v2Carbon-Aware Compute--Power Scheduling for AI Data Centers with Microgrid Prosumer Operations2026-05-12T20:41:16ZAI data centers are increasingly becoming tightly coupled compute--energy systems, where workload placement, cooling demand, electricity procurement, storage operation, and carbon emissions interact over time. This paper studies carbon-aware compute--power scheduling for geographically distributed AI data centers with microgrid prosumer capabilities. We propose a mixed-integer linear programming (MILP) framework that jointly schedules rigid training jobs, routes elastic inference workloads, dispatches local generation and battery storage, and manages bidirectional grid interaction under latency, continuity, power-balance, and carbon-budget constraints. The model captures two key features of emerging AI infrastructure: heterogeneous workload flexibility and site-level energy prosumer operation. Experiments on synthetic yet practically motivated instances show that the proposed joint MILP substantially improves total operational benefit over compute-only and energy-only baselines while reducing emissions. The results further indicate that inference-routing flexibility is a major source of value, battery storage provides useful temporal flexibility, and local-generation-rich settings are particularly favorable. The framework provides a tractable optimization abstraction for sustainable and grid-interactive AI data centers.2026-05-05T13:35:26Z3 pages, 2 figuresJohnny R. ZhangGaoyuan DuQianyi SunShiqi WangJiaxuan LiXian Sun