https://arxiv.org/api/DPOBeZ9vqI50i1urcN19MyRZ6To2026-06-10T05:55:32Z1065213515http://arxiv.org/abs/2409.00876v2Rapid GPU-Based Pangenome Graph Layout2026-05-27T20:07:32ZComputational Pangenomics is an emerging field that studies genetic variation using a graph structure encompassing multiple genomes. Visualizing pangenome graphs is vital for understanding genome diversity. Yet, handling large graphs can be challenging due to the high computational demands of the graph layout process.
In this work, we conduct a thorough performance characterization of a state-of-the-art pangenome graph layout algorithm, revealing significant data-level parallelism, which makes GPUs a promising option for compute acceleration. However, irregular data access and the algorithm's memory-bound nature present significant hurdles. To overcome these challenges, we develop a solution implementing three key optimizations: a cache-friendly data layout, coalesced random states, and warp merging. Additionally, we propose a quantitative metric for scalable evaluation of pangenome layout quality.
Evaluated on 24 human whole-chromosome pangenomes, our GPU-based solution achieves a 57.3x speedup over the state-of-the-art multithreaded CPU baseline without layout quality loss, reducing execution time from hours to minutes.2024-09-02T00:05:20ZAccepted and presented on SC 2024: https://dl.acm.org/doi/10.1109/SC41406.2024.00035Jiajie LiJan-Niklas SchmelzleYixiao DuSimon HeumosAndrea GuarracinoGiulia GuidiPjotr PrinsErik GarrisonZhiru Zhang10.1109/SC41406.2024.00035http://arxiv.org/abs/2605.24670v2Fractional-gradient Sparsity with Autoencoding Sequential Deep Image Prior for 3D CT Reconstruction2026-05-27T18:54:49Z3D volumetric reconstruction from incomplete or noisy measurements is a fundamental problem in medical imaging and computational tomography. Deep image prior (DIP)-based methods have recently shown strong capability for solving inverse problems without requiring large training datasets. However, directly extending DIP to 3D reconstruction by fully 3D networks can incur high computational cost, while slice-by-slice 2D DIP approaches may lead to inter-slice inconsistencies due to the lack of explicit regularization along the third direction. In this paper, we propose a novel volumetric reconstruction framework, Fractional-gradient Autoencoding Sequential Tomography DIP (FAST-DIP), which integrates input-adaptive sequential deep image prior modeling of slices with fractional sparsity regularization to capture inter-slice dependencies. Specifically, we introduce a fractional l1/l2-based sparsity prior on the gradients along the slice (z) direction to explicitly enforce inter-slice structural consistency. We further provide theoretical analysis of the proposed alternating minimization algorithm under the majorization-minimization (MM) framework, establishing monotonic descent of the objective function and convergence to a critical point under the Kurdyka-Lojasiewicz (KL) property. Experimental results for 3D X-ray computed tomography (CT) reconstruction demonstrate that the proposed method improved reconstruction quality and structural consistency compared with existing DIP-based approaches.2026-05-23T17:11:35Z16 pages, 3 figures, 4 tablesHaijie YuanChaoyan HuangSrijita BandopadhyayLiyue ShenSaiprasad Ravishankarhttp://arxiv.org/abs/2605.28978v1VFEAgent: A Multimodal Agent Framework for End-to-End Automated Finite Element Analysis2026-05-27T18:34:04ZFinite Element Analysis (FEA) serves as the cornerstone of modern engineering design. However, its workflow is inherently complex and relies heavily on domain expertise. Although recent efforts have integrated Large Language Models (LLMs) into FEA, existing approaches face limitations in handling multimodal inputs and executing complex tasks. To address these limitations, we propose VFEAgent, an end-to-end multi-agent system designed to automate FEA modeling and simulation directly from input images and problem descriptions. Our methodology integrates two core components: (1) a multimodal vision-language multi-agent pipeline that employs ReAct-driven reasoning to extract structured FEA specifications from heterogeneous inputs and (2) a verification-first code synthesis framework, incorporating robust self-debugging and fallback mechanisms to ensure executability and physical validity. We systematically evaluated the system across various engineering mechanics scenarios. The results demonstrate that VFEAgent achieves a high success rate in generating complete and physically valid simulations, outperforming LLM-based baseline methods in reliability and correctness. These findings validate the feasibility of automating the complete FEA workflow, highlighting the framework's potential to liberate engineers from tedious manual analysis.2026-05-27T18:34:04Z9 pages, 3 figures, 2 tables. Equal contribution: Jiachen Zhang and Junyi Lao. Corresponding author: Songfang Huang. PreprintJiachen ZhangPeking UniversityChina Agricultural UniversityJunyi LaoPeking UniversityChenghao LiuPeking UniversitySiyuan LiuPeking UniversityShixin WuPeking UniversityLinsen ZhangPeking UniversityBoyu WangPeking UniversitySongfang HuangPeking Universityhttp://arxiv.org/abs/2606.02618v1Closed-Loop Molecular Design with Calibrated Deference2026-05-27T16:44:34ZWe present Cognitive Loop via In-Situ Optimization (CLIO), an agent that couples a continuously-updated belief-state graph with a recursive plan-then-act loop. The result is a reasoning agent that can contribute something qualitatively different, which we term \emph{calibrated deference}: the capacity to recognize when its own tools or assumptions are failing, to adapt its strategy in response, and to generate mechanistic hypotheses that guide experimental revision. We tested CLIO in a closed-loop human-AI campaign to design an aqueous organic redox flow battery (AORFB) negolyte, with CLIO leading proposal and interpretation in close partnership with chemists who synthesized, characterized, and weighed in on design choices. Across 17 candidates over three rounds, CLIO converged on a top phosphonate candidate; characterization confirmed a 130~mV improvement in redox potential over the literature baseline. Characterization then revealed unexpectedly poor electrochemical reversibility -- a regression no property predictor had flagged. CLIO generated competing mechanistic hypotheses, prioritized discriminating diagnostics, traced the failure to phosphonate-potassium ion pairing, and prescribed a sulfonate replacement. The resulting compound showed substantially improved electrochemical reversibility and maintained a 90~mV improvement in redox potential, closing the design-make-test-redesign loop.2026-05-27T16:44:34ZNewman ChengGordon BroadbentJason DongSyed Mohammed Ali HussainiFarman UllahMorris SharpGabrielle BarnesNanlin GuoDeyu ZouKarin StraussWilliam ChappellDavid G. KwabiBichlien H. NguyenJake A. Smithhttp://arxiv.org/abs/2605.28684v1History-aware adaptive reduced-order models via incremental singular value decomposition2026-05-27T16:15:11ZReduced-order models (ROMs) can accelerate high-dimensional dynamical simulations, but their accuracy often deteriorates when online dynamics leave the regime represented by offline training data. We develop a projection-based adaptive ROM framework based on incremental singular value decomposition (iSVD), in which occasional full-order operator evaluations provide correction snapshots for online basis updates. The intrusive ROMs considered here are fully parameterized by the basis, so each update naturally propagates to reduced operators and hyper-reduction machinery. Through its evolving singular structure, iSVD retains an encoded history of the observed dynamics and is history-aware in this sense. We study the method on three nonlinear problems of increasing complexity: the one-dimensional viscous Burgers equation, the Sod shock tube, and a stiff one-dimensional ten-species rotating detonation engine (RDE). The Burgers problem is used to analyze the method and compare iSVD with alternative basis adaptation rules, showing that history-aware updates outperform instantaneous updates and that iSVD gives the strongest overall performance. The Sod and RDE cases demonstrate that these advantages persist in more challenging compressible-flow settings. For the RDE problem, the iSVD adaptive ROM improves upon the current state-of-the-art Direct adaptive ROM baseline in both predictive accuracy and computational efficiency. A cost analysis shows that the dominant online cost comes from interacting with the full-order model to obtain correction snapshots, while the iSVD update itself is negligible. These results identify iSVD as an effective mechanism for online learning of reduced subspaces and suggest a path toward ROMs that remain predictive over horizons several orders of magnitude longer than their initial training window.2026-05-27T16:15:11Z50 pages, 27 figures, Preprint submitted to ElsevierAmirpasha HedayatAli MohagheghLaura BalzanoCheng HuangKarthik Duraisamyhttp://arxiv.org/abs/2605.28601v1Local Information Operators for Spatial Identifiability in Distributed-Parameter Inverse Problems in Computational Mechanics2026-05-27T15:17:51ZIn distributed-parameter inverse problems in computational mechanics, spatially varying fields are inferred from noisy, indirect, and heterogeneous observations. The relevant identifiability question concerns which spatial perturbation patterns of the field are distinguishable under a specified sensing and excitation programme. This paper develops a local information-operator framework for this purpose. Around a nominal parameter field, the parameter-to-observation map is linearized and the likelihood contribution to posterior precision is interpreted as an operator on parameter-field perturbations. For locally linearized Gaussian models with parameter-independent covariance, this operator is equivalently Fisher information, Gauss-Newton data-misfit curvature, and a noise-weighted sensitivity Gramian.
The framework separates pointwise visibility from spatial identifiability. The diagonal gives a coordinate-dependent local information density, while the full kernel and metric- or prior-preconditioned spectra rank spatial patterns that are strongly visible, weakly visible, or locally invisible. Heterogeneous observation blocks are assembled in a common parameter space; information is additive only under conditional independence, whereas correlated errors require the full joint covariance. Model discrepancy, nuisance parameters, and prior information modify the same geometry through covariance inflation, Schur-complement information loss, and prior-preconditioned modes. Examples cover analytic beam kernels, two-span support coupling, static-dynamic fusion for flexural-rigidity identification, and two-dimensional damage-field reconstruction in a leading information subspace. The operator view supports interpretation of identifiability, sensor complementarity, and reduced reconstruction in distributed-parameter inverse problems.2026-05-27T15:17:51Z39 pages, 12 figuresTammam Bakeerhttp://arxiv.org/abs/2605.28574v1Surface Contact Approximation for Magneto-Thermal Finite Element Analysis of No-Insulation HTS Coils2026-05-27T14:58:27ZHigh-temperature superconducting (HTS) coated conductors (CCs) can be wound into no-insulation (NI) coils, in which electrical current can partially bypass local normal zones via turn-to-turn contact layers (T2TCLs). Accurate magneto-thermal simulation of such coils, therefore, requires an efficient representation of the electrical and thermal behavior of the T2TCLs. This paper introduces a magneto-thermal surface contact approximation (SCA) for finite element analysis of NI HTS coils. The formulation is derived as a special case of the more general thin shell approximation (TSA) by introducing suitable approximations such as negligible tangential surface currents and eddy-current effects inside the T2TCL. The resulting SCA formulation replaces the thin volumetric contact layer with a dedicated surface weak formulation based on the electric contact resistance and thermal contact conductance. In contrast, the TSA formulation requires the definition of electric resistivities and thermal conductivities as well as the thickness of the T2TCL. The SCA is implemented in the Pancake3D module of the free and open-source Finite Element Quench Simulator. It is verified through transient magneto-thermal simulations of a model NI pancake coil. Numerical results are compared against the established TSA formulation. The results show that the SCA accurately reproduces the relevant electromagnetic and thermal behavior. For the TSA, there is a trade-off between choosing large (potentially unphysical) thicknesses with low resistivities leading to inaccurate results, or small thicknesses with large resistivities making the linear system harder to solve, increasing the computational effort. In contrast, the SCA, thanks to using contact resistances and conductances directly without the necessity to define a thickness, is easy to use and robust.2026-05-27T14:58:27ZThis work has been submitted to the IEEE for possible publicationErik SchnaubeltLouis DenisMariusz WozniakJulien DularArjan Verweijhttp://arxiv.org/abs/2605.28446v1Modelling the effect of fiber distribution on the transverse mechanical characteristics of unidirectionally reinforced continuous-fiber composite2026-05-27T13:14:01ZThis study investigates the influence of fiber spatial distribution on the transverse mechanical properties of unidirectionally reinforced continuous-fiber composites. A Swelling & Random Migration algorithm was employed to generate representative volume elements with controlled fiber arrangements, ranging from clustered to equilibrium configurations. Finite element homogenization with periodic boundary conditions was used to estimate effective elastic properties. To characterize fiber randomness and assess statistical equivalence with experimental microstructures, several descriptors are employed, including nearest neighbor distance, Ripley's K-function, pair distribution function, and local fiber volume fraction. Results reveal that, at constant fiber volume fraction, clustered fiber distributions yield significantly higher transverse stiffness but lower transverse tensile strength compared to the equilibrium distributions. For glass/epoxy composites, transverse stiffness varies by up to 20% depending on the degree of fiber clustering. A single scalar descriptor, the mean nearest neighbor distance, was shown to efficiently characterize sufficiently random fiber distributions: effective stiffness decreases, whereas transverse tensile strength increases linearly with mean nearest neighbor distance. The findings highlight the critical role of microstructural characteristics in tailoring composite performance and provide a robust framework for predictive modeling of fiber reinforced materials.2026-05-27T13:14:01ZSergejs TarasovsJanis ModniksAndrea Bercini MartinsChristina SchefflerJanis Andersonshttp://arxiv.org/abs/2603.23290v2Traveling Salesman Problem with a preprocessing method for classical and quantum optimization2026-05-27T13:00:22ZThe Traveling Salesman Problem is a fundamental combinatorial optimization problem widely studied in operations research. Despite its simple formulation, it remains computationally challenging due to the exponential growth of the search space and the large number of constraints required to eliminate subtours. This paper introduces a preprocessing strategy that significantly reduces the size of the optimization model by restricting the set of candidate arcs and retaining only the lowest-cost neighbors for each vertex. Computational experiments on TSPLIB benchmark instances demonstrate that the proposed approach substantially reduces the number of decision variables. The method is evaluated using both classical and quantum optimization techniques, showing improvements in computational time and reductions in optimality gaps. Overall, the results indicate that the proposed preprocessing enhances the scalability of the formulations and makes them more suitable for both classical solvers and emerging quantum optimization frameworks.2026-03-24T14:54:14Z6 pages, 1 figure, 4 tablesAlessia CiaccoLuigi Di Puglia PuglieseFrancesca Guerrierohttp://arxiv.org/abs/2604.19355v2LASER: Learning Active Sensing for Continuum Field Reconstruction2026-05-27T09:37:48ZHigh-fidelity measurements of continuum physical fields are essential for scientific discovery and engineering design but remain challenging under sparse and constrained sensing. Conventional reconstruction methods typically rely on fixed sensor layouts, which cannot adapt to evolving physical states. We propose LASER, a unified, closed-loop framework that formulates active sensing as a Partially Observable Markov Decision Process (POMDP). At its core, LASER employs a continuum field latent world model that captures the underlying physical dynamics and provides intrinsic reward feedback. This enables a reinforcement learning policy to simulate ''what-if'' sensing scenarios within a latent imagination space. By conditioning sensor movements on predicted latent states, LASER navigates toward potentially high-information regions beyond current observations. Our experiments demonstrate that LASER consistently outperforms static and offline-optimized strategies, achieving high-fidelity reconstruction under sparsity across diverse continuum fields.2026-04-21T11:36:09ZAccepted by ICML 2026 (Oral)Huayu DengJinghui ZhongXiangming ZhuYunbo WangXiaokang Yanghttp://arxiv.org/abs/2505.19358v3RoofNet: A Global Multimodal Dataset for Roof Material Identification from Earth Observation2026-05-27T06:30:19ZBuilding-level exposure data are critical to natural hazard risk modeling, yet most global inventories describe where buildings are located rather than what they are made of. Roof material is a critical but poorly documented attribute for assessing vulnerability to wildfires, wind hazards, urban heat, floods, and earthquakes. To address this gap, we introduce RoofNet, a global dataset that maps 49,662 georeferenced building instances from 101 countries to 14 key roofing material classes using Earth observation (EO) imagery (redistributed where permitted) and associated geospatial metadata. RoofNet contributes (1) climatographically and architecturally diverse coverage of roof material labels, (2) a scalable annotation pipeline combining SME-guided manual labeling with vision-language model (VLM)-assisted classification, rule-based validation, and human-in-the-loop verification, and (3) a resource for evaluating subtle, geographically variable material-level identification in EO imagery and its implications for material-aware hazard risk modeling. Evaluation on a manually labeled hold-out set shows that zero-shot Remote Contrastive Language-Image Pre-Training (RemoteCLIP) struggles with roof material classification, while fine-tuning with RoofNet improves top-1 accuracy from 4.9% to 47.7%. We use RoofNet in an illustrative hazard case study to demonstrate how material-aware exposure data can change vulnerability estimates relative to material-naive inventories. RoofNet provides a missing material layer for global building attribute mapping and scalable hazard risk assessment.2025-05-25T23:14:24Zv3: Restructured manuscript with updated framing, added hazard case study, clarified data release/licensing, and refined main text and appendix for clarityBenjamin TarverNoelle LawSasha GetzYuki Miurahttp://arxiv.org/abs/2605.27968v1Adapting Automotive Aerodynamics Surrogates to New Vehicle Families via Transfer Learning2026-05-27T05:03:25ZDeploying Scientific Machine Learning surrogates in industrial CFD workflows requires adapting pretrained models to new vehicle families without large datasets; yet whether geometric representations learned by a geometry encoder transfer to topologically distinct shapes remains unvalidated.
We address this through leave-one-family-out experiments on a 61.47M-parameter Transformer surrogate (AB-UPT) pretrained on four vehicle families (411 external aerodynamics cases) and adapted to the held-out fifth with only 20 samples. Three strategies are compared: Full Fine-Tuning (FFT), Lightweight Fine-Tuning (LFT), and Low-Rank Adaptation (LoRA). The central finding is that pretrained geometry encoders learn transferable representations, but the adaptation mechanism determines whether they can be exploited. FFT destabilizes as 61.47M unconstrained parameters overfit to 20 samples (R^2=0.40); LFT fails because the frozen encoder cannot represent unseen shapes (R^2<0). LoRA resolves both: rank-constrained adapters injected into all layers regularize the loss landscape while preserving pretrained features, achieving R^2=0.85+/-0.02 across all five families with 50% lower force RMSE than FFT and 28% lower pointwise field errors. LoRA also outperforms from-scratch training using 3x more target-family data, eliminating the need for large per-family datasets. These results recast LoRA from a memory-saving convenience into a convergence enabler for geometry transfer: a shared backbone paired with lightweight per-family adapters trainable in hours from minimal data.2026-05-27T05:03:25Z23 pages, 12 figuresSeunghwan KeumAlok Wareyhttp://arxiv.org/abs/2605.27896v1FinBoardBench: Benchmarking Dynamic Wealth Management and Strategic Financial Reasoning of LLMs via Board Game Simulations2026-05-27T03:20:25ZRecently, large language models (LLMs) have achieved superior performance in static financial reasoning and simple dynamic trading tasks. However, existing static financial benchmarks are insufficient to assess the dynamic wealth management and financial decision-making capabilities of LLMs in real-world environments. To bridge this gap, we present FinBoardBench, an evaluation suite based on three classic financial board games: Cashflow, Acquire, and Monopoly. FinBoardBench assesses a comprehensive set of financial skills, including personal cash flow management with debt balancing, corporate investment and acquisition forecasting, and competitive trade negotiations with asset auctions. Our experiments with 9 advanced LLMs reveal that while exhibiting basic long-term planning and investment logic, they fail to effectively leverage complex interactions for profit, and their strong static reasoning performance does not transform into successful dynamic decision-making. Notably, they tend to prioritize immediate asset acquisition over maintaining sufficient liquidity, making them vulnerable to financial crises triggered by random events. We hope that FinBoardBench can provide a valuable reference for more intelligent LLM-based decision-making systems in the future.2026-05-27T03:20:25ZPreprintXuesi HuPeng WangJinpeng MiaoXilin TaoCaiwei LiYue MaJie HeQiancheng ZhangYuntao ZouDagang Lihttp://arxiv.org/abs/2602.21362v2Signed network models for dimensionality reduction of portfolio optimization2026-05-26T19:02:56ZIn this paper, we develop a time-series-based signed network model for dimensionality reduction in portfolio optimization, grounded in Markowitz's portfolio theory and extended to incorporate higher-order moments of asset return distributions. Unlike traditional correlation-based approaches, we construct a complete signed graph for each trading day within a specified time window, where the sign of an edge between a pair of assets is determined by the relative behavior of their log returns with respect to their mean returns. Within this framework, we introduce a combinatorial interpretation of higher-order moments, showing that maximizing skewness and minimizing kurtosis correspond to maximizing balanced triangles and balanced 4-cliques with specific signed edge configurations respectively. We establish that the latter leads to an NP-hard combinatorial optimization problem, while the former is naturally guaranteed by the structural properties of the signed graph model. Based on this interpretation, we propose a dimensionality reduction method using a combinatorial formulation of the mean-variance optimization problem through a combinatorial hedge score metric for assets. The proposed framework is validated through extensive backtesting on 199 S\&P 500 assets over a 16-year period (2006 - 2021), demonstrating the effectiveness of reduced asset universes for portfolio construction using both Markowitz optimization and equally weighted strategy.2026-02-24T20:46:46Zextension of arXiv:2510.05377Bibhas Adhikarihttp://arxiv.org/abs/2605.27578v1From Centerlines to Hemodynamics: Anisotropic RBF Decoders for Coronary Arteries2026-05-26T18:48:37ZAccurate and rapid estimation of hemodynamic metrics, such as pressure and wall shear stress (WSS), is important for assessing the severity of Coronary Artery Disease (CAD). Existing approaches, including invasive Fractional Flow Reserve (FFR) measurements and computationally expensive Computational Fluid Dynamics (CFD) simulations, face challenges in invasiveness, cost, and speed. We present a framework for fast, non-invasive coronary hemodynamics prediction. The model encodes 1D vessel centerlines together with inlet flow rate using a transformer-based encoder, and predicts continuous wall-based fields via an anisotropic Radial Basis Function (RBF) decoder aligned with vessel morphology. To support training and evaluation, we introduce two datasets with paired steady-state OpenFOAM simulations: (i) a synthetic benchmark of 4,200 single-vessel geometries with controlled anatomical variations, and (ii) a multi-vessel dataset derived from ImageCAS including 4,800 cases spanning both right and left coronary arteries, generated by randomly introducing stenoses and varying physiologically plausible flow rates. Across both datasets, our method achieves lower pressure and WSS errors than strong neural-operator baselines (GNOT, Transolver, and ONO) at a fraction of the computational cost of CFD. On the multi-vessel dataset, using 1,024 anisotropic RBF centers our model reduces the mean relative L2 error by 52% compared to the best neural-operator baseline, while at 128 centers it requires 13.8x fewer FLOPs than GNOT and still outperforms all baselines. The single-vessel dataset is publicly available at https://huggingface.co/datasets/angioinsight/single-vessel-flow.2026-05-26T18:48:37ZReza Akbarian BafghiSukirt ThakurMaziar Raissi