https://arxiv.org/api/FCWxW/3yHbzOE509meFw2pPcBlI 2026-03-18T10:08:53Z 10056 0 15 http://arxiv.org/abs/2603.16732v1 Confusion-Aware Spectral Regularizer for Long-Tailed Recognition 2026-03-17T16:15:07Z Long-tailed image classification remains a long-standing challenge, as real-world data typically follow highly imbalanced distributions where a few head classes dominate and many tail classes contain only limited samples. This imbalance biases feature learning toward head categories and leads to significant degradation on rare classes. Although recent studies have proposed re-sampling, re-weighting, and decoupled learning strategies, the improvement on the most underrepresented classes still remains marginal compared with overall accuracy. In this work, we present a confusion-centric perspective for long-tailed recognition that explicitly focuses on worst-class generalization. We first establish a new theoretical framework of class-specific error analysis, which shows that the worst-class error can be tightly upper-bounded by the spectral norm of the frequency-weighted confusion matrix and a model-dependent complexity term. Guided by this insight, we propose the Confusion-Aware Spectral Regularizer (CAR) that minimizes the spectral norm of the confusion matrix during training to reduce inter-class confusion and enhance tail-class generalization. To enable stable and efficient optimization, CAR integrates a Differentiable Confusion Matrix Surrogate and an EMA-based Confusion Estimator to maintain smooth and low-variance estimates across mini-batches. Extensive experiments across multiple long-tailed benchmarks demonstrates that CAR substantially improves both worst-class accuracy and overall performance. When combined with ConCutMix augmentation, CAR consistently surpasses exisiting state-of-the-art long-tailed learning methods under both the training-from-scratch setting (by 2.37% ~ 4.83%) and the fine-tuning-from-pretrained setting (by 2.42% ~ 4.17%) across ImageNet-LT, CIFAR100-LT, and iNaturalist datasets. 2026-03-17T16:15:07Z Ziquan Zhu Gaojie Jin Hanruo Zhu Si-Yuan Lu Yunxiao Zhang Zeyu Fu Ronghui Mu Guoqiang Zhang Zhao Sun Xia Yuhang Jiaxing Shang Xiang Li Lu Liu Tianjin Huang http://arxiv.org/abs/2603.16729v1 GeMA: Learning Latent Manifold Frontiers for Benchmarking Complex Systems 2026-03-17T16:12:30Z Benchmarking the performance of complex systems such as rail networks, renewable generation assets and national economies is central to transport planning, regulation and macroeconomic analysis. Classical frontier methods, notably Data Envelopment Analysis (DEA) and Stochastic Frontier Analysis (SFA), estimate an efficient frontier in the observed input-output space and define efficiency as distance to this frontier, but rely on restrictive assumptions on the production set and only indirectly address heterogeneity and scale effects. We propose Geometric Manifold Analysis (GeMA), a latent manifold frontier framework implemented via a productivity-manifold variational autoencoder (ProMan-VAE). Instead of specifying a frontier function in the observed space, GeMA represents the production set as the boundary of a low-dimensional manifold embedded in the joint input-output space. A split-head encoder learns latent variables that capture technological structure and operational inefficiency. Efficiency is evaluated with respect to the learned manifold, endogenous peer groups arise as clusters in latent technology space, a quotient construction supports scale-invariant benchmarking, and a local certification radius, derived from the decoder Jacobian and a Lipschitz bound, quantifies the geometric robustness of efficiency scores. We validate GeMA on synthetic data with non-convex frontiers, heterogeneous technologies and scale bias, and on four real-world case studies: global urban rail systems (COMET), British rail operators (ORR), national economies (Penn World Table) and a high-frequency wind-farm dataset. Across these domains GeMA behaves comparably to established methods when classical assumptions hold, and provides additional insight in settings with pronounced heterogeneity, non-convexity or size-related bias. 2026-03-17T16:12:30Z Latent manifold frontiers for benchmarking complex production systems, and applications to national rail operators, wind farms, and macroeconomic productivity are presented Jia Ming Li Anupriya Daniel J. Graham http://arxiv.org/abs/2603.11868v2 Towards heterogeneous parallelism for SPHinXsys 2026-03-17T15:09:28Z Simulations based on particle methods, such as Smoothed Particle Hydrodynamics (SPH), are known to be computationally demanding. While such methods have for long been executed in parallel on multi-core CPUs, in recent years the increasing adoption of many-core accelerators, such as GPUs. However, hardware fragmentation and vendor-specific programming interfaces are still characterizing their market. Hence, support for various hardware configurations may easily lead to non-trivial and less maintainable implementations. To leverage over some higher-level specifications have become available recently, such as the SYCL programming standard, this work highlights the initial effort in adopting the SYCL standard for the execution of SPHinXsys, an open-source multi-physics library. The result is an execution model able to run the same implementation on variable (heterogeneous) hardware, with considerable speed-up compared to the current multi-core CPU parallelization. Among others, representation of data-structures for parallel access, communication strategies, and parallel methods for data sorting will be topics discussed in depth. Benchmarks has also been presented, showcasing performance comparisons between the current multi-core CPU implementation and the newly introduced SYCL parallelization with a GPU back-end. 2026-03-12T12:39:21Z 15 pages and 5 figures Presented in 2025 SPHERIC World Conference Xiangyu Hu Alberto Guarnieri http://arxiv.org/abs/2603.16599v1 Data-driven generalized perimeter control: Zürich case study 2026-03-17T14:45:20Z Urban traffic congestion is a key challenge for the development of modern cities, requiring advanced control techniques to optimize existing infrastructures usage. Despite the extensive availability of data, modeling such complex systems remains an expensive and time consuming step when designing model-based control approaches. On the other hand, machine learning approaches require simulations to bootstrap models, or are unable to deal with the sparse nature of traffic data and enforce hard constraints. We propose a novel formulation of traffic dynamics based on behavioral systems theory and apply data-enabled predictive control to steer traffic dynamics via dynamic traffic light control. A high-fidelity simulation of the city of Zürich, the largest closed-loop microscopic simulation of urban traffic in the literature to the best of our knowledge, is used to validate the performance of the proposed method in terms of total travel time and CO2 emissions. 2026-03-17T14:45:20Z 33 pages, 16 figures Alessio Rimoldi Carlo Cenedese Alberto Padoan Florian Dörfler John Lygeros http://arxiv.org/abs/2306.12272v2 From structure mining to unsupervised exploration of atomic octahedral networks 2026-03-17T13:05:12Z Understanding the spatial arrangements of atom-centered coordination octahedra is crucial for relating structures to properties for many materials families. Traditional case-by-case inspection becomes a prohibitive task for discovering trends and similarities in large datasets. Here, we operationalize chemical intuition to automate the geometric parsing, quantification, and classification of coordination octahedral networks using unsupervised machine learning. We apply the workflow to analyze two datasets to demonstrate its effectiveness. For computationally generated single oxide perovskite (ABO$_{3}$) polymorphs, we uncover axis-dependent tilting trends which assist in detecting oxidation state changes. For hybrid iodoplumbates (A$_x$Pb$_y$I$_z$) from measured structures, we taxonomize their octahedral networks, revealing a Pauling-like connectivity rule for the coordination environment and the design principles underpinning their structural diversity. Our results offer a glimpse into the vast design space of atomic octahedral networks in materials chemistry and inform high-throughput, targeted screening of specific structure types. 2023-06-21T13:49:35Z updated version, incl. three supporting information files R. Patrick Xian Ryan J. Morelock Ido Hadar Charles B. Musgrave Christopher Sutton http://arxiv.org/abs/2603.15101v2 Gaussian mixture models for model improvement 2026-03-17T11:11:50Z Modeling complex physical systems such as they arise in civil engineering applications requires finding a trade-off between physical fidelity and practicality. Consequently, deviations of simulation from measurements are ubiquitous even after model calibration due to the model discrepancy, which may result from deliberate modeling decisions, ignorance, or lack of knowledge. If the mismatch between simulation and measurements are deemed unacceptable, the model has to be improved. Targeted model improvement is challenging due to a non-local impact of model discrepancies on measurements and the dependence on sensor configurations. Many approaches to model improvement, such as Bayesian calibration with additive mismatch terms, gray-box models, symbolic regression, or stochastic model updating, often lack interpretability, generalizability, physical consistency, or practical applicability. This paper introduces a non-intrusive approach to model discrepancy analysis using mixture models. Instead of directly modifying the model structure, the method maps sensor readings to clusters of physically meaningful parameters, automatically assigning sensor readings to parameter vector clusters. This mapping can reveal systematic discrepancies and model biases, guiding targeted, physics-based refinements by the modeler. The approach is formulated within a Bayesian framework, enabling the identification of parameter clusters and their assignments via the Expectation-Maximization (EM) algorithm. The methodology is demonstrated through numerical experiments, including an illustrative example and a real-world case study of heat transfer in a concrete bridge. 2026-03-16T10:52:09Z 19 pages, 10 figures, submitted for review Paolo Villani Daniel Andrés Arcones Jörg F. Unger Martin Weiser http://arxiv.org/abs/2603.16216v1 Generative AI for Quantum Circuits and Quantum Code: A Technical Review and Taxonomy 2026-03-17T07:45:40Z We review thirteen generative systems and five supporting datasets for quantum circuit and quantum code generation, identified through a structured scoping review of Hugging Face, arXiv, and provenance tracing (January-February 2026). We organize the field along two axes: artifact type (Qiskit code, OpenQASM programs, circuit graphs); crossed with training regime (supervised fine-tuning, verifier-in-the-loop RL, diffusion/graph generation, agentic optimization); and systematically apply a three-layer evaluation framework covering syntactic validity, semantic correctness, and hardware executability. The central finding is that while all reviewed systems address syntax and most address semantics to some degree, none reports end-to-end evaluation on quantum hardware (Layer 3b), leaving a significant gap between generated circuits and practical deployment. Scope note: quantum code refers throughout to quantum program artifacts (QASM, Qiskit); we do not cover generation of quantum error-correcting codes (QEC). 2026-03-17T07:45:40Z 20 pages, 4 tables Juhani Merilehto http://arxiv.org/abs/2603.16212v1 Rapid Worst-Case Gust Identification for Very Flexible Aircraft Using Reduced-Order Models 2026-03-17T07:41:08Z Identification of worst-case gust loads is a critical step in the certification of very flexible aircraft, yet the computational cost of nonlinear full-order simulations renders exhaustive parametric searches impractical. This paper presents a reduced-order model (ROM) based methodology for rapid worstcase gust identification that achieves computational speedups of up to 600 times relative to full-order nonlinear simulations. The approach employs nonlinear model order reduction via Taylor series expansion and eigenvector projection of the coupled fluid-structure-flight dynamic system. Three test cases of increasing complexity are considered: a three-degree-of-freedom aerofoil (14 states, worst-case identified from 1,000 design sites), a Global Hawk-like UAV (540 states, 80 parametric calculations with 30 times speedup), and a very flexible flying-wing (1,616 states, 37 parametric calculations reduced from 222 hours to 22 minutes). The linear ROM is shown to be accurate for deformations below 10% of the wingspan, while the nonlinear ROM with second-order Taylor expansion accurately captures the large-deformation regime. The methodology provides a practical tool for integrating worst-case gust search into aircraft certification workflows. 2026-03-17T07:41:08Z Nikolaos D. Tantaroudas Andrea Da Ronch Ilias Karachalios Kenneth J. Badcock http://arxiv.org/abs/2603.16209v1 Physics-guided diffusion models for inverse design of disordered metamaterials 2026-03-17T07:40:11Z Disordered metamaterials are promising for programming physical properties across diverse applications, yet their inverse design remains challenging due to the non-intuitive structure-property relationships and large design spaces. Recent generative approaches, particularly diffusion models, have shown potential in high-dimensional inverse design tasks. However, existing methods typically rely on carefully crafted training objectives, such as conditional data-driven or physics-informed loss functions. Because these strategies are inherently task-specific, the model must be retrained from scratch whenever the design problem changes (e.g., different governing equations, boundary conditions, or design objectives), severely limiting their flexibility and generalization ability. In this work, we propose physics-guided diffusion models that leverage differentiable physics-based solvers to instantly guide the generative process for inverse design. Drawing inspiration from classifier guidance, we develop a sampling strategy that directly incorporates physics guidance into the reverse stochastic differential equations. Our approach enables task-adaptive generation using gradients from differentiable solvers, while the diffusion model itself needs to be trained only once on unlabeled data. Focusing on disordered foam metamaterials, we present three representative design tasks: (1) achieving target effective thermal conductivity, (2) matching desired load-displacement response, and (3) maximizing energy absorption involving fractures. In each scenario, the proposed method successfully generates foam-like geometries that fulfill the prescribed physical objectives. These results demonstrate the versatility, efficiency, and practicality of physics-guided diffusion models for tackling complex inverse design problems in disordered metamaterials and beyond. 2026-03-17T07:40:11Z 30 pages, 13 figures Ziyuan Xie Weipeng Xu Dazhi Zhao Wenchang Zhang Daoyang Dong Bingbing Xu Ning Liu Sheng Mao Tianju Xue http://arxiv.org/abs/2603.16112v1 ASDA: Automated Skill Distillation and Adaptation for Financial Reasoning 2026-03-17T04:25:54Z Adapting large language models (LLMs) to specialized financial reasoning typically requires expensive fine-tuning that produces model-locked expertise. Training-free alternatives have emerged, yet our experiments show that leading methods (GEPA and ACE) achieve only marginal gains on the FAMMA financial reasoning benchmark, exposing the limits of unstructured text optimization for complex, multi-step domain reasoning. We introduce Automated Skill Distillation and Adaptation (ASDA), a framework that automatically generates structured skill artifacts through iterative error-corrective learning without modifying model weights. A teacher model analyzes a student model's failures on financial reasoning tasks, clusters errors by subfield and error type, and synthesizes skill files containing reasoning procedures, code templates, and worked examples, which are dynamically injected during inference. Evaluated on FAMMA, ASDA achieves up to +17.33% improvement on arithmetic reasoning and +5.95% on non-arithmetic reasoning, substantially outperforming all training-free baselines. The resulting skill artifacts are human-readable, version-controlled, and compatible with the Agent Skills open standard, offering any organization with a labeled domain dataset a practical and auditable path to domain adaptation without weight access or retraining. 2026-03-17T04:25:54Z Tik Yu Yim Wenting Tan Sum Yee Chan Tak-Wah Lam Siu Ming Yiu http://arxiv.org/abs/2510.10402v2 Controllable Graph Generation with Diffusion Models via Inference-Time Tree Search Guidance 2026-03-17T04:12:55Z Graph generation is a fundamental problem in graph learning with broad applications across Web-scale systems, knowledge graphs, and scientific domains such as drug and material discovery. Recent approaches leverage diffusion models for step-by-step generation, yet unconditional diffusion offers little control over desired properties, often leading to unstable quality and difficulty in incorporating new objectives. Inference-time guidance methods mitigate these issues by adjusting the sampling process without retraining, but they remain inherently local, heuristic, and limited in controllability. To overcome these limitations, we propose TreeDiff, a Monte Carlo Tree Search (MCTS) guided dual-space diffusion framework for controllable graph generation. TreeDiff is a plug-and-play inference-time method that expands the search space while keeping computation tractable. Specifically, TreeDiff introduces three key designs to make it practical and scalable: (1) a macro-step expansion strategy that groups multiple denoising updates into a single transition, reducing tree depth and enabling long-horizon exploration; (2) a dual-space denoising mechanism that couples efficient latent-space denoising with lightweight discrete correction in graph space, ensuring both scalability and structural fidelity; and (3) a dual-space verifier that predicts long-term rewards from partially denoised graphs, enabling early value estimation and removing the need for full rollouts. Extensive experiments on 2D and 3D molecular generation benchmarks, under both unconditional and conditional settings, demonstrate that TreeDiff achieves state-of-the-art performance. Notably, TreeDiff exhibits favorable inference-time scaling: it continues to improve with additional computation, while existing inference-time methods plateau early under limited resources. 2025-10-12T01:40:33Z Accepted by WWW 2026 Jiachi Zhao Zehong Wang Yamei Liao Chuxu Zhang Yanfang Ye http://arxiv.org/abs/2603.08108v2 Tau-BNO: Brain Neural Operator for Tau Transport Model 2026-03-17T01:49:20Z Mechanistic modeling provides a biophysically grounded framework for studying the spread of pathological tau protein in tauopathies like Alzheimer's disease. Existing approaches typically model tau propagation as a diffusive process on the brain's structural connectome, reproducing macroscopic patterns but neglecting microscale cellular transport and reaction mechanisms. The Network Transport Model (NTM) was introduced to fill this gap, explaining how region-level progression of tau emerges from microscale biophysical processes. However, the NTM faces a common challenge for complex models defined by large systems of partial differential equations: the inability to perform parameter inference and mechanistic discovery due to high computational burden and slow model simulations. To overcome this barrier, we propose Tau-BNO, a Brain Neural Operator surrogate framework for rapidly approximating NTM dynamics that captures both intra-regional reaction kinetics and inter-regional network transport. Tau-BNO combines a function operator that encodes kinetic parameters with a query operator that preserves initial state information, while approximating anisotropic transport through a spectral kernel that retains directionality. Empirical evaluations demonstrate high predictive accuracy ($R^2\approx$ 0.98) across diverse biophysical regimes and an 89\% performance improvement over state-of-the-art sequence models like Transformers and Mamba, which lack inherent structural priors. By reducing simulation time from hours to seconds, we show that the surrogate model is capable of producing new insights and generating new hypotheses. This framework is readily extensible to a broader class of connectome-based biophysical models, showcasing the transformative value of deep learning surrogates to accelerate analysis of large-scale, computationally intensive dynamical systems. 2026-03-09T08:52:02Z Nuutti Barron Heng Rao Urmi Saha Yu Gu Zhenghao Liu Ge Yu Defu Yang Ashish Raj Minghan Chen http://arxiv.org/abs/2603.15943v1 Scientific Machine Learning-assisted Model Discovery from Telemetry Data 2026-03-16T21:47:43Z Calibration of dynamic models to data is an important step in building building digital twins of HVAC equipment, thermal loads and control systems. Sometimes, when a model fails to calibrate to data, a possible cause is that the model has made too many sim- plifying assumptions and is missing physics. In this paper we propose a semi-automated approach, called Dyad Model Discovery, that can augment the physical equations of the model with symbolic expressions discovered from the data. We demonstrate this method on a digital twin of a transportation refrigeration unit to improve its predictive performance, trained using telemetry data. An engineer-in-the-loop workflow is proposed, which provides suggestions to the user which can then be accepted or rejected. This is the first AI-assisted engineering design workflow to our knowledge. 2026-03-16T21:47:43Z Sebastian Micluta-Campeanu Avinash Subramanian Anas Abdelrehim Ranjan Anantharaman Rohit Dhumane Brad Carman Chris Rackauckas http://arxiv.org/abs/2603.15917v1 Bayesian-guided inverse design of hyperelastic microstructures: Application to stochastic metamaterials 2026-03-16T21:09:57Z From a given pool of all feasible design variants, our aim is to identify a structure that achieves a target macroscopic stress response. For each candidate design, the response is obtained from a high-fidelity oracle, in particular, time- and resource-intensive computational homogenization or experiments. We consider the case where (i) the geometry cannot be conveniently parameterized, rendering gradient-based optimization inapplicable, and (ii) brute-force evaluation of all candidates is infeasible due to the cost of oracle queries. To tackle this challenge, we propose a Bayesian-guided inverse design framework that proceeds as follows. First, the dimensionality of the design variants is reduced through statistical feature engineering, and the resulting low-dimensional descriptors are mapped to effective constitutive parameters describing the macroscopic hyperelastic response. This mapping is modeled using a multi-output Gaussian process surrogate that accounts for correlations between the parameters. The surrogate is trained using uncertainty-driven active learning under severe budget constraints, allowing only a very limited number of high-fidelity oracle evaluations. Based on surrogate predictions, a finite number of promising candidates are shortlisted. Since the surrogate accuracy is inherently limited, the final selection of the optimal design is performed through high-fidelity oracle evaluations within the shortlist. In numerical test cases, we consider a dataset of 50,000 candidate structures. Active learning requires labeling less than half a percent of the full dataset. Bayesian-guided inverse design under unseen loading conditions reaches a prescribed error threshold with only a handful of oracle evaluations in the majority of cases. 2026-03-16T21:09:57Z Hooman Danesh Henning Wessels http://arxiv.org/abs/2603.15722v1 A Framework and Prototype for a Navigable Map of Datasets in Engineering Design and Systems Engineering 2026-03-16T17:08:20Z The proliferation of data across the system lifecycle presents both a significant opportunity and a challenge for Engineering Design and Systems Engineering (EDSE). While this ``digital thread'' has the potential to drive innovation, the fragmented and inaccessible nature of existing datasets hinders method validation, limits reproducibility, and slows research progress. Unlike fields such as computer vision and natural language processing, which benefit from established benchmark ecosystems, engineering design research often relies on small, proprietary, or ad-hoc datasets. This paper addresses this challenge by proposing a systematic framework for a ``Map of Datasets in EDSE.'' The framework is built upon a multi-dimensional taxonomy designed to classify engineering datasets by domain, lifecycle stage, data type, and format, enabling faceted discovery. An architecture for an interactive discovery tool is detailed and demonstrated through a working prototype, employing a knowledge graph data model to capture rich semantic relationships between datasets, tools, and publications. An analysis of the current data landscape reveals underrepresented areas (``data deserts'') in early-stage design and system architecture, as well as relatively well-represented areas (``data oases'') in predictive maintenance and autonomous systems. The paper identifies key challenges in curation and sustainability and proposes mitigation strategies, laying the groundwork for a dynamic, community-driven resource to accelerate data-centric engineering research. 2026-03-16T17:08:20Z 10 pages, 3 figures, Submitted to ASME IDETC 2026-DAC22 H. Sinan Bank Daniel R. Herber