https://arxiv.org/api/QUKEeFIisCTE//riQsp/ecFcoLg2026-03-22T10:29:32Z599784515http://arxiv.org/abs/2512.00348v2Exposed extreme rays of the SONC cone2026-03-19T00:33:45ZWe provide a complete and explicit characterization of the exposed extreme rays of the cone of sums of nonnegative circuit (SONC) polynomials. The criterion we derive is purely combinatorial and depends only on the existence of certain circuits within the ground set and on the nature of the corresponding extreme ray. Our constructive proofs also yield explicit exposing functionals, offering a basis for algorithmic detection of exposed rays in SONC-based optimization.2025-11-29T06:39:30Z11 pagesMareike DresslerHongzhi LiaoVera Roshchinahttp://arxiv.org/abs/2603.18367v1Stabilization of highly nonlinear hybrid stochastic differential delay equations by periodically intermittent feedback controls based on discrete-time observations with asynchronous switching2026-03-19T00:04:13ZIn this paper, we will investigate the moment exponential stabilization of highly nonlinear hybrid stochastic differential delay equations. A periodically intermittent controller based on discrete time state observations with asynchronous switching is designed. The upper bound of observation period as well as the lower bound of the control width are all obtained. Firstly, the finiteness and boundedness of the $p$-th moment of the solution are established under a generalized Khasminskii-type condition. Then reasonable conditions of control function, drift and diffusion coefficients are presented. Then exponential stability as well as the convergence rate of controlled system are proved. Finally, an example is presented to interpret the conclusion, which also indicates that the proportion of control interval has positive relation to the convergence rate.2026-03-19T00:04:13Z22 pagesGuangqiang LanFansai Menghttp://arxiv.org/abs/2509.08759v3Fourier Learning Machines: Nonharmonic Fourier-Based Neural Networks for Scientific Machine Learning2026-03-18T23:01:08ZWe introduce the Fourier Learning Machine (FLM), a neural network (NN) architecture designed to represent a multidimensional nonharmonic Fourier series. The FLM uses a simple feedforward structure with cosine activation functions to learn the frequencies, amplitudes, and phase shifts of the series as trainable parameters. This design allows the model to create a problem-specific spectral basis adaptable to both periodic and nonperiodic functions. Unlike previous Fourier-inspired NN models, the FLM is the first architecture able to represent a multidimensional Fourier series with a complete set of basis functions in separable form, doing so by using a standard Multilayer Perceptron-like architecture. A one-to-one correspondence between the Fourier coefficients and amplitudes and phase-shifts is demonstrated, allowing for the translation between a full, separable basis form and the cosine phase-shifted one. Additionally, we evaluate the performance of FLMs on several scientific computing problems, including benchmark Partial Differential Equations (PDEs) and a family of Optimal Control Problems (OCPs). Computational experiments show that the performance of FLMs is comparable, and often superior, to that of established architectures like SIREN and vanilla feedforward NNs.2025-09-10T16:49:20ZPlease cite the peer-reviewed, published version available on Transactions on Machine Learning Research at https://openreview.net/forum?id=LPKt5vd7yzTransactions on Machine Learning Research, December 2025Mominul RubelAdam MeyersGabriel Nicolosihttp://arxiv.org/abs/2603.18307v1Adversarial Robustness for Matrix Control Barrier Functions in Sampled-Data Systems2026-03-18T21:47:43ZThis paper presents novel theoretical results to guarantee multi-agent set invariance using Matrix Control Barrier Functions in sampled-data systems. More specifically, the paper presents conditions under which heterogeneous control-affine agents applying zero-order-hold control inputs can compute control inputs to render safe sets defined by matrix inequalities forward invariant. It then introduces methods to guarantee set invariance while accounting for the presence of adversarial agents seeking to drive the system state to unsafe sets. Finally, the paper presents theoretical extensions of these set invariance results to systems having high relative degree with respect to the matrix-valued safe set function.2026-03-18T21:47:43ZJames Usevitchhttp://arxiv.org/abs/2603.18304v1Forward-Backward Dynamic Programming for LQG Dynamic Games with Partial and Asymmetric Information2026-03-18T21:41:19ZWe formulate and study a class of two-player zero-sum stochastic dynamic games with partial and asymmetric information. Information asymmetry introduces fundamental challenges involving \emph{belief representation} and \emph{theory of mind} issues, where agents must impute belief states and estimates of other agents to inform their own strategy. To avoid an infinite regress of higher-order beliefs amongst agents and obtain computationally implementable results, we focus on a linear quadratic Gaussian (LQG) model and consider strategies with limited internal state dimension. We present a novel iterative forward-backward algorithm to jointly compute belief states and equilibrium strategies and value functions for a finite-horizon problem. We also present a value iteration-like algorithm to jointly compute stationary belief states and equilibrium strategies for an average-cost infinite-horizon problem. An open-source implementation of the algorithms is provided, and we demonstrate the effectiveness of the proposed algorithms in numerical experiments.2026-03-18T21:41:19ZYuxiang GuanIman ShamesTyler Summershttp://arxiv.org/abs/2603.18283v1Turnpike with Uncertain Measurements: Triangle-Equality ILP with a Deterministic Recovery Guarantee2026-03-18T20:59:45ZWe study Turnpike with uncertain measurements: reconstructing a one-dimensional point set from an unlabeled multiset of pairwise distances under bounded noise and rounding. We give a combinatorial characterization of realizability via a multi-matching that labels interval indices by distinct distance values while satisfying all triangle equalities. This yields an ILP based on the triangle equality whose constraint structure depends only on the two-partition set $\mathcal{P}_y=\{(r,s,t): y_r+y_s=y_t\}$ and a natural LP relaxation with $\{0,1\}$-coefficient constraints. Integral solutions certify realizability and output an explicit assignment matrix, enabling an assignment-first, regression-second pipeline for downstream coordinate estimation. Under bounded noise followed by rounding, we prove a deterministic separation condition under which $\mathcal{P}_y$ is recovered exactly, so the ILP/LP receives the same combinatorial input as in the noiseless case. Experiments illustrate integrality behavior and degradation outside the provable regime.2026-03-18T20:59:45Z16 pages, 4 figuresC. S. ElderGuillaume MarçaisCarl Kingsfordhttp://arxiv.org/abs/2603.12140v2Forecasting and Manipulating the Forecasts of Others2026-03-18T20:40:18ZIn strategic environments with private information, evaluating a change in policy requires predicting how the equilibrium responds -- but when actions reshape opponents' signals, each agent's optimal response depends on an infinite hierarchy of beliefs about beliefs that has resisted exact analysis for four decades. We provide the first exact equilibrium characterization of finite-player continuous-time LQG games with endogenous signals. Conditioning on primitive Brownian shocks rather than the physical state -- a dynamic analogue of Harsanyi's common-prior construction -- collapses the belief hierarchy onto deterministic two-time kernels, reducing Nash equilibrium to a deterministic fixed point with no truncation and no large-population limit. The characterization yields an explicit information wedge that prices the marginal value of shifting opponents' posteriors. The wedge vanishes precisely when signals are exogenous to controls, formally delineating the boundary where strategic belief manipulation matters, and provides a closed-form mapping from information primitives to equilibrium outcomes.2026-03-12T16:43:21Z53 pages, 7 figuresSam Babichenkohttp://arxiv.org/abs/2603.18249v1RAFT-UP: Robust Alignment for Spatial Transcriptomics with Explicit Control of Spatial Distortion2026-03-18T20:15:41ZSpatial transcriptomics (ST) profiles gene expression across a tissue section while preserving the spatial coordinates. Because current ST technologies typically profile two-dimensional tissue slices, integrating and aligning slices from different regions of the same three-dimensional tissue or from samples under different conditions enables analyses that reveal 3D organization and condition-associated spatial patterns. Two major challenges remain. First, interpretable and flexible control over spatial distortion is needed because rigid transformations can be overly restrictive, whereas highly deformable mappings may arbitrarily distort spatial proximity. Second, biologically plausible matching is also needed, especially when the slices overlap partially. Here, we introduce RAFT-UP, a tool for robust ST alignment that provides explicit control over spatial distance preservation through a fused supervised Gromov-Wasserstein (FsGW) optimal transport framework. FsGW combines expression and spatial information, incorporates spot-wise constraints to discourage biologically implausible matches, and enforces a pairwise distance-consistency constraint that prevents mapping two pairs of spots when their spatial distances differ beyond a specified tolerance. We demonstrate that RAFT-UP accurately aligns slices from different regions of the same tissue and slices from different samples. Benchmarking shows that RAFT-UP improves spatial distance preservation while achieving spot label matching accuracy comparable to state-of-the-art methods. Finally, we demonstrate RAFT-UP on two spatially constrained downstream applications, including spatiotemporal mapping of developing mouse midbrain and comparative cross-slice analysis of cell-cell communication. RAFT-UP is available as open-source software.2026-03-18T20:15:41ZYaqi WuJingfeng WangXin Maizie ZhouYanxiang ZhaoZixuan Canghttp://arxiv.org/abs/2503.20510v3Extended mean field control: a global numerical solution via finite-dimensional approximation2026-03-18T19:09:02ZWe investigate the global numerical approximation of a class of extended mean field control problems (MFC), where the dynamics and costs depend on the joint distribution of the state and the control. We propose a framework to approximate the value function globally over the Wasserstein space, moving beyond the restriction of fixed initial conditions. Our approach exploits the propagation of chaos by approximating the infinite-dimensional MFC problem by an $N$-player cooperative game, together with the usage of finite-dimensional solvers. This method avoids the need to parametrise functions on an infinite-dimensional space, offering a balance between probabilistic rigor and computational efficiency.2025-03-26T12:50:56ZAthena PicarelliMarco ScarattiJonathan Tamhttp://arxiv.org/abs/2603.18215v1Solving Sparsity Constrained PCA, Regression, and QCQP via the Spartrahedron2026-03-18T19:01:13ZSparsity is a fundamental modeling principle in statistics, signal processing, and data science. However, optimization with sparsity constraints is notoriously difficult. We introduce a new convex relaxation framework for {sparse quadratically constrained quadratic programs} (QCQPs), a class that subsumes sparse regression, sparse principal component analysis (PCA), and related problems. Our approach is based on a novel convex cone, the spartrahedron, which exactly characterizes sparsity at the matrix level. This leads to a semidefinite programming (SDP) relaxation that is tight whenever its solution is rank-one, providing a simple certificate of global optimality. We establish theoretical guarantees, including approximation bounds and exactness regions for sparse PCA and sparse ridge regression, as well as a general stability result under perturbations. Numerical experiments on sparse PCA, sparse regression, RIP constant estimation, and sparse canonical correlation analysis (CCA) demonstrate the practical success of our methods.2026-03-18T19:01:13ZDiego CifuentesZhuorui Lihttp://arxiv.org/abs/2512.22124v2The Solution of Potential-Driven, Steady-State Nonlinear Network Flow Equations via Graph Partitioning2026-03-18T18:36:37ZThe solution of potential-driven steady-state flow in large networks is required in various engineering applications, such as transport of natural gas or water through pipeline networks. The resultant system of nonlinear equations depends on the network topology, and its solution grows more challenging as the network size increases. We present an algorithm that utilizes a given partition of a network into tractable sizes to compute a global solution for the full nonlinear system through local solution of smaller subsystems induced by the partitions. When the partitions are induced by interconnects or transfer points corresponding to networks owned by different operators, the method ensures data is shared solely at the interconnects, leaving network operators free to solve the network flow system corresponding to their own domain in any manner of their choosing. The proposed method is shown to be connected to the Schur complement and the method's viability demonstrated on some challenging test cases.2025-11-21T02:00:00Z6 pages, 2 figuresShriram SrinivasanKaarthik Sundar10.1109/LCSYS.2026.3674163http://arxiv.org/abs/2603.18177v1A Hybrid Decomposition Approach for Stochastic Unit Commitment with Combined-Cycle Generators2026-03-18T18:21:32ZThe U.S. power grid is undergoing a major paradigm shift with the increased development of renewable generators, electric vehicles, and data centers. In response to this growing need, the U.S. has ramped up the construction of combined-cycle generators (CCs). CCs are fast-ramping generators that utilize variable configurations of combustion turbines (CTs) and steam turbines (STs) to achieve much higher efficiency than traditional CTs alone. For schedule optimization, this requires the addition of a large number of binary constraints and variables in Unit Commitment (UC) problem formulations. This paper presents a novel hybrid Benders' (BD) and Dantzig-Wolfe (DW) decomposition algorithm for stochastic UC problems with CCs. The algorithm exploits the separability of the linear constraints in UC through BD and the integer CC constraints through DW. Results are presented for the 935-generator FERC test data set, modified to include mode data for CCs. The algorithm demonstrates a significant speed-up over traditional BD across all cases. It also demonstrates better convergence rates on cases with 25 or more scenarios than both BD and Gurobi's branch-and-bound solver. These cases show that the proposed algorithm is a scalable approach for solving stochastic UC.2026-03-18T18:21:32Z12 pagesRosemary BarrassHarsha NagarajanMathieu TanneauRussell BentPascal Van Hentenryckhttp://arxiv.org/abs/2603.17981v1A Convex Formulation of the Multi-Commodity Dynamic Traffic Assignment2026-03-18T17:43:14ZWe consider a multi-commodity Dynamic Traffic Assignment (DTA) problem formulated as a network flow control problem on the Cell Transmission Model (CTM). The objective is to design optimal control policies using variable speed limits, ramp metering, and dynamic routing to regulate traffic evolution over time on a given limited-capacity transportation network. Even simple instances of DTA problems on the CTM are known to give rise to non-convex optimal control formulations. Nevertheless, a single-commodity DTA formulation has recently been proposed that admits a tight convex relaxation, thereby enabling tractable optimal control synthesis. The single-commodity formulation, however, is structurally restrictive, as it effectively allows only a single destination. To address this limitation, we develop a multi-commodity CTM model in which each commodity is associated with potentially distinct sets of off-ramps. By extending the convexification approach developed for the single-commodity case, we establish a tight convex relaxation of the multi-commodity DTA problem on the CTM model. This relaxation relies on concave, commodity-specific demand functions and concave aggregate supply functions for every cell, which ensure convexity of the resulting optimal control problem. Our proposed formulation requires commodity-dependent implementation of variable speed limits and dynamic routing policies.2026-03-18T17:43:14ZDavide SipioneGiacomo ComoGustav Nilssonhttp://arxiv.org/abs/2603.17970v1Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training2026-03-18T17:37:31ZOrthogonalized-momentum optimizers such as Muon improve transformer training by approximately whitening/orthogonalizing matrix-valued momentum updates via a short polar-decomposition iteration. However, polar-factor approximations typically require multiple large matrix multiplications, and the resulting overhead can be substantial and hardware-dependent. We introduce MUD (MomentUm Decorrelation), a complementary whitening approach that replaces Muon's polar update with a triangular (Cholesky-like) whitening surrogate inspired by classical Gram--Schmidt and Gauss-Seidel ideas. We show that row-orthonormal matrices are fixed points of the MUD map, relate the inner step to symmetric Gauss-Seidel preconditioning of the Gram matrix, and prove quadratic local convergence near the fixed point. In terms of time-to-perplexity, MUD yields consistent 10-50\% wall-clock improvements over tuned AdamW and Muon in time-to-perplexity, typically converging slightly slower per step than Muon but with substantially lower optimizer overhead -- relative to Muon, MUD improves peak tokens/s by roughly $1.3-2.6\times$ across most settings and up to nearly $3\times$ on GPT-2 large on an A100. We also demonstrate training a ESM-2 150M protein language model, where MUD matches Muon-level validation perplexity in significantly less wall-clock time.2026-03-18T17:37:31ZBen S. SouthworthStephen Thomashttp://arxiv.org/abs/2603.17934v1State-dependent temperature control in Langevin diffusions using numerical exploratory Hamiltonian-Jacobi-Bellman equations2026-03-18T17:09:04ZChoosing how much noise to add in Langevin dynamics is essential for making these algorithms effective in challenging optimization problems. One promising approach is to determine this noise by solving Hamilton-Jacobi-Bellman (HJB) equations and their exploratory variants. Though these ideas have been demonstrated to work well in one dimension, extension to high-dimensional minimization has been limited by two unresolved numerical challenges: setting reliable control bounds and stably computing the second-order information (Hessians) required by the equations. These issues and the broader impact of HJB parameters have not been systematically examined. This work provides the first such investigation. We introduce principled control bounds and develop a physics-informed neural network framework that embeds the structure of exploratory HJB equations directly into training, stabilizing computation, and enabling accurate estimation of state-dependent noise in high-dimensional problems. Numerical experiments demonstrate that the resulting method remains robust and effective well beyond low-dimensional test cases.2026-03-18T17:09:04ZTaorui WangXun LiGu WangZhongqiang Zhang