https://arxiv.org/api/kTPna3Wm9KhmzXAqXBd025eF3Zs2026-05-16T00:59:50Z118563015http://arxiv.org/abs/2605.03817v1Data-driven Initial Gap Identification of Piecewise-linear Systems using Sparse Regression and Universal Approximation Theorem2026-05-05T14:45:46ZThis paper proposes a method for identifying an initial gap in piecewise-linear systems from data. Piecewise-linear systems appear in many engineered systems such as degraded mechanical systems and infrastructures, and are known to show strong nonlinearities. To analyze the behavior of such piecewise-linear systems, it is necessary to identify the initial gap, at which the system behavior switches. The proposed method identifies the initial gap by discovering the governing equations using sparse regression and calculating the gap based on the universal approximation theorem. A key step to achieve this is to approximate a piecewise-linear function by a finite sum of piecewise-linear functions in sparse regression. The equivalent gap is then calculated from the coefficients of the multiple piecewise-linear functions and their respective switching points in the obtained equation. The proposed method is first applied to a numerical model to confirm its applicability to piecewise-linear systems. Experimental validation of the proposed method has then been conducted with a simple mass-spring-hopping system, where the method successfully identifies the initial gap in the system with high accuracy.2026-05-05T14:45:46ZJournal of Computational and Nonlinear Dynamics, 19(6), 061003 (2024)Ryosuke KankiAkira Saito10.1115/1.4065440http://arxiv.org/abs/2601.16226v2D-MODD: A Diffusion Model of Opinion Dynamics Derived from Online Data2026-05-05T09:10:05ZWe present the first empirical derivation of a continuous-time stochastic model for real-world opinion dynamics. Using longitudinal social-media data to infer users opinion on a binary climate-change topic, we reconstruct the underlying drift and diffusion functions governing individual opinion updates. We show that the observed dynamics are well described by a Langevin-type stochastic differential equation, with persistent attractor basins and spatially sensitive drift and diffusion terms. The empirically inferred one-step transition probabilities closely reproduce the transition kernel generated from the D-MODD model we introduce. Our results provide the first direct evidence that online opinion dynamics on a polarized topic admit a Markovian description at the operator level, with empirically reconstructed transition kernels accurately reproduced by a data-driven Langevin model, bridging sociophysics, behavioral data, and complex-systems modeling.2026-01-16T16:17:44ZIxandra AchitouvDavid ChavalariasRaphael Fournier-S'niehottahttp://arxiv.org/abs/2605.03267v1Partial Effective Information Decomposition for Synergistic Causality2026-05-05T01:39:17ZCausality is a central topic in scientific inquiry, yet for complex systems, the identification and analysis of synergistic causation remain a challenging and fundamental problem. In the context of causal relations among multivariate variables, a decomposition framework grounded in interventionist causation is still lacking. To address this gap, this paper proposes Partial Effective Information Decomposition (PEID), a framework that decomposes the influence of multiple source variables on a target variable under maximum-entropy interventions into unique and synergistic information, thereby providing a unified and computable characterization of synergistic causal relations. Theoretically, in the three-variable case, the proposed framework is compatible with the major axioms of Partial Information Decomposition (PID). Empirically, under maximum-entropy interventions, correlations among input variables are removed, causing redundancy to vanish and thereby enabling PEID to compute synergistic relations. Furthermore, based on this framework, it is possible to define causal graphs containing hyperedges as well as downward causation, thus offering a unified toolkit for analyzing cross-scale and multivariate causal mechanisms in complex systems. Finally, applying the framework to a machine-learning-based air quality forecasting task on KnowAir-V2, we demonstrate that PEID can extract interpretable inter-station causal structures from a learned dynamical model. These results suggest that PEID provides a general interventionist information-theoretic tool for analyzing multivariate and synergistic causal mechanisms in complex systems.2026-05-05T01:39:17ZMingzhe YangShuo WangJiang Zhanghttp://arxiv.org/abs/2605.02873v1Fixed-detector tilt--defocus sensing by upstream source coding in a time-reversed Young interferometer2026-05-04T17:46:39ZWe propose a physically explicit sensing application of a time-reversed Young (TRY) interferometer: simultaneous monitoring of beam tilt and focus drift with a fixed detector. The task is relevant to compact optical relays, free-space links, fiber-coupling stages, and micro-optical alignment modules, where continuous tracking of pointing and focus is needed but downstream wavefront cameras or multiport analyzers are undesirable. Using a finite-width double-slit Fresnel model, we derive the exact local TRY response functions for tilt-like and defocus-like phase perturbations and compute the corresponding optimal upstream source codes numerically. The physical optimal codes are fringe-locked and differ qualitatively from the simple odd/even modes suggested by Gaussian toy models. Two source-coded scalar channels recover essentially all local Fisher information in the full source-resolved TRY record for the physical model considered here. Compared with downstream direct intensity sensing, TRY provides first-order access to the mixed tilt--defocus task with fixed detection; compared with ideal downstream matched-mode sorting, its advantage is architectural rather than fundamental.2026-05-04T17:46:39Zthis is two-parameter estimation workJianming Wenhttp://arxiv.org/abs/2508.10266v2Dynamic mode decomposition for detecting oscillatory transient activity via sparsity and smoothness regularization2026-05-04T17:09:55ZDynamic Mode Decomposition (DMD) is a data-driven modal decomposition technique that extracts coherent spatio-temporal structures from high-dimensional time-series data. By decomposing the dynamics into a set of modes, each associated with a single frequency and a growth rate, DMD enables a natural modal decomposition and dimensionality reduction of complex dynamical systems. However, when DMD is applied to transient dynamics, even if a large number of modes are used, it remains difficult to interpret how these modes contribute to the transient behavior. In this study, we propose a simple extension of DMD that facilitates extraction of oscillatory transient activity by introducing time-varying amplitudes for the DMD modes based on sparsity and smoothness regularization. This approach enables identification of dynamically significant modes and extraction of their transient activities, providing a more interpretable representation of non-steady dynamics. We illustrate the validity of the proposed method using a simple example and then apply it to fluid flow data of a laminar airfoil wake exhibiting transient behavior. We demonstrate that it can capture the temporal structure of mode activations that are not accessible with the standard DMD method.2025-08-14T01:22:25Z14 pages, 9 figuresYutaro TanakaHiroya Nakaohttp://arxiv.org/abs/2601.10791v2OmniMol: Transferring Particle Physics Knowledge to Molecular Dynamics with Point-Edge Transformers2026-05-04T15:47:57ZWe present OmniMol, a state-of-the-art all-to-all transformer-based small molecule machine-learned interatomic potential (MLIP). OmniMol is built by adapting Omnilearned, a foundation model for particle jets found in high-energy physics (HEP) experiments such as at the Large Hadron Collider (LHC). Omnilearned is built with a Point-Edge-Transformer (PET) and pre-trained using a diverse set of one billion particle jets. It includes an interaction-matrix attention bias that injects pairwise sub-nuclear (HEP) or atomic (molecular-dynamics) physics directly into the transformer's attention logits, steering the network toward physically meaningful neighborhoods without sacrificing expressivity. We demonstrate OmniMol using the oMol dataset and find excellent performance even with relatively few examples for fine-tuning. Further, due to architectural transfer from Omnilearned, we demonstrate uniquely fast inference. This study lays the foundation for building interdisciplinary connections given datasets represented as collections of point clouds.2026-01-15T19:00:04Z9 pages, 10 figuresIbrahim ElsharkawyVinicius MikuniWahid BhimjiBenjamin Nachmanhttp://arxiv.org/abs/2605.02644v1Polymer Knots in Thin Films: Thickness Dependence, Local Effects, and Stiffness2026-05-04T14:29:30ZWe study how confinement affects topology and conformations in polymer films of varying thickness $h$. The knotting probability exhibits a maximum at intermediate thicknesses near the bulk radius of gyration $h \approx R_\mathrm{g,bulk}$, vanishes at small $h$ and approaches bulk values for large $h$. Close to walls, the entanglement length increases monotonically and conformations become flatter. A layer-resolved analysis of structural and topological properties allows us to reconstruct the explicit thickness dependencies by integrating layer-resolved properties of a thick film.2026-05-04T14:29:30ZMaurice P. SchmittHendrik MeyerPeter Virnauhttp://arxiv.org/abs/2604.08373v2Stochastic problems in pulsar timing2026-05-04T11:29:37ZLangevin stochastic differential equations provide a dynamical description of pulsar timing noise and gravitational wave background (GWB) signals. They are also central to state space algorithms that have gained traction in pulsar timing array analysis due to their linear computational scaling with the number of observations. In this work, we utilize established methods in diffusion theory to derive analytical time-domain solutions (means, covariances, and probability density functions) to Langevin equations relevant to red noise and the GWB signal in pulsars. The solutions give direct physical insight on the dynamics of pulsar timing signals. As a canonical example, we show that the pulsar spin frequency modeled as an Ornstein-Uhlenbeck process is mathematically inconsistent with a stationary GWB signal when the timing residual is the direct observable. The nonstationarity can be partially dealt with by marginalizing over long time deterministic trends in the data. Then, we show that a random process based on an overdamped harmonic oscillator supports both a stationary spin frequency and phase residuals, consistent with a stationary GWB signal. We also turn our attention to a phenomenological model of a neutron star -- a two-component model with spin wandering -- that has been motivated to explain observed timing noise in radio pulsars. We derive analytical expressions for the means, covariances, and cross-covariances of the crust and superfluid rotational states driven by white noise. The associated constant deterministic torques are linked to the quadratic spin-down of pulsars. The solutions reveal the physical origin of nonstationarity in the residual model: the coexistence of damped and diffusive eigenmodes of the system.2026-04-09T15:35:35Z27 pages + refs, 2 figures, discussion improvedReginald Christian Bernardohttp://arxiv.org/abs/2605.02453v1Testing General Relativity Through Gravitational Wave Classification: A Convolutional Neural Network Framework2026-05-04T10:57:10ZWe present a machine learning framework for testing general relativity (GR) with gravitational wave signals from binary black hole mergers. Using the source parameters of 173 BBH events from the GWTC catalog as a realistic astrophysical population, we generate simulated GR waveforms and construct beyond GR (BGR) waveforms by applying controlled phase deformations. We introduce a response function formalism that provides a systematic framework for quantifying how any observable responds to modifications of GR. We train convolutional neural networks (CNNs) on two input representations: whitened waveforms and a response function type observable derived from the waveform mismatch, which isolates the effect of phase deviations from the bulk signal. Using response functions as the CNN input improves the classification sensitivity by a factor of approximately 33 compared to whitened waveforms, demonstrating that the choice of observable representation is as important as the classifier architecture. We study the fundamental limits of this classification through Bayes optimal error analysis, averaging methods that reveal coherent patterns hidden in noise, and a comparison between CNN accuracy and a single feature classifier as a proxy for human performance. At all deformation scales, the CNN outperforms the best single feature approach. We extend the framework to physically motivated theories using the parameterized post Einsteinian (ppE) formalism and apply it to massive gravity, where the classifier detects deviations for graviton masses of order $m_g \sim 10^{-23}\;\mathrm{eV}/c^2$ with aLIGO design sensitivity.2026-05-04T10:57:10Z36 pages, 20 figures, 4 tables. Comments welcome!Lavinia HeisenbergShayan HemmatyarHector Villarrubia-Rojohttp://arxiv.org/abs/2601.20805v4Plotting correlated data2026-05-04T09:00:48ZA very common task in data visualization is to plot many data points with some measured y-value as a function of fixed x-values. Uncertainties on the y-values are typically presented as vertical error bars that represent either a Frequentist confidence interval or Bayesian credible interval for each data point. Most of the time, these error bars represent a 68\% confidence/credibility level, which leads to the intuition that a model fits the data reasonably well if its prediction lies within the error bars of roughly two thirds of the data points. Unfortunately, this and other intuitions no longer work when the uncertainties of the data points are correlated. If the error bars only show the square root of diagonal elements of some covariance matrix with non-negligible off-diagonal elements, we simply do not have enough information in the plot to judge whether a drawn model line agrees well with the data or not. In this paper we will demonstrate this problem and discuss ways to add more information to the plots to make it easier to judge the agreement between the data and some model prediction in the plot, as well as glean some insight where the model might be deficient. This is done by explicitly showing the contribution of the first principal component of the uncertainties, and by displaying the conditional uncertainties of all data points.2026-01-28T17:50:06Z13 pages, 10 figures, added notebook as supplementary materialLukas Koch10.52933/jdssv.v6i2.170http://arxiv.org/abs/2605.08155v1Structural and Lagrangian properties of analogue ensembles to characterize multifractality of stochastic processes2026-05-04T08:50:24ZWe present a framework for the scale-invariance characterization of stochastic processes in reconstructed finite-dimensional phase spaces. This framework analyses the structural and dynamical properties of the phase space and is based on a Takens embedding reconstruction followed by the definition of ensembles of analogue states. We define the analogues of a target state as its nearest neighbors. Then, we specify a collection of target states densely sampling the full phase space. For each target state, we search for the ensemble of its k-best analogues and we analyze its volume and dynamics. First, we study the probability distribution of the volumes and relate its mean and variance to the scale-invariance properties of the stochastic process. Second, we study the Lagrangian properties of the analogues by characterizing how they disperse in time. More particularly, we study the volume occupied by the analogue's successors in function of time and of their initial volume. We link these dynamical properties to the scale-invariance properties of the process. We analyze two types of stationary and dissipative 1-dimensional scale-invariant processes: regularized fractional Brownian motion and regularized multifractal random walk. For both processes, the structure and dynamics of the phase space are determined by their scale-invariant properties.2026-05-04T08:50:24ZCarlos Granero-BelinchonODYSSEY, IMT Atlantique - MEE, Lab-STICC\_OSEhttp://arxiv.org/abs/2605.02323v1When Attention Collapses: Residual Evidence Modeling for Compositional Inference2026-05-04T08:18:02ZCompositional inference - the decomposition of observations into an unknown number of latent components - is central to perception and scientific data analysis. Attention-based models perform well when components are approximately separable, as in object-centric vision. Under additive superposition, however - where multiple components contribute to every observation - we identify a structural failure mode we term slot collapse: multiple slots converge to the same dominant component while weaker ones remain unrepresented. We trace this to a general limitation: attention is memoryless with respect to explained evidence. All slots repeatedly operate on the same input without accounting for what has already been explained, so gradients are dominated by the strongest component, inducing shared fixed points across slots. As a result, attention fails to enforce non-redundant allocation under additive superposition. We address this by introducing residual evidence modeling, instantiated via evidence depletion - a minimal modification combining multiplicative depletion with an attention bias. Controlled ablations show that parallel attention, sequential processing alone, and loss-based regularization fail to resolve collapse; evidence depletion, which adds residual state to sequential attention, consistently succeeds. Across synthetic benchmarks and real-world audio mixtures (FUSS), evidence depletion reduces slot collapse by up to an order of magnitude, generalizing beyond synthetic settings. On gravitational-wave source inference for the ESA/NASA LISA mission, under identical architectures, data, and losses, standard attention fails while evidence depletion prevents collapse and enables multi-source posterior estimation. These results show that under additive superposition, residual evidence tracking is the operative ingredient for preventing collapse and enabling compositional inference.2026-05-04T08:18:02ZNiklas Houbahttp://arxiv.org/abs/2604.06704v2Biases in the Determination of Correlations Between Underground Muon Flux and Atmospheric Temperature2026-05-04T03:50:25ZThe underground rates of cosmic-ray muons exhibit seasonal variations correlated with effective atmospheric temperature, quantified via a single coefficient. We compare two analysis methods for studying the correlation: the standard Unbinned Method, where all rate-temperature data points are fit simultaneously via linear regression, and the Binned Method, where data points with similar temperatures are first grouped into bins before fitting. We find that while both methods are unbiased in the limit of negligible temperature uncertainties, the Binned Method develops significant bias when temperature uncertainties are present, due to binning-induced distortions. In contrast, the Unbinned Method remains robust if the uncertainties are accurately known. To address the widely encountered issue of imprecise uncertainty estimation, we propose a novel procedure that assesses correlation stability by varying the time intervals and their assigned uncertainties. This approach resolves methodological tensions in studies of seasonal modulation of the muon rate and provides a practical framework for robust correlation estimation under real-world conditions.2026-04-08T05:44:26Z15 pages, 13 FiguresBangzheng MaKatherine DugasKam-Biu LukJuan Pedro Ochoa-RicouxBedřich RoskovecQun Wuhttp://arxiv.org/abs/2605.02190v1KANs need curvature: penalties for compositional smoothness2026-05-04T03:37:43ZKolmogorov-Arnold networks (KANs) offer a potent combination of accuracy and interpretability, thanks to their compositions of learnable univariate activation functions. However, the activations of well-fitting KANs tend to exhibit pathologically high-curvature oscillations, making them difficult to interpret, and standard regularization penalties do not prevent this. Here we derive a basis-agnostic curvature penalty and show that penalized models can maintain accuracy while achieving substantially smoother activations. Accounting for how function composition shapes curvature, we prove an upper bound on the full model's curvature relative to the curvature penalty, and use this to motivate richer forms of penalties. Scientific machine learning is increasingly bottlenecked by the trade-off between accuracy and interpretability. Results such as ours that improve interpretability without sacrificing accuracy will further strengthen KANs as a practical tool for both prediction and insight.2026-05-04T03:37:43Z14 pages, 6 figures, 1 tableJames Bagrowhttp://arxiv.org/abs/2605.01629v1Brain criticality through nonadditive entropic analysis of electroencephalograms2026-05-02T22:40:43ZOn the grounds of nonadditive entropies -- appropriate for complex systems -- we investigate the electroencephalogram amplitudes of typical and ADHD children. The corresponding probability distributions are $q$-Gaussians, i.e., $ρ(x) \propto e_q^{-βx^2} \equiv [1+(q-1) βx^2]^{1/(1-q)}$, where $(q,β)$ are, respectively, the entropic index characterizing complexity and the inverse width. We show that $q$ tends to monotonically vary with $β$ for both typical and ADHD subjects, thus revealing critical behavior of the brain. Moreover, we verify that ADHD subjects have a higher complexity than the typical ones. Consistently, biomarkers for objective phychyatric diagnosis could emerge along this path. We show that $q$ tends to monotonically vary with $β$ for both typical and ADHD subjects, thus revealing critical behavior of the brain. Moreover, we verify that ADHD subjects have a higher complexity than the typical ones. Consistently, biomarkers for objective phychyatric diagnosis could emerge along this path.2026-05-02T22:40:43Z7 pages and 6 figuresHenrique Santos LimaConstantino TsallisDimitri M. Abramov