https://arxiv.org/api/oWb1DvBUphlKl+MgJjCZrF5QTqc 2026-06-21T10:16:22Z 23582 615 15 http://arxiv.org/abs/2603.05612v2 Behavior-dLDS: A decomposed linear dynamical systems model for neural activity partially constrained by behavior 2026-05-04T19:39:32Z

Brain-wide recordings of large-scale networks of neurons now provide an unprecedented view into how the brain drives behavior. However, brain activity contains both information directly related to behavior as well as the potential for many internal computations. Moreover, observable behavior is executed not only by the brain, but also by the spinal cord and peripheral nervous system. Behavior is a coarse-grained product of neural activity, and we thus take the view that it can be best represented by lower-dimensional latent neural dynamics. Capturing this indirect relationship while disambiguating behavior-generating networks from internal computations running in parallel requires new modeling approaches that can embody the parallel and distributed nature of large-scale neural populations. We thus present behavior-decomposed linear dynamical systems (b-dLDS) to disentangle simultaneously recorded subsystems and identify how the latent neural subsystems relate to behavior. We demonstrate the ability of b-dLDS to decouple behavioral vs. internal computations on controlled, simulated data, showing improvements over a state-of-the-art model that uses behavior to supervise all dynamics based on behavior. We also demonstrate b-dLDS's interpretability benefits on a task-driven RNN dataset featuring a nonlinear relationship between behavior and activations. We then show that b-dLDS can further scale up to tens of thousands of neurons by applying our model to a large-scale recording of a zebrafish hindbrain during the complex positional homeostasis behavior, wherein b-dLDS highlights asymmetry in behavior-related dynamic connectivity networks.

2026-03-05T19:11:42Z Eva Yezerets En Yang Misha B. Ahrens Adam S. Charles http://arxiv.org/abs/2603.10169v2 Novel g-computation algorithms for time-varying actions with recurrent and semi-competing events 2026-05-04T18:40:40Z

Background: A core aspect of epidemiology is determining the impacts of potential public health interventions over time. With long follow-up periods, epidemiologists may need to consider semi-competing events, in which a terminal event, like death, precludes a non-terminal event, like hypertension. Time-varying confounding poses an additional challenge when studying time-varying interventions or actions. Existing methods do not simultaneously address semi- competing events and time-varying confounding. Methods: We propose two novel g-computation algorithms for causal effects with semi- competing events and time-varying actions. To explore performance of our novel g-computation estimators, we conducted a Monte Carlo simulation study. We then applied our estimator to investigate how cigarette smoking prevention throughout young and middle adulthood might impact prevalent hypertension using data from Waves III (aged 18-26 years) - VI (aged 39-51 years) of the National Longitudinal Study of Adolescent to Adult Health. Results: Our simulations show that the novel g-computation estimators had little bias and appropriate confidence interval coverage. They outperformed existing alternative estimators across sample sizes. In the illustrative application, the novel estimator identified a small reduction in prevalence of hypertension and risk of death in midlife had all cigarette smoking been prevented across follow-up compared to the observed smoking patterns. Conclusion: As long-running cohorts progress in age, death within the study sample will become an increasing concern for studies of aging-related outcomes, life course analyses, and investigations into chronic disease development. Our novel g-computation estimators provide a simultaneous solution.

2026-03-10T19:00:46Z 23 pages, 2 figures, 5 tables Alena Sorensen D'Alessio Lucas M. Neuroth Jessie K Edwards Chantel L. Martin Paul N Zivich http://arxiv.org/abs/2605.03044v1 Tweedie-based nonparametric estimation for semicontinuous mixed densities 2026-05-04T18:12:08Z

Semicontinuous outcomes occur frequently in health services, insurance, and cost studies. Standard nonparametric density estimators are not well suited to such data because they do not naturally accommodate the mixed structure, the nonnegative support, or the pronounced boundary effects near zero. To address these limitations, we introduce an asymmetric kernel estimator for mixed densities on $[0,\infty)$ based on the Tweedie distribution. For a power parameter $p\in(1,2)$, the Tweedie kernel itself has a point mass at zero and an absolutely continuous component on $(0,\infty)$, yielding a unified smoothing construction that preserves the atom at zero and smooths the positive component using the full semicontinuous sample. We establish pointwise bias and variance expansions, derive asymptotic formulae for the mean squared error and mean integrated squared error, obtain optimal bandwidth rates, and prove asymptotic normality. We propose a profile least-squares cross-validation procedure to jointly select the bandwidth and the power parameter. Simulation results show competitive performance, particularly in challenging boundary-spike and heavy-tailed settings, and an application to emergency department length-of-stay data illustrates the practical value of the method.

2026-05-04T18:12:08Z 30 pages, 6 figures, 3 tables Guanjie Lyu Frédéric Ouimet Cindy Feng http://arxiv.org/abs/2605.03041v1 Synergy Area with FDR-controlled Evaluation (SAFE) to robustly assess safety profile in clinical trials 2026-05-04T18:09:56Z

Safety assessment plays a fundamental role in developing a new drug via clinical trials for ethical considerations. Due to complexity, manual review is typically conducted on the totality of data to draw safety conclusions. There are some existing quantitative methods to facilitate or tailor further medical review, with a controlled error rate and integration of clinical knowledge. In addition to those two key aspects, we emphasize the importance of relying on substantial evidence to draw robust conclusions on safety. Motivated by these three important properties, we propose a two-layer Synergy Area with FDR-controlled Evaluation (SAFE) structural framework to robustly assess the safety profile in clinical trials. In the first layer of SAFE, we investigate each clinically meaningful Synergy Area (SA) based on compelling evidence. In the next layer, the false discovery rate (FDR) is controlled for potential findings across all SAs. Simulation studies show that SAFE properly controls error rates within and across SAs at the nominal level. We further apply the proposed approach to two case studies based on real data from the Historical Trial Data (HTD) Sharing Initiative of the DataCelerate platform. As compared to some direct methods, SAFE demonstrates an appealing feature of screening out extreme data and reaching solid safety conclusions. It can act as either a building block in another framework, or a platform to incorporate additional components.

2026-05-04T18:09:56Z Tianyu Zhan Yabing Mai Yihua Gu Thao Doan Xun Chen http://arxiv.org/abs/2605.02873v1 Fixed-detector tilt--defocus sensing by upstream source coding in a time-reversed Young interferometer 2026-05-04T17:46:39Z

We propose a physically explicit sensing application of a time-reversed Young (TRY) interferometer: simultaneous monitoring of beam tilt and focus drift with a fixed detector. The task is relevant to compact optical relays, free-space links, fiber-coupling stages, and micro-optical alignment modules, where continuous tracking of pointing and focus is needed but downstream wavefront cameras or multiport analyzers are undesirable. Using a finite-width double-slit Fresnel model, we derive the exact local TRY response functions for tilt-like and defocus-like phase perturbations and compute the corresponding optimal upstream source codes numerically. The physical optimal codes are fringe-locked and differ qualitatively from the simple odd/even modes suggested by Gaussian toy models. Two source-coded scalar channels recover essentially all local Fisher information in the full source-resolved TRY record for the physical model considered here. Compared with downstream direct intensity sensing, TRY provides first-order access to the mixed tilt--defocus task with fixed detection; compared with ideal downstream matched-mode sorting, its advantage is architectural rather than fundamental.

2026-05-04T17:46:39Z this is two-parameter estimation work Jianming Wen http://arxiv.org/abs/2205.00098v3 Dynamic Inference in Term Structure Models with Unspanned Latent Risks 2026-05-04T15:46:16Z

We propose a parsimonious class of arbitrage-free, yields-only dynamic term structure models (DTSMs) with unspanned latent risks. To enable sequential estimation and forecasting, we develop a Sequential Monte Carlo framework that combines particle learning for static parameters with Kalman filter updates for latent states, yielding joint posterior inference and predictive distributions that account for both parameter and state uncertainty. We use this framework to assess the out-of-sample statistical and economic value of bond return predictability from the perspective of a Bayesian investor. Empirically, we find that unspanned latent factors contain predictive information beyond that embedded in the yield curve, improving out-of-sample forecasting performance relative to standard benchmark models. These gains translate into economically meaningful utility improvements across a range of portfolio settings. Finally, we show that the hidden component of the slope-related risk factor is countercyclical and associated with real economic activity, suggesting that the latent factors capture economically relevant variation not directly reflected in yields.

2022-04-29T22:52:09Z arXiv admin note: text overlap with arXiv:2204.10658 Tomasz Dubiel-Teleszynski Konstantinos Kalogeropoulos Nikolaos Karouzakis http://arxiv.org/abs/2605.02706v1 Semi-Markov Models with Particle-Based Bayesian Inference for Epidemics 2026-05-04T15:13:44Z

The COVID-19 pandemic has been characterised by multiple waves of transmission driven by interventions and emerging variants, challenging epidemic models that assume gradually evolving transmission dynamics. We propose a class of state-space models in which the transmission rate evolves through persistent regimes of random duration, governed by a semi-Markov process. This formulation yields an interpretable representation of sustained transmission phases and retains a parsimonious parameterisation. Particle-based Bayesian methods are well established for standard state-space models, but their use in semi-Markov settings has received comparatively limited attention. In epidemic applications, inference is further complicated by differential equation-driven latent dynamics and observation models defined through functionals of the latent process. We develop an inferential framework that accommodates these features, combining particle-based state updates with gradient-based parameter updates and enabling batch and sequential inference via particle and sequential Monte Carlo. We apply the proposed methodology to COVID-19 data from the United Kingdom and show that combining reported cases and deaths leads to more precise and stable inference compared to using deaths alone. These results illustrate the practical value of semi-Markov transmission models for epidemic analysis under complex observation schemes.

2026-05-04T15:13:44Z Patrick Aschermayr Konstantinos Kalogeropoulos Nikolaos Demiris http://arxiv.org/abs/2511.22430v2 Spatial constraints improve filtering of measurement noise from animal tracks 2026-05-04T14:46:46Z

Advances in tracking technologies for animal movement require new statistical tools to better exploit the increasing amount of data. Animal positions are usually calculated using the GPS or Argos satellite system and include potentially non-Gaussian and heavy-tailed measurement error patterns. Errors are usually handled through a Kalman filter algorithm, which can be sensitive to non-Gaussian error distributions. We introduce a latent movement model through an underdamped Langevin stochastic differential equation (SDE) that includes an additional drift term to ensure that the animal remains in a known spatial domain of interest. This can be applied to aquatic animals moving in water or terrestrial animals moving in a restricted zone delimited by fences or natural barriers. We demonstrate that the incorporation of these spatial constraints into the latent movement model can improve the accuracy of filtering for noisy observations of the positions. We implement an Extended Kalman Filter as well as a particle filter adapted to non-Gaussian error distributions. Our filters are based on solving the SDE through splitting schemes to approximate the latent dynamic. We illustrate the approach on a real Argos telemetry track of a bowhead whale in Foxe Basin, Canada.

2025-11-27T13:11:37Z Alexandre Delporte Susanne Ditlevsen Adeline Samson http://arxiv.org/abs/2506.13687v2 Enforcing tail calibration when training probabilistic forecast models 2026-05-04T14:30:58Z

Probabilistic forecasts are typically obtained using state-of-the-art statistical and machine learning models, with model parameters estimated by optimizing a proper scoring rule over a set of training data. If the model class is not correctly specified, then the learned model will not necessarily issue forecasts that are calibrated. Calibrated forecasts allow users to appropriately balance risks in decision making, and it is particularly important that forecast models issue calibrated predictions for extreme events, since such outcomes often generate large socio-economic impacts. In this work, we study how the loss function used to train probabilistic forecast models can be adapted to improve the reliability of forecasts made for extreme events. We investigate loss functions based on weighted scoring rules, and additionally propose regularizing loss functions using a measure of tail miscalibration. We apply these approaches to a hierarchy of increasingly flexible forecast models for UK wind speeds, including simple parametric models, distributional regression networks, and conditional generative models. We demonstrate that state-of-the-art models do not issue calibrated forecasts for extreme wind speeds, and that the calibration of forecasts for extreme events can be improved by suitable adaptations to the loss function during model training. This introduces a trade-off between calibrated forecasts for extreme events and calibrated forecasts for more common outcomes.

2025-06-16T16:51:06Z Jakob Benjamin Wessel Maybritt Schillinger Frank Kwasniok Sam Allen http://arxiv.org/abs/2510.09276v2 The bixplot: A variation on the boxplot suited for bimodal data 2026-05-04T12:31:33Z

Boxplots and related visualization methods are widely used exploratory tools for taking a first look at collections of univariate variables. In this note an extension is provided that is specifically designed to detect and display bimodality and multimodality when the data warrant it. For this purpose a univariate clustering method is constructed that ensures contiguous clusters, meaning that no cluster has members inside another cluster, and such that each cluster contains at least a given number of unique members. The resulting bixplot display facilitates the identification and interpretation of potentially meaningful subgroups underlying the data. The bixplot also displays the individual data values, which can draw attention to isolated points. Implementations of the bixplot are available in both Python and R, and their many options are illustrated on several real datasets. For instance, an external variable can be visualized by color gradations inside the display.

2025-10-10T11:19:47Z Camille M. Montalcini Peter J. Rousseeuw http://arxiv.org/abs/2605.02527v1 Research trends in music-based interventions in neonatal intensive care units: a text mining and topic modeling study 2026-05-04T12:27:31Z

Background: Music-based interventions are increasingly used in neonatal intensive care units (NICUs), but the literature remains heterogeneous in intervention type, provider role, and research focus. This study examined research trends in NICU music-based intervention studies using text mining. Methods: We analyzed 83 abstracts from peer-reviewed studies published between 1998 and 2025. Methods included preprocessing, RAKE-based keyphrase extraction, keyword frequency analysis, temporal trend analysis, intervention-type comparison, and latent Dirichlet allocation topic modeling. The optimal number of topics was determined using the CaoJuan2009, Arun2010, and Deveaud2014 metrics. Results: Study volume increased steadily over time, with nearly half (38/83) published from 2020 onward. Early studies focused on passive music listening and short-term physiological outcomes, whereas recent studies increasingly examined singing, live music, and parent-involved interventions. Keyword analysis showed a shift from physiological stability and behavioral responses toward neurodevelopmental outcomes, parental emotional well-being, and parent-infant interaction. Music medicine studies emphasized passive auditory stimulation and immediate physiological outcomes, whereas music therapy studies addressed broader developmental, relational, and psychosocial topics. Topic modeling identified four major themes, with parent-involved physiological regulation and stress reduction the most frequent dominant topic. Conclusions: NICU music-based intervention research is becoming more interdisciplinary. The field has expanded from immediate physiological stabilization to broader developmental, relational, and psychosocial goals. Future work should clarify the distinction between music therapy and music medicine and promote interdisciplinary collaboration in NICU care.

2026-05-04T12:27:31Z Min young Choun Mijeong Kim Soo Ji Kim http://arxiv.org/abs/2605.02441v1 A Behavioral Micro-foundation for Cross-sectional Network Models 2026-05-04T10:39:07Z

Models for cross-sectional network data have become increasingly well-developed in recent decades, and are widely used. This has led to a growing interest in the connection between such cross-sectional models and the behavioral processes from which the corresponding networks were presumably generated. Here, we build on prior work in this area to present a behavioral micro-foundation for cross-sectional network models, based on a continuous time stochastic choice mechanism, that can accommodate highly general classes of cases (including agents who are not themselves in the network, and multilateral edge control). As we show, the equilibrium behavior of this process under appropriate conditions can be expressed in exponential family form, allowing estimation of individual preferences using existing methods; the graph potential separates naturally into a preference-based term reflecting agent utilities, and an entropic term reflecting the rules of tie formation. We illustrate our approach via an analysis of friendship in a professional organization, and modeling of phase transitions in the structure of small groups.

2026-05-04T10:39:07Z Carter T. Butts Alexander Murray-Watters http://arxiv.org/abs/2605.08155v1 Structural and Lagrangian properties of analogue ensembles to characterize multifractality of stochastic processes 2026-05-04T08:50:24Z

We present a framework for the scale-invariance characterization of stochastic processes in reconstructed finite-dimensional phase spaces. This framework analyses the structural and dynamical properties of the phase space and is based on a Takens embedding reconstruction followed by the definition of ensembles of analogue states. We define the analogues of a target state as its nearest neighbors. Then, we specify a collection of target states densely sampling the full phase space. For each target state, we search for the ensemble of its k-best analogues and we analyze its volume and dynamics. First, we study the probability distribution of the volumes and relate its mean and variance to the scale-invariance properties of the stochastic process. Second, we study the Lagrangian properties of the analogues by characterizing how they disperse in time. More particularly, we study the volume occupied by the analogue's successors in function of time and of their initial volume. We link these dynamical properties to the scale-invariance properties of the process. We analyze two types of stationary and dissipative 1-dimensional scale-invariant processes: regularized fractional Brownian motion and regularized multifractal random walk. For both processes, the structure and dynamics of the phase space are determined by their scale-invariant properties.

2026-05-04T08:50:24Z Carlos Granero-Belinchon ODYSSEY, IMT Atlantique - MEE, Lab-STICC\_OSE http://arxiv.org/abs/2605.02309v1 The AECM Algorithm for Deterministic Maximum Likelihood Direction Finding in the Presence of Gaussian Mixture Noise 2026-05-04T08:01:42Z

Gaussian mixture noise can model non-Gaussian noise and also be used when outliers are present. For deterministic maximum likelihood direction finding in Gaussian mixture noise, the Space-Alternating Generalized Expectation-maximization (SAGE) algorithm, an extension of the expectation-maximization algorithm, was applied and designed by Kozick and Sadler twenty odd years ago, which simultaneously updates direction of arrival (DOA) estimates at each iteration and cannot properly converge under unequal signal powers. In this article, the Alternating Expectation-Conditional Maximization (AECM) algorithm, an extension of the SAGE algorithm, is applied and designed, which utilizes multiple less informative versions of the complete data and the golden section search method to update DOA estimates at each iteration sequentially (one by one). Theoretical analysis shows that the AECM algorithm has almost the same computational complexity of each iteration as the SAGE algorithm. However, numerical results show that the AECM algorithm yields faster stable convergence and is computationally more efficient.

2026-05-04T08:01:42Z Mingyan Gong Bin Lyu http://arxiv.org/abs/2205.00093v2 Bayesian Benefit-Risk Assessment with Dependent Outcomes via Latent Factor Models 2026-05-04T07:15:17Z

Approving and assessing new drugs is complex because multiple criteria must be considered simultaneously. A common approach is benefit-risk analysis, often conducted within a Bayesian framework to account for uncertainty and combine data with expert judgement, typically through multi-criteria decision analysis (MCDA) scores. This requires models that accommodate mixed and potentially correlated outcomes; latent factor models provide a natural framework. We develop a coherent Bayesian framework for benefit-risk analysis that addresses these challenges and supports sequential decision-making. We extend structured factor models to mixed outcomes and introduce a principled approach for selecting among competing specifications that combines model fit with out-of-sample predictive performance. We then develop a sequential estimation framework that updates MCDA scores as new data become available, allowing treatment comparisons to evolve over time. This supports early stopping when conclusions are clear and permits dynamic treatment allocation aligned with study objectives. To make this feasible, we develop tailored sequential Monte Carlo methods adapted to the model structure. The methodology is illustrated using data on patients with type II diabetes treated with Metformin, Rosiglitazone, and their combination.

2022-04-29T22:24:44Z Konstantinos Vamvourellis Konstantinos Kalogeropoulos Lawrence Phillips