https://arxiv.org/api/hSyj2VWrOOpXF9F1f6zlwrFZmi82026-06-21T13:50:41Z36316105015http://arxiv.org/abs/2605.17764v1Stationary birth-death processes generating inflation-deflation distributions: Avoiding the issue of dominance2026-05-18T02:31:47ZA mixture of two or more count distributions has become deeply embedded in the analysis of excess counts, often relative to the stationary (equilibrium) distributions of birth-death processes such as the geometric, Poisson, Poisson-Lindley (PL), negative binomial (NB), hyper-Poisson (HP), and Conway-Maxwell-Poisson (CMP) distributions. However, the mechanism by which excess counts arise--namely, through modifications of the birth and death rates in the base distributions--has not yet been directly examined in the research literature. All well-known inflation mixture distributions are, in fact, parameterizations of the stationary distributions of birth-death processes. Thus, although the resulting distributions share the same shapes, they arise from distinct mechanisms and are not equivalent in regression analyses. This paper focuses on inflation-deflation stationary distributions arising from modified birth-death processes that form an exponential family and introduces two types of such distributions.2026-05-18T02:31:47ZWanrudee SkulpakdeeMongkol Hunkrajokhttp://arxiv.org/abs/2605.17763v1Comparing Two Categorical Gini Correlations with Applications to Classification Problems2026-05-18T02:29:57ZThis article proposes an inferential framework for comparing predictor importance in classification problems with categorical response variables. The approach is based on the categorical Gini correlation (CGC) proposed by Dang et al. (2020), a measure of dependence between numerical predictors and categorical outcomes. Predictor importance is evaluated by testing differences in CGCs across competing predictor groups. The proposed methodology accommodates predictors of arbitrary and unequal dimensions and allows for dependence between predictor groups. Asymptotic normality of the test statistic is established under both the null and alternative hypotheses, and the resulting test is shown to be consistent. In addition to deriving the asymptotic distribution, a nonparametric bootstrap procedure is developed as an alternative approach to inference. Simulation studies, along with applications to breast cancer and human activity recognition datasets, demonstrate the effectiveness of the proposed framework.2026-05-18T02:29:57ZSameera HewageYongli Sanghttp://arxiv.org/abs/2601.10100v3Prediction Suboptimality of the Lasso in Sparse Linear Regression2026-05-18T00:50:17ZThe choice of the tuning parameter in the Lasso is central to its statistical performance in high-dimensional linear regression. In this work, we study tuning regimes under which the Lasso exhibits suboptimal prediction performance, in the sense that a simple refinement improves upon it both on high-probability events and in mean squared prediction error. Our analysis shows that the relevant stochastic scale is governed by Gaussian maxima on the selected or localized support, which may be more informative than the universal rate in Lasso theory. We further illustrate how structural factors in the design matrix can influence the suboptimality phenomenon and discuss extensions to other estimators and more general noise structures.2026-01-15T06:11:44Z26 pages; revised versionGuo LiuWaseda Universityhttp://arxiv.org/abs/2605.17705v1Online Conformal Prediction for Non-Exchangeable Panel Data2026-05-18T00:02:11ZPanel data, in which multiple units are repeatedly observed over time, arise throughout science and engineering. Quantifying predictive uncertainty in such settings is challenging because conformal prediction, while distribution-free and model-agnostic, classically relies on exchangeability assumptions that fail under temporal dependence and unit heterogeneity. We propose a simple online conformal framework for non-exchangeable panel data. The method exploits a key feature of online panel prediction: when a forecast is required for one unit, contemporaneous outcomes from related units may already be observed and can serve as a calibration panel. At each round, prediction sets are formed using currently observed calibration units together with two adaptive quantities: history-based similarity weights that emphasize calibration units resembling the target, and an adaptive miscoverage level that is updated whenever target feedback is revealed. This two-state design yields a stepwise coverage bound and a long-run coverage guarantee. Empirically, across synthetic and real panel data sets, the method improves coverage on the worst-covered target units through adaptive interval-width allocation rather than uniform inflation. The two states are complementary: similarity weights protect coverage when target feedback is sparse, while the adaptive level further improves coverage as feedback accumulates.2026-05-18T00:02:11Z34 pages, 5 figuresDaohong TuKay Gieseckehttp://arxiv.org/abs/2602.02830v3SC3D: Dynamic and Differentiable Causal Discovery for Temporal and Instantaneous Graphs2026-05-17T23:46:30ZDiscovering causal structures from multivariate time series is a key problem because interactions span across multiple lags and possibly involve instantaneous dependencies. Additionally, the search space of the dynamic graphs is combinatorial in nature. In this study, we propose Stable Causal Dynamic Differentiable Discovery (SC3D), a two-stage differentiable framework that jointly learns lag-specific adjacency matrices and, if present, an instantaneous directed acyclic graph (DAG). In Stage 1, SC3D performs edge preselection through node-wise prediction to obtain masks for lagged and instantaneous edges, whereas Stage 2 refines these masks by optimizing a likelihood with sparsity along with enforcing acyclicity on the instantaneous block. Numerical results across synthetic SVAR systems, nonlinear and chaotic benchmarks, nonstationary dynamics and real-world datasets demonstrate that SC3D achieves improved stability and more accurate recovery of both lagged and instantaneous causal structures compared to existing baselines.2026-02-02T21:32:18Z12 pagesSourajit DasDibyajyoti ChakrabortyRomit Maulikhttp://arxiv.org/abs/2605.17689v1Do Stationarity Transformations Actually Improve Time Series Forecasts? A Controlled Experimental Evaluation2026-05-17T23:02:53ZStationarity transformations are standard preprocessing in time series forecasting, yet their actual impact on accuracy across different non-stationarity types and model families has received little controlled evaluation. We construct synthetic datasets with known properties - trend, seasonality, heteroscedasticity, and combinations - and apply fourteen transformation configurations across seven models and three forecast horizons (3,528 experiments). Stationarity is quantified via consensus ratios from ten statistical tests, and each transform-dataset pair is classified as matched or mismatched based on whether the transform targets the dataset's known non-stationarity. For matched pairs, transforms improve forecasts only 18% of the time. The primary exception is variance stabilization: log and Box-Cox on heteroscedastic data improve accuracy in 60-65% of cases. Differencing a linear-trend series - a textbook use case - worsens forecasts in all cases tested. Mediation analysis confirms that while transforms achieve trend stationarity, this does not translate into lower forecast error; the mechanism is signal attenuation. Real-world validation on TSA airport passenger data corroborates these findings. Our results suggest transformation selection should be guided by empirical out-of-sample evaluation rather than theoretical stationarity assumptions.2026-05-17T23:02:53ZBhanu Suraj MallaYuqing Huhttp://arxiv.org/abs/2501.11181v5Sample size and power calculations for causal inference of observational studies2026-05-17T22:14:07ZThis paper investigates the theoretical foundation and develops analytical formulas for sample size and power calculations for causal inference with observational data. By analyzing the variance of an inverse probability weighting estimator of the average treatment effect, we decompose the power calculation into three components: propensity score distribution, potential outcome distribution, and their correlation. We show that to determine the minimal sample size of an observational study, in addition to the standard inputs in the power calculation of randomized trials, it is sufficient to have two parameters, which quantify the strength of the confounder-treatment and the confounder-outcome association, respectively. For the former, we propose using the Bhattacharyya coefficient, which measures the covariate overlap and, together with the treatment proportion, leads to a uniquely identifiable and easily computable propensity score distribution. For the latter, we propose a sensitivity parameter bounded by the R-squared statistic of the regression of the outcome on covariates. Our procedure relies on a parametric propensity score model and a semiparametric restricted mean outcome model, but does not require distributional assumptions on the multivariate covariates. We develop an associated R package PSpower and an online calculator.2025-01-19T21:44:27ZBo LiuChengxin YangFan Lihttp://arxiv.org/abs/2605.17646v1Starshaped Mean Residual Life Models for Non-Monotonic Survival Data: A Bayesian PMRL Regression Framework with Applications to Teacher Retention2026-05-17T20:49:08ZWe develop a Starshaped Mean Residual Life (SMEL) framework for survival data with non-monotonic hazard patterns, where early-stage attrition is followed by mid-career stabilization. Unlike Cox proportional hazards models or standard mean residual life models requiring monotonicity, SMEL accommodates complex temporal dynamics by requiring only that $m(t)/t$ be nondecreasing, formalizing the transition from vulnerability to equilibrium. We extend SMEL to regression settings via proportional mean residual life (PMRL) models, $m(t\mid Z)=m_0(t)\exp(Z^\topγ)$, with adaptive Bayesian estimation using three-parameter Weibull--resilience distributions and the No-U-Turn Sampler. Monte Carlo simulations across 48,000 datasets show SMEL-PMRL maintains bias $\leq 0.02$ under 40\% right-censoring, reduces integrated Brier score by 19\% over Cox models ($2.34$ vs.\ $2.88\times10^{-2}$), and achieves 5.4\% AIC improvement. Joint longitudinal-survival extensions via shared frailty enable simultaneous modeling of correlated time-to-event and continuous outcomes. Application to 169 rural STEM teachers (2018--2023, NSF Noyce) confirms starshaped equilibrium ($Λ=12.47$, $p=0.002$), with 38\% early-career tenure decline (years 1--3). The joint model ($\hatθ=0.41$, 95\% CI: $[0.35,\,0.47]$) shows persistence beyond year~3 yields 31-point cumulative achievement gains (0.56~SD) over four years. SMEL-PMRL offers a flexible, theoretically grounded alternative to proportional hazards for workforce dynamics and high-attrition settings where equilibrium processes govern long-term stability.2026-05-17T20:49:08ZMohammad Sepehrifarhttp://arxiv.org/abs/2412.13731v2Reliability analysis for non-deterministic limit-states using stochastic emulators2026-05-17T19:42:18ZReliability analysis is a sub-field of uncertainty quantification that assesses the probability of a system performing as intended under various uncertainties. Traditionally, this analysis relies on deterministic models, where experiments are repeatable, i.e., they produce consistent outputs for a given set of inputs. However, real-world systems often exhibit stochastic behavior, leading to non-repeatable outcomes. These so-called stochastic simulators produce different outputs each time the model is run, even with fixed inputs. This paper formally introduces reliability analysis for stochastic models and addresses it by using suitable surrogate models to lower its typically high computational cost. Specifically, we focus on the recently introduced generalized lambda models and stochastic polynomial chaos expansions. These emulators are designed to learn the inherent randomness of the simulator's response and enable efficient uncertainty quantification at a much lower cost than traditional Monte Carlo simulation. We validate our methodology through three case studies. First, using an analytical function with a closed-form solution, we demonstrate that the emulators converge to the correct solution. Second, we present results obtained from the surrogates using a toy example of a simply supported beam. Finally, we apply the emulators to perform reliability analysis on a realistic wind turbine case study, where only a dataset of simulation results is available.2024-12-18T11:08:56ZStructural Safety, 117,102621,pp. 1-14, 2025Anderson V. PiresMaliki MoustaphaStefano MarelliBruno Sudret10.1016/j.strusafe.2025.102621http://arxiv.org/abs/2605.17592v1Ordered POVMs and Residual Collapse2026-05-17T18:39:47ZOrdered realizations of discrete POVMs are studied through a residual transform generated by sequential tests. One application of the transform replaces each coordinate by the effect obtained after all earlier tests have failed, and appends the remaining mass as a terminal outcome. Under natural hypotheses, iterating the transform produces a collapsed POVM whose non-escape coordinates are the parts of the original effects that survive all earlier tests. The resulting collapse map gives an equivalence relation on ordered POVM realizations. Its range and fibers are characterized. The range consists of collapsed POVMs, whose non-escape coordinates are mutually orthogonal and whose support projections strongly sum to the identity. The fiber over a collapsed POVM consists of all ordered realizations with the same residually visible compressions. In particular, different ordered realizations, including ones with different off-diagonal coupling data, can have the same collapsed image. After collapse, the non-escape coordinates are fixed under further residual iteration. The remaining dynamics takes place in the escape effect, which is fragmented by a universal scalar functional calculus.2026-05-17T18:39:47ZJames Tianhttp://arxiv.org/abs/2605.17585v1Modelling pairs of Poissons and binomials with negative correlation2026-05-17T18:29:05ZSuppose $f_1(x)$ and $f_2(y)$ are given marginals for pairs $(x,y)$. I consider the construction $f_1(x)f_2(y)\{ 1+αh_1(x)h_2(y) \}$, where $h_1$ and $h_2$ are seen as bounded adjustment functions, normalised to have means zero under $f_1$ and $f_2$. This defines a bivariate distribution for $(X,Y)$ with the specified marginal densities $f_1$ and $f_2$, with an interval of permissible values of $α$, both positive and negative; in particular, independence corresponds to an innter point in the adjustments parameter region. Applications to bivariate Poisson distributions, allowing both positive and negative correlation, are discussed. As illustration I provide a more accurate and extended analysis of a Poisson pairs dataset, pertaining to competing seeds and plants, for $n=958$ plots of soil, earlier analysed in the well-cited paper Lakshminarayana, Pandit, Rao, Srinivasa (1999). The general apparatus is also shown to work for negatively correlated binomials. Those methods are illustrated in a meta-analysis framework for two-by-two tables across different studies, pertaining to the Audit-C screening questionnaire for alcohol use disorders, where again negative correlation is demonstrated, between $X$, the number of correct `yes', and $Y$, the number of correct `no'.2026-05-17T18:29:05Z14 pages, 4 figures, 3 tables; Statistical Research Report, Department of Mathematics, University of Oslo, 17 May 2026; submitted for publicationNils Lid Hjorthttp://arxiv.org/abs/2406.19152v4Mixture priors for replication studies2026-05-17T18:24:35ZReplication of scientific studies is important for assessing the credibility of their results. However, there is no consensus on how to quantify the extent to which a replication study replicates an original result. We propose a novel Bayesian approach for replication studies based on mixture priors. The idea is to use a mixture of the posterior distribution based on the original study and a non-informative distribution as the prior for the analysis of the replication study. The mixture weight then determines the extent to which the original and replication data are pooled. Two distinct strategies are presented: one with fixed mixture weights, and one that introduces uncertainty by assigning a prior distribution to the mixture weight itself. Furthermore, it is shown how within this framework Bayes factors can be used for formal testing of relevant scientific hypotheses, such as tests on the presence or absence of an effect or whether the mixture weight equals zero (completely discounting the original data) or one (fully pooling with the original data). To showcase the practical application of the methodology, we analyze data from three replication studies. Our findings suggest that mixture priors are a valuable and intuitive alternative to other Bayesian methods for analyzing replication studies, such as hierarchical models and power priors. We provide the free and open source R package repmix that implements the proposed methodology.2024-06-27T13:11:15ZRoberto Macrì-DemartinoLeonardo EgidiLeonhard HeldSamuel Pawelhttp://arxiv.org/abs/2605.18910v1A Tutorial on Symbolic Structural Identifiability Analysis of ODE Models in Julia2026-05-17T18:15:32ZStructural identifiability analysis determines whether the parameters of a mechanistic ordinary differential equation (ODE) model can be uniquely recovered from ideal observations and is therefore a fundamental prerequisite for reliable parameter estimation. This tutorial presents a modern, reproducible computational framework for symbolic structural identifiability analysis using the Julia package StructuralIdentifiability.jl. We provide a rigorous yet accessible introduction to local and global identifiability, observability, parameter-to-output mappings, and identifiable parameter combinations, together with a unified workflow based on the core functions @ODEmodel, assess_local_identifiability, assess_identifiability, and find_identifiable_functions. The framework is demonstrated through seven case studies from epidemiology, pharmacokinetics, and systems biology, illustrating globally identifiable systems, local-only identifiability, structural non-identifiability, and recovery of identifiability through additional measurements and reparameterization. Beyond the theoretical foundations, the tutorial emphasizes practical model reformulation, experimental design, and reproducible scientific workflows within the Julia SciML ecosystem, providing a comprehensive reference for researchers and graduate students working with mechanistic ODE models.2026-05-17T18:15:32ZAbdallah Alsammanihttp://arxiv.org/abs/2605.17559v1Controlling False Discovery in Arbitrarily Structured Hypothesis Spaces via Reproducing Kernels2026-05-17T17:42:56ZLarge-scale hypothesis testing is central to modern science, where controlling the False Discovery Rate (FDR) has become the standard approach to managing false positives across many simultaneous tests. Hypotheses rarely exist in isolation; they often exhibit structure through proximity, connectivity, or hierarchy. This structure represents both a challenge and an opportunity: while classical methods treat these dependencies as obstacles requiring conservative correction, leveraging them can substantially increase discovery power. Here, we reframe structured FDR control as a regularized learning problem. By optimizing within a suitable Reproducing Kernel Hilbert Space (RKHS), we introduce a framework that unifies continuous domains, graphs, and hierarchies under a single algorithm through kernel choice alone. This formulation enables smooth solutions in place of the piecewise-constant fits of prior methods, principled likelihood-based hyperparameter selection rather than heuristic tuning, and inference at unobserved locations which in turn supports sample-efficient experimental design. Building on this estimator, we provide two decision rules which we prove to control the FDR. We validate our method on two sources: spatial locations derived from high-dimensional real-world datasets, and a differential gene expression task utilizing protein-protein interaction graphs.2026-05-17T17:42:56Z9 pagesBinyamin PeretsShie Mannorhttp://arxiv.org/abs/2604.12288v2SMART Fine-tuning Factor Augmented Neural Lasso2026-05-17T16:04:01ZFine-tuning is a widely used strategy for adapting pre-trained models to new tasks, yet its methodology and theoretical properties in high-dimensional nonparametric settings with variable selection have not yet been developed. We propose a source-model-augmented residual tuning (SMART) framework, which incorporates the pre-trained source model as an augmented feature into the target learner and estimates only the residual target-specific component. The approach is widely applicable, from parametric and sparse models to neural networks and blackbox machine learning models. We focus on the development of fine-tuning factor-augmented neural Lasso, resulting in SMART-FAN-Lasso. This transfer-learning framework for high-dimensional nonparametric regression with variable selection simultaneously handles covariate and posterior shifts. We use a low-rank factor structure to manage high-dimensional dependent covariates and a residual tuning decomposition in which the target function is expressed as a function of source model and other target-specific variables, thereby reducing the effective complexity of the target task. We derive minimax-optimal excess risk bounds, characterizing the precise conditions, in terms of relative sample sizes and function complexities, under which fine-tuning yields statistical acceleration over single-task learning. Extensive numerical experiments across diverse covariate- and posterior-shift scenarios demonstrate that SMART-FAN-Lasso consistently outperforms standard baselines and achieves near-oracle performance even under severe target sample size constraints, empirically validating the derived rates.2026-04-14T05:01:18ZAuthors are listed in alphabetical orderJinhang ChaiJianqing FanCheng GaoQishuo Yin