https://arxiv.org/api/DKuHzaNsy/iyscfnm40N1f4ppDk2026-06-21T18:35:56Z2358272015http://arxiv.org/abs/2603.17281v2Improving causal inference in interrupted time series analysis: the triple difference design2026-04-25T17:11:40ZBackground: Interrupted time series analysis (ITSA) is widely used to evaluate health policy and intervention effects. While multiple-group ITSA (MG-ITSA) improves causal inference by incorporating a control group, residual confounding from unmeasured time-varying factors may remain. The triple-difference interrupted time series (DDD-ITSA) design extends this approach by adding a second control group to further isolate treatment effects, but it remains underutilized and lacks formal guidance.
Methods: We formalize the DDD-ITSA framework, specify the regression model, define key parameters for estimating level and trend effects, and clarify interpretation of the triple-difference estimand. We illustrate the approach using a worked example evaluating California's Proposition 99 cigarette tax and its impact on per-capita cigarette sales.
Results: In the example, all groups were balanced on pre-intervention level and trend. The triple-difference estimand indicated a statistically significant annual reduction of -1.76 per-capita cigarette packs in California relative to the secondary control (P = 0.020; 95 percent CI: -3.24, -0.28), consistent with results from the primary comparison. Differences between control groups were not significant.
Conclusions: DDD-ITSA strengthens causal inference when two-group comparisons may be confounded by leveraging an additional control group to remove remaining biases and assess heterogeneity. Implementation is facilitated by updates to the itsa Stata package. Careful attention to control selection, baseline balance, and autocorrelation remains essential.2026-03-18T02:19:30ZAriel Lindenhttp://arxiv.org/abs/2604.23381v1MCMC with Adaptive Principal-Component Transformation: Rotation-Invariant Universal Samplers for Bayesian Structural System Identification2026-04-25T17:06:49ZOver decades, Markov chain Monte Carlo (MCMC) methods have been widely studied, with a typical application being the quantification of posterior uncertainties in Bayesian system identification of structural dynamic models. To address the issue of excessively low sampling efficiency in generic MCMC methods when applied to specific problems, researchers developed several MCMC algorithms that integrate trainable neural networks to replace and enhance their critical components. Later, meta-learning MCMC methods emerged to reduce training time. However, they require considerable similarity between test and training tasks, while their sampling efficiency is constrained by trade-off-simplified network designs. This paper proposes the Adaptive Principal-Component (PC) Meta-learning Stochastic Gradient Hamiltonian Monte Carlo (APM-SGHMC) algorithm. It adaptively rotates coordinate axes in the parameter space to align with the PC directions of the current posterior samples, ensuring rotation-invariance of sampling performance with respect to the posterior distribution. By incorporating translation-invariance, scale-invariance, and rotation-invariance in a unified framework, APM-SGHMC enables universal samplers to acquire generalizable knowledge across diverse Bayesian system identification tasks using minimalistic tasks while eliminating the constraints imposed by network design trade-offs on sampling efficiency. Practical feasibility issues are also addressed. Two Bayesian system identification case studies demonstrate its effectiveness and universality: our method overcomes the case-by-case limitations of traditional data-driven approaches, achieving zero-shot generalization across structurally distinct models without retraining and maintaining consistent superior performance across all scenarios.2026-04-25T17:06:49ZAccepted by Advanced Engineering Informatics on Apr 25, 2026Xianghao MengYong HuangJames L. BeckKui JiangHui Lihttp://arxiv.org/abs/2604.23357v1Modelling spatial heterogeneity in the effects of area-level covariates on income distributions using Bayesian nonparametric methods2026-04-25T15:52:51ZUnderstanding the how the distribution of an economic outcome, such as income, changes with respect to space and covariates is a key concern for policy makers. To address this, we develop a Bayesian nonparametric model, the Normalised Latent Measure Factor Model with Covariates (NLMFM-C), which expresses a large collection of related densities as mixtures of latent factor densities and allows for spatial and covariate effects. We propose an adaptive Gibbs sampler to automatically infer the number of latent factor distributions, and a rotation method to make posterior inference on different data sets comparable. We apply the NLMFM-C model to Public Use Microdata Sample (PUMS) data, focusing on income distributions for sub-areas of four U.S. states over to different years, 2016 and 2020. We show that the latent factor distributions can be interpreted by income level (e.g., low, medium, and high) and investigate the spatially- and time-changing impact of three covariates: gender, race and educational attainment.2026-04-25T15:52:51ZZiyou WangJim GriffinMaria Kallihttp://arxiv.org/abs/2604.19391v3On the Practical Performance of Noise Modulation for Ultra-Low-Power IoT: Limitations, Capacity, and Energy Trade-offs2026-04-25T15:31:58ZUltra-low-power (ULP) Internet of Things (IoT) applications demand communication architectures with minimal energy consumption. Noise Modulation (NoiseMod) addresses this by encoding data through the statistical variance of a noise-like signal, eliminating the need for a coherent carrier. To bridge the gap between theoretical potential and practical deployment, this paper benchmarks NoiseMod against standard modulations like BPSK and NC-FSK. We analytically derive the optimal detection threshold and Bit Error Rate (BER) for AWGN and Rayleigh fading channels. Our results show that non-coherent NoiseMod suffers a catastrophic error floor in fading environments, making architectural additions like channel state information (CSI) estimation and 2-antenna selection diversity desirable. Using an ADC-aware energy model, we reveal that NoiseMod's oversampling severely bottlenecks capacity and imposes an 8 dB SNR penalty compared to NC-FSK for a $10^{-3}$ BER in AWGN. Despite its oscillator-free design drastically reducing baseline circuit power, these limitations establish a critical energy crossover distance, which decreases with frequency. Below this distance, NoiseMod offers superior energy efficiency; beyond it, the radiated power needed to overcome its SNR penalty makes coherent schemes like BPSK vastly superior.2026-04-21T12:18:11Z5 pages, 5 figures, conferenceFelipe A. P. de FigueiredoPedro M. R. PereiraEvandro C. Vilas BoasFernando D. A. GarciaHadi ZayyaniRausley A. A. de Souzahttp://arxiv.org/abs/2509.09758v4A Path Signature Framework for Detecting Creative Fatigue in Digital Advertising2026-04-25T14:32:04ZThis paper introduces a signature-based framework for detecting advertising creative fatigue using path signatures, a geometric representation from rough path theory. Creative fatigue -- the degradation of creative effectiveness under repeated exposure -- is operationally important in digital marketing because delayed detection can translate directly into avoidable opportunity cost. We reframe fatigue monitoring as a geometric change detection problem: advertising performance trajectories are embedded as paths and represented by truncated (log-)signatures, enabling detection of changes in trend, volatility, and non-linear dynamics beyond simple mean or variance shifts. We further connect statistical detection to managerial decision-making via an explicit quantification of performance loss relative to a benchmark period. Because proprietary production data cannot be released, we evaluate the proposed framework on a synthetic panel dataset designed to mimic realistic impression volumes and noisy day-to-day CTR dynamics. We define observed CTR as the realised binomial rate $CTR_t := C_t/I_t$ using daily clicks $C_t$ and impressions $I_t$. The accompanying CSV also contains a pre-computed CTR field (e.g., due to rounding or upstream derivation), but all modelling and evaluation in this paper use $C_t/I_t$. Crucially, the dataset does not include injected changepoints; we therefore define an operational ground truth for ``fatigue onset'' based on a noise-robust CTR estimate and a sustained deterioration relative to a recent-best baseline. We report lead-time (early warning) and alert-burden metrics under this operational definition, and provide a sensitivity analysis over the detector's primary tuning parameters. The methodology scales linearly in time-series length for fixed signature depth and is suitable for monitoring large creative portfolios.2025-09-11T17:46:08Zversion 3Charles Shawhttp://arxiv.org/abs/2604.21087v2Model quality in football: Quantifying the quality of an Expected Threat model2026-04-25T12:57:52ZThe recent growth in data availability in football has increased the risk of incorrect use of data-driven models, making guidelines on their validation and application necessary. The Expected Threat (xT) model is an accessible option for football organizations that start building in-house methods, yet little is known about how to assess its quality. The aim of this study is twofold: to examine how the model error depends on the number of game states and the number of training points, and to translate these results into guidelines for constructing and applying the model. Using the Markov chain underlying the model, we perform theoretical analyses and simulations to study the model error. These show that the model error is approximately log-normally distributed for a specified number of training points and game states. Additionally, we combine the simulations with expert consultation to establish the model error beyond which player evaluations based on the Expected Threat model become unreliable for scouting applications. From this, we derive rules of thumb to ensure the quality of an Expected Threat model before application, and we illustrate through an example how a validated model can be applied in practice. Because the approach generalizes to Expected Possession Value models, this paper illustrates a framework to systematically quantify model quality, despite the ground truth being unobservable in football analytics.2026-04-22T21:03:21ZKoen van AremJakob SöhlMirjam BruinsmaGeurt Jongbloedhttp://arxiv.org/abs/2604.23127v1A Dynamic Learning Observatory Reveals the Rapid Salinization of Satkhira, Bangladesh2026-04-25T03:49:04ZSoil salinity is a major environmental challenge in coastal Bangladesh, threatening agricultural productivity and local livelihoods. This study develops a machine-learning-based framework to predict and map soil salinity in Satkhira district by integrating field observations with Landsat-derived spectral indices. A total of 205 soil samples collected during 2024-2025 were used to train an Extreme Gradient Boosting (XGBoost) model, and predictions were further improved using a Generalized Additive Model (GAM). Spatial cross-validation was applied to reduce autocorrelation bias, and bootstrap resampling was used to quantify prediction uncertainty. The results show strong spatial variability of soil salinity, with higher concentrations in the southern and central coastal regions and lower levels in the northern inland areas. Vegetation indices, particularly NDVI, along with salinity-related spectral indicators, were identified as key predictors. 10-year-window peak-exposure maps generated for 2014-2023 reveal recurrent high-salinity zones and a persistent, expanding footprint of moderate-to-high salinity exposure across the central parts of the district. Uncertainty analysis indicates higher variability in coastal zones and improved prediction stability when multi-year datasets are combined. The proposed framework provides a robust and scalable approach for long-term monitoring of soil salinity. It supports climate-resilient agriculture, land-use planning, and evidence-based decision-making in coastal Bangladesh.2026-04-25T03:49:04ZShowmitra Kumar SarkarSai Ravelahttp://arxiv.org/abs/2511.21992v2Design-based nested instrumental variable analysis2026-04-25T00:58:24ZTwo binary instrumental variables (IVs) are nested if individuals who comply under one binary IV also comply under the other. This situation often arises when the two IVs represent different intensities of encouragement or discouragement to take the treatment, with one stronger than the other. In a nested IV structure, treatment effects can be identified for two latent subgroups: always-compliers and switchers. Always-compliers are individuals who comply even under the weaker IV, while switchers are those who do not comply under the weaker IV but do under the stronger IV. We introduce a novel pair-of-pairs nested IV design, where each matched stratum consists of four units organized in two pairs. We develop design-based inference for the always-complier sample average treatment effect and switcher sample average treatment effect. In a nested IV analysis, IV assignment is randomized within each IV pair; however, whether a study unit receives the weaker or stronger IV may not be randomized. To address this complication, we then propose a novel partly biased randomization scheme and study design-based inference under this new scheme. Using extensive simulation studies, we demonstrate the validity of the proposed method even in challenging scenarios with small sample sizes and a low proportion of switchers. Applying the nested IV framework, we estimated that 52.2% (95% CI: 50.4%-53.9%) of participants enrolled at the Henry Ford Health System in the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial were always-compliers, while 26.7% (95% CI: 24.5%-28.9%) were switchers. Among always-compliers, flexible sigmoidoscopy was associated with a trend toward a decreased colorectal cancer rate. No effect was detected among switchers. This offers a richer interpretation of why no increase in the intention-to-treat effect was observed after 1997, even though the compliance rate rose.2025-11-27T00:22:13ZZhe ChenXinran LiMichael O. HarhayBo Zhanghttp://arxiv.org/abs/2604.07011v2Recovering manifold structure in LLM responses through a joint Euclidean mirror2026-04-24T20:11:45ZUnderstanding the behavior of black-box large language models and determining effective means of comparing their performance is a key task in modern machine learning. We consider how large language models respond to a specific query by analyzing how the distributions of responses vary over different values of tuning parameters. We frame this problem in a general mathematical setting, treating the mapping from model parameters to response distributions as a structured family of probability measures, endowed with a geometry via a dissimilarity measure. We show how dissimilarities between response distributions can be represented in low-dimensional Euclidean space through a joint Euclidean mirror surface encoding the underlying geometry, which permits both qualitative and quantitative analysis of large language models and provides insight into predicting response distributions for different values of tuning parameters. We propose an estimation procedure for the underlying joint Euclidean mirror based on observed samples from the response distributions, and we prove its asymptotic properties. Additionally, we propose a statistically consistent procedure to infer the value of an unknown model parameter based on samples from the corresponding response distribution and the estimated joint Euclidean mirror. In an experimental setting with large language models, we find that changes in different tuning parameter values correspond to distinct directions in the embedding space, making it possible to estimate the tuning parameters that were used to generate a given response.2026-04-08T12:33:35Z13 pages, 9 figuresMaximilian BaumAranyak AcharyyaTianyi ChenAvanti AthreyaYoungser ParkFrancesco Sanna PassinoCarey E. PriebeZachary Lubbertshttp://arxiv.org/abs/2604.22925v1Come Together: Analyzing Popular Songs Through Statistical Embeddings2026-04-24T18:00:48ZStatistical modeling of popular music presents a unique challenge due to the complexity of song structures, which cannot be easily analyzed using conventional statistical tools. However, recent advances in data science have shown that converting non-standard data objects into real vector-valued embeddings enables meaningful statistical analysis. In this work, we demonstrate an approach based on logistic principal component analysis to construct embeddings from global song features, allowing for standard multivariate analysis. We apply this method to a corpus of Lennon and McCartney songs from 1962-1966, using embeddings derived from chords, melodic notes, chord and pitch transitions, and melodic contours. Our analysis explores how these song embeddings cluster by Beatles album, how songwriting styles evolved over time, and whether Lennon and McCartney's compositions exhibited convergence or divergence. This embedding-based approach offers a powerful framework for statistically examining musical structure and stylistic development in popular music.2026-04-24T18:00:48ZMatthew Esmaili MalloryMark GlickmanJason Brownhttp://arxiv.org/abs/2604.22692v1A Unified Framework for Multiple Exposure Distributed Lag Non-Linear Models for Air Pollution Epidemiology2026-04-24T16:27:08ZThis study quantifies the association between air pollution and mortality in Ontario, Canada. Exposure-response relationships in air pollution epidemiology are complex due to three features: time-lagged associations, non-linear associations, and multiple pollutants. To address the first two features, two distinct classes of distributed lag non-linear model (DLNM) have been proposed, but extending them to multiple exposures and selecting an appropriate model remain challenging. We propose a unified framework for multiple exposure DLNMs, integrating model specification, estimation, selection and stacking. The framework applies to four different model structures: two additive and two proposed single-index DLNMs, all applicable to general outcome types, including the mortality counts in the motivating application. We develop an estimation approach that applies to all four models. Choosing among the candidate DLNMs is challenging a priori, and we derive an AIC to select among them. As an alternative to selecting a single model, we also extend a model stacking approach to combine inferences across the four DLNMs and propose an implementation scalable to our dataset with 106,346 observations. In the motivating analysis, the four DLNMs yield different estimates, and the proposed stacking approach identifies significant associations between respiratory mortality and a mixture of PM2.5, O3 and NO2.2026-04-24T16:27:08ZTianyi PanHwashin Hyun ShinAlex StringerGlen McGeehttp://arxiv.org/abs/2604.18820v2Sparse Network Inference under Imperfect Detection and its Application to Ecological Networks2026-04-24T15:50:52ZRecovering latent structure from count data has received considerable attention in network inference, particularly when one seeks both cross-group interactions and within-group similarity patterns in bipartite networks, which is widely used in ecology research. Such networks are often sparse and inherently imperfect in their detection. Existing models mainly focus on interaction recovery, while the induced similarity graphs are much less studied. Moreover, sparsity is often not controlled, and scale is unbalanced, leading to oversparse or poorly rescaled estimates with degrading structural recovery. To address these issues, we propose a framework for structured sparse nonnegative low-rank factorization with detection probability estimation. We impose nonconvex $\ell_{1/2}$ regularization on the latent similarity and connectivity structures to promote sparsity within-group similarity and cross-group connectivity with better relative scale. The resulting optimization problem is nonconvex and nonsmooth. To solve it, we develop an ADMM-based algorithm with adaptive penalization and scale-aware initialization and establish its asymptotic feasibility and KKT stationarity of cluster points under mild regularity conditions. Experiments on synthetic and real-world ecological datasets demonstrate improved recovery of latent factors and similarity/connectivity structure relative to existing baselines.2026-04-20T20:41:23Z13 pages, 4 figuresAoran ZhangTianyao WeiMaria J. GuerreroCésar A. Uribehttp://arxiv.org/abs/2503.04491v4A Spatiotemporal, Quasi-experimental Causal Inference Approach to Characterize the Effects of Global Plastic Waste Export and Burning on Air Quality Using Remotely Sensed Data2026-04-24T15:50:00ZOpen burning of plastic waste may pose a significant threat to global health by degrading air quality, but quantitative research on this problem -- crucial for policy making -- has been stunted by lack of data. Many low- and middle-income countries, where open burning is most concerning, have little to no air quality monitoring. Here, we leverage remotely sensed data products combined with spatiotemporal causal analytic techniques to evaluate the impact of large-scale plastic waste policies on air quality. Throughout, we study Indonesia before and after 2018, when China halted its import of plastic waste, resulting in diversion of this massive waste stream to other countries. We tailor cutting-edge statistical methods to this setting, estimating effects of increased plastic waste imports on fine particulate matter (PM$_{2.5}$) near waste dump sites in Indonesia as a function of proximity to ports, an induced continuous exposure. We observe strong evidence that monthly PM$_{2.5}$increased after China's ban (2018-2019) relative to expected business-as-usual (2012-2017), with increases up to 1.68 $μ$g/m$^3$ (95% CI = [0.72, 2.48]) when exposed to medium-high port proximity. Effects were more modest for very high port proximity exposure, possibly reflecting smaller increases in dumping/burning where government oversight is greater.2025-03-06T14:39:22ZEllen M. ConsidineRachel C. Nethery10.1093/jrsssc/qlag031http://arxiv.org/abs/2604.22636v1CLVAE: A Variational Autoencoder for Long-Term Customer Revenue Forecasting2026-04-24T15:12:57ZPredicting customers' long-term revenue from sparse and irregular transaction data is central to marketing resource allocation in non-contractual settings, yet existing approaches face a trade-off. Traditional probabilistic customer base models deliver robust long-horizon forecasts by imposing strong structural assumptions, while flexible machine-learning models often require substantial training data and careful tuning. We propose a variational-autoencoder-based model that preserves the process-based likelihood of established attrition-transaction-spend models conditional on customer heterogeneity, but replaces the restrictive parametric mixing distribution with a flexible latent representation learned by encoder-decoder networks. The resulting approach (i) provides a single model for customer attrition, transactions and spending, (ii) remains reliable when contextual covariates are unavailable, and (iii) flexibly incorporates rich covariates and nonlinear effects when they are available. This design balances structural stability with the flexibility needed to capture complex purchase dynamics. Across multiple real-world datasets and prediction horizons, the proposed model improves upon the latest benchmarks. Businesses benefit directly, as a better assessment of customers' future revenues improves the efficiency of campaign targeting. For research, this work provides guidance on how to embed domain-specific models into the variational autoencoder framework, enabling flexible representation learning while retaining an econometrically meaningful process structure.2026-04-24T15:12:57ZJeffrey NäfRiana Valera MbelsonMarkus Meiererhttp://arxiv.org/abs/2504.02518v3Online Multivariate Regularized Distributional Regression for High-dimensional Probabilistic Electricity Price Forecasting2026-04-24T13:45:42ZProbabilistic electricity price forecasting (PEPF) is vital for short-term electricity markets, yet the multivariate nature of day-ahead prices - spanning 24 consecutive hours - remains underexplored. At the same time, real-time decision-making requires methods that are both accurate and fast. We introduce an online algorithm for multivariate distributional regression models, allowing efficient modeling of the conditional means, variances, and dependence structures of electricity prices. The approach combines multivariate distributional regression with online coordinate descent and LASSO-type regularization (absolute shrinkage and selection operator), enabling scalable estimation in high-dimensional covariate spaces. Additionally, we propose a regularized estimation path over increasingly complex dependence structures, allowing for early stopping and avoiding overfitting. In a case study using historical data from the German day-ahead market, the proposed method yields interpretable and well-calibrated joint prediction intervals for the 24-dimensional price distribution and provides robust performance across a range of proper scoring rules. The results underscore the importance of modeling the dependence structure of electricity prices. Furthermore, we analyze the trade-off between predictive accuracy and computational costs for batch and online estimation and provide a high-performing open-source Python implementation in the ondil package.2025-04-03T12:08:51ZRevised Version March 2026. 40 pages incl. appendix, 14 figures, 7 tablesSimon Hirsch