https://arxiv.org/api/fzuS7tRNp9GWqJKOAeaso8ELhzE2026-06-09T23:29:22Z16834515http://arxiv.org/abs/2604.20285v1Time-dependent structural equation modeling of fans' football fever using activity tracking data during the 2025 DFB Cup final2026-04-22T07:31:12ZFootball fans frequently exhibit pronounced emotional and physiological reactions during high-stakes matches. However, the temporal dynamics of this football fever are rarely modeled as a latent process. Using intensive longitudinal data from Arminia Bielefeld supporters who wore smartwatches during the 2025 German Football Association (DFB) Cup final, we investigate how football fever unfolds. The devices recorded heart rate, stress level, and related indicators in short intervals, allowing us to construct a latent variable for football fever and model its dynamics. We specify a time-dependent structural equation model with latent growth components and autoregressive effects to capture both overall trends and short-term carry-over effects in fans' physiological responses. Results are aggregated across multiple imputations of missing measurements. Model fit is evaluated using adjustments for the high data dimensionality. The results show that football fever follows a V-shaped trajectory: high at kick-off, followed by a steady decline until the renewed arousal in the second half, with substantial between-fan heterogeneity in both baseline level and temporal dynamics. Our findings demonstrate that football fever can be adequately represented as a latent variable using structural equation modeling and reflected by wearable technology data. This highlights the importance of accounting for temporal dependence when studying dynamic emotional phenomena, e. g., in sports spectatorship.2026-04-22T07:31:12ZJonas BauerChristiane FuchsTamara Schambergerhttp://arxiv.org/abs/2604.19279v1Early Prediction of Student Performance Using Bayesian Updating with Informative Priors Across Cohorts2026-04-21T09:48:54ZEarly identification of at risk students in higher education depends on predictive models that maintain accuracy across successive cohorts -- a requirement that single-cohort modeling approaches fail to meet. This study evaluates Bayesian updating with informative priors from a previous cohort to improve cross-cohort prediction robustness using digital trace data. We fit weekly Bayesian linear, logistic, and ordinal regression models with either uninformative default priors or informative priors derived from posterior distributions of a preceding cohort. Models were applied to six weekly self-regulated learning (SRL)-aligned engagement indicators from two consecutive cohorts of students in a blended first-year mathematics course (N1 = 307; N2 = 323). Outcomes were exam points, final grades, and a binary at risk indicator. The models were evaluated weekly based on accuracy, sensitivity, and RMSE. In the source cohort, performance was already substantial by week 6. In the target cohort, informative priors improved early classification: Logistic models with priors reduced misclassification by 22% and false negatives by 38% in week 3 relative to the uninformative default. Ordinal models with priors similarly showed the strongest improvements in early weeks, reducing misclassification by 42% in week 2 and reaching an accuracy of .77 by week 4. Linear models showed little benefit from prior information. These findings demonstrate that Bayesian updating is a viable method for improving early classification performance across cohorts, with gains concentrated in the early weeks of the semester when current-cohort data are scarce.2026-04-21T09:48:54ZJakob SchwerterAmer KrivosijaTim NovakKatja IckstadtAlexander Munteanuhttp://arxiv.org/abs/2604.17762v1A Parameter-Centric View on Regression2026-04-20T03:25:54ZDiscussion on ``Regression by Composition'' by Farewell, Daniel, Stensrud, and Huitfeldt2026-04-20T03:25:54ZJingxin YanLin LiuOliver DukesQizhai LiLinbo Wanghttp://arxiv.org/abs/2604.17760v1Toward Variation-Independent Regression by Composition2026-04-20T03:22:18ZDiscussion on "Regression by Composition" by Farewell, Daniel, Stensrud, and Huitfeldt.2026-04-20T03:22:18ZRuixuan ZhaoOliver DukesLinbo WangLin Liuhttp://arxiv.org/abs/2604.14086v1The Epidemiology of Artificial Intelligence2026-04-15T16:59:10ZArtificial intelligence (AI) systems increasingly shape how people access health information, make medical decisions, and receive care -- yet epidemiology lacks frameworks for measuring AI exposure or studying its health effects at the population level. Here we argue that AI now functions as a determinant of health and propose a conceptual framework, borrowed from environmental epidemiology, for studying it. We distinguish ambient AI exposure -- algorithmic curation and AI-mediated institutional decisions that affect populations regardless of individual choice -- from personal AI exposure -- direct, volitional use of AI tools. We characterize AI's possible causal roles in epidemiological models, show that existing experimental approaches are inadequate for capturing chronic, population-level effects, and illustrate these ideas with nationally representative US survey data. We discuss implications for study design, health equity, and AI governance.2026-04-15T16:59:10ZPerspective/Viewpoint of causal role of AIHarsh ParikhTyler McCormickEmily JohnsonLeo HickeyMegan RanneyBhramar Mukherjeehttp://arxiv.org/abs/2604.10555v1On Some Multivariate Extensions to Zenga Curve: Properties and Applications2026-04-12T09:56:02ZMeasures of inequality are often limited in their ability to capture multidimensional aspects that arise from the joint distribution of multiple socio-economic variables. In this paper, we develop bivariate extensions of the Zenga inequality measure using bivariate quantile functions. We propose new bivariate Zenga surfaces and study their theoretical properties. A vector-valued bivariate Zenga curve is also introduced to provide a more detailed characterization of inequality. A non-parametric estimator is proposed and methods are evaluated through simulation studies and applied to the analysis of digital inequality across countries using indicators such as broadband penetration and digital literacy. The results highlight the effectiveness of the proposed framework in capturing multidimensional inequality.2026-04-12T09:56:02ZShifna P RS. M. Sunojhttp://arxiv.org/abs/2604.10310v1Weak convergence from projected laws on a positive-measure set of directions2026-04-11T18:23:35ZThe Cramér-Wold device characterises weak convergence of probability measures on $\mathbb{R}^d$ through convergence of all one-dimensional projected laws. We prove that, if the target projected laws are moment-determinate for surface-almost every direction, then weak convergence already follows from projected convergence on a positive-measure set of directions. This yields a simple probabilistic interpretation: if one samples a direction at random from any distribution on the sphere that is absolutely continuous with respect to surface measure, then, with probability one, convergence of the projected law along the sampled direction already forces global weak convergence under the same moment-determinacy assumption.2026-04-11T18:23:35ZAlejandro CholaquidisManuel Hernandez Banadikhttp://arxiv.org/abs/2603.14273v3Using large language models for sensitivity analysis in causal inference: case studies on Cornfield inequality and E-value2026-04-09T03:17:35ZSensitivity analysis methods such as the Cornfield inequality and the E-value were developed to assess the robustness of observed associations against unmeasured confounding -- a major challenge in observational studies. However, the calculation and interpretation of these methods can be difficult for clinicians and interdisciplinary researchers. Recent advances in large language models (LLMs) offer accessible tools that could assist sensitivity analyses, but their reliability in this context has not been studied. We assess four widely used LLMs, ChatGPT, Claude, DeepSeek, and Gemini, on their ability to conduct sensitivity analyses using Cornfield inequalities and E-values. We first extract study-specific information (exposures, outcomes, measured confounders, and effect estimates) from four published observational studies in different fields. Using such information, we develop structured prompts to assess the performance of the LLMs in three aspects: (1) accuracy of E-value calculation, (2) qualitative interpretation of robustness to unmeasured confounding, and (3) suggestion of possible unmeasured confounders. To our knowledge, there has been little prior work on using LLMs for sensitivity analysis, and this study is an early investigation in this area. The results show that ChatGPT, Claude, and Gemini accurately reproduce the E-values, whereas DeepSeek shows small biases. Qualitative conclusions from all the LLMs align with the magnitude of the E-values and the reported effect sizes, and all models identify biologically and epidemiologically plausible unmeasured confounders. These findings suggest that, when guided by structured prompts, LLMs can effectively assist in evaluating unmeasured confounding, and thereby can support study design and decision-making in observational studies.2026-03-15T08:07:00ZQingyan XiangJiahao ZhangBojian Fenghttp://arxiv.org/abs/2511.05834v2Impacts of Data Splitting Strategies on Parameterized Link Prediction Algorithms2026-04-08T02:43:27ZLink prediction is a fundamental problem in network science, aiming to infer potential or missing links based on observed network structures. With the increasing adoption of parameterized models, the rigor of evaluation protocols has become critically important. However, a previously common practice of using the test set during hyperparameter tuning has led to human-induced information leakage, thereby inflating the reported model performance. To address this issue, this study introduces a novel evaluation metric, Loss Ratio, which quantitatively measures the extent of performance overestimation. We conduct large-scale experiments on 60 real-world networks across six domains. The results demonstrate that the information leakage leads to an average overestimation of about 3.6%, with the bias reaching over 15% for specific algorithms. Meanwhile, heuristic and random-walk-based methods exhibit greater robustness and stability. The analysis uncovers a pervasive information leakage issue in link prediction evaluation and underscores the necessity of adopting standardized data splitting strategies to enable fair and reproducible benchmarking of link prediction models.2025-11-08T03:52:22Z18 pages, 3 figures. Published in Physica A (2026)Physica A: Statistical Mechanics and its Applications, 692 (2026), 131545Xinshan JiaoYuxin LuoYilin BiTao Zhou10.1016/j.physa.2026.131545http://arxiv.org/abs/2302.08724v4Piecewise Deterministic Markov Processes for Bayesian Neural Networks2026-04-06T08:51:01ZInference on modern Bayesian Neural Networks (BNNs) often relies on a variational inference treatment, imposing violated assumptions of independence and the form of the posterior. Traditional MCMC approaches avoid these assumptions at the cost of increased computation due to its incompatibility to subsampling of the likelihood. New Piecewise Deterministic Markov Process (PDMP) samplers permit subsampling, though introduce a model specific inhomogenous Poisson Process (IPPs) which is difficult to sample from. This work introduces a new generic and adaptive thinning scheme for sampling from these IPPs, and demonstrates how this approach can accelerate the application of PDMPs for inference in BNNs. Experimentation illustrates how inference with these methods is computationally feasible, can improve predictive accuracy, MCMC mixing performance, and provide informative uncertainty measurements when compared against other approximate inference schemes.2023-02-17T06:38:16Ztypo fix, Includes correction to software and corrigendum note (fix supplementary references)Ethan GoanDimitri PerrinKerrie MengersenClinton Fookeshttp://arxiv.org/abs/2604.00848v2Debiased Estimators in High-Dimensional Regression: A Review and Replication of Javanmard and Montanari (2014)2026-04-06T03:25:32ZHigh-dimensional statistical settings ($p \gg n$) pose fundamental challenges for classical inference, largely due to bias introduced by regularized estimators such as the LASSO. To address this, Javanmard and Montanari (2014) propose a debiased estimator that enables valid hypothesis testing and confidence interval construction. This report examines their debiased LASSO framework, which yields asymptotically normal estimators in high-dimensional settings. The key theoretical results underlying this approach are presented. Specifically, the construction of an optimized debiased estimator that restores asymptotic normality, which enables the computation of valid confidence intervals and $p$-values. To evaluate the claims of Javanmard and Montanari, a subset of the original simulation study and the real-data analysis is presented. The original empirical analysis is extended to the desparsified LASSO, which is referenced but not implemented in the original study. The results demonstrate that while the debiased LASSO achieves reliable coverage and controls Type I error, the LASSO projection estimator can offer improved power in idealized low-signal settings without compromising error rates. The results reveal a trade-off: the LASSO projection estimator performs well in low-signal settings, while Javanmard and Montanari's method is more robust to complex correlations, improving precision and signal detection in real data.2026-04-01T13:01:08ZBenjamin Smithhttp://arxiv.org/abs/2604.02992v1Why is Regularization Underused? An Empirical Study on Trust and Adoption of Statistical Methods2026-04-03T12:12:10ZStatistical practice does not automatically follow methodological innovation. Regularization methods, widely advocated to reduce overfitting and stabilize inference, are readily available in modern software, but are not consistently used by data analysts. We investigate this implementation gap in a large-scale empirical study of trust in, and acceptance of, regularization techniques, based on $N = 606$ data analysts. Drawing on measurement frameworks from technology acceptance research, we survey practitioners and embed a randomized experiment to test whether written recommendation of regularization methods increases trust or intended use. We find no evidence of such an effect. Instead, adoption intentions are strongly associated with analysts' perceptions of ease of implementation and practical benefit, such as improved bias control or interpretability. Perceived social norms also emerge as a central driver. These results indicate that uptake of statistical methodology depends less on formal recommendations than on usability, perceived utility, and community practice.2026-04-03T12:12:10ZKonstantin Emil ThielMarléne BaumeisterNicole KrämerAndreas GrollMarkus PaulyMagdalena Wischnewskihttp://arxiv.org/abs/2604.01501v1Identifying and Estimating Causal Direct Effects Under Unmeasured Confounding2026-04-02T00:23:59ZCausal mediation analysis provides techniques for defining and estimating effects that may be endowed with mechanistic interpretations. With many scientific investigations seeking to address mechanistic questions, causal direct and indirect effects have garnered much attention. The natural direct and indirect effects, the most widely used among such causal mediation estimands, are limited in their practical utility due to stringent identification requirements. Accordingly, considerable effort has been invested in developing alternative direct and indirect effect decompositions with relaxed identification requirements. Such efforts often yield effect definitions with nuanced and challenging interpretations. By contrast, relatively limited attention has been paid to relaxing the identification assumptions of the natural direct and indirect effects. Motivated by a secondary aim of a recent non-randomized vaccine prospective cohort study (NCT05168813), we present a set of relaxed conditions under which the natural direct effect is identifiable in spite of unobserved baseline confounding of the exposure-mediator pathway; we use this result to investigate the effect mediated by putative immune correlates of protection. Relaxing the commonly used but restrictive cross-world counterfactual independence assumption, we discuss strategies for evaluating the natural direct effect in non-randomized settings that arise in the analysis of vaccine studies. We revisit prior studies of semi-parametric efficiency theory to demonstrate the construction of flexible, multiply robust estimators of the natural direct effect and discuss efficient estimation strategies that do not place restrictive modeling assumptions on nuisance functions.2026-04-02T00:23:59ZPhilippe BoileauNima S. HejaziIvana MalenicaPeter B. GilbertSandrine DudoitMark J. van der Laanhttp://arxiv.org/abs/2604.00424v1Distributional regression models for meta-analysis2026-04-01T03:09:15ZMeta-analyses are regarded as the highest level in the hierarchy of evidence, yet standard models traditionally concentrated on estimating the mean effect size, often under restrictive assumptions about the underlying distribution, such as homogeneous variance, symmetric shapes. We introduce a distributional regression framework for meta-analysis that generalizes these conventional models by allowing all parameters of the effect size distribution, such as location, scale, and shape, to be modelled as functions of explanatory variables. This unified framework accommodates a wide range of existing models, including random-effects, multilevel, multivariate, location-scale, and outlier-robust meta-analyses, as special cases. We provide an illustrative example, using 67,393 meta-analyses from the Cochrane Database of Systematic Reviews, employing location-scale models to investigate whether smaller studies tend to report larger effect sizes (i.e., small-study effects) and exhibit greater heterogeneity. We discuss implementation strategies using existing software, considerations for model selection and pre-registration, and the need for further methodological development. By moving beyond the mean effect size, distributional regression enables researchers to explore systematic variation in distributional structure, facilitating the joint test of new hypotheses corresponding to multiple distributional parameters.2026-04-01T03:09:15Z31 pagesYefeng YangShinichi Nakagawahttp://arxiv.org/abs/2603.21672v3Mislearning of Factor Risk Premia under Structural Breaks: A Misspecified Bayesian Learning Framework2026-03-31T23:16:16ZWhile asset-pricing models increasingly recognize that factor risk premia are subject to structural change, existing literature typically assumes that investors correctly account for such instability. This paper studies how investors instead learn under a misspecified model that underestimates structural breaks. We propose a minimal Bayesian framework in which this misspecification generates persistent prediction errors and pricing distortions, and we introduce an empirically tractable measure of mislearning intensity $(Δ_t)$ based on predictive likelihood ratios.
The empirical results yield three main findings. First, in benchmark factor systems, elevated mislearning does not forecast a deterministic short-run collapse in performance; instead, it is associated with stronger long-horizon returns and Sharpe ratios, consistent with an equilibrium premium for acute model uncertainty. Second, in a broader anomaly universe, this pricing relation does not generalize uniformly: mislearning is more strongly associated with future drawdowns, downside semivolatility, and other measures of instability, with substantial heterogeneity across anomaly families. Third, the cross-sectional relation between instability and mislearning is inherently conditional: while a monotonic link between break-proneness and average mislearning does not hold in the full cross-section, it re-emerges in low-friction (low-IVOL) environments where break-state severity is more comparable across assets.2026-03-23T07:54:15ZYimeng Qiu