https://arxiv.org/api/7yvnTQhNM5Lc+S6rVQLhvwyTnRQ 2026-03-22T10:03:26Z 1629 15 15 http://arxiv.org/abs/2603.06820v1 Hippocratic Utility 2026-03-06T19:29:57Z A utility function has been proposed that values more those lives that are saved by not imposing a harmful treatment and values less those lives that could be saved by treating people who would otherwise die. I do not dispute the ethical motivation behind this kind of asymmetry. However, as my example illustrates, the scope of applicability of such a decision criterion may be limited. 2026-03-06T19:29:57Z Tomasz Strzalecki http://arxiv.org/abs/2603.06328v1 Variable selection in linear mixed model meta-regression with suspected interaction effects -- How can tree-based methods help? 2026-03-06T14:40:23Z Detecting interaction effects (IEs) in meta-regression is challenging, especially when few studies are available and many plausible interactions are considered. In many meta-analyses, interpretability is essential, which limits the use of complex machine learning methods. Tree-based approaches offer a potentially useful compromise, but their role in meta-regression with random effects is not yet well understood. This paper examines how traditional linear and tree-based methods can support variable selection for IEs in random effects meta-regression. We compare test-based and information-criterion-based linear selection procedures with meta-CART approaches. These include fixed effect and random effects trees and their stability-selected ensemble variants. All methods are evaluated using a real-world meta-analytic dataset and a plasmode simulation study. The data-generating process assumes linear IEs and is complemented by settings with nonlinear interactions. Our results show that under strictly linear interactions, linear selection methods perform as expected and achieve superior performance for IE detection. Tree-based methods are more conservative when the number of studies is small, but become competitive as sample size increases, particularly the stability-selected variants. When IEs deviate from strict linearity, even in simple ways, the performance of linear methods deteriorates, whereas tree-based approaches, especially stability-selected fixed effect trees, provide a more robust alternative. Overall, stability-selected random effects trees are useful complementary tools for IE detection in applied meta-regression, particularly for metric covariates. They are well suited for pre-selection and sensitivity analyses, and selection frequency patterns in tree ensembles can help reveal structural patterns in the data. 2026-03-06T14:40:23Z 25 pages, 5 figures. Supplementary Materials at https://doi.org/10.17877/TUDODATA-2026-3CDZSS Jan-Bernd Igelmann Paula Lorenz Markus Pauly http://arxiv.org/abs/2603.06072v1 A Hierarchical Bayesian Dynamic Game for Competitive Inventory and Pricing under Incomplete Information: Learning, Credible Risk, and Equilibrium 2026-03-06T09:24:49Z We develop a hierarchical Bayesian dynamic game for competitive inventory and pricing under incomplete information. Two firms repeatedly choose order quantities and prices while facing two layers of uncertainty: unknown market demand and private rival characteristics. The framework combines Bayesian learning about demand and substitution with strategic belief updating about rival types. To make decisions robust to posterior uncertainty, we introduce a credible-risk criterion that rewards expected future profit while penalizing posterior predictive dispersion. This yields a conservative equilibrium concept in which firms learn, compete, and adapt simultaneously. The paper provides the model formulation, information structure, posterior updating mechanism, equilibrium definition, and a computational strategy based on belief-state dynamic programming. A simulation study shows that Bayesian learning is crucial for strong performance and that the credible-risk rule is especially effective as an operational regularizer under uncertainty. A real-data illustration on a high-dimensional protein-expression dataset demonstrates that the same uncertainty-aware Bayesian principle can produce biologically interpretable subgroup and latent-state findings. The proposed framework offers a unified bridge between Bayesian game theory and operations research, with practical relevance for competitive decision-making in uncertain and information-limited environments. 2026-03-06T09:24:49Z Debashis Chatterjee http://arxiv.org/abs/2603.05885v1 Bayesian Linear Programming under Learned Uncertainty: Posterior Feasibility Guarantees, Scenario Certification, and Applications 2026-03-06T04:07:44Z Linear programming is widely used for decision-making in science, engineering, and operations research, yet in many modern applications the coefficients entering the constraints and objective are not known exactly and must be learned from data. Classical stochastic and robust optimization offer two influential paradigms for handling such uncertainty, but they typically treat the underlying uncertainty description as given and do not directly integrate priors and updated to posteriors guarantees. This paper develops a Bayesian framework for linear programming in which uncertain quantities are modeled probabilistically, updated through observed data, and propagated into optimization through posterior feasibility requirements. We present two complementary computational strategies: a credible-region robustification that converts posterior uncertainty into deterministic protection, and a posterior-scenario approach that uses sampled posterior realizations to construct tractable optimization problems with finite-sample interpretability. We also propose a Monte Carlo certification procedure that provides conservative, data-conditioned assessments of residual infeasibility. Simulation experiments show that the proposed framework substantially improves safety relative to naive plug-in decisions, while a real-data study on single-cell transcriptomic data demonstrates that the approach can produce scientifically interpretable decisions together with explicit uncertainty-aware feasibility diagnostics. The proposed methodology offers a unified bridge between Bayesian learning, optimization under uncertainty, and practical decision certification. 2026-03-06T04:07:44Z Debashis Chatterjee http://arxiv.org/abs/2602.15581v2 Confidence as Forecast: A Decision-Theoretic Interpretation of Confidence Intervals 2026-03-05T15:01:56Z What, if anything, should a frequentist say about a single realized confidence interval (CI) and its chance of having covered the parameter? Jerzy Neyman's original answer was to refuse any nondegenerate probability for coverage ex post and, instead, to "state that the interval covers". In this paper I argue that the usual frequentist machinery already supports a different reading. I treat the coverage event as a Bernoulli random variable, with the nominal level 1-alpha as its design-based success probability, and view "confidence" as a probability forecast for that Bernoulli outcome. Using strictly proper scoring rules, I show that 1-alpha is the unique optimal constant forecast for coverage, both before and after observing the data, and that it remains optimal post-trial in common unbounded, translation-invariant models with pivot-based CIs. When the design yields a theta-free statistic--such as the relative width of the interval in a finite-window uniform model--the conditional coverage given that statistic provides a nonconstant, design-based refinement of 1-alpha that strictly improves predictive performance. Two thought experiments, a Monty Hall-style shell game and the "lost submarine" example of Morey et al. (2016), illustrate how this perspective resolves familiar interpretational puzzles about CIs without appealing to priors or single-case subjective degrees of belief. I conclude with simple "what to do when you see an interval" guidance for applied work and some implications for teaching confidence intervals as tools for forecasting long-run coverage. Keywords: Confidence intervals, coverage probability, proper scoring rules, probabilistic forecasting, frequentist inference Disclaimer: The findings and conclusions in this report are those of the author and do not necessarily represent the official position of the Centers for Disease Control and Prevention 2026-02-17T13:52:32Z Scott Lee http://arxiv.org/abs/2603.04541v1 Engaging students with statistics through choice of real data context on homework 2026-03-04T19:27:18Z Statistics educators recommend teaching with real data with relevant contexts, but defining relevancy is challenging and varies by student. We investigated whether providing student choice of data context increases engagement through a quasi-experiment in two sections of an introductory probability and statistics course at a large public university (n=65 consenting students). Sections alternated as treatment and control: during their treatment, students chose weekly homework from three similar instructor-provided options varying by data context; during control weeks, they received randomly assigned contexts. We found no significant difference in homework grades between treatment and control conditions. However, thematic analysis revealed students with choice reported enhanced engagement and motivation, greater appreciation for statistics' real-world value, and increased autonomy. Students overwhelmingly preferred contexts relevant to their interests, experiences, daily lives, and career paths-though preferences varied considerably across individuals. Based on these findings, we provide four recommendations for statistics educators: (1) use real data with authentic contexts, (2) select contexts students care about, (3) incorporate variety across data contexts, and (4) consider choice as a pedagogical tool. 2026-03-04T19:27:18Z 25 pages, 3 figures, 2 tables. Submitted to The American Statistician. Supplementary materials and code available at https://github.com/CatalinaMedina/data-context-choice-manuscript Catalina Medina Mine Dogucu http://arxiv.org/abs/2603.11060v1 LLY Ricci Reweighting in Stochastic Block Models: Uniform Curvature Concentration and Finite-Horizon Tracking 2026-03-04T18:32:09Z We study curvature-driven edge reweighting for community recovery in the balanced two-block stochastic block model. Given a graph G with initial weights equal to the adjacency matrix, we iteratively update edge weights using Lin-Lu-Yau (Ollivier-type) Ricci curvature, while all transportation costs are computed in the unweighted graph metric. In a moderate-density regime we prove uniform concentration of edge curvatures and show that a single Ricci reweighting step produces a two-level weighting that amplifies within-block connectivity relative to across-block connectivity. As a consequence, spectral clustering on the reweighted graph has a strictly larger population eigengap, and we obtain corresponding non-asymptotic perturbation bounds and Davis-Kahan misclustering guarantees. We further analyze a fixed finite horizon of iterated reweighting, where the random iterates track a deterministic two-weight recursion uniformly over the time horizon. This yields a principled finite-horizon curvature flow interpretation for community detection in a canonical random graph model. 2026-03-04T18:32:09Z Varun Kotharkar http://arxiv.org/abs/2511.01960v2 Towards a Unified Framework for Statistical and Mathematical Modeling 2026-03-04T15:42:36Z Within the biological, physical, and social sciences, there are two broad quantitative traditions: statistical and mathematical modeling. Both traditions have the common pursuit of advancing our scientific knowledge, but these traditions have developed largely independently using distinct languages and inferential frameworks. This paper uses the notion of identification from causal inference, a field originating from the statistical modeling tradition, to develop a shared language. I first review foundational identification results for statistical models and then extend these ideas to mathematical models. Central to this framework is the use of bounds, ranges of plausible numerical values, to analyze both statistical and mathematical models. I discuss the implications of this perspective for the interpretation, comparison, and integration of different modeling approaches, and illustrate the framework with a simple pharmacodynamic model for hypertension. To conclude, I describe areas where the approach taken here should be extended in the future. By formalizing connections between statistical and mathematical modeling, this work contributes to a shared framework for quantitative science. My hope is that this work will advance interactions between these two traditions. 2025-11-03T18:21:50Z Paul N Zivich http://arxiv.org/abs/2603.03828v1 Philosophical foundations of statistics 2026-03-04T08:26:53Z The philosophical foundations of statistics involve issues in theoretical statistics, such as goals and methods to meet these goals, and interpretation of the meaning of inference using statistics. They are related to the philosophy of science and to the philosophy of probability. We review the core and partly interrelated themes and place them in context. 2026-03-04T08:26:53Z 7 pages, no figures; Statistical Research Report, Department of Mathematics, University of Oslo, February 2023, but now arXiv'd March 2026. The article has appeared in International Encyclopedia of Statistical Science 2024, pages 1894-1899, Springer, at this url: https://link.springer.com/content/pdf/10.1007/978-3-662-69359-9_471.pdf Inge G. Helland Nils Lid Hjort Gunnar Taraldsen http://arxiv.org/abs/2602.21792v2 p-Hacking Inflates Type I Error Rates in the Error Statistical Approach but not in the Formal Inference Approach 2026-03-03T08:54:25Z p-hacking occurs when researchers conduct multiple significance tests (e.g., p1;H0,1 and p2;H0,2) and then selectively report tests that yield desirable (usually significant) results (e.g., p2 < 0.05;H0,2) without correcting for multiple testing (e.g., 0.05/2 = 0.025). In the present article, I consider p-hacking in the context of two philosophies of significance testing - the error statistical approach and the formal inference approach. I argue that although p-hacking inflates Type I error rates in the error statistical approach, it does not inflate them in the formal inference approach. Specifically, in the error statistical approach, the "actual" familywise error rate (e.g., 1 - [1 - 0.05]2 = 0.098 for two tests) is relevant because it covers both the selectively reported and unreported tests in the "actual" test procedure (i.e., p1;H0,1 and p2;H0,2). In this approach, Type I error rate inflation occurs because the "actual" error rate (0.098) is higher than the nominal error rate (0.05). In contrast, in the formal inference approach, the "actual" familywise error rate is irrelevant because (a) the researcher does not report a statistical inference about the corresponding intersection null hypothesis (i.e., H0,1 intersect H0,2), and (b) the "actual" familywise error rate does not license inferences about the reported individual hypotheses (i.e., H0,2). Instead, in the formal inference approach, only the nominal error rate is relevant, and a comparison with the "actual" error rate is inappropriate. Implications for conceptualizing, demonstrating, and reducing p-hacking are discussed. 2026-02-25T11:17:36Z Mark Rubin http://arxiv.org/abs/2603.02372v1 Implications of the Pessimistic Lower Limit on the Drake Equation 2026-03-02T20:21:40Z The observation of life on Earth is generally accepted to be uninformative concerning the probability of life on other Earth-like planets, a belief first formalized by Brandon Carter and based on the selection effect of our existence. In a similar way, the Drake equation is either presented as estimate of the total number of active, communicative, extraterrestrial civilizations in our Galaxy ($n^g_{\rm civ}$), i.e. excluding humanity, or humanity is included in the estimate but judged to be an uninformative data point. Daniel Whitmire has recently challenged the Carter abiogenesis argument, claiming the logic behind it is flawed, as the conditional likelihoods used by Carter in Bayes' theorem are not evaluated prior to the occurrence of the evidence of life on Earth, but posterior. Doing so correctly, the anthropic selection effect is removed and the observation of life on Earth is informative after all. Following this argument, we treat the Drake equation as estimate of all technological civilizations in a statistical counting experiment and include the data point of humanity as informative evidence. This allows one to set a pessimistic lower limit on $n^o_{\rm civ}$ for the observable universe, $n^o_{\rm civ} > 0.051$ at 95\% C.L., or $n^g_{\rm civ} > 8\times10^{-13}$ at 95\% C.L. for the Galaxy. In particular, this excludes models that predict $n^o_{\rm civ}\ll 1$ for the observable universe and refines the allowable parameter space for hypotheses like Rare Earth. Our analysis substantially reduces the portion of the Drake equation parameter space that predicts humanity is alone; when applying the lower limit this study finds $P(n^o_{\rm civ}>1 |\, {\rm humanity}) = 97.6\%$, making solitude in the observable universe a disfavored outcome. For the low-end estimate of $n^o_{\rm civ}\! =\! 1$ we calculate a probability of 42\% for the existence of other communicating civilizations. 2026-03-02T20:21:40Z 8 pages, 2 figures Max Baak Hella Snoek http://arxiv.org/abs/2603.02131v1 Socio-Spatial Patterns of Suicide Mortality in the United States 2026-03-02T17:47:27Z Suicides cause over 49000 deaths yearly in the United States, 55% involving firearms. Suicide mortality exhibits substantial geographical and sociodemographic heterogeneity; yet the role of social networks remains underexplored. To assess how suicide risk and firearm restriction policies propagate through social ties, we integrate county-level suicide mortality data (2010-2022) with the Facebook Social Connectedness Index (SCI). We also examine Extreme Risk Protection Orders (ERPO), state-level policies restricting firearm access for individuals at risk of self-harm. In two-way fixed effects regressions, a one-standard-deviation increase in the SCI-weighted average suicide mortality rate of connected counties was associated with +2.78 deaths per 100,000 in a focal county, while a one-standard-deviation increase in ERPO social exposure was associated with -0.214 deaths per 100,000. These associations persisted when adjusting for geographic proximity and including state-by-year fixed effects, and confirm the effect of social networks on diffusion of both harmful exposures and protective interventions. 2026-03-02T17:47:27Z Code and data: https://github.com/kut97/suicide-sci Kushagra Tiwari M. Amin Rahimian Marie-Laure Charpignon Philippe J. Giabbanelli Praveen Kumar http://arxiv.org/abs/2603.01800v1 Phase-Type Variational Autoencoders for Heavy-Tailed Data 2026-03-02T12:32:42Z Heavy-tailed distributions are ubiquitous in real-world data, where rare but extreme events dominate risk and variability. However, standard Variational Autoencoders (VAEs) employ simple decoder distributions (e.g., Gaussian) that fail to capture heavy-tailed behavior, while existing heavy-tail-aware extensions remain restricted to predefined parametric families whose tail behavior is fixed a priori. We propose the Phase-Type Variational Autoencoder (PH-VAE), whose decoder distribution is a latent-conditioned Phase-Type (PH) distribution defined as the absorption time of a continuous-time Markov chain (CTMC). This formulation composes multiple exponential time scales, yielding a flexible and analytically tractable decoder that adapts its tail behavior directly from the observed data. Experiments on synthetic and real-world benchmarks demonstrate that PH-VAE accurately recovers diverse heavy-tailed distributions, significantly outperforming Gaussian, Student-t, and extreme-value-based VAE decoders in modeling tail behavior and extreme quantiles. In multivariate settings, PH-VAE captures realistic cross-dimensional tail dependence through its shared latent representation. To our knowledge, this is the first work to integrate Phase-Type distributions into deep generative modeling, bridging applied probability and representation learning. 2026-03-02T12:32:42Z Abdelhakim Ziani András Horváth Paolo Ballarini http://arxiv.org/abs/2603.01033v1 Interpreting Net Survival: What We Estimate Versus What We Think We Estimate 2026-03-01T10:18:40Z Net survival is conventionally defined as ``survival if cancer were the only possible cause of death'', an estimand corresponding to cancer-specific mortality alone. The Pohar Perme estimator targets this by removing general population other-cause mortality from observed total mortality, but achieves it only when cancer patients experience the same other-cause mortality as the general population. However, cancer patients often experience elevated other-cause mortality due to baseline health differences and treatment-induced effects. Using recent theoretical work decomposing total mortality into four components (cancer deaths, baseline health differences, treatment-induced other-cause deaths, and general population other-cause mortality), we show that the Pohar Perme estimator delivers the sum of cancer deaths, baseline differences, and treatment-induced deaths, falling short of its intended estimand whenever either source of excess is present. From Botta \textit{et al}, we present empirical evidence showing relative risk of other-cause deaths ranging from 1.0 (colorectal cancer) to 4.0+ (head and neck cancers), and calculations demonstrating that net survival can substantially underestimate cancer-specific survival probability when relative risk exceeds 1.0. Critically, treatment-induced other-cause deaths represent irreducible causal pathways from cancer to death that cannot be eliminated through better stratification. We recommend interpreting net survival as ``survival where general population other-cause mortality is removed'' rather than as a causal counterfactual, and call for more precise language in cancer epidemiology. 2026-03-01T10:18:40Z 21 pages, 4 figures Matthew J. Smith http://arxiv.org/abs/2503.20852v2 Teachable normal approximations to binomial and related probabilities or confidence bounds 2026-02-25T18:52:03Z For the usual normal approximations to binomial, hypergeometric, or Poisson interval probabilities, we collect some simple but then reasonably sharp error bounds. For the Clopper-Pearson~(1934) binomial confidence bounds, we present, following Michael Short's~(2023) approach, bounds similar to, but necessarily more complicated than, Lagrange's (1776) success rate plus/minus normal quantile times estimated standard deviation. The bounds, as presented here in four theorems, should be teachable, to people ranging from sufficiently advanced high school pupils to university students in mathematics or statistics: For understanding most of the proposed approximation results, it should suffice to know binomial laws, their means and variances, and the standard normal distribution function, but not necessarily the concept of a corresponding normal random variable. Accompanying technical remarks, references, and proofs are meant for assuring teachers or for stimulating further research. Of the proposed approximations, some are essentially well-known at least to experts, and some are based on teaching experience and research at Trier University. 2025-03-26T17:57:15Z 13 pages. Contains now a complete proof of the proposed bounds for Clopper-Pearson bounds. Further various minor improvements Lutz Mattner