https://arxiv.org/api/sTP4GAnScbGWd1jCmWKEWx+5/GI 2026-06-10T02:55:44Z 1686 90 15 http://arxiv.org/abs/2603.05885v1 Bayesian Linear Programming under Learned Uncertainty: Posterior Feasibility Guarantees, Scenario Certification, and Applications 2026-03-06T04:07:44Z

Linear programming is widely used for decision-making in science, engineering, and operations research, yet in many modern applications the coefficients entering the constraints and objective are not known exactly and must be learned from data. Classical stochastic and robust optimization offer two influential paradigms for handling such uncertainty, but they typically treat the underlying uncertainty description as given and do not directly integrate priors and updated to posteriors guarantees. This paper develops a Bayesian framework for linear programming in which uncertain quantities are modeled probabilistically, updated through observed data, and propagated into optimization through posterior feasibility requirements. We present two complementary computational strategies: a credible-region robustification that converts posterior uncertainty into deterministic protection, and a posterior-scenario approach that uses sampled posterior realizations to construct tractable optimization problems with finite-sample interpretability. We also propose a Monte Carlo certification procedure that provides conservative, data-conditioned assessments of residual infeasibility. Simulation experiments show that the proposed framework substantially improves safety relative to naive plug-in decisions, while a real-data study on single-cell transcriptomic data demonstrates that the approach can produce scientifically interpretable decisions together with explicit uncertainty-aware feasibility diagnostics. The proposed methodology offers a unified bridge between Bayesian learning, optimization under uncertainty, and practical decision certification.

2026-03-06T04:07:44Z Debashis Chatterjee http://arxiv.org/abs/2602.15581v2 Confidence as Forecast: A Decision-Theoretic Interpretation of Confidence Intervals 2026-03-05T15:01:56Z

What, if anything, should a frequentist say about a single realized confidence interval (CI) and its chance of having covered the parameter? Jerzy Neyman's original answer was to refuse any nondegenerate probability for coverage ex post and, instead, to "state that the interval covers". In this paper I argue that the usual frequentist machinery already supports a different reading. I treat the coverage event as a Bernoulli random variable, with the nominal level 1-alpha as its design-based success probability, and view "confidence" as a probability forecast for that Bernoulli outcome. Using strictly proper scoring rules, I show that 1-alpha is the unique optimal constant forecast for coverage, both before and after observing the data, and that it remains optimal post-trial in common unbounded, translation-invariant models with pivot-based CIs. When the design yields a theta-free statistic--such as the relative width of the interval in a finite-window uniform model--the conditional coverage given that statistic provides a nonconstant, design-based refinement of 1-alpha that strictly improves predictive performance. Two thought experiments, a Monty Hall-style shell game and the "lost submarine" example of Morey et al. (2016), illustrate how this perspective resolves familiar interpretational puzzles about CIs without appealing to priors or single-case subjective degrees of belief. I conclude with simple "what to do when you see an interval" guidance for applied work and some implications for teaching confidence intervals as tools for forecasting long-run coverage. Keywords: Confidence intervals, coverage probability, proper scoring rules, probabilistic forecasting, frequentist inference Disclaimer: The findings and conclusions in this report are those of the author and do not necessarily represent the official position of the Centers for Disease Control and Prevention

2026-02-17T13:52:32Z Scott Lee http://arxiv.org/abs/2603.04541v1 Engaging students with statistics through choice of real data context on homework 2026-03-04T19:27:18Z

Statistics educators recommend teaching with real data with relevant contexts, but defining relevancy is challenging and varies by student. We investigated whether providing student choice of data context increases engagement through a quasi-experiment in two sections of an introductory probability and statistics course at a large public university (n=65 consenting students). Sections alternated as treatment and control: during their treatment, students chose weekly homework from three similar instructor-provided options varying by data context; during control weeks, they received randomly assigned contexts. We found no significant difference in homework grades between treatment and control conditions. However, thematic analysis revealed students with choice reported enhanced engagement and motivation, greater appreciation for statistics' real-world value, and increased autonomy. Students overwhelmingly preferred contexts relevant to their interests, experiences, daily lives, and career paths-though preferences varied considerably across individuals. Based on these findings, we provide four recommendations for statistics educators: (1) use real data with authentic contexts, (2) select contexts students care about, (3) incorporate variety across data contexts, and (4) consider choice as a pedagogical tool.

2026-03-04T19:27:18Z 25 pages, 3 figures, 2 tables. Submitted to The American Statistician. Supplementary materials and code available at https://github.com/CatalinaMedina/data-context-choice-manuscript Catalina Medina Mine Dogucu http://arxiv.org/abs/2603.11060v1 LLY Ricci Reweighting in Stochastic Block Models: Uniform Curvature Concentration and Finite-Horizon Tracking 2026-03-04T18:32:09Z

We study curvature-driven edge reweighting for community recovery in the balanced two-block stochastic block model. Given a graph G with initial weights equal to the adjacency matrix, we iteratively update edge weights using Lin-Lu-Yau (Ollivier-type) Ricci curvature, while all transportation costs are computed in the unweighted graph metric. In a moderate-density regime we prove uniform concentration of edge curvatures and show that a single Ricci reweighting step produces a two-level weighting that amplifies within-block connectivity relative to across-block connectivity. As a consequence, spectral clustering on the reweighted graph has a strictly larger population eigengap, and we obtain corresponding non-asymptotic perturbation bounds and Davis-Kahan misclustering guarantees. We further analyze a fixed finite horizon of iterated reweighting, where the random iterates track a deterministic two-weight recursion uniformly over the time horizon. This yields a principled finite-horizon curvature flow interpretation for community detection in a canonical random graph model.

2026-03-04T18:32:09Z Varun Kotharkar http://arxiv.org/abs/2511.01960v2 Towards a Unified Framework for Statistical and Mathematical Modeling 2026-03-04T15:42:36Z

Within the biological, physical, and social sciences, there are two broad quantitative traditions: statistical and mathematical modeling. Both traditions have the common pursuit of advancing our scientific knowledge, but these traditions have developed largely independently using distinct languages and inferential frameworks. This paper uses the notion of identification from causal inference, a field originating from the statistical modeling tradition, to develop a shared language. I first review foundational identification results for statistical models and then extend these ideas to mathematical models. Central to this framework is the use of bounds, ranges of plausible numerical values, to analyze both statistical and mathematical models. I discuss the implications of this perspective for the interpretation, comparison, and integration of different modeling approaches, and illustrate the framework with a simple pharmacodynamic model for hypertension. To conclude, I describe areas where the approach taken here should be extended in the future. By formalizing connections between statistical and mathematical modeling, this work contributes to a shared framework for quantitative science. My hope is that this work will advance interactions between these two traditions.

2025-11-03T18:21:50Z Paul N Zivich http://arxiv.org/abs/2603.03828v1 Philosophical foundations of statistics 2026-03-04T08:26:53Z

The philosophical foundations of statistics involve issues in theoretical statistics, such as goals and methods to meet these goals, and interpretation of the meaning of inference using statistics. They are related to the philosophy of science and to the philosophy of probability. We review the core and partly interrelated themes and place them in context.

2026-03-04T08:26:53Z 7 pages, no figures; Statistical Research Report, Department of Mathematics, University of Oslo, February 2023, but now arXiv'd March 2026. The article has appeared in International Encyclopedia of Statistical Science 2024, pages 1894-1899, Springer, at this url: https://link.springer.com/content/pdf/10.1007/978-3-662-69359-9_471.pdf Inge G. Helland Nils Lid Hjort Gunnar Taraldsen http://arxiv.org/abs/2603.02372v1 Implications of the Pessimistic Lower Limit on the Drake Equation 2026-03-02T20:21:40Z

The observation of life on Earth is generally accepted to be uninformative concerning the probability of life on other Earth-like planets, a belief first formalized by Brandon Carter and based on the selection effect of our existence. In a similar way, the Drake equation is either presented as estimate of the total number of active, communicative, extraterrestrial civilizations in our Galaxy ($n^g_{\rm civ}$), i.e. excluding humanity, or humanity is included in the estimate but judged to be an uninformative data point. Daniel Whitmire has recently challenged the Carter abiogenesis argument, claiming the logic behind it is flawed, as the conditional likelihoods used by Carter in Bayes' theorem are not evaluated prior to the occurrence of the evidence of life on Earth, but posterior. Doing so correctly, the anthropic selection effect is removed and the observation of life on Earth is informative after all. Following this argument, we treat the Drake equation as estimate of all technological civilizations in a statistical counting experiment and include the data point of humanity as informative evidence. This allows one to set a pessimistic lower limit on $n^o_{\rm civ}$ for the observable universe, $n^o_{\rm civ} > 0.051$ at 95\% C.L., or $n^g_{\rm civ} > 8\times10^{-13}$ at 95\% C.L. for the Galaxy. In particular, this excludes models that predict $n^o_{\rm civ}\ll 1$ for the observable universe and refines the allowable parameter space for hypotheses like Rare Earth. Our analysis substantially reduces the portion of the Drake equation parameter space that predicts humanity is alone; when applying the lower limit this study finds $P(n^o_{\rm civ}>1 |\, {\rm humanity}) = 97.6\%$, making solitude in the observable universe a disfavored outcome. For the low-end estimate of $n^o_{\rm civ}\! =\! 1$ we calculate a probability of 42\% for the existence of other communicating civilizations.

2026-03-02T20:21:40Z 8 pages, 2 figures Max Baak Hella Snoek http://arxiv.org/abs/2603.02131v1 Socio-Spatial Patterns of Suicide Mortality in the United States 2026-03-02T17:47:27Z

Suicides cause over 49000 deaths yearly in the United States, 55% involving firearms. Suicide mortality exhibits substantial geographical and sociodemographic heterogeneity; yet the role of social networks remains underexplored. To assess how suicide risk and firearm restriction policies propagate through social ties, we integrate county-level suicide mortality data (2010-2022) with the Facebook Social Connectedness Index (SCI). We also examine Extreme Risk Protection Orders (ERPO), state-level policies restricting firearm access for individuals at risk of self-harm. In two-way fixed effects regressions, a one-standard-deviation increase in the SCI-weighted average suicide mortality rate of connected counties was associated with +2.78 deaths per 100,000 in a focal county, while a one-standard-deviation increase in ERPO social exposure was associated with -0.214 deaths per 100,000. These associations persisted when adjusting for geographic proximity and including state-by-year fixed effects, and confirm the effect of social networks on diffusion of both harmful exposures and protective interventions.

2026-03-02T17:47:27Z Code and data: https://github.com/kut97/suicide-sci Kushagra Tiwari M. Amin Rahimian Marie-Laure Charpignon Philippe J. Giabbanelli Praveen Kumar http://arxiv.org/abs/2603.01033v1 Interpreting Net Survival: What We Estimate Versus What We Think We Estimate 2026-03-01T10:18:40Z

Net survival is conventionally defined as ``survival if cancer were the only possible cause of death'', an estimand corresponding to cancer-specific mortality alone. The Pohar Perme estimator targets this by removing general population other-cause mortality from observed total mortality, but achieves it only when cancer patients experience the same other-cause mortality as the general population. However, cancer patients often experience elevated other-cause mortality due to baseline health differences and treatment-induced effects. Using recent theoretical work decomposing total mortality into four components (cancer deaths, baseline health differences, treatment-induced other-cause deaths, and general population other-cause mortality), we show that the Pohar Perme estimator delivers the sum of cancer deaths, baseline differences, and treatment-induced deaths, falling short of its intended estimand whenever either source of excess is present. From Botta \textit{et al}, we present empirical evidence showing relative risk of other-cause deaths ranging from 1.0 (colorectal cancer) to 4.0+ (head and neck cancers), and calculations demonstrating that net survival can substantially underestimate cancer-specific survival probability when relative risk exceeds 1.0. Critically, treatment-induced other-cause deaths represent irreducible causal pathways from cancer to death that cannot be eliminated through better stratification. We recommend interpreting net survival as ``survival where general population other-cause mortality is removed'' rather than as a causal counterfactual, and call for more precise language in cancer epidemiology.

2026-03-01T10:18:40Z 21 pages, 4 figures Matthew J. Smith http://arxiv.org/abs/2503.20852v2 Teachable normal approximations to binomial and related probabilities or confidence bounds 2026-02-25T18:52:03Z

For the usual normal approximations to binomial, hypergeometric, or Poisson interval probabilities, we collect some simple but then reasonably sharp error bounds. For the Clopper-Pearson~(1934) binomial confidence bounds, we present, following Michael Short's~(2023) approach, bounds similar to, but necessarily more complicated than, Lagrange's (1776) success rate plus/minus normal quantile times estimated standard deviation. The bounds, as presented here in four theorems, should be teachable, to people ranging from sufficiently advanced high school pupils to university students in mathematics or statistics: For understanding most of the proposed approximation results, it should suffice to know binomial laws, their means and variances, and the standard normal distribution function, but not necessarily the concept of a corresponding normal random variable. Accompanying technical remarks, references, and proofs are meant for assuring teachers or for stimulating further research. Of the proposed approximations, some are essentially well-known at least to experts, and some are based on teaching experience and research at Trier University.

2025-03-26T17:57:15Z 13 pages. Contains now a complete proof of the proposed bounds for Clopper-Pearson bounds. Further various minor improvements Lutz Mattner http://arxiv.org/abs/2602.20954v1 Hierarchical Aggregation Clustering Algorithms Derived from the Bi-partial Objective Function 2026-02-24T14:36:35Z

The paper outlines the principles of construction of a broad class of hierarchical aggregation algorithms of cluster analysis, essentially based on minimum distance mergers, which are derived from the general bi-partial objective function. It is shown how the algorithms arise from the bi-partial objective function, their affinity with the classical hierarchical aggregation algorithms is demonstrated, and the examples of such algorithms for the concrete forms of the bi-partial objective function are provided. This amounts to the first explicit and, at the same time, quite general, connection between optimization in clustering and the hierarchical aggregation algorithms. Thereby, the respective hierarchical algorithms gain a deeper justification, the means for evaluating the quality of clustering is provided, along with the criterion of stopping the cluster mergers.

2026-02-24T14:36:35Z An original paper, not yet submitted anywhere Jan W. Owsiński http://arxiv.org/abs/2405.09797v3 Extrapolating Single-Treatment Effects Out of Factorial Experiments 2026-02-23T15:47:06Z

Despite their cost, randomized controlled trials (RCTs) are widely regarded as gold-standard evidence in disciplines ranging from social science to medicine. In recent decades, researchers have increasingly sought to reduce the resource burden of repeated RCTs with factorial designs that simultaneously test multiple hypotheses, e.g. experiments that evaluate the effects of many medications or products simultaneously. Here I show that when multiple interventions are randomized in experiments, the effect any single intervention would have outside the experimental setting is not identified absent heroic assumptions, even if otherwise perfectly realistic conditions are achieved. This happens because single-treatment effects involve a counterfactual world with a single focal intervention, allowing other variables to take their natural values (which may be confounded or modified by the focal intervention). In contrast, observational studies and factorial experiments provide information about potential-outcome distributions with zero and multiple interventions, respectively. In this paper, I formalize sufficient conditions for the identifiability of those isolated quantities. I show that researchers who rely on this type of design have to justify either linearity of functional forms or -- in the nonparametric case -- specify with Directed Acyclic Graphs how variables are related in the real world. Finally, I develop nonparametric sharp bounds -- i.e., maximally informative best-/worst-case estimates consistent with limited RCT data -- that show when extrapolations about effect signs are empirically justified. These new results are illustrated with simulated data.

2024-05-16T04:01:53Z Guilherme Duarte http://arxiv.org/abs/2602.18242v1 Reflections on the Future of Statistics Education in a Technological Era 2026-02-20T14:26:15Z

Keeping pace with rapidly evolving technology is a key challenge in teaching statistics. To equip students with essential skills for the modern workplace, educators must integrate relevant technologies into the statistical curriculum where possible. University-level statistics education has experienced substantial technological change, particularly in the tools and practices that underpin teaching and learning. Statistical programming has become central to many courses, with R widely used and Python increasingly incorporated into statistics and data analytics programmes. Additionally, coding practices, database management, and machine learning now feature within some statistics curricula. Looking ahead, we anticipate a growing emphasis on artificial intelligence (AI), particularly the pedagogical implications of generative AI tools such as ChatGPT. In this article, we explore these technological developments and discuss strategies for their integration into contemporary statistics education.

2026-02-20T14:26:15Z Craig Alexander Jennifer Gaskell Vinny Davies http://arxiv.org/abs/2602.17896v1 Central limit theorem for the global clustering coefficient of random geometric graphs 2026-02-19T23:27:39Z

The global clustering coefficient serves as a powerful metric for the structural analysis and comparison of complex networks. Random geometric graphs offer a realistic framework for representing the spatial constraints and geometry often found in real-world network datasets. In this paper, we establish a central limit theorem for the global clustering coefficient of random geometric graphs. Our main result identifies the centering and scaling sequences required for convergence in law to the standard normal distribution. Our approach varies by regime: in the dense case, we employ the Lyapunov CLT; in the intermediate case, we utilize the asymptotic theory of $U$-statistics with sample-size-dependent kernels; and in the sparse regime, we use the method of moments to derive the asymptotic distribution. Notably, the convergence rates for non-uniform and uniform random geometric graphs diverge in the dense regime, yet they coincide in the sparse regime. In addition, we find that the global clustering coefficient for both uniform and non-uniform RGGs is asymptotically equal to $3/4$

2026-02-19T23:27:39Z Mingao Yuan Md. Niamul Islam Sium http://arxiv.org/abs/2602.16283v1 Orthogonal parametrisations of Extreme-Value distributions 2026-02-18T09:06:26Z

Extreme value distributions are routinely employed to assess risks connected to extreme events in a large number of applications. They typically are two- or three- parameter distributions: the inference can be unstable, which is particularly problematic given the fact that often times these distributions are fitted to small samples. Furthermore, the distribution's parameters are generally not directly interpretable and not the key aim of the estimation. We present several orthogonal reparametrisations of the main extreme-value distributions, key in the modelling of rare events. In particular, we apply the theory developed in Cox and Reid (1987) to the Generalised Extreme-Value, Generalised Pareto, and Gumbel distributions. We illustrate the principal advantage of these reparametrisations in a simulation study.

2026-02-18T09:06:26Z Nathan Huet Ilaria Prosdocimi