https://arxiv.org/api/KR6tksuzgbGOYlbVWkEw1wLRNMI 2026-06-10T14:32:19Z 1686 240 15 http://arxiv.org/abs/2507.21022v1 A Generalized Cramér-Rao Bound Using Information Geometry 2025-07-28T17:43:06Z

In information geometry, statistical models are considered as differentiable manifolds, where each probability distribution represents a unique point on the manifold. A Riemannian metric can be systematically obtained from a divergence function using Eguchi's theory (1992); the well-known Fisher-Rao metric is obtained from the Kullback-Leibler (KL) divergence. The geometric derivation of the classical Cramér-Rao Lower Bound (CRLB) by Amari and Nagaoka (2000) is based on this metric. In this paper, we study a Riemannian metric obtained by applying Eguchi's theory to the Basu-Harris-Hjort-Jones (BHHJ) divergence (1998) and derive a generalized Cramér-Rao bound using Amari-Nagaoka's approach. There are potential applications for this bound in robust estimation.

2025-07-28T17:43:06Z Presented at the IEEE International Symposium on Information Theory (ISIT 2025) Satyajit Dhadumia M. Ashok Kumar http://arxiv.org/abs/2505.09619v5 Machine Learning Solutions Integrated in an IoT Healthcare Platform for Heart Failure Risk Stratification 2025-07-28T09:08:11Z

The management of chronic Heart Failure (HF) presents significant challenges in modern healthcare, requiring continuous monitoring, early detection of exacerbations, and personalized treatment strategies. In this paper, we present a predictive model founded on Machine Learning (ML) techniques to identify patients at HF risk. This model is an ensemble learning approach, a modified stacking technique, that uses two specialized models leveraging clinical and echocardiographic features and then a meta-model to combine the predictions of these two models. We initially assess the model on a real dataset and the obtained results suggest that it performs well in the stratification of patients at HR risk. Specifically, we obtained high sensitivity (95\%), ensuring that nearly all high-risk patients are identified. As for accuracy, we obtained 84\%, which can be considered moderate in some ML contexts. However, it is acceptable given our priority of identifying patients at risk of HF because they will be asked to participate in the telemonitoring program of the PrediHealth research project on which some of the authors of this paper are working. The initial findings also suggest that ML-based risk stratification models can serve as valuable decision-support tools not only in the PrediHealth project but also for healthcare professionals, aiding in early intervention and personalized patient management. To have a better understanding of the value and of potentiality of our predictive model, we also contrasted its results with those obtained by using three baseline models. The preliminary results indicate that our predictive model outperforms these baselines that flatly consider features, \ie not grouping them in clinical and echocardiographic features.

2025-04-07T14:07:05Z Aiman Faiz Claudio Pascarelli Gianvito Mitrano Gianluca Fimiani Marina Garofano Mariangela Lazoi Claudio Passino Alessia Bramanti http://arxiv.org/abs/2503.15382v3 The information mismatch, and how to fix it 2025-07-28T03:49:48Z

We live in unprecedented times in terms of our ability to use evidence to inform medical care. For example, we can perform data-driven post-test probability calculations. However, there is work to do. As has been previously noted, sensitivity and specificity, which play a key role in post-test probability calculations, are defined as unadjusted for patient covariates. In light of this, there have been multiple recommendations that sensitivity and specificity be adjusted for covariates. However, there is less work on the downstream clinical impact of unadjusted sensitivity and specificity. We discuss this here. We argue that unadjusted sensitivity and specificity, when mixed with covariate-dependent pre-test probability scores (which are more easily available nowadays given the multitude of online calculators), can lead to a post-test probability that contains an ``information mismatch.'' We write the equations behind such an information mismatch and discuss the steps that can be taken to fix it.

2025-03-19T16:19:25Z Samuel J. Weisenthal Amit K. Chowdhry http://arxiv.org/abs/2407.18572v2 Bernoulli amputation 2025-07-25T07:56:50Z

An approach to amputation, the process of introducing missing values to a complete dataset, is presented. It allows to construct missingness indicators in a flexible and principled way via copulas and Bernoulli margins and to incorporate dependence in missingness patterns. Besides more classical missingness models such as missing completely at random, missing at random, and missing not at random, the approach is able to model structured missingness such as block missingness and, via mixtures, monotone missingness, which are patterns of missing data frequently found in real-life datasets. Properties such as joint missingness probabilities or missingness correlation are derived mathematically. The approach is demonstrated with mathematical examples and empirical illustrations in terms of a well-known dataset.

2024-07-26T07:55:25Z Marius Hofert James Jackson Niels Hagenbuch http://arxiv.org/abs/2507.11833v2 R2 priors for Grouped Variance Decomposition in High-dimensional Regression 2025-07-24T21:55:23Z

We introduce the Group-R2 decomposition prior, a hierarchical shrinkage prior that extends R2-based priors to structured regression settings with known groups of predictors. By decomposing the prior distribution of the coefficient of determination R2 in two stages, first across groups, then within groups, the prior enables interpretable control over model complexity and sparsity. We derive theoretical properties of the prior, including marginal distributions of coefficients, tail behavior, and connections to effective model complexity. Through simulation studies, we evaluate the conditions under which grouping improves predictive performance and parameter recovery compared to priors that do not account for groups. Our results provide practical guidance for prior specification and highlight both the strengths and limitations of incorporating grouping into R2-based shrinkage priors.

2025-07-16T01:40:56Z 43 pages, 16 figures Javier Enrique Aguilar David Kohns Aki Vehtari Paul-Christian Bürkner http://arxiv.org/abs/2501.12596v2 Adapting OpenAI's CLIP Model for Few-Shot Image Inspection in Manufacturing Quality Control: An Expository Case Study with Multiple Application Examples 2025-07-14T15:52:38Z

This expository paper introduces a simplified approach to image-based quality inspection in manufacturing using OpenAI's CLIP (Contrastive Language-Image Pretraining) model adapted for few-shot learning. While CLIP has demonstrated impressive capabilities in general computer vision tasks, its direct application to manufacturing inspection presents challenges due to the domain gap between its training data and industrial applications. We evaluate CLIP's effectiveness through five case studies: metallic pan surface inspection, 3D printing extrusion profile analysis, stochastic textured surface evaluation, automotive assembly inspection, and microstructure image classification. Our results show that CLIP can achieve high classification accuracy with relatively small learning sets (50-100 examples per class) for single-component and texture-based applications. However, the performance degrades with complex multi-component scenes. We provide a practical implementation framework that enables quality engineers to quickly assess CLIP's suitability for their specific applications before pursuing more complex solutions. This work establishes CLIP-based few-shot learning as an effective baseline approach that balances implementation simplicity with robust performance, demonstrated in several manufacturing quality control applications.

2025-01-22T02:45:30Z 36 pages, 13 figures Fadel M. Megahed Ying-Ju Chen Bianca Maria Colosimo Marco Luigi Giuseppe Grasso L. Allison Jones-Farmer Sven Knoth Hongyue Sun Inez Zwetsloot http://arxiv.org/abs/2507.08921v1 Are Betting Markets Better than Polling in Predicting Political Elections? 2025-07-11T17:03:39Z

Political elections are one of the most significant aspects of what constitutes the fabric of the United States. In recent history, typical polling estimates have largely lacked precision in predicting election outcomes, which has not only caused uncertainty for American voters, but has also impacted campaign strategies, spending, and fundraising efforts. One intriguing aspect of traditional polling is the types of questions that are asked -- the questions largely focus on asking individuals who they intend to vote for. However, they don't always probe who voters think will win -- regardless of who they want to win. In contrast, online betting markets allow individuals to wager money on who they expect to win, which may capture who individuals think will win in an especially salient manner. The current study used both descriptive and predictive analytics to determine whether data from Polymarket, the world's largest online betting market, provided insights that differed from traditional presidential polling. Overall, findings suggest that Polymarket was superior to polling in predicting the outcome of the 2024 presidential election, particularly in swing states. Results are in alignment with research on ''Wisdom of Crowds'' theory, which suggests a large group of people are often accurate in predicting outcomes, even if they are not necessarily experts or closely aligned with the issue at hand. Overall, our results suggest that betting markets, such as Polymarket, could be employed to predict presidential elections and/or other real-world events. However, future investigations are needed to fully unpack and understand the current study's intriguing results, including alignment with Wisdom of Crowds theory and portability to other events.

2025-07-11T17:03:39Z 30 pages, 4 figures Laurie E. Cutting Sarah S. Hughes-Berheim Paul M. Johnson Hiba Baroud Brett Goldstein http://arxiv.org/abs/2411.08547v4 Frequentist Statistics as Internalist Reliabilism 2025-07-11T10:58:51Z

There has long been an impression that reliabilism implies externalism and that frequentist statistics, due to its reliabilist nature, is inherently externalist. I argue, however, that frequentist statistics can plausibly be understood as a form of internalist reliabilism -- internalist in the conventional sense, yet reliabilist in certain unconventional and intriguing ways. Crucially, in developing the thesis that reliabilism does not imply externalism, my aim is not to stretch the meaning of `reliabilism' merely to sever the implication. Instead, it is to gain a deeper understanding of frequentist statistics, which stands as one of the most sustained attempts by scientists to develop an epistemology for their own use.

2024-11-13T11:52:16Z Hanti Lin http://arxiv.org/abs/2506.04677v2 The cost of ensembling: is it always worth combining? 2025-07-09T12:32:09Z

Given the continuous increase in dataset sizes and the complexity of forecasting models, the trade-off between forecast accuracy and computational cost is emerging as an extremely relevant topic, especially in the context of ensemble learning for time series forecasting. To asses it, we evaluated ten base models and eight ensemble configurations across two large-scale retail datasets (M5 and VN1), considering both point and probabilistic accuracy under varying retraining frequencies. We showed that ensembles consistently improve forecasting performance, particularly in probabilistic settings. However, these gains come at a substantial computational cost, especially for larger, accuracy-driven ensembles. We found that reducing retraining frequency significantly lowers costs, with minimal impact on accuracy, particularly for point forecasts. Moreover, efficiency-driven ensembles offer a strong balance, achieving competitive accuracy with considerably lower costs compared to accuracy-optimized combinations. Most importantly, small ensembles of two or three models are often sufficient to achieve near-optimal results. These findings provide practical guidelines for deploying scalable and cost-efficient forecasting systems, supporting the broader goals of sustainable AI in forecasting. Overall, this work shows that careful ensemble design and retraining strategy selection can yield accurate, robust, and cost-effective forecasts suitable for real-world applications.

2025-06-05T06:54:19Z Marco Zanotti http://arxiv.org/abs/2504.07704v2 Measures of non-simplifyingness for conditional copulas and vines 2025-07-07T16:32:06Z

In copula modeling, the simplifying assumption has recently been the object of much interest. Although it is very useful to reduce the computational burden, it remains far from obvious whether it is actually satisfied in practice. We propose a theoretical framework which aims at giving a precise meaning to the following question: how non-simplified or close to be simplified is a given conditional copula? For this, we propose a new framework centered at the notion of measure of non-constantness. Then we discuss generalizations of the simplifying assumption to the case where the conditional marginal distributions may not be continuous, and corresponding measures of non-simplifyingness in this case. The simplifying assumption is of particular importance for vine copula models, and we therefore propose a notion of measure of non-simplifyingness of a given copula for a particular vine structure, as well as different scores measuring how non-simplified such a vine decompositions would be for a general vine. Finally, we propose estimators for these measures of non-simplifyingness given an observed dataset. A small simulation study shows the performance of a few estimators of these measures of non-simplifyingness.

2025-04-10T12:46:39Z 22 pages, 1 figure Alexis Derumigny http://arxiv.org/abs/2410.07569v3 Grammatical structures in mathematics: a personal view 2025-07-06T08:47:26Z

The ability to read, write, and speak mathematics is critical to students becoming comfortable with statistical models and skills. Faster development of those skills may act as encouragement to further engage with the discipline. Vocabulary has been the focus of scholarship in existing literature on the linguistics of mathematics and statistics but there are structures such as grammar that go beyond the content of words and symbols. Here I introduce ideas for grammar structures through a sequence of examples.

2024-10-10T03:10:38Z Tess O'Brien http://arxiv.org/abs/2507.03628v1 When Numbers Mislead Us 2025-07-04T14:52:35Z

The belief that numbers offer a single, objective description of reality overlooks a crucial truth: data does not speak for itself. Every dataset results from choices-what to measure, how, when, and with whom-which inevitably reflect implicit, and sometimes ideological, assumptions about what is worth quantifying. Moreover, in any analysis, what remains unmeasured can be just as significant as what is captured. When a key variable is omitted-whether by neglect, design, or ignorance-it can distort the observed relationships between other variables. This phenomenon, known as omitted variable bias, may produce misleading correlations or conceal genuine effects. In some cases, accounting for this hidden factor can completely overturn the conclusions drawn from a superficial analysis. This is precisely the mechanism behind Simpson's paradox.

2025-07-04T14:52:35Z Arthur Charpentier http://arxiv.org/abs/2507.02130v1 BACTA-GPT: An AI-Based Bayesian Adaptive Clinical Trial Architect 2025-07-02T20:29:54Z

Bayesian adaptive clinical trials offer a flexible and efficient alternative to traditional fixed-design trials, but their implementation is often hindered by the complexity of Bayesian computations and the need for advanced statistical programming expertise. The authors introduce a custom fine-tuned LLM designed to assist with this and lower barriers to adoption of Bayesian methods for adaptive clinical trials. This paper describes the development and fine-tuning of BACTA-GPT, a Large Language Model (LLM)-based tool designed to assist in the implementation of Bayesian Adaptive Clinical Trials. This engine uses GPT-3.5 as the underlying model and takes in Natural Language input from the Statistician or the Trialist. The fine-tuned model demonstrates a viable proof-of-concept in its objectives. Test case evaluations show that the model is capable of generating a fit-for-purpose Bayesian model for an adaptive trial and evaluate its operating characteristics via simulations using R and JAGS. The integration of AI code generation has significant potential to lower technical barriers for the design and implementation of Bayesian Adaptive trials. But they also require attention to important considerations regarding validation and quality control.

2025-07-02T20:29:54Z 15 pages plus 9 page appendix Krishna Padmanabhan Danny Baker http://arxiv.org/abs/2503.10984v3 The Problem of the Priors, or Posteriors? 2025-06-30T03:48:43Z

The problem of the priors is well known: it concerns the challenge of identifying norms that govern one's prior credences. I argue that a key to addressing this problem lies in considering what I call the problem of the posteriors -- the challenge of identifying norms that directly govern one's posterior credences, which backward induce some norms on the priors via the diachronic requirement of conditionalization. This forward-looking approach can be summarized as: Think ahead, work backward. Although this idea can be traced to Freedman (1963), Carnap (1963), and Shimony (1970), I believe that it has not received enough attention. In this paper, I initiate a systematic defense of forward-looking Bayesianism, addressing potential objections from more traditional views (both subjectivist and objectivist). I also develop a specific approach to forward-looking Bayesianism -- one that values the convergence of posterior credences to the truth, and treats it as a fundamental rather than derived norm. This approach, called convergentist Bayesianism, is argued to be crucial for a Bayesian foundation of Ockham's razor in statistics and machine learning.

2025-03-14T01:06:34Z Hanti Lin http://arxiv.org/abs/2506.22182v1 Average-case complexity in statistical inference: A puzzle-driven research seminar 2025-06-27T12:45:44Z

These notes describe our experience with running a student seminar on average-case complexity in statistical inference using the jigsaw learning format at ETH Zurich in Fall of 2024. The jigsaw learning technique is an active learning technique where students work in groups on independent parts of the task and then reassemble the groups to combine all the parts together. We implemented this technique for the proofs of various recent research developments, combined with a presentation by one of the students in the beginning of the session. We describe our experience and thoughts on such a format applied in a student research seminar: including, but not limited to, higher engagement, more accessible talks by the students, and increased student participation in discussions. In the Appendix, we include all the exercises sheets for the topic, which may be of independent interest for courses on statistical-to-computational gaps and average-case complexity.

2025-06-27T12:45:44Z Anastasia Kireeva Afonso S. Bandeira