https://arxiv.org/api/WOZ/sXPreOzVi+SnsPXmmRnHXmU 2026-06-10T15:36:06Z 1686 255 15 http://arxiv.org/abs/2506.20173v1 Valid Selection among Conformal Sets 2025-06-25T06:59:55Z

Conformal prediction offers a distribution-free framework for constructing prediction sets with coverage guarantees. In practice, multiple valid conformal prediction sets may be available, arising from different models or methodologies. However, selecting the most desirable set, such as the smallest, can invalidate the coverage guarantees. To address this challenge, we propose a stability-based approach that ensures coverage for the selected prediction set. We extend our results to the online conformal setting, propose several refinements in settings where additional structure is available, and demonstrate its effectiveness through experiments.

2025-06-25T06:59:55Z Mahmoud Hegazy Liviu Aolaritei Michael I. Jordan Aymeric Dieuleveut http://arxiv.org/abs/2307.16048v4 Structural restrictions in local causal discovery: identifying direct causes of a target variable 2025-06-21T17:16:22Z

We consider the problem of learning a set of direct causes of a target variable from an observational joint distribution. Learning directed acyclic graphs (DAGs) that represent the causal structure is a fundamental problem in science. Several results are known when the full DAG is identifiable from the distribution, such as assuming a nonlinear Gaussian data-generating process. Here, we are only interested in identifying the direct causes of one target variable (local causal structure), not the full DAG. This allows us to relax the identifiability assumptions and develop possibly faster and more robust algorithms. In contrast to the Invariance Causal Prediction framework, we only assume that we observe one environment without any interventions. We discuss different assumptions for the data-generating process of the target variable under which the set of direct causes is identifiable from the distribution. While doing so, we put essentially no assumptions on the variables other than the target variable. In addition to the novel identifiability results, we provide two practical algorithms for estimating the direct causes from a finite random sample and demonstrate their effectiveness on several benchmark and real datasets.

2023-07-29T18:31:35Z Published in Biometrika (2025). 34 pages, 4 figures Biometrika (2025) Juraj Bodik Valérie Chavez-Demoulin 10.1093/biomet/asaf042 http://arxiv.org/abs/2412.16402v2 The Landscape of College-level Data Visualization Courses, and the Benefits of Incorporating Statistical Thinking 2025-06-19T18:30:21Z

Data visualization is a core part of statistical practice and is ubiquitous in many fields. Although there are numerous books on data visualization, instructors in statistics and data science may be unsure how to teach data visualization, because it is such a broad discipline. To give guidance on teaching data visualization from a statistical perspective, we make two contributions. First, we conduct a survey of data visualization courses at top colleges and universities in the United States, in order to understand the landscape of data visualization courses. We find that most courses are not taught by statistics and data science departments and do not focus on statistical topics, especially those related to inference. Instead, most courses focus on visual storytelling, aesthetic design, dashboard design, and other topics specialized for other disciplines. Second, we outline three teaching principles for incorporating statistical inference in data visualization courses, and provide several examples that demonstrate how to follow these principles. The dataset from our survey allows others to explore the diversity of data visualization courses, and our teaching principles give guidance for encouraging statistical thinking when teaching data visualization.

2024-12-20T23:32:53Z Journal of Statistics and Data Science 2025 Zach Branson Monica Paz Parra Ronald Yurko 10.1080/26939169.2025.2537049 http://arxiv.org/abs/2506.15129v1 Data Verbalisation: What is Text Doing in a Data Visualisation? 2025-06-18T04:07:54Z

This article discusses the role that text elements play in a data visualisation. We argue that there is a need for a simple, coherent explanation of text elements similar to the understanding that already exists for non-text elements like bars, points, and lines. We explore examples of how text is used within a data visualisation and use existing knowledge and assessment techniques to evaluate when text is effective and when it is not. The result is a framework that aims to be easy to understand and easy to apply in order to understand the purpose and effectiveness of the text elements in any data visualisation.

2025-06-18T04:07:54Z 43 pages (including appendix), 20 figures Paul Murrell http://arxiv.org/abs/2501.10974v3 Sequential Change Detection for Learning in Piecewise Stationary Bandit Environments 2025-06-17T18:24:35Z

A finite-horizon variant of the quickest change detection problem is investigated, which is motivated by a change detection problem that arises in piecewise stationary bandits. The goal is to minimize the \emph{latency}, which is smallest threshold such that the probability that the detection delay exceeds the threshold is below a desired low level, while controlling the false alarm probability to a desired low level. When the pre- and post-change distributions are unknown, two tests are proposed as candidate solutions. These tests are shown to attain order optimality in terms of the horizon. Furthermore, the growth in their latencies with respect to the false alarm probability and late detection probability satisfies a property that is desirable in regret analysis for piecewise stationary bandits. Numerical results are provided to validate the theoretical performance results.

2025-01-19T07:27:24Z 15 pages, 2 figures. arXiv admin note: text overlap with arXiv:2501.01291 Yu-Han Huang Venugopal V. Veeravalli http://arxiv.org/abs/2506.14389v1 Ole E. Barndorff-Nielsen: Sand, Wind and Inference 2025-06-17T10:37:50Z

This paper reviews Ole Eiler Barndorff-Nielsen's research in the first decades of his career. The focus is on topics that he kept returning to throughout his scientific life, and on papers that he built on in later important contributions. First his early contributions to the foundations of statistical inference are reviewed with focus on conditional inference and exponential families, two topics in which he had a lifelong interest. The second half of the paper reviews his research on wind blown sand and hyperbolic distributions and processes, including his early contributions to modelling of turbulent wind fields. This research laid the foundations for his later work on financial econometrics and ambit processes.

2025-06-17T10:37:50Z Bernoulli 32, 2026, 49-67 Michael Sørensen 10.3150/25-BEJ1903 http://arxiv.org/abs/2506.13384v1 Delving Into the Psychology of Machines: Exploring the Structure of Self-Regulated Learning via LLM-Generated Survey Responses 2025-06-16T11:48:58Z

Large language models (LLMs) offer the potential to simulate human-like responses and behaviors, creating new opportunities for psychological science. In the context of self-regulated learning (SRL), if LLMs can reliably simulate survey responses at scale and speed, they could be used to test intervention scenarios, refine theoretical models, augment sparse datasets, and represent hard-to-reach populations. However, the validity of LLM-generated survey responses remains uncertain, with limited research focused on SRL and existing studies beyond SRL yielding mixed results. Therefore, in this study, we examined LLM-generated responses to the 44-item Motivated Strategies for Learning Questionnaire (MSLQ; Pintrich \& De Groot, 1990), a widely used instrument assessing students' learning strategies and academic motivation. Particularly, we used the LLMs GPT-4o, Claude 3.7 Sonnet, Gemini 2 Flash, LLaMA 3.1-8B, and Mistral Large. We analyzed item distributions, the psychological network of the theoretical SRL dimensions, and psychometric validity based on the latent factor structure. Our results suggest that Gemini 2 Flash was the most promising LLM, showing considerable sampling variability and producing underlying dimensions and theoretical relationships that align with prior theory and empirical findings. At the same time, we observed discrepancies and limitations, underscoring both the potential and current constraints of using LLMs for simulating psychological survey data and applying it in educational contexts.

2025-06-16T11:48:58Z Leonie V. D. E. Vogelsmeier Eduardo Oliveira Kamila Misiejuk Sonsoles López-Pernas Mohammed Saqr 10.1016/j.chb.2025.108769 http://arxiv.org/abs/2501.02126v2 A Mathematical Lens for Teaching Data Science 2025-06-11T19:12:37Z

Using the National Academies report, {\em Data Science for Undergraduates: Opportunities and Options}, we connect data science curricula to the more familiar pedagogy used by many mathematical scientists. We use their list of ``data acumen" components to ground a discussion, which hopes to connect data science curricula to the more familiar pedagogy used by many mathematical scientists.

2025-01-03T22:36:20Z Johanna Hardin http://arxiv.org/abs/2506.14817v1 Next-Generation Conflict Forecasting: Unleashing Predictive Patterns through Spatiotemporal Learning 2025-06-08T20:42:29Z

Forecasting violent conflict at high spatial and temporal resolution remains a central challenge for both researchers and policymakers. This study presents a novel neural network architecture for forecasting three distinct types of violence -- state-based, non-state, and one-sided -- at the subnational (priogrid-month) level, up to 36 months in advance. The model jointly performs classification and regression tasks, producing both probabilistic estimates and expected magnitudes of future events. It achieves state-of-the-art performance across all tasks and generates approximate predictive posterior distributions to quantify forecast uncertainty. The architecture is built on a Monte Carlo Dropout Long Short-Term Memory (LSTM) U-Net, integrating convolutional layers to capture spatial dependencies with recurrent structures to model temporal dynamics. Unlike many existing approaches, it requires no manual feature engineering and relies solely on historical conflict data. This design enables the model to autonomously learn complex spatiotemporal patterns underlying violent conflict. Beyond achieving state-of-the-art predictive performance, the model is also highly extensible: it can readily integrate additional data sources and jointly forecast auxiliary variables. These capabilities make it a promising tool for early warning systems, humanitarian response planning, and evidence-based peacebuilding initiatives.

2025-06-08T20:42:29Z 33 pages, 9 figures, 3 tables. Presented at workshops hosted by PRIO, AFK (German Association for Peace and Conflict Studies), CCEW (Bundeswehr University Munich), Uppsala University, SODAS (University of Copenhagen) and in briefings with UN agencies including UNIDIR, OCHA, and FAO Simon P. von der Maase http://arxiv.org/abs/2506.06840v1 A Statistical Framework for Model Selection in LSTM Networks 2025-06-07T15:44:27Z

Long Short-Term Memory (LSTM) neural network models have become the cornerstone for sequential data modeling in numerous applications, ranging from natural language processing to time series forecasting. Despite their success, the problem of model selection, including hyperparameter tuning, architecture specification, and regularization choice remains largely heuristic and computationally expensive. In this paper, we propose a unified statistical framework for systematic model selection in LSTM networks. Our framework extends classical model selection ideas, such as information criteria and shrinkage estimation, to sequential neural networks. We define penalized likelihoods adapted to temporal structures, propose a generalized threshold approach for hidden state dynamics, and provide efficient estimation strategies using variational Bayes and approximate marginal likelihood methods. Several biomedical data centric examples demonstrate the flexibility and improved performance of the proposed framework.

2025-06-07T15:44:27Z Fahad Mostafa http://arxiv.org/abs/2506.03680v1 Win Probabilities, Hand Sizes, and Game Duration Analysis in the Bhikar-Sawkar Card Game 2025-06-04T08:10:49Z

We present a Monte Carlo simulation study of the Bhikar-Sawkar card game, a non-deterministic game structurally similar to the classic Beggar-My-Neighbour, which is fully deterministic. Although both games share a common setup, key differences in their rules, particularly the reshuffling of cards after each won hand in Bhikar-Sawkar, introduce stochasticity and significantly increase the space of possible game evolutions. This inherent randomness raises a range of interesting statistical questions regarding the duration of the game, the hand-winner distributions, and the probability of winning the game for a given player. These questions are systematically investigated through large-scale simulations across multiple game configurations.

2025-06-04T08:10:49Z 13 pages, 5 figures Mihir Durve http://arxiv.org/abs/2506.01763v2 Modelling benthic animals in space and time using Bayesian Point Process with cross validation: the case of Holoturians 2025-06-03T16:39:43Z

Understanding the spatial distribution of Holothurians is an essential task for ecosystem monitoring and sustainable management, particularly in the Mediterranean habitats. However, species distribution modeling is often complicated by the presence-only nature of the data and heterogeneous sampling designs. This study develops a spatio-temporal framework based on Log-Gaussian Cox Processes to analyze Holothurians' positions collected across nine survey campaigns conducted from 2022 to 2024 near Giglio Island, Italy. The surveys combined high-resolution photogrammetry with diver-based visual censuses, leading to varying detection probabilities across habitats, especially within Posidonia oceanica meadows. We adopt a model with a shared spatial Gaussian process component to accommodate this complexity, accounting for habitat structure, environmental covariates, and temporal variability. Model estimation is performed using Integrated Nested Laplace Approximation. We evaluate the predictive performances of alternative model specifications through a novel k-fold cross-validation strategy for point processes, using the Continuous Ranked Probability Score. Our approach provides a flexible and computationally efficient framework for integrating heterogeneous presence-only data in marine ecology and comparing the predictive ability of alternative models.

2025-06-02T15:11:52Z Daniele Poggio Gian Mario Sangiovanni Gianluca Mastrantonio Giovanna Jona Lasinio Edoardo Casoli Stefano Moro Daniele Ventura http://arxiv.org/abs/2505.22371v2 Adaptive tail index estimation: minimal assumptions and non-asymptotic guarantees 2025-05-29T07:22:57Z

A notoriously difficult challenge in extreme value theory is the choice of the number $k\ll n$, where $n$ is the total sample size, of extreme data points to consider for inference of tail quantities. Existing theoretical guarantees for adaptive methods typically require second-order assumptions or von Mises assumptions that are difficult to verify and often come with tuning parameters that are challenging to calibrate. This paper revisits the problem of adaptive selection of $k$ for the Hill estimator. Our goal is not an `optimal' $k$ but one that is `good enough', in the sense that we strive for non-asymptotic guarantees that might be sub-optimal but are explicit and require minimal conditions. We propose a transparent adaptive rule that does not require preliminary calibration of constants, inspired by `adaptive validation' developed in high-dimensional statistics. A key feature of our approach is the consideration of a grid for $k$ of size $ \ll n $, which aligns with common practice among practitioners but has remained unexplored in theoretical analysis. Our rule only involves an explicit expression of a variance-type term; in particular, it does not require controlling or estimating a biasterm. Our theoretical analysis is valid for all heavy-tailed distributions, specifically for all regularly varying survival functions. Furthermore, when von Mises conditions hold, our method achieves `almost' minimax optimality with a rate of $\sqrt{\log \log n}~ n^{-|ρ|/(1+2|ρ|)}$ when the grid size is of order $\log n$, in contrast to the $ (\log \log (n)/n)^{|ρ|/(1+2|ρ|)} $ rate in existing work. Our simulations show that our approach performs particularly well for ill-behaved distributions.

2025-05-28T13:58:20Z Johannes Lederer Anne Sabourin Mahsa Taheri http://arxiv.org/abs/2505.16696v1 Sensitivity of ECG QRS Complexes to His-Purkinje Structure in Computational Heart Models 2025-05-22T14:00:05Z

Cardiac digital twins (CDT) are emerging as a potentially transformative tool in cardiology. A critical yet understudied determinant of CDT accuracy is the His-Purkinje system (HPS), which influences ventricular depolarization and shapes the QRS complex of the electrocardiogram (ECG). Here, we quantify how structural variations in the HPS alter QRS morphology and identify which parameters drive this variability. We generated HPS structures using a fractal-tree, rule-based algorithm, systematically varying nine model parameters and assessing their effects on ten QRS-related metrics. We conducted a Sobol sensitivity analysis to quantify direct and interaction-driven contributions of each parameter to observed variability. Our results suggest that most minor changes in HPS structure exert minimal influence on individual QRS features; however, certain parameter combinations can produce abnormal QRS morphologies. Wave durations and peak amplitudes of the QRS complex exhibit low sensitivity to individual HPS parameter variations; however, we found that specific parameter combinations can result in interactions that significantly alter these aspects of QRS morphology. We found that certain HPS structures can cause premature QRS formation, obscuring P-wave formation. QRS timing variability was primarily driven by interactions among branch and fascicle angles and branch repulsivity, though other parameters also showed notable interaction effects. In addition to interactions, individual variations in the number of branches in the HPS also affected QRS timing. While future models should account for these potential sources of variability, this study indicates that minor anatomical differences between a healthy patient's HPS and that of a generic model are unlikely to significantly impact model fidelity or clinical interpretation when both systems are physiologically normal.

2025-05-22T14:00:05Z 35 pages, 18 figures Preetam V. Tanikella Laryssa Abdala Karin Leiderman Annie Green Howard Boyce E. Griffith http://arxiv.org/abs/2505.14955v1 Bayesian Multivariate Approach to Subnational mortality graduation with Age-Varying Smoothness 2025-05-20T22:29:14Z

This work introduces a Bayesian smoothing approach for the joint graduation of mortality rates across multiple populations. In particular, dynamical linear models are used to induce smoothness across ages through structured dependence, analogously to how temporal correlation is accommodated in state-space time-indexed models. An essential issue in subnational mortality probabilistic modelling is the lack or sparseness of information for some subpopulations. For many countries, mortality data is severely limited, and approaches based on a single population model can result in high uncertainty in the adjusted mortality tables. Here, we recognize the interdependence within a group of mortality data and pursue the pooling of information across several curves that ideally share common characteristics, such as the influence of epidemics or major economic shifts. Our proposal considers multivariate Bayesian dynamical models with common parameters, allowing for borrowing of information across mortality tables and enabling tests of convergence across populations. We also employ discount factors, typical in DLMs, to regulate smoothness, with varying discounting across ages, ensuring less smoothness at younger ages and greater stability at adult ages. This setup implies a trade-off between stability and adaptability. The discount parameter controls the responsiveness of the fit at older ages to new data. The estimation is fully Bayesian, accommodating all uncertainties in modelling and prediction. To illustrate the effectiveness of our model, we analyse male and female mortality data from England and Wales between 2010 and 2012, obtained from the Office for National Statistics. In scenarios with simulated missing data, our approach showed strong performance and flexibility in pooling information from related populations with more complete data.

2025-05-20T22:29:14Z 27 pages, 13 figures Luiz F. V. Figueiredo Viviana G. R. Lobo Mariane B. Alves Thais C. O. Fonseca