https://arxiv.org/api/xElZ1BESEYYqHiKo0hjkkOoLem4 2026-03-22T18:49:01Z 1629 105 15 http://arxiv.org/abs/2511.04213v1 Can we trust LLMs as a tutor for our students? Evaluating the Quality of LLM-generated Feedback in Statistics Exams 2025-11-06T09:18:54Z One of the central challenges for instructors is offering meaningful individual feedback, especially in large courses. Faced with limited time and resources, educators are often forced to rely on generalized feedback, even when more personalized support would be pedagogically valuable. To overcome this limitation, one potential technical solution is to utilize large language models (LLMs). For an exploratory study using a new platform connected with LLMs, we conducted a LLM-corrected mock exam during the "Introduction to Statistics" lecture at the University of Munich (Germany). The online platform allows instructors to upload exercises along with the correct solutions. Students complete these exercises and receive overall feedback on their results, as well as individualized feedback generated by GPT-4 based on the correct answers provided by the lecturers. The resulting dataset comprised task-level information for all participating students, including individual responses and the corresponding LLM-generated feedback. Our systematic analysis revealed that approximately 7 \% of the 2,389 feedback instances contained errors, ranging from minor technical inaccuracies to conceptually misleading explanations. Further, using a combined feedback framework approach, we found that the feedback predominantly focused on explaining why an answer was correct or incorrect, with fewer instances providing deeper conceptual insights, learning strategies or self-regulatory advice. These findings highlight both the potential and the limitations of deploying LLMs as scalable feedback tools in higher education, emphasizing the need for careful quality monitoring and prompt design to maximize their pedagogical value. 2025-11-06T09:18:54Z Preprint Markus Herklotz Niklas Ippisch Anna-Carolina Haensch http://arxiv.org/abs/2511.03242v1 Topography, climate, land cover, and biodiversity: Explaining endemic richness and management implications on a Mediterranean island 2025-11-05T07:09:18Z Island endemism is shaped by complex interactions among environmental, ecological, and evolutionary factors, yet the relative contributions of topography, climate, and land cover remain incompletely quantified. We investigated the drivers of endemic plant richness across Crete, a Mediterranean biodiversity hotspot, using spatially explicit data on species distributions, topographic complexity, climatic variability, land cover, and soil characteristics. Artificial Neural Network models, a machine learning tool, were employed to assess the relative importance of these predictors and to identify hotspots of endemism. We found that total species richness, elevation range, and climatic variability were the strongest predictors of endemic richness, reflecting the role of biodiversity, topographic heterogeneity, and climatic gradients in generating diverse habitats and micro-refugia that promote speciation and buffer extinction risk. Endemic hotspots only partially overlapped with areas of high total species richness, indicating that total species richness was the optimal from the ones examined, yet an imperfect surrogate. These environmentally heterogeneous areas also provide critical ecosystem services, including soil stabilization, pollination, and cultural value, which are increasingly threatened by tourism, renewable energy development, land-use change, and climate impacts. Our findings underscore the importance of prioritizing mountainous and climatically variable regions in conservation planning, integrating ecosystem service considerations, and accounting for within-island spatial heterogeneity. By explicitly linking the environmental drivers of endemism to both biodiversity patterns and ecosystem function, this study provides a framework for evidence-based conservation planning in Crete and other Mediterranean islands with similar geological and biogeographic contexts. 2025-11-05T07:09:18Z Aristides Moustakas Ioannis N Vogiatzakis http://arxiv.org/abs/2511.02881v1 From Hume to Jaynes: Induction as the Logic of Plausible Reasoning 2025-11-04T09:02:52Z The problem of induction has persisted since Hume exposed the logical gap between repeated observation and universal inference. Traditional attempts to resolve it have oscillated between two extremes: the probabilistic optimism of Laplace and Jeffreys, who sought to quantify belief through probability, and the critical skepticism of Popper, who replaced confirmation with falsification. Both approaches, however, assume that induction must deliver certainty or its negation. In this paper, I argue that the problem of induction dissolves when recast in terms of logical coherence (understood as internal consistency of credences under updating) rather than truth. Following E. T. Jaynes, probability is interpreted not as frequency or decision rule but as the extension of deductive logic to incomplete information. Under this interpretation, Bayes's theorem is not an empirical statement but a consistency condition that constrains rational belief updating. Induction thus emerges as the special case of deductive reasoning applied to uncertain premises. Falsification appears as the limiting form of Bayesian updating when new data drive posterior plausibility toward zero, while the Bayes Factor quantifies the continuous spectrum of evidential strength. Through analytical examples, including Laplace's sunrise problem, Jeffreys's mixed prior, and confidence-based reformulations, I show that only the logic of plausible reasoning unifies these perspectives without contradiction. Induction, properly understood, is not the leap from past to future but the discipline of maintaining coherence between evidence, belief, and information. 2025-11-04T09:02:52Z Tommaso Costa http://arxiv.org/abs/2510.13389v2 Understanding and Using the Relative Importance Measures Based on Orthogonalization and Reallocation 2025-11-03T05:28:34Z A class of relative importance measures based on orthogonalization and reallocation, ORMs, has been found to effectively approximate the General Dominance index (GD). In particular, Johnson's Relative Weight (RW) has been deemed the most successful ORM in the literature. Nevertheless, the theoretical foundation of the ORMs remains unclear. To further understand the ORMs, we provide a generalized framework that breaks down the ORM into two functional steps: orthogonalization and reallocation. To assess the impact of each step on the performance of ORMs, we conduct extensive Monte Carlo simulations under various predictors' correlation structures and response variable distributions. Our findings reveal that Johnson's minimal transformation consistently outperforms other common orthogonalization methods. We also summarize the performance of reallocation methods under four scenarios of predictors' correlation structures in terms of the first principal component and the variance inflation factor (VIF). This analysis provides guidelines for selecting appropriate reallocation methods in different scenarios, illustrated with real-world dataset examples. Our research offers a deeper understanding of ORMs and provides valuable insights for practitioners seeking to accurately measure variable importance in various modeling contexts. 2025-10-15T10:28:09Z 20 pages, 10 figures Tien-En Chang Argon Chen http://arxiv.org/abs/2511.00982v1 The Neutrality Boundary Framework: Quantifying Statistical Robustness Geometrically 2025-11-02T15:50:21Z We introduce the Neutrality Boundary Framework (NBF), a set of geometric metrics for quantifying statistical robustness and fragility as the normalized distance from the neutrality boundary, the manifold where the effect equals zero. The neutrality boundary value nb in [0,1) provides a threshold-free, sample-size invariant measure of stability that complements traditional effect sizes and p-values. We derive the general form nb = |Delta - Delta_0| / (|Delta - Delta_0| + S), where S>0 is a scale parameter for normalization; we prove boundedness and monotonicity, and provide domain-specific implementations: Risk Quotient (binary outcomes), partial eta^2 (ANOVA), and Fisher z-based measures (correlation). Unlike threshold-dependent fragility indices, NBF quantifies robustness geometrically across arbitrary significance levels and statistical contexts. 2025-11-02T15:50:21Z 8 pages, no figures Thomas F. Heston http://arxiv.org/abs/2503.10710v3 How causal perspectives can inform neuroscience data analysis 2025-11-01T01:41:09Z Over the past two decades, considerable strides have been made in advancing neuroscientific techniques, yet challenges remain in attributing causality to observed associations. This review addresses a fundamental issue in observational neuroscience studies and advocates for incorporating causal inference frameworks into standard practice. We systematically introduce necessary definitions and concepts, emphasizing how causal assumptions underlie statistical analyses even when not explicitly stated. Through a running example on sleep quality and white matter integrity, we illustrate how persistent challenges, including confounding and selection biases, can be conceptualized and addressed using causal frameworks. We demonstrate practical approaches for making assumption violations transparent through hands-on examples: supplementary case studies using multi-site harmonization and head motion exclusion procedures provide step-by-step diagnostic techniques for checking covariate overlap and identifying selection bias through exclusion pattern analysis. We explore how these causal perspectives can inform both experimental design and analytical choices, particularly for observational studies where traditional randomization is infeasible. Together, we believe this framework offers concrete tools for strengthening causal interpretations and inspiring more robust approaches to problems in neuroscience. 2025-03-12T22:20:24Z Eric W. Bridgeford Brian S. Caffo Maya B. Mathur Russell A. Poldrack http://arxiv.org/abs/2504.20941v3 Density-Aware Noise Mechanisms for Differential Privacy on Riemannian Manifolds via Conformal Transformation 2025-10-31T17:16:58Z Differential Privacy (DP) enables privacy-preserving data analysis by adding calibrated noise. While recent works extend DP to curved manifolds such as diffusion-tensor MRI or social networks by adding geodesic noise, these assume uniform data distribution and are not always practical. Hence, these approaches may introduce biased noise and suboptimal privacy-utility tradeoffs for non-uniform data. To address these shortcomings, we develop a density-aware differential privacy mechanism based on conformal transformations over Riemannian manifolds, which calibrates perturbations according to local density while preserving intrinsic geometric structure. We construct the conformal factor based on local kernel density estimates and establish that it inherently adapts to variations in data density. Our mechanism achieves a local balance of sample density and redefines geodesic distances while faithfully preserving the intrinsic geometry of the underlying manifold. We demonstrate that, through conformal transformation, our mechanism satisfies epsilon-differential privacy on any complete Riemannian manifold and derives a closed-form expected geodesic error bound that is contingent solely on the maximal density ratio, independent of global curvature. Empirical results on synthetic and real-world datasets demonstrate that our mechanism substantially improves the privacy-utility tradeoff in heterogeneous manifold settings and remains on par with state-of-the-art approaches when data are uniformly distributed. 2025-04-29T17:05:55Z Submitted Peilin He Liou Tang M. Amin Rahimian James Joshi http://arxiv.org/abs/2409.14284v5 Survey Data Integration for Distribution Function Estimation 2025-10-30T15:34:00Z Estimates of finite population cumulativedistribution functions (CDFs) and quantiles are critical forpolicy-making, resource allocation, and public health planning. For instance, federal finance agencies may require accurate estimates of the proportion of individuals with income below the federal poverty line to determine funding eligibility, while health organizations may rely on precise quantile estimates of key health variables to guide local health interventions. Despite growing interest in survey data integration, research on the integration of probability and nonprobability samples toestimate CDFs and quantiles remains limited. In this study, we propose a novel residual-based CDF estimator that integrates information from a probability sample with data from potentially large nonprobability samples. Our approach leverages shared covariates observed in both datasets, while the response variable is available only in the nonprobability sample. Using a semiparametric approach, we train an outcome model on the nonprobability sample and incorporate model residuals with sampling weights from the probability sample to estimate the CDF of the target variable. Based on this CDF estimator, we define a quantile estimator and introduce linearization and bootstrap methods for variance estimation of both the CDF and quantile estimators. Under certain regularity conditions, we establish the asymptotic properties, including bias and variance, of the CDF estimator. Our empirical findings support the theoretical results and demonstrate the favorable performance of the proposed estimators relative to plug-in mass imputation estimators and the naïve estimators derived from the nonprobability sample only. A real data example is presented to illustrate the proposed estimators. 2024-09-22T01:09:19Z Jeremy Flood Sayed Mostafa http://arxiv.org/abs/2510.26177v1 Variable selection in spatial lag models using the focussed information criterion 2025-10-30T06:35:04Z Spatial regression models have a variety of applications in several fields ranging from economics to public health. Typically, it is of interest to select important exogenous predictors of the spatially autocorrelated response variable. In this paper, we propose variable selection in linear spatial lag models by means of the focussed information criterion (FIC). The FIC-based variable selection involves the minimization of the asymptotic risk in the estimation of a certain parametric focus function of interest under potential model misspecification. We systematically investigate the key asymptotics of the maximum likelihood estimators under the sequence of locally perturbed mutually contiguous probability models. Using these results, we obtain the expressions for the bias and the variance of the estimated focus leading to the desired FIC formula. We provide practically useful focus functions that account for various spatial characteristics such as mean response, variability in the estimation and spatial spillover effects. Furthermore, we develop an averaged version of the FIC that incorporates varying covariate levels while evaluating the models. The empirical performance of the proposed methodology is demonstrated through simulations and real data analysis. 2025-10-30T06:35:04Z 20 pages, 2 figures, 3 tables Sagar Pandhare Divya Kappara Siuli Mukhopadhyay http://arxiv.org/abs/2510.23830v1 Statistical estimation of $π$: varying choices over dimensions 2025-10-27T20:16:49Z This article studies statistical estimation of $π$ based on the fact that the ratio of the volumes of a $d$-dimensional hypersphere and a $d$-dimensional hypercube is a certain function of $π$, and the function depends on the dimension $d$. The estimation of $π$ is carried out for various choices of $d$ (strictly speaking, $d\in\{1, 2, \ldots, 20\}$) using the idea of Monte Carlo simulations. Various intriguing facts are observed, and the estimation of $π$ using infinite dimensional observations is outlined. Moreover, the R codes associated with relevant numerical studies are provided. 2025-10-27T20:16:49Z 8 pages, 4 figures. This is a preliminary draft. The manuscript will be updated further before formal communication Syon Bhattacharjee Subhra Sankar Dhar http://arxiv.org/abs/2505.20822v2 Larger cities, more commuters, more crime? The role of inter-city commuting in the scaling of urban crime 2025-10-27T11:09:03Z Cities attract a daily influx of non-resident commuters, reflecting their roles within wider urban networks -- not as isolated places. However, it remains unclear how this interconnectivity shapes the way crime scales with population, given that larger cities tend to receive more commuters and experience more crime. In this work, we investigate how inter-city commuting relates to the population-crime relationship. We find that larger cities receive proportionately more commuters, which in turn is associated with higher levels of burglary, drug possession, robbery, shoplifting, and theft. For example, each 1% increase in inbound commuters corresponds to a 0.32% rise in theft and 0.20% rise in burglary, holding population size constant. We demonstrate that models incorporating both population size and commuter inflows explain variation in these offenses better than population-only models. Our findings underscore the importance of considering how cities are connected -- not just their population size -- in disentangling the population-crime relationship. 2025-05-27T07:31:43Z 19 pages, 3 figures Simon Puttock Umberto Barros Diego Pinheiro Marcos Oliveira http://arxiv.org/abs/2510.20023v2 Change, dependence, and discovery: Celebrating the work of T.L. Lai 2025-10-26T10:39:20Z Tze Leung Lai made seminal contributions to sequential analysis, particularly in sequential hypothesis testing, changepoint detection and nonlinear renewal theory. His work established fundamental optimality results for the sequential probability ratio test and its extensions, and provided a general framework for testing composite hypotheses. In changepoint detection, he introduced new optimality criteria and computationally efficient procedures that remain influential. He applied these and related tools to problems in biostatistics. In this article, we review these key results in the broader context of sequential analysis. 2025-10-22T20:54:02Z Alexander G. Tartakovsky Jay Bartroff Cheng-Der Fuh Haipeng Xing http://arxiv.org/abs/2510.22550v1 Regularization method in the variable selection for logistic regression on BRFSS data 2025-10-26T06:23:27Z Stroke remains a leading cause of death and disability worldwide, yet effective prediction of stroke risk using large-scale population data remains challenging due to data imbalance and high-dimensional features. In this study, we develop and evaluate regularized logistic regression models for stroke prediction using data from the 2022 Behavioral Risk Factor Surveillance System (BRFSS), comprising 445132 U.S. adult respondents and 328 health-related variables. To address data imbalance, we apply several resampling techniques including oversampling, undersampling, class weighting, and the Synthetic Minority Oversampling Technique (SMOTE). We further employ Lasso, Elastic Net, and Group Lasso regularization methods to perform feature selection and dimensionality reduction. Model performance is assessed using ROC-AUC, sensitivity, and specificity metrics. Among all methods, the Lasso-based model achieved the highest predictive performance (AUC = 0.761), while the Group Lasso method identified a compact set of key predictors: Age, Heart Disease, Physical Health, and Dental Health. These findings demonstrate the potential of regularized regression techniques for interpretable and efficient prediction of stroke risk from large-scale behavioral health data. 2025-10-26T06:23:27Z Jinbo Niu http://arxiv.org/abs/2510.20738v1 Optimizing Feature Ordering in Radar Charts for Multi-Profile Comparison 2025-10-23T16:56:32Z Radar charts are widely used to visualize multivariate data and compare multiple profiles across features. However, the visual clarity of radar charts can be severely compromised when feature values alternate drastically in magnitude around the circle, causing areas to collapse, which misrepresents relative differences. In the present work we introduce a permutation optimization strategy that reorders features to minimize polygon ``spikiness'' across multiple profiles simultaneously. The method is combinatorial (exhaustive search) for moderate numbers of features and uses a lexicographic minimax criterion that first considers overall smoothness (mean jump) and then the largest single jump as a tie-breaker. This preserves more global information and produces visually balanced arrangements. We discuss complexity, practical bounds, and relations to existing approaches that either change the visualization (e.g., OrigamiPlot) or learn orderings (e.g., Versatile Ordering Network). An example with two profiles and $p=6$ features (before/after ordering) illustrates the qualitative improvement. Keywords: data visualization, radar charts, combinatorial optimization, minimax optimization, feature ordering 2025-10-23T16:56:32Z Albert Dorador http://arxiv.org/abs/2508.12982v3 Revisiting Functional Derivatives in Multi-object Tracking 2025-10-23T13:42:37Z Probability generating functionals (PGFLs) are efficient and powerful tools for tracking independent objects in clutter. It was shown that PGFLs could be used for the elegant derivation of practical multi-object tracking algorithms, e.g., the probability hypothesis density (PHD) filter. However, derivations using PGFLs use the so-called functional derivatives whose definitions usually appear too complicated or heuristic, involving Dirac delta ``functions''. This paper begins by comparing different definitions of functional derivatives and exploring their relationships and implications for practical applications. It then proposes a rigorous definition of the functional derivative, utilizing straightforward yet precise mathematics for clarity. Key properties of the functional derivative are revealed and discussed. 2025-08-18T14:58:50Z submitted to SIAM Journal on Control and Optimization Jan Krejčí Ondřej Straka Petr Girg Jiří Benedikt