https://arxiv.org/api/gwfaaWXzh+bDFD4PwlkhlwATTR4 2026-06-10T19:40:17Z 1686 315 15 http://arxiv.org/abs/2411.05391v4 Impossibility results for equating the Youden Index with average scoring rules and Tjur $R^2$-like metrics 2025-02-24T19:57:31Z

We consider the Youden index fas well as measures evaluating predicted probabilities for the maximum-likelihood estimate of a logistic regression model with predictor the classifier. We give impossibility results showing that the Youden index can not equal any average of a real scoring rule nor any metric averaging over binary outcomes (0s and 1s) for any continuous real-valued scoring rule. This shows the obstructions of such potential equivalences and highlights the distinct roles these metrics play in diagnostic assessment.

2024-11-08T08:05:44Z Linard Hoessly http://arxiv.org/abs/2502.13225v1 Entropy of spatial network with applications to non-extensive statistical mechanics 2025-02-18T19:04:40Z

A new method is proposed for analyzing complexity and studying the information in random geometric networks using Tsallis entropy tool. Tsallis entropy of the ensemble of random geometric networks is calculated based on the components of the random connection model on the point process which is obtained by connecting the points with a probability that depends on their relative positions (10.1016/j.indag.2022.05.002, 2022). According to information theory and conditional discussion, the bounds for Shannon and Tsallis entropies of the ensemble of this random graph are presented. Using this function and Lagrange's formula, the connection function that provides the maximum Tsallis entropy based on general constraints is obtained. Then, a simulation-based example is presented to clarify the application of the proposed method in the study of ad hoc wireless networks. By observing the obtained results, it can be stated that the wireless networks that adhere to the model studied here are almost maximally complex. Also, Tsallis conditional entropy maximizing function is compared with other connection functions using numerical calculations and the optimal value for the maximization of conditional entropies is obtained.

2025-02-18T19:04:40Z 17 pages O. K. Kazemi S. M. Taheri http://arxiv.org/abs/2502.13106v1 Score Matching Riemannian Diffusion Means 2025-02-18T18:18:50Z

Estimating means on Riemannian manifolds is generally computationally expensive because the Riemannian distance function is not known in closed-form for most manifolds. To overcome this, we show that Riemannian diffusion means can be efficiently estimated using score matching with the gradient of Brownian motion transition densities using the same principle as in Riemannian diffusion models. Empirically, we show that this is more efficient than Monte Carlo simulation while retaining accuracy and is also applicable to learned manifolds. Our method, furthermore, extends to computing the Fréchet mean and the logarithmic map for general Riemannian manifolds. We illustrate the applicability of the estimation of diffusion mean by efficiently extending Euclidean algorithms to general Riemannian manifolds with a Riemannian $k$-means algorithm and maximum likelihood Riemannian regression.

2025-02-18T18:18:50Z Frederik Möbius Rygaard Steen Markvorsen Søren Hauberg Stefan Sommer http://arxiv.org/abs/2502.12912v1 A Simplified and Numerically Stable Approach to the BG/NBD Churn Prediction model 2025-02-18T14:53:56Z

This study extends the BG/NBD churn probability model, addressing its limitations in industries where customer behaviour is often influenced by seasonal events and possibly high purchase counts. We propose a modified definition of churn, considering a customer to have churned if they make no purchases within M days. Our contribution is twofold: First, we simplify the general equation for the specific case of zero purchases within M days. Second, we derive an alternative expression using numerical techniques to mitigate numerical overflow or underflow issues. This approach provides a more practical and robust method for predicting customer churn in industries with irregular purchase patterns.

2025-02-18T14:53:56Z 4 pages, numerically stable BG/NBD Dylan Zammit Christopher Zerafa http://arxiv.org/abs/2502.11036v2 A Survey: Potential Dimensionality Reduction Methods 2025-02-18T03:24:39Z

Dimensionality reduction is a fundamental technique in machine learning and data analysis, enabling efficient representation and visualization of high-dimensional data. This paper explores five key methods: Principal Component Analysis (PCA), Kernel PCA (KPCA), Sparse Kernel PCA, t-Distributed Stochastic Neighbor Embedding (t-SNE), and Uniform Manifold Approximation and Projection (UMAP). PCA provides a linear approach to capturing variance, whereas KPCA and Sparse KPCA extend this concept to non-linear structures using kernel functions. Meanwhile, t-SNE and UMAP focus on preserving local relationships, making them effective for data visualization. Each method is examined in terms of its mathematical formulation, computational complexity, strengths, and limitations. The trade-offs between global structure preservation, computational efficiency, and interpretability are discussed to guide practitioners in selecting the appropriate technique based on their application needs.

2025-02-16T08:28:33Z Yuan-chin Ivan Chang http://arxiv.org/abs/2502.11645v1 Deviation Ratings: A General, Clone-Invariant Rating Method 2025-02-17T10:39:04Z

Many real-world multi-agent or multi-task evaluation scenarios can be naturally modelled as normal-form games due to inherent strategic (adversarial, cooperative, and mixed motive) interactions. These strategic interactions may be agentic (e.g. players trying to win), fundamental (e.g. cost vs quality), or complementary (e.g. niche finding and specialization). In such a formulation, it is the strategies (actions, policies, agents, models, tasks, prompts, etc.) that are rated. However, the rating problem is complicated by redundancy and complexity of N-player strategic interactions. Repeated or similar strategies can distort ratings for those that counter or complement them. Previous work proposed ``clone invariant'' ratings to handle such redundancies, but this was limited to two-player zero-sum (i.e. strictly competitive) interactions. This work introduces the first N-player general-sum clone invariant rating, called deviation ratings, based on coarse correlated equilibria. The rating is explored on several domains including LLMs evaluation.

2025-02-17T10:39:04Z Luke Marris Siqi Liu Ian Gemp Georgios Piliouras Marc Lanctot http://arxiv.org/abs/2310.08467v2 Teaching Resources for Embedding Ethics in Mathematics: Exercises, Projects, and Handouts 2025-02-11T11:47:04Z

The resources compiled in this document provide an approach to embed and teach Ethics in Mathematics at the undergraduate level. We provide mathematical exercises and homework problems that teach students ethical awareness and transferable skills, for many of the standard courses in the first and second years of a university degree in mathematics or related courses with significant mathematical content (e.g., physics, engineering, computer science, economics, etc). In addition to the exercises, this document also contains a list of projects, essay topics, and handouts for use as final projects and in seminars. This is a living document, and additional contributions are welcome.

2023-10-12T16:27:35Z 106 pages, 2 figures. This is the second version, and we intend to make revisions. Comments and feedback are welcome - please get in touch with us Maurice Chiodo Dennis Müller Rehan Shah http://arxiv.org/abs/2502.06628v1 Random Variables aren't Random 2025-02-10T16:26:01Z

This paper examines the foundational concept of random variables in probability theory and statistical inference, demonstrating that their mathematical definition requires no reference to randomization or hypothetical repeated sampling. We show how measure-theoretic probability provides a framework for modeling populations through distributions, leading to three key contributions. First, we establish that random variables, properly understood as measurable functions, can be fully characterized without appealing to infinite hypothetical samples. Second, we demonstrate how this perspective enables statistical inference through logical rather than probabilistic reasoning, extending the reductio ad absurdum argument from deductive to inductive inference. Third, we show how this framework naturally leads to information-based assessment of statistical procedures, replacing traditional inference metrics that emphasize bias and variance with information-based approaches that better describe the families of distributions used in parametric inference. This reformulation addresses long-standing debates in statistical inference while providing a more coherent theoretical foundation. Our approach offers an alternative to traditional frequentist inference that maintains mathematical rigor while avoiding the philosophical complications inherent in repeated sampling interpretations.

2025-02-10T16:26:01Z 17 pages, no figures Paul W. Vos http://arxiv.org/abs/2502.05336v1 Leveraging Order-Theoretic Tournament Graphs for Assessing Internal Consistency in Survey-Based Instruments Across Diverse Scenarios 2025-02-07T21:18:29Z

This paper introduces Monotone Delta, an order-theoretic measure designed to enhance the reliability assessment of survey-based instruments in human-machine interactions. Traditional reliability measures, such as Cronbach's Alpha and McDonald's Omega, often yield misleading estimates due to their sensitivity to redundancy, multidimensional constructs, and assumptions of normality and uncorrelated errors. These limitations can compromise decision-making in human-centric evaluations, where survey instruments inform adaptive interfaces, cognitive workload assessments, and human-AI trust models. Monotone Delta addresses these issues by quantifying internal consistency through the minimization of ordinal contradictions and alignment with a unidimensional latent order using weighted tournaments. Unlike traditional approaches, it operates without parametric or model-based assumptions. We conducted theoretical analyses and experimental evaluations on four challenging scenarios: tau-equivalence, redundancy, multidimensionality, and non-normal distributions, and proved that Monotone Delta provides more stable reliability assessments compared to existing methods. The Monotone Delta is a valuable alternative for evaluating questionnaire-based assessments in psychology, human factors, healthcare, and interactive system design, enabling organizations to optimize survey instruments, reduce costly redundancies, and enhance confidence in human-system interactions.

2025-02-07T21:18:29Z Muhammad Umair Danish Umair Rehman Katarina Grolinger http://arxiv.org/abs/2502.02927v1 Bayesian estimation of Unit-Weibull distribution based on dual generalized order statistics with application to the Cotton Production Data 2025-02-05T06:40:58Z

The Unit Weibull distribution with parameters $α$ and $β$ is considered to study in the context of dual generalized order statistics. For the analysis purpose, Bayes estimators based on symmetric and asymmetric loss functions are obtained. The methods which are utilized for Bayesian estimation are approximation and simulation tools such as Lindley, Tierney-Kadane and Markov chain Monte Carlo methods. The authors have considered squared error loss function as symmetric and LINEX and general entropy loss function as asymmetric loss functions. After presenting the mathematical results, a simulation study is conducted to exhibit the performances of various derived estimators. As this study is considered for the dual generalized order statistics that is unification of models based distinct ordered random variable such as order statistics, record values, etc. This provides flexibility in our results and in continuation of this, the cotton production data of USA is analyzed for both submodels of ordered random variables: order statistics and record values.

2025-02-05T06:40:58Z 19 Pages, 1 figure, 12 tables, preprint Qazi J. Azhad Abdul Nasir Khan Bhagwati Devi Jahangir Sabbir Khan Ayush Tripathi http://arxiv.org/abs/2501.17719v1 A Framework for Generating Realistic Synthetic Tabular Data in a Randomized Controlled Trial Setting 2025-01-29T15:42:33Z

Generation of realistic synthetic data has garnered considerable attention in recent years, particularly in the health research domain due to its utility in, for instance, sharing data while protecting patient privacy or determining optimal clinical trial design. While much work has been concentrated on synthetic image generation, generation of realistic and complex synthetic tabular data of the type most commonly encountered in classic epidemiological or clinical studies is still lacking, especially with regards to generating data for randomized controlled trials (RTCs). There is no consensus regarding the best way to generate synthetic tabular RCT data such that the underlying multivariate data distribution is preserved. Motivated by an RCT in the treatment of Human Immunodeficiency Virus, we empirically compared the ability of several strategies and two generation techniques (one machine learning, the other a more classical statistical method) to faithfully reproduce realistic data. Our results suggest that using a sequential generation approach with a R-vine copula model to generate baseline variables, followed by a simple random treatment allocation to mimic the RCT setting, and subsequent regression models for variables post-treatment allocation (such as the trial outcome) is the most effective way to generate synthetic tabular RCT data that capture important and realistic features of the real data.

2025-01-29T15:42:33Z Niki Z. Petrakos Erica E. M. Moodie Nicolas Savy http://arxiv.org/abs/2409.01631v3 Doppler Power Spectrum in Channels with von Mises-Fisher Distribution of Scatterers 2025-01-29T12:53:22Z

This paper presents an analytical analysis of the Doppler spectrum in von Mises-Fisher (vMF) scattering channels. A simple closed-form expression for the Doppler spectrum is derived and used to investigate the impact of the vMF scattering parameters, i.e., the mean direction and the degree of concentration of scatterers. The spectrum is observed to exhibit exponential behavior for mobile antenna motion parallel to the mean direction of scatterers, while conforming to a Gaussian-like shape for the perpendicular motion. The validity of the obtained results is verified by comparison against the results of Monte Carlo simulations, where an exact match is observed.

2024-09-03T05:59:58Z Accepted for publication in IEEE Communications Letters (submitted on 2024-07-26; revised and resubmitted on 2024-12-19, accepted 2025-01-27) Kenan Turbic Martin Kasparick Slawomir Stanczak 10.1109/LCOMM.2025.3534520 http://arxiv.org/abs/2405.20415v3 Differentially Private Boxplots 2025-01-27T21:14:50Z

Despite the potential of differentially private data visualization to harmonize data analysis and privacy, research in this area remains underdeveloped. Boxplots are a widely popular visualization used for summarizing a dataset and for comparison of multiple datasets. Consequentially, we introduce a differentially private boxplot. We evaluate its effectiveness for displaying location, scale, skewness and tails of a given empirical distribution. In our theoretical exposition, we show that the location and scale of the boxplot are estimated with optimal sample complexity, and the skewness and tails are estimated consistently, which is not always the case for a boxplot naively constructed from a single existing differentially private quantile algorithm. As a byproduct of this exposition, we introduce several new results concerning private quantile estimation. In simulations, we show that this boxplot performs similarly to a non-private boxplot, and it outperforms the naive boxplot. Additionally, we conduct a real data analysis of Airbnb listings, which shows that comparable analysis can be achieved through differentially private boxplot visualization.

2024-05-30T18:42:07Z Kelly Ramsay Jairo Diaz-Rodriguez http://arxiv.org/abs/2501.16008v1 Gaussian credible intervals in Bayesian nonparametric estimation of the unseen 2025-01-27T12:48:05Z

The unseen-species problem assumes $n\geq1$ samples from a population of individuals belonging to different species, possibly infinite, and calls for estimating the number $K_{n,m}$ of hitherto unseen species that would be observed if $m\geq1$ new samples were collected from the same population. This is a long-standing problem in statistics, which has gained renewed relevance in biological and physical sciences, particularly in settings with large values of $n$ and $m$. In this paper, we adopt a Bayesian nonparametric approach to the unseen-species problem under the Pitman-Yor prior, and propose a novel methodology to derive large $m$ asymptotic credible intervals for $K_{n,m}$, for any $n\geq1$. By leveraging a Gaussian central limit theorem for the posterior distribution of $K_{n,m}$, our method improves upon competitors in two key aspects: firstly, it enables the full parameterization of the Pitman-Yor prior, including the Dirichlet prior; secondly, it avoids the need of Monte Carlo sampling, enhancing computational efficiency. We validate the proposed method on synthetic and real data, demonstrating that it improves the empirical performance of competitors by significantly narrowing the gap between asymptotic and exact credible intervals for any $m\geq1$.

2025-01-27T12:48:05Z 63 pages, 5 figures Claudia Contardi Emanuele Dolera Stefano Favaro http://arxiv.org/abs/2407.05572v2 Reducing Total Trip Time and Vehicle Emission through Park-and-Ride -- methods and case-study 2025-01-22T18:22:23Z

This study addresses important issues of traffic congestion and vehicle emissions in urban areas by developing a comprehensive mathematical framework to evaluate Park-and-Ride (PnR) systems. The proposed approach integrates queueing theory and emissions modeling to simultaneously assess waiting times, travel times, and vehicle emissions under various PnR usage scenarios. The methodology employs a novel combination of Monte Carlo simulation and matrix geometric analytic methods to analyze a queueing network representing PnR facilities and road traffic. A case study of Tsukuba, Japan demonstrates the model's applicability, revealing potential reductions in social costs related to total trip time and emissions through optimized PnR policies. Specifically, the study found that implementing optimal bus frequency and capacity policies could reduce total social costs by up to 30\% compared to current conditions. This research contributes to the literature by providing a unified framework for evaluating PnR systems that considers both time and environmental costs, offering valuable insights for urban planners and policymakers seeking to improve transportation sustainability. The proposed model utilizes a single server queue with a deterministic service time and multiple arrival streams to represent traffic flow, incorporating both private cars and public buses. Emissions are calculated using the Methodologies for Estimating Air Pollutant Emissions from Transport (MEET) framework. The social cost of emissions and total trip time (SCETT) is introduced as a comprehensive metric for evaluating PnR system performance.

2024-07-08T03:04:59Z 54 pages, 34 figures, to be published in Journal of Cleaner Production Ayane Nakamura Fabiana Ferracina Naoki Sakata Takahiro Noguchi Hiroyasu Ando