Holistic Decision-Making in Stopping Problems: Emphasizing Psychological Aspects

2026-04-28T14:23:05Z

Our research is closely related to ontological studies in mathematics. It provides crucial insights into the nature of decisions and strategies characterized by Markov moments. In a stopping game, a holistic decision-maker would evaluate comprehensive information by assessing the probabilities of various outcomes and their associated payoffs. This involves understanding the current state, historical data, and potential future scenarios. Such a decision-maker must also consider strategic interactions by anticipating and accounting for the strategies of other players. They must be flexible in adapting their strategy as the game evolves and able to integrate uncertainty by incorporating risk preferences and tolerances. They would perform scenario analysis to evaluate the impact of different stopping times under varying conditions. The goal of this modeling and its implementation in psychological practice is to introduce a novel method for assessing the state of players, leveraging deviations from rational strategies as diagnostic indicators of their psychological and decision-making profiles. The details of other models will be subject to contributed papers. The article presents the theoretical basis for combining various factors when modeling decision-making processes. The original title is "Rationality, Deviation, and Diagnosis: A Holistic Approach to Stopping Games" and will be used when it is possible to describe and interpret the results of the experiments we write about in the last section of the paper.

Generating Synthetic Citation Networks with Communities

2026-04-28T13:03:21Z

Generating realistic synthetic citation, patent, or component dependency networks is essential for benchmarking community detection, graph visualisation, and network data mining algorithms. We present the first systematic comparison of generators of directed graphs that are nearly acyclic and have a ground-truth community structure. We evaluate 12 methods across 7 real citation networks and 26 metrics. We propose the practice of reversing directions of edges in static generators to break cycles and induce a citation-like flow, which significantly improves the performance of a degree-corrected Stochastic Block Model. Our novel methodological approach to evaluating community detection benchmarks distinguishes between endogenous and exogenous mesoscopic similarities, with the latter proving more important. This distinction reveals that high-parameter models suffer from overfitting by memorising planted community statistics which lead to their failing to produce realistic networks. Finally, we introduce the Citation Seeder (CS) algorithm, an iterative generator grounded in the Price-Pareto model of citation networks, with interpretable parameters and O(N+E) runtime. CS achieves competitive results against the best-performing baselines while using up to four orders of magnitude fewer parameters and providing a clean framework for explaining and predicting a network's future growth.

The Price-Pareto growth model of networks with community structure

2026-04-28T12:56:24Z

We introduce a new analytical framework for modelling degree sequences in individual communities of real-world networks, e.g., citations to papers in different fields. Our work is inspired by a recent modification of the Price's model, which assumes that citations are gained partly accidentally, and to some extent preferentially. Our work addresses the need to represent the heterogeneity of various scientific domains, as standard homogeneous models fail to capture the distinct growth ratios and citing cultures of different fields. Extending the model to networks with a community structure allows us to devise the analytical formulae for, amongst others, citation counts in each cluster and their inequality as described by the Gini index. We also show that a citation count distribution in each community tends to a Pareto type II distribution. Thanks to the derived model parameter estimators, the new model can be fitted to real citation and similar networks.

Stop Using the Wilcoxon Test: Myth, Misconception and Misuse in IR Research

2026-04-28T08:08:36Z

In benchmarking of Information Retrieval systems, the Wilcoxon signed-rank test is often treated as a safer alternative to the t-test. This belief is fueled by textbooks and recommendations that portray Wilcoxon as the proper non-parametric alternative because metric scores are not normally distributed. We argue that this narrative is misleading and harmful. A careful review of Statistics textbooks reveals inconsistencies and omissions in how the assumptions underlying these tests are presented, fostering confusion that has propagated into IR research. As a result, Wilcoxon has been routinely misapplied for decades, creating a false sense of safety against a threat that was never there to begin with, while introducing another one so severe that it virtually guarantees the test will break down and mislead researchers. Through a combination of systematic literature review, analysis and empirical demonstrations with TREC data, we show how and why the Wilcoxon test easily loses control of its Type I error rate in IR settings. We conclude that the continued use of Wilcoxon in IR evaluation is unjustified and that abandoning it would improve the methodological soundness of our field.

On the use of satellite information to estimate agricultural carbon footprint in a small area framework

2026-04-28T07:57:09Z

The agricultural sector is undergoing rapid change due to climate pressures, demographic shifts, and uneven economic development, increasing the demand for reliable environmental indicators at fine spatial scales. However, limited data availability often constrains subregional analyses. This study develops a model-based framework for producing reliable small-area estimates for assessing the agricultural carbon footprint in the Po Valley (Northern Italy), a region characterized by intensive livestock farming and high environmental pressure. We integrate survey, census, and satellite-derived emission data into a unified framework and produce estimates at the level of Agrarian Subregions, defined as agriculturally homogeneous municipalities by the Italian National Institute of Statistics. Satellite-based ammonia emission data are incorporated as auxiliary covariates to improve precision and spatial coherence. A key methodological contribution is the treatment of spatial misalignment between gridded satellite data and administrative boundaries. This issue is addressed through a geostatistical upscaling procedure combined with a parametric bootstrap that propagates uncertainty from the covariate construction stage to the final small-area estimates. The results show that satellite-derived information substantially improves the accuracy and stability of carbon footprint estimates while reducing reliance on large, heterogeneous auxiliary datasets, illustrating the potential of Earth observation data in model-based environmental statistics.

An Algebraic Approach to Evolutionary Accumulation Models

2026-04-28T07:38:30Z

We present an algebraic approach to evolutionary accumulation modelling (EvAM). EvAM is concerned with learning and predicting the order in which evolutionary features accumulate over time. Our approach is complementary to the more common optimisation-based inference methods used in this field. Namely, we first use the natural underlying polynomial structure of the evolutionary process to define a semi-algebraic set of candidate parameters consistent with a given data set before maximising the likelihood function. We consider explicit examples and show that this approach is compatible with the solutions given by various statistical evolutionary accumulation models. Furthermore, we discuss the additional information of our algebraic model relative to these models.

Conflict Forecasting via Conformal Prediction for Markov Processes

2026-04-28T02:29:07Z

Whether or not a country is at war, or experiencing escalating or deescalating levels of conflict, has massive ramifications on a country's national and foreign policy. Given a country's history of conflict, or lack thereof, future predictions about the war-status of a country are valuable information. In this paper, we present the use of conformal prediction on temporally-dependent data to obtain prediction sets of possible future conflict state-sequences. More specifically, we compare the results of conformal prediction to a likelihood-based prediction strategy when the data are assumed to come from a discrete-state Markov process. A point-prediction may not supply sufficient information because the penalty for a wrong prediction is extreme, and so we consider a machine learning alternative that gives valid uncertainty quantification and is robust to model misspecification. In the data analysis, we present real forecasts of conflict dynamics across multiple countries. Lastly, we comment on the possible limitations of existing approaches for applying conformal prediction to Markovian data, where the exchangeability assumption is violated.

A Bayesian Framework for Latent Compliance Modeling in Cluster Randomized Trials with One-Sided Noncompliance

2026-04-28T00:35:47Z

In pragmatic cluster randomized controlled trials (PCRCTs), healthcare providers are randomized while both providers and patients may deviate from the assigned intervention. In many PCRCTs, cluster-level implementation is measured using multiple continuous metrics, while individual compliance is recorded as a binary indicator. Standard complier average causal effect (CACE) estimands focus on individual-level compliance and do not account for heterogeneity in implementation across clusters. When intervention uptake is shaped by both provider- and patient-level processes, it is of scientific interest to characterize how effects vary across these sources of compliance. We propose a Bayesian framework for PCRCTs with one-sided binary noncompliance at the individual level and one-sided partial compliance at the cluster level. The method uses a latent mixture model to summarize heterogeneity in cluster-level implementation based on baseline characteristics and observed implementation measures, and links these latent implementation types to individual compliance and outcomes through a joint model. Because compliance is only observed in treated clusters, the model imputes unobserved compliance behavior for clusters and individuals assigned to control. The framework enables estimation of finite- and super-population intent-to-treat (ITT) and CACE estimands, both marginally and within latent implementation types. We apply the method to the METRIcAL trial, a pragmatic cluster randomized study evaluating a personalized music intervention for nursing home residents with dementia. The analysis illustrates how accounting for implementation heterogeneity and individual compliance can provide insights beyond standard ITT analyses.}{Causal inference; Principal stratification; Complier average causal effect; Cluster randomized trials; Noncompliance; Bayesian methods; Latent variable models; Interference.

FightTracker: Real-time predictive analytics for Mixed Martial Arts bouts

2026-04-27T21:47:02Z

Mixed martial arts (MMA) has been one of the fastest-growing sports in recent years and has become a mainstream sport on the global stage. The growth of MMA has been driven by the Ultimate Fighting Championship (UFC), which is currently the largest MMA promotion organization in the world. However, data collection and statistical modeling in MMA are still in their infancy. We developed FightTracker, a data-driven solution that delivers real-time predictions for UFC fights. We first conducted regression analyses on the data provided by the UFC and MMA Decisions and built two predictive models of UFC fight outcomes. One model predicts the judges' majority score by round while the other predicts whether the red fighter will win the fight or not in 3-round fights that go beyond the second round (53% of all UFC fights). Both models use in-round fight statistics as explanatory variables and achieve 80% accuracy. We then designed an R shiny app that delivers these two predictions in real-time based on the ESPN live data. This information is valuable for fans, coaches, athletes, and especially bettors. Indeed, a live betting strategy based on FightTracker proved to generate large profits over an 8-week period against the bookmaker Unibet (90.17% ROI).

Coupled Supply and Demand Forecasting in Platform Accommodation Markets

2026-04-27T19:53:07Z

Tourism demand forecasting is methodologically mature, but it typically treats accommodation supply as fixed or exogenous. In platform-mediated short-term rentals, supply is elastic, decision-driven, and co-evolves with demand through pricing, information design, and interventions. I reframe the core issue as endogenous stock-out censoring: realized booked nights satisfy B_{k,t} <= min(D_{k,t}, S_{k,t}), so booking models that ignore supply learn a regime-specific ceiling and become fragile under policy changes and supply shocks. This narrated review synthesizes work from tourism forecasting, revenue management, two-sided market economics, and Bayesian time-series methods; develops a three-part coupling framework (behavioral, informational, intervention); and illustrates the identification failure with a toy simulation. I conclude with a focused research agenda for jointly forecasting supply, demand, and their compositions.

A cautious use of auxiliary outcomes for decision-making in randomized clinical trials

2026-04-27T16:32:27Z

Clinical trials often collect data on multiple outcomes, such as overall survival (OS), progression-free survival (PFS), and response to treatment (RT). In most cases, however, study designs only use primary outcome data for interim and final decision-making. In several disease settings, clinically relevant outcomes, for example OS, become available years after patient enrollment. Moreover, the effects of experimental treatments on OS might be less pronounced compared to auxiliary outcomes such as RT. We develop a Bayesian decision-theoretic framework that uses both primary and auxiliary outcomes for interim and final decision-making. The framework allows investigators to control standard frequentist operating characteristics, such as the type I error rate and can be used with auxiliary outcomes from emerging technologies, such as circulating tumor assays. False positive rates and other frequentist operating characteristics are rigorously controlled without any assumption about the concordance between primary and auxiliary outcomes. We discuss algorithms to implement this decision-theoretic approach and show that incorporating auxiliary information into interim and final decision-making can lead to relevant efficiency gains according to established and interpretable metrics.

Fisher Information and Dynamical Sampling I

2026-04-27T14:03:02Z

Information theory is a powerful framework to capture aspects of dynamical systems with multiple degrees of freedom. Mathematically, the dynamics can be represented as a continuous curve $\mathcal{C}$ on a suitable hyperplane in flat space and the Fisher information provides the norm of an infinitesimal displacement along this curve. In many applications, however, we do not have direct access to $\mathcal{C}$. Instead, we have to reconstruct the latter from a time-series of measurements (obtained as samples of size $n$), which are represented by an ordered set of points $\widehat{\mathcal{C}}$ on the same hyperplane. In this work, we calculate the bias of the Fisher information for large $n$, which provides a quantitative estimation for how accurately the dynamics of a system can be reconstructed from a given set of sampled data. Based on this result, we show that a clustering of the degrees of freedom reduces the bias and thus improves the accuracy with which the new system can be described with the same data. Inspired by a recent proposal for such a clustering, we provide a quantitive assessment of the loss of information, which allows to estimate how much information about the dynamics of a system can reliably be extracted based on a given set of data. We illustrate our findings in the case of a simple compartmental model. Although the latter is inspired by epidemiology, the results of this work are applicable to very general dynamical models with multiple degrees of freedom.

Large-Sample Bayesian Approximations for Privatized Data

2026-04-27T13:56:11Z

The increased use of differential privacy (DP) has allowed the sharing of large amounts of data while reducing the risk of disclosure of sensitive information at the individual level. However, the noise introduced by DP methods makes performing statistical inference more challenging. While various methods have been proposed to address different inferential tasks, they often require strong parametric assumptions and/or do not scale well with sample sizes (e.g. U.S. Census products). In response to these limitations, we propose an approximate Bayesian method to analyze privatized data products, which uses a two-step approach of imputing the confidential data and then sampling from the non-private posterior, and which is inspired by the method of Guha and Reiter (2025). We prove that this approximate sampler is asymptotically valid under mild assumptions. While this approach is motivated by Bayesian theory, we show through simulations that it provides conservative frequentist properties as well. We demonstrate the utility of our method by applying it in simulated settings as well as for an analysis on the drivers of homeownership via the 2022 American Community Survey.

Digital Divide: Evidence from the 2020 Canadian Internet Use Survey

2026-04-27T13:50:57Z

This paper studies inequality in digital participation across socioeconomic and demographic groups using the 2020 Canadian Internet Use Survey (CIUS). We combine survey-weighted logistic Lasso, an exact Shapley decomposition of age--education gaps, a sequential logit, and a bifactor item response theory (IRT) measure of digital literacy to identify who is excluded, why gaps persist, and where along the adoption path they arise. Education is the only determinant that remains significant at every rung of the digital ladder. Income inequality is most pronounced for virtual-wallet adoption; for online banking, employment and education together account for nearly half of the pro-rich concentration, indicating a broad socioeconomic gradient rather than a purely income-based divide. Persons with disabilities face the largest penalty at the digital-payments stage rather than at online banking, pointing to accessibility gaps in retail payment interfaces. Conditioning on digital literacy eliminates the education gradient at internet entry and reduces it by 61\% at the online banking rung, but a substantial residual persists, pointing to behavioral and institutional frictions beyond measurable competence. The youngest cohort records the lowest information-seeking score despite high digital engagement, and security deficits are concentrated among landed immigrants and visible minorities.

Machine Learning for Network Attacks Classification and Statistical Evaluation of Adversarial Learning Methodologies for Synthetic Data Generation

2026-04-27T13:39:09Z

Supervised detection of network attacks has always been a critical part of network intrusion detection systems (NIDS). Nowadays, in a pivotal time for artificial intelligence (AI), with even more sophisticated attacks that utilize advanced techniques, such as generative artificial intelligence (GenAI) and reinforcement learning, it has become a vital component if we wish to protect our personal data, which are scattered across the web. In this paper, we address two tasks, in the first unified multi-modal NIDS dataset, which incorporates flow-level data, packet payload information and temporal contextual features, from the reprocessed CIC-IDS-2017, CIC-IoT-2023, UNSW-NB15 and CIC-DDoS-2019, with the same feature space. In the first task we use machine learning (ML) algorithms, with stratified cross validation, in order to prevent network attacks, with stability and reliability. In the second task we use adversarial learning algorithms to generate synthetic data, compare them with the real ones and evaluate their fidelity, utility and privacy using the SDV framework, f-divergences, distinguishability and non-parametric statistical tests. The findings provide stable ML models for intrusion detection and generative models with high fidelity and utility, by combining the Synthetic Data Vault framework, the TRTS and TSTR tests, with non-parametric statistical tests and f-divergence measures.