https://arxiv.org/api/mRTVCBqzVtfaym4zXCZjbozFUTU 2026-06-10T08:52:16Z 36124 150 15 http://arxiv.org/abs/2506.00666v2 Unbiased estimation in new Gini index extensions under gamma distributions, with application to real income data 2026-06-04T16:40:38Z

In this paper, we introduce two flexible extensions of the classical Gini index, referred to as the extended lower and upper Gini indices. The proposed measures are based on the differences between an observation and the minimum and maximum order statistics in samples of size $m\geqslant 2$ and reduce to the classical Gini coefficient when $m=2$. Unlike conventional Gini-type measures, they provide a position-oriented assessment of inequality relative to the lower and upper tails of the distribution. We establish the consistency and asymptotic normality of the proposed estimators under mild regularity conditions. For gamma-distributed populations, we derive exact expressions for their expectations and prove their unbiasedness, thereby extending previous results of [Deltas, G. 2003. The small-sample bias of the gini coefficient: Results and implications for empirical research. Review of Economics and Statistics 85:226-234] and [Baydil, B., de la Peña, V. H., Zou, H., and Yao, H. 2025. Unbiased estimation of the gini coefficient. Statistics & Probability Letters 222:110376]. The finite-sample performance of the estimators is investigated through Monte Carlo simulations, and an application to 2023 GDP per capita data from South American countries illustrates the practical usefulness of the proposed measures. The results show that the extended lower and upper Gini indices provide a richer and more informative characterization of inequality than traditional Gini-type measures.

2025-05-31T18:35:28Z 18 pages, 3 figures Roberto Vila Helton Saulo http://arxiv.org/abs/2512.05013v2 Detecting Perspective Shifts in Multi-agent Systems 2026-06-04T16:38:06Z

Generative models augmented with external tools and update mechanisms (or \textit{agents}) have demonstrated capabilities beyond intelligent prompting of base models. As agent use proliferates, dynamic multi-agent systems have naturally emerged. Recent work has investigated the theoretical and empirical properties of low-dimensional representations of agents based on query responses at a single time point. This paper introduces the Temporal Data Kernel Perspective Space (TDKPS), which jointly embeds agents across time, and proposes several novel hypothesis tests for detecting behavioral change at the agent- and group-level in black-box multi-agent systems. We characterize the empirical properties of our proposed tests, including their sensitivity to key hyperparameters, in simulations motivated by a multi-agent system of evolving digital personas. Finally, we demonstrate via natural experiment that our proposed tests detect changes that correlate sensitively, specifically, and significantly with a real exogenous event. As far as we are aware, TDKPS is the first principled framework for monitoring behavioral dynamics in black-box multi-agent systems -- a critical capability as generative agent deployment continues to scale.

2025-12-04T17:24:56Z Eric Bridgeford Hayden Helm http://arxiv.org/abs/2606.06368v1 Optimally taming biases in black-box models for efficient semiparametric estimation 2026-06-04T16:36:18Z

Modern semiparametric estimation often relies on flexible black-box machine learning methods to estimate nuisance functions, raising a fundamental question: how do nuisance estimation errors propagate into inference for low-dimensional target parameters? The dominant paradigm, exemplified by double machine learning (DML), yields error bounds in which nuisance estimation errors enter multiplicatively. While widely adopted, it remains unclear whether this multiplicative-rate dependence is optimal for black-box models. In this paper, we start by revisiting the partial linear model $Y = μ_0(X)+T\cdotβ_0+\varepsilon$ under a structure-agnostic setting, where the nuisance function $μ_0$ is estimated using a generic machine learning model, with approximation error $δ^a_μ$ and stochastic error $δ_μ^s$. We show that the standard DML rate is not optimal in the regime where the auxiliary function $\mathbb{E}[T|X=x]$ cannot be consistently estimated. We propose a new estimator for $β_0$ that achieves a sharper rate of $n^{-1/2}+δ^a_μ+(δ_μ^s)^2$ and establish a matching lower bound demonstrating its optimality. Our results reveal a new principle: the first-order stochastic error of nuisance estimation can be eliminated without imposing any additional assumptions. This also leads to a revised tuning strategy favoring under-smoothing, where $δ^a_μ\asymp(δ_μ^s)^2$, rather than the classical bias-variance trade-off $δ^a_μ\asymp δ_μ^s$. Under mild additional conditions, the estimator is asymptotically normal with minimal asymptotic variance. The proposed method extends to a broad class of semi-parametric linear functional estimation problems, including average treatment effect estimation. Our results imply that popular orthogonal score methods in semiparametric estimation with black-box nuisance learners can be substantially improved.

2026-06-04T16:36:18Z 25 pages, 3 figures; comments welcome Yihong Gu Qishuo Yin Tianxi Cai Jianqing Fan http://arxiv.org/abs/2508.11861v2 A novel approach to generate distributions with applications to regression modeling 2026-06-04T16:32:12Z

A novel approach to adding an additional parameter to a family of distributions for better adaptability has been put forth. This approach yields a versatile class of distributions supported on the positive real line. An important advantage of the proposed family is that the additional parameter admits a clear interpretation in terms of tail behavior, providing a simple mechanism for modulating tail heaviness. We proceed to analyze its mathematical characteristics, such as critical points, modality, stochastic representation, identifiability, quantiles, moments, and truncated moments. We present two new regression models for positive continuous data based on submodels of the newly proposed family of distributions, in which the distribution of the response variable is reparameterized in terms of the median. We use the maximum likelihood method to estimate the parameters, which was implemented through the gamlss package in R. The proposed regression models were applied to a real dataset, and their advantages over common alternative regression models were demonstrated through quantile residual analysis and information criteria.

2025-08-16T01:15:54Z 35 pages, 7 figures Subhankar Dutta Roberto Vila Terezinha K. A. Ribeiro http://arxiv.org/abs/2606.05120v2 Stochastic Sensitivity Analysis for Matched Observational Studies 2026-06-04T16:32:01Z

Sensitivity analysis asks how strong unmeasured confounding needs to be to explain away an observational study's conclusion. The conventional approach in matched studies conducts inference conditional upon the potential outcomes as well as both observed and unobserved confounders, and then finds the worst-case distribution for the conditional treatment assignments across all possible realizations of the unobserved confounder. The resulting worst-case allocation imagines strong, near perfect, correlations between the potential outcomes and hidden bias. We propose a stochastic sensitivity analysis that instead targets inference conditional upon potential outcomes and observed confounders while treating the hidden confounders as random with unknown conditional laws. Rather than finding the worst-case realizations for the hidden confounders, we instead determine the worst-case conditional law over a broad class of distributions. This preserves the adversarial spirit of sensitivity analysis while allowing for imperfect alignment between hidden bias and potential outcomes to a degree controlled by a scalar sensitivity parameter. We consider restrictions to both an interpretable class with no parametric assumptions and a Bernoulli class of conditional laws. Design sensitivity calculations and real-data demonstrations illustrate that allowing for even a small degree of stochasticity can materially increase reported robustness to hidden bias relative to the conventional approach.

2026-06-03T17:25:15Z Mengqi Lin Colin B. Fogarty Gongjun Xu http://arxiv.org/abs/2510.12663v7 The $α$--regression for compositional data: a unified framework for standard, temporal and spatial regression models including compositional predictors 2026-06-04T16:20:27Z

We revisit the $α$--regression framework for compositional data. We formulate $α$--regression as a non--linear least squares problem, study its asymptotic properties, and provide efficient estimation via the Levenberg--Marquardt algorithm. We then propose a permutation--based hypothesis testing procedure, derive marginal effects for interpretation, and provide a visual inspection of the effect of each predictor. We further discuss robustified versions, the inclusion of natural splines, and the incorporation of compositional predictors, which further facilitate the formulation of a simple time series model. The framework is extended to spatial settings through four models. (a) The $α$--spatially--lagged X regression model, which incorporates spatial spillover effects via spatially--lagged covariates, with decomposition into direct and indirect effects. (b) The $α$--spatial autoregressive model that allows for spatial autocorrelation. (c) The geographically--weighted $α$--regression, which allows coefficients to vary spatially for capturing local relationships. (d) The $α$--eigenvector spatial filtering that is computationally efficient and captures spatial dependence via the eigenvectors of the kernelized distance matrix. Applications to four real datasets illustrate that the models perform on par with or outperform existing models in the literature. The examples showcase that $α$--regression can outperform various competing regression models under different scenarios and its spatial extensions capture the dependence and improve the predictive performance. Overall, the examples provide evidence that the log--ratio methodology does not always lead to the optimal results.

2025-10-14T15:59:02Z Michail Tsagris Nader Alharbi Abdulaziz Alenazi Yannis Pantazis http://arxiv.org/abs/2606.06346v1 Unified formulas for conditional quantities and transportation functionals 2026-06-04T16:18:28Z

This paper develops a unified probabilistic framework based on distributional derivatives and Dirac delta representations for the analysis of conditional and transportation-related quantities. General identities are established for arbitrary random variables, encompassing absolutely continuous, discrete, and mixed distributions. The proposed approach yields unified formulas for conditional expectations, conditional distributions, hazard functions, and improper distributions, revealing a common localization mechanism underlying these classical concepts. The framework is further combined with copula methods to investigate transportation and dispersion functionals through dependence structures. Exploiting the extremal properties of the Fréchet--Hoeffding bounds together with expectation inequalities induced by $Δ$-antitonic functions, sharp bounds are derived for absolute difference moments under fixed marginals. These results lead to concise derivations of quantile representations for the Wasserstein distance and a corresponding upper transportation functional, as well as survival-function representations and bounds for generalized absolute difference moments. As a particular case, new representations are obtained for the bivariate Gini mean difference and the associated bivariate Gini index. Applications are given to Wasserstein-type functionals arising in the normal approximation of standardized counting distributions, including Poisson, Binomial, and Negative Binomial models, for which explicit quantile representations are derived. Overall, the results establish explicit links among conditional structures, dependence modeling, dispersion measures, normal approximation, and optimal transport, providing a unified perspective on several fundamental constructions in probability and mathematical statistics.

2026-06-04T16:18:28Z 23 pages, 1 figure Roberto Vila Eduardo Nakano Chang C. Y. Dorea http://arxiv.org/abs/2606.06332v1 Bentkus-type asymptotic e-values 2026-06-04T16:08:08Z

Asymptotic e-values are emerging as a powerful alternative to asymptotic p-values, particularly in post-hoc inference and multiple testing, where significance levels may be data-dependent. Existing asymptotic e-values, however, suffer from the ``missing factor,'' a scaling inefficiency resulting in overly conservative inference. Drawing on the framework of near-optimal concentration inequalities developed by Bentkus in the 2000s, we introduce Bentkus-type asymptotic e-values and prove that they successfully eliminate the missing factor. We also demonstrate both theoretically and empirically that Bentkus-type e-values consistently deliver sharper inference than existing alternatives, leading to tighter post-hoc confidence intervals and higher rejection rates in multiple testing procedures.

2026-06-04T16:08:08Z Diego Martinez-Taboada Ben Chugg Aaditya Ramdas http://arxiv.org/abs/2602.07165v3 PoissonRatioUQ: An R package for band ratio uncertainty quantification 2026-06-04T16:02:43Z

We introduce an R package for Bayesian modeling and uncertainty quantification for problems involving count ratios. The modeling relies on the assumption that the quantity of interest is the ratio of Poisson means rather than the ratio of counts. We provide multiple different options for retrieval of this quantity for problems with and without spatial information included. Some added capability for uncertainty quantification for problems of the form $Z=(mT+z_0)^{p}$, where $Z$ is the intensity ratio and $T$ the quantity of interest, is included.

2026-02-06T20:06:52Z Description of the R package in https://github.com/mfleduc/PoissonRatioUQ. New release available on Zenodo at https://doi.org/10.5281/zenodo.20492078 Matthew LeDuc Tomoko Matsuo http://arxiv.org/abs/2411.02123v2 Uncertainty quantification and multi-stage variable selection for personalized treatment regimes 2026-06-04T15:44:06Z

A dynamic treatment regime is a sequence of medical decisions that adapts to the evolving clinical status of a patient over time. To facilitate personalized care, it is crucial to assess the probability of each available treatment option being optimal for a specific patient, while also identifying the key prognostic factors that determine the optimal sequence of treatments. This task has become increasingly challenging due to the growing number of individual prognostic factors typically available. In response to these challenges, we propose a Bayesian model for optimizing dynamic treatment regimes that addresses the uncertainty in identifying optimal decision sequences and incorporates dimensionality reduction to manage high-dimensional individual covariates. The first task is achieved through a suitable augmentation of the model to handle counterfactual variables. For the second, we introduce a novel class of spike-and-slab priors for the multi-stage selection of significant factors, to favor the sharing of information across stages. The effectiveness of the proposed approach is demonstrated through extensive simulation studies and illustrated using clinical trial data on severe acute arterial hypertension.

2024-11-04T14:35:44Z Jiefeng Bi Matteo Borrotti Bernardo Nipoti http://arxiv.org/abs/2605.09726v2 On the Impossibility of Specification Testing of Interference Models Based on Exposure Mappings 2026-06-04T14:50:56Z

Researchers use interference models based on exposure mappings to facilitate estimation of causal effects in randomized experiments with interference. To test the veracity of such models, researchers can use specification tests that aim to detect departures from the stipulated model. However, existing tests suffer from poor power and are often unable to detect important model violations. The main result in this paper is to show that the specification testing problem for exposure mapping models is inherently difficult, and the poor power of existing tests is inescapable. In particular, the worst-case Type I and Type II error rates must sum to one for any specification test of such models, ruling out the existence of a uniformly consistent test. This is the worst-case overall error rate achieved by a naive test that discards all data and arbitrarily rejects the null at random; the testing problem is in this sense impossible. This negative result holds true for all exposure mappings, all sample sizes, for uniformly bounded outcomes, and for alternatives that are maximally separated from the null. While some tests can detect some type of departures from the null model, there will always be relevant departures from the null that are undetectable. Informative specification tests must therefore restrict the alternative model against which they seek to attain power for, beyond the restrictions imposed by the exposure mappings alone. We illustrate this by providing a uniformly consistent test for differentiating no-interference from a network-linear-in-means model.

2026-05-10T19:50:11Z Chao Gao Christopher Harshaw Fredrik Sävje Yitan Wang http://arxiv.org/abs/2606.06233v1 Anchor PCA 2026-06-04T14:39:09Z

Principal component analysis (PCA) is one of the most widely used unsupervised dimension reduction techniques. We study PCA for data from multiple related domains. Since principal components generally differ across domains, one way to obtain a shared low-rank embedding is to perform PCA on the pooled data. However, this approach can focus on spurious directions that exhibit high variation in only a few domains. To find a robust embedding that still explains most variance in unseen but similar domains, we propose instead to focus on shared directions of variation. To this end, we introduce Anchor PCA which trades off overall explained variance with agreement between the shared and domain-specific low-rank embeddings. Anchor PCA amounts to PCA on a modified target matrix and thus can be solved efficiently. Moreover, we show that Anchor PCA recovers a maximal invariant subspace and admits a minimax reconstruction interpretation under bounded domain-specific covariance inflations. On simulated and real-world gas sensor data with temporal drift, we demonstrate, respectively, that Anchor PCA recovers the maximally invariant subspace and yields embeddings that explain more variance on unseen domains than the pooling baseline and a worst-case alternative. Taken together, these findings establish Anchor PCA as a promising approach to robust unsupervised dimension reduction from multi-domain data.

2026-06-04T14:39:09Z Benedikt Seiter Anya Fries Julius von Kügelgen Jonas Peters http://arxiv.org/abs/2603.17925v2 Multi-Armed Sequential Hypothesis Testing by Betting 2026-06-04T14:37:42Z

We consider a variant of sequential testing by betting where, at each time step, the statistician is presented with multiple data sources (arms) and obtains data by choosing one of the arms. We consider the composite global null hypothesis $\mathscr{P}$ that all arms are null in a certain sense (e.g. all dosages of a treatment are ineffective) and we are interested in rejecting $\mathscr{P}$ in favor of a composite alternative $\mathscr{Q}$ where at least one arm is non-null (e.g. there exists an effective treatment dosage). We posit an optimality desideratum that we describe informally as follows: even if several arms are non-null, we seek $e$-processes and sequential tests whose performance are as strong as the ones that have oracle knowledge about which arm generates the most evidence against $\mathscr{P}$. Formally, we generalize notions of log-optimality and expected rejection time optimality to more than one arm, obtaining matching lower and upper bounds for both. A key technical device in this optimality analysis is a modified upper-confidence-bound-like algorithm for unobservable but sufficiently "estimable" rewards. In the design of this algorithm, we derive nonasymptotic concentration inequalities for optimal wealth growth rates in the sense of Kelly [1956]. These may be of independent interest.

2026-03-18T17:01:34Z Ricardo J. Sandoval Ian Waudby-Smith Michael I. Jordan http://arxiv.org/abs/2603.14169v2 Beyond Means: Topological Causal Effects under Persistent-Homology Ignorability 2026-06-04T13:55:30Z

Average treatment effects (ATE) and conditional average treatment effects (CATE) are foundational causal estimands, but they target changes in expected outcomes and can miss treatment-induced changes in the shape of outcome distributions. A canonical failure mode occurs when control outcomes are unimodal, treated outcomes become bimodal, and both distributions have the same mean. In such cases mean-based causal estimands are zero even though the geometry and topology of the outcome law change substantially. This paper develops a topological causal framework based on persistent homology. We formalize a persistent-homology ignorability condition, define topological analogues of CATE and ATE, and prove that these estimands are identifiable up to an explicit error bound under approximate topological ignorability. We also clarify a subtle but important point: a marginal persistence-diagram effect is not identified from conditional topological ignorability alone because persistent homology does not in general commute with mixtures over covariates. To preserve the original intuition while ensuring scientific correctness, we retain the marginal effect as a motivating quantity, but place the mathematically sound conditional estimands at the center of the theory. A synthetic experiment with mean-preserving topology change shows that mean-based causal estimands remain near zero while the proposed topological effect increases sharply and remains recoverable after adjustment for confounding.

2026-03-15T01:03:32Z Amir Saki Usef Faghihi http://arxiv.org/abs/2510.06789v2 Model-free Rank Aggregation in the Presence of Rater Heterogeneity: A Maximum Score Approach 2026-06-04T11:58:03Z

This paper investigates the rank aggregation problem through the lens of multi-way comparison data derived from rater scores. Departing from traditional parametric frameworks, such as the Bradley-Terry and Plackett-Luce models, we propose a model-free method that accommodates highly heterogeneous preference distributions across raters and encompasses weak stochastic transitivity in pairwise comparisons as a special case. We establish the theoretical foundations of the proposed estimator by proving its consistency, demonstrating that the proportion of discordant pairs (Kendall tau) converges to zero in probability as the number of raters diverges. Furthermore, we derive upper and lower bounds for a performance metric based on Kendall's tau. In certain asymptotic regimes, these bounds coincide up to logarithmic factors, so the estimator is nearly minimax optimal. These results are obtained by analyzing the convergence behavior of a U-empirical process; the novel technical results developed for this analysis may be of independent theoretical interest. The practical utility of our method is validated through extensive simulations and applications to sports player rankings and survey preference aggregation.

2025-10-08T09:18:17Z Haoran Zhang Yunxiao Chen