https://arxiv.org/api/zg5WW+K6JNghyJuzzPQYMlcw2JA 2026-04-06T10:00:03Z 34888 285 15 http://arxiv.org/abs/2603.24201v1 A Bayesian Dynamic Latent Space Model for Weighted Networks 2026-03-25T11:23:00Z A new dynamic latent space eigenmodel (LSM) is proposed for weighted temporal networks. The model accommodates integer-valued weights, excess of zeros, time-varying node positions (features), and time-varying network sparsity. The latent positions evolve according to a vector autoregressive process that accounts for lagged and contemporaneous dependence across nodes and features, a characteristic neglected in the LSM literature. A Bayesian approach is used to address two of the primary sources of inference intractability in dynamic LSMs: latent feature estimation and the choice of latent space dimension. We employ an efficient auxiliary-mixture sampler that performs data augmentation and supports conditionally conjugate prior distributions. A point-process representation of the network weights and the finite-dimensional distribution of the latent processes are used to derive a multi-move sampler in which each feature trajectory is drawn in a single block, without recursions. This sampling strategy is new to the network literature and can significantly reduce computational time while improving chain mixing. To avoid trans-dimensional samplers, a Laplace approximation of the partial marginal likelihood is used to design a partially collapsed Gibbs sampler. Overall, our procedure is general, as it can be easily adapted to static and dynamic settings, as well as to other discrete or continuous weight distributions. 2026-03-25T11:23:00Z Roberto Casarin Matteo Iacopini Antonio Peruzzi http://arxiv.org/abs/2501.06844v3 REML implementations of kernel-based genomic prediction models for genotype x environment x management interactions 2026-03-25T10:39:29Z High-throughput pheno-, geno-, and envirotyping allows characterization of plant genotypes and the trials they are evaluated in, producing different types of data. These different data modalities can be integrated into statistical or machine learning models for genomic prediction in several ways. One commonly used approach within the analysis of multi-environment trial data in plant breeding is to create linear or nonlinear kernels which are subsequently used in linear mixed models (LMMs) to model genotype by environment (G$\times$E) interactions. Current implementations of these kernel-based LMMs present a number of opportunities in terms of methodological extensions. Here we show how these models can be implemented in standard software, allowing direct restricted maximum likelihood (REML) estimation of all parameters. We also further extend the models by combining the kernels with unstructured covariance matrices for three-way interactions in genotype by environment by management (G$\times$E$\times$M) datasets, while simultaneously allowing for environment-specific genetic variances. We show how the models incorporating nonlinear kernels and heterogeneous variances maximize the amount of genetic variance captured by environmental covariables and perform best in prediction settings. We discuss the opportunities regarding models with multiple kernels or kernels obtained after environmental feature selection, as well as the similarities to models regressing phenotypes on latent and observed environmental covariables. Finally, we discuss the flexibility provided by our implementation in terms of modeling complex plant breeding datasets, allowing for straightforward integration of phenomics, enviromics, and genomics. 2025-01-12T15:30:11Z Killian A. C. Melsen Salvador Gezan Daniel J. Tolhurst Fred A. van Eeuwijk Carel F. W. Peeters http://arxiv.org/abs/2603.24122v1 Scoring Rules with Normalized Upper Order Statistics for Tail Inference 2026-03-25T09:33:47Z This paper proposes a scoring-rule-based method for ranking predictive distributions in the Fréchet domain that is able to distinguish between different tail indices. The approach is built on normalized order statistics and exploits proper scoring rules to compare tail limit distributions in a distributional framework, with direct relevance for insurance claim-severity tails. On the theoretical side, consistency and asymptotic normality for empirical tail scores based on normalized upper order statistics are obtained through residual estimation theory. Simulation results demonstrate that the scoring-rule-based approach is capable of discriminating between different tail behaviors in finite samples and that trends in the scaling have only a minor impact on stability. We further show that optimizing scoring rules (equivalently, minimizing the associated loss form) yields consistent tail-index estimators and that the classical Hill estimator arises as a special case. The performance of the proposed method is investigated and compared with the Hill estimator across a range of tail indices. Lastly, we analyze an automobile claim-severity data set to demonstrate how scoring rules can be used to rank predictive models based on tail predictions in actuarial settings. 2026-03-25T09:33:47Z 8 figures, 1 table Martin Bladt Christoffer Øhlenschlæger http://arxiv.org/abs/2603.24108v1 Aitchison Geometry on the Simplex for Uncertainty Quantification in Bayesian Hyperspectral Image Unmixing 2026-03-25T09:14:04Z Most algorithms for hyperspectral image unmixing produce point estimates of fractional abundances of the materials to be separated. However, in the absence of reliable ground truth, the ability to perform abundance uncertainty quantification (UQ) should be an important feature of algorithms, e.g. to evaluate how hard the unmixing problem is and how much the results should be trusted. The usual modeling assumptions in Bayesian models for unmixing rely heavily on the Euclidean geometry of the simplex and typically disregard spatial information. In addition, to our knowledge, abundance UQ is close to nonexistent. In this paper, we propose to leverage Aitchinson geometry from the compositional data analysis literature to provide practitioners with alternative tools for modeling prior abundance distributions. In particular we show how to design simplex-valued Gaussian Process priors using this geometry. Then we link Aitchinson geometry to constrained sampling algorithms in the literature, and propose UQ diagnostics that comply with the constraints on abundance vectors. We illustrate these concepts on real and simulated data. 2026-03-25T09:14:04Z Hector Blondel Lucas Drumetz Thierry Chonavel http://arxiv.org/abs/2405.17669v4 Bayesian Nonparametrics for Principal Stratification with Continuous Post-Treatment Variables 2026-03-25T09:07:39Z Principal stratification provides a causal inference framework for investigating treatment effects in the presence of a post-treatment variable. Principal strata play a key role in characterizing the treatment effect by identifying groups of units with the same or similar values for the potential post-treatment variable at all treatment levels. The literature has focused mainly on binary post-treatment variables. Few papers considered continuous post-treatment variables. In the presence of a continuous post-treatment, a challenge is how to identify and characterize meaningful coarsening of the latent principal strata that lead to interpretable principal causal effects. This paper introduces the Confounders-Aware SHared atoms BAyesian mixture (CASBAH), a novel approach for principal stratification with binary treatment and continuous post-treatment variables. CASBAH leverages Bayesian nonparametric priors with an innovative hierarchical structure for the potential post-treatment outcomes that overcomes some of the limitations of previous works. Specifically, the novel features of our method allow for (i) identifying coarsened principal strata through a data-adaptive approach and (ii) providing a comprehensive quantification of the uncertainty surrounding stratum membership. Through Monte Carlo simulations, we show that the proposed methodology performs better than existing methods in characterizing the principal strata and estimating principal effects of the treatment. Finally, CASBAH is applied to a case study in which we estimate the causal effects of US national air quality regulations on pollution levels and health outcomes. 2024-05-27T21:47:41Z Dafne Zorzetto Antonio Canale Fabrizia Mealli Francesca Dominici Falco J. Bargagli-Stoffi http://arxiv.org/abs/2603.24632v1 Estimation in moderately misspecified models 2026-03-25T08:25:20Z Suppose data are fitted to some parametric model but that the true model happens to be one with an additional parameter. When a parameter is to be estimated one can use likelihood estimation in the wider model or in the narrow model. Including the extra parameter in the model means less bias but larger sampling variability. Two basic questions are addressed in this article. (i) Just how much misspecification can the narrow model tolerate? In the context of a large-sample moderate misspecification framework we find a surprisingly simple, sharp, and general answer. There is effectively a `tolerance radius' around a given narrow model, inside of which narrow estimation is more precise than wide estimation for all estimands. This is computed in a selection of examples that also demonstrate the degree of robustness of important standard methods against moderate incorrectness of the model under which they are optimal. (ii) Are there other estimators that work well both under narrow and wide circumstances? We discuss several possibilities and propose some new procedures. All methods are compared in a broad large-sample performance study. 2026-03-25T08:25:20Z 31 pages, 1 figure. Statistical Research Report, Department of Mathematics, University of Oslo, from May 1993, but arXiv'd March 2026 Nils Lid Hjort http://arxiv.org/abs/2603.24041v1 Minimal Sufficient Representations for Self-interpretable Deep Neural Networks 2026-03-25T07:51:21Z Deep neural networks (DNNs) achieve remarkable predictive performance but remain difficult to interpret, largely due to overparameterization that obscures the minimal structure required for interpretation. Here we introduce DeepIn, a self-interpretable neural network framework that adaptively identifies and learns the minimal representation necessary for preserving the full expressive capacity of standard DNNs. We show that DeepIn can correctly identify the minimal representation dimension, select relevant variables, and recover the minimal sufficient network architecture for prediction. The resulting estimator achieves optimal non-asymptotic error rates that adapt to the learned minimal dimension, demonstrating that recovering minimal sufficient structure fundamentally improves generalization error. Building on these guarantees, we further develop hypothesis testing procedures for both selected variables and learned representations, bridging deep representation learning with formal statistical inference. Across biomedical and vision benchmarks, DeepIn improves both predictive accuracy and interpretability, reducing error by up to 30% on real-world datasets while automatically uncovering human-interpretable discriminative patterns. Our results suggest that interpretability and statistical rigor can be embedded directly into deep architectures without sacrificing performance. 2026-03-25T07:51:21Z Zhiyao Tan Liu Li Huazhen Lin http://arxiv.org/abs/2603.24025v1 i-IF-Learn: Iterative Feature Selection and Unsupervised Learning for High-Dimensional Complex Data 2026-03-25T07:35:38Z Unsupervised learning of high-dimensional data is challenging due to irrelevant or noisy features obscuring underlying structures. It's common that only a few features, called the influential features, meaningfully define the clusters. Recovering these influential features is helpful in data interpretation and clustering. We propose i-IF-Learn, an iterative unsupervised framework that jointly performs feature selection and clustering. Our core innovation is an adaptive feature selection statistic that effectively combines pseudo-label supervision with unsupervised signals, dynamically adjusting based on intermediate label reliability to mitigate error propagation common in iterative frameworks. Leveraging low-dimensional embeddings (PCA or Laplacian eigenmaps) followed by $k$-means, i-IF-Learn simultaneously outputs influential feature subset and clustering labels. Numerical experiments on gene microarray and single-cell RNA-seq datasets show that i-IF-Learn significantly surpasses classical and deep clustering baselines. Furthermore, using our selected influential features as preprocessing substantially enhances downstream deep models such as DeepCluster, UMAP, and VAE, highlighting the importance and effectiveness of targeted feature selection. 2026-03-25T07:35:38Z 28 pages, 5 figures, including appendix. Accepted at AISTATS Chen Ma Wanjie Wang Shuhao Fan http://arxiv.org/abs/2603.24015v1 STAMP: A shot-type-aware areal multilevel Poisson model for league-wide comparison of basketball shot charts 2026-03-25T07:20:01Z Shooting location is a core indicator of offensive style in invasion sports. Existing basketball shot-chart analyses often use spatial information for descriptive visualization, location-based efficiency modeling, or clustering players into shooting archetypes, yet few studies provide a unified framework for fair comparison of shot-type-specific tendencies. We propose the shot-type-aware areal multilevel Poisson (STAMP) model, which jointly models team-level field-goal attempts across predefined court regions, seasons, and shot types using a Poisson likelihood with a possession-based exposure offset. The hierarchical random-effects structure combines team, area, team-area, and team-side random effects with shot-type-specific random slopes for key shot categories. We fit the model using approximate Bayesian inference via the Integrated Nested Laplace Approximation (INLA), enabling efficient analysis of more than $3\times 10^{5}$ shots from two seasons of B.LEAGUE (the men's professional basketball league in Japan). The STAMP model achieves better out-of-sample predictive performance than simpler baselines, yielding interpretable relative-rate maps and left-right bias summaries. Case studies illustrate how the model reveals team-specific spatial tendencies for comparative analysis, and we discuss its limitations and potential extensions. 2026-03-25T07:20:01Z 25 pages Kazuhiro Yamada Keisuke Fujii http://arxiv.org/abs/2603.23963v1 An Exponential-Polynomial Divergence-based Robust Information Criterion for Linear Panel Data Models and Neural Networks 2026-03-25T05:49:59Z Model selection is a cornerstone of statistical inference, where information criteria are widely employed to balance model fit and complexity. However, classical likelihood-based criteria are often highly sensitive to contamination, outliers, and model misspecification. In this paper, we develop a robust alternative based on the Exponential-Polynomial Divergence, a flexible extension of existing divergence measures that enhances adaptability to diverse data irregularities. The proposed Exponential-Polynomial Divergence Information Criterion preserves the objective of approximating the discrepancy between the true model and candidate models while incorporating robustness against anomalous observations. Its theoretical properties are established, and robustness is examined through influence function analysis, demonstrating controlled sensitivity to extreme data points. For practical implementation, a data-driven tuning parameter selection strategy based on generalized score matching is employed, ensuring improved computational stability and efficiency. The effectiveness of the proposed method is demonstrated through extensive simulation studies under varying contamination levels, as well as real data applications involving linear mixed-effects panel data models and neural network-based prediction tasks. The results consistently show improved stability and reliability compared to classical likelihood and density power divergence-based information criteria. The proposed framework thus provides a practical and unified approach for model selection in complex and contaminated data settings. 2026-03-25T05:49:59Z 31 pages, 2 figures Udita Goswami Shuvashree Mondal http://arxiv.org/abs/2603.23959v1 Microergodicity implies orthogonality of Matérn fields on bounded domains in $\mathbb{R}^4$ 2026-03-25T05:41:09Z Matérn random fields are one of the most widely used classes of models in spatial statistics. The fixed-domain identifiability of covariance parameters for stationary Matérn Gaussian random fields exhibits a dimension-dependent phase transition. For known smoothness $ν$, Zhang \cite{Zhang2004} showed that when $d\le3$, two Matérn models with the same microergodic parameter $m=σ^2α^{2ν}$ induce equivalent Gaussian measures on bounded domains, while Anderes \cite{Anderes2010} proved that when $d>4$, the corresponding measures are mutually singular whenever the parameters differ. The critical case $d=4$ for stationary Matérn models has remained open. We resolve this case. Let $d=4$ and consider two stationary Matérn models on $\mathbb R^4$ with parameters $(σ_1,α_1)$ and $(σ_2,α_2)$ satisfying \[ σ_1^2α_1^{2ν}=σ_2^2α_2^{2ν}, \qquad α_1\neq α_2. \] We prove that the corresponding Gaussian measures on any bounded observation domain are mutually singular on every countable dense observation set, and on the associated path space of continuous functions. Our approach can be viewed as a spectral analogue of the higher-order increment method of Anderes \cite{Anderes2010}. Whereas Anderes isolates the second irregular covariance coefficient through renormalized quadratic variations in physical space, we detect the first nonvanishing high-frequency spectral mismatch via localized Fourier coefficients and use a normalized Whittle score to identify parameters. More broadly, the localized spectral probing framework used here for detecting subtle covariance differences in Gaussian random fields may be useful for studying identifiability and estimation in other spatial models. 2026-03-25T05:41:09Z Natesh S. Pillai http://arxiv.org/abs/2302.12728v5 Principles of Conditionality and Layering of Error Rates with Application to Platform Trials 2026-03-25T04:55:45Z There has been a misconception that only one type of error rate control is necessary in clinical trials, leading to debates over whether to prioritize Familywise Error Rate (FWER) or False Discovery Rate (FDR). This misconception has led to misleading statements about FWER control and proposals to shift towards FDR control, which could be manipulated by the industry. In reality, since the early 2000s, biopharmaceutical statistics have implicitly applied two layers of Type I error rate control. This aligns with Tukey's 1953 invention of Error Rate per Family (ERpF) for controlling error across studies, while FWER applies within each study. Our paper clarifies this layering, using Platform trials to demonstrate the verifiable conditions needed across studies for the FDA to fulfill its regulatory mission. We show that controlling FWER within a study at $5\%$ inherently controls ERpF across studies at 5-per-100, regardless of study correlations. This supports current regulatory practices that protect public health while fostering innovation. We also address concerns about ERpF stability in Platform trials, where shared controls introduce dependencies. By applying the Conditionality Principle and utilizing an innovative Shiny app, we explore how correlations impact ERpF variability, providing deeper insights for informed decision-making. Our findings, supported by principles like Layering of Error Rate Controls and the Conditionality Principle, are particularly relevant as Platform trials gain popularity for their efficiency in testing multiple treatments simultaneously. 2023-02-24T16:24:52Z Xinping Cui Emily Ouyang Yi Liu Jingjing Yan Schneider Hong Tian Bushi Wang Jason C. Hsu http://arxiv.org/abs/2603.23923v1 Elements of Conformal Prediction for Statisticians 2026-03-25T04:28:00Z Predictive inference is a fundamental task in statistics, traditionally addressed using parametric assumptions about the data distribution and detailed analyses of how models learn from data. In recent years, conformal prediction has emerged as a rapidly growing alternative framework that is particularly well suited to modern applications involving high-dimensional data and complex machine learning models. Its appeal stems from being both distribution-free -- relying mainly on symmetry assumptions such as exchangeability -- and model-agnostic, treating the learning algorithm as a black box. Even under such limited assumptions, conformal prediction provides exact finite-sample guarantees, though these are typically of a marginal nature that requires careful interpretation. This paper explains the core ideas of conformal prediction and reviews selected methods. Rather than offering an exhaustive survey, it aims to provide a clear conceptual entry point and a pedagogical overview of the field. 2026-03-25T04:28:00Z Matteo Sesia Stefano Favaro http://arxiv.org/abs/2603.23790v1 Root Finding and Metamodeling for Rapid and Robust Computer Model Calibration 2026-03-24T23:45:54Z We concern computer model calibration problem where the goal is to find the parameters that minimize the discrepancy between the multivariate real-world and computer model outputs. We propose to solve an approximation using signed residuals that enables a root finding approach and an accelerated search. We characterize the distance of the solutions to the approximation from the solutions of the original problem for the strongly-convex objective functions, showing that it depends on variability of the signed residuals across output dimensions, as wells as their variance and covariance. We develop a metamodel-based root finding framework under kriging and stochastic kriging that is augmented with a sequential search space reduction. We derive three new acquisition functions for finding roots of the approximate problem along with their derivatives usable by first-order solvers. Compared to kriging, stochastic kriging accounts for observational noise, promoting more robust solutions. We also analyze the case where a root may not exist. Our analysis of the asymptotic behavior in this context show that, since existence of roots in the approximation problem may not be known a priori, using new acquisition functions will not compromise the outcome. Numerical experiments on data-driven and physics-based examples demonstrate significant computational gains over standard calibration approaches. 2026-03-24T23:45:54Z Yongseok Jeon Sara Shashaani http://arxiv.org/abs/2506.11232v4 Regularized Estimation of the Loading Matrix in Factor Models for High-Dimensional Time Series 2026-03-24T23:10:28Z High-dimensional data analysis using traditional models suffers from overparameterization. Two types of techniques are commonly used to reduce the number of parameters - regularization and dimension reduction. In this project, we combine them by imposing a sparse factor structure and propose a regularized estimator to further reduce the number of parameters in factor models. A challenge limiting the widespread application of factor models is that factors are hard to interpret, as both factors and the loading matrix are unobserved. To address this, we introduce a penalty term when estimating the loading matrix for a sparse estimate. As a result, each factor only drives a smaller subset of time series that exhibit the strongest correlation, improving the factor interpretability. The theoretical properties of the proposed estimator are investigated. The simulation results are presented to confirm that our algorithm performs well. We apply our method to Hawaii tourism data. 2025-06-12T19:13:27Z Xialu Liu Xin Wang