https://arxiv.org/api/7tJaVMfPpN6boyMSlW3Xdcxy0ZI 2026-04-06T13:18:13Z 34888 315 15 http://arxiv.org/abs/2509.01540v8 Discrete Chi-Square Method can model and forecast complex time series, like El Nino data between 1870 and 2026 2026-03-24T14:35:56Z Forecasting El Nino is one of the greatest challenges of science. We show how intensive, large and accurate time series allow us to see through time. Our Discrete Chi-square Method (DCM) can detect arbitrary trend and signal(-s) combinations. It can forecast complex time series. The widely-used Discrete Fourier Transform (DFT) and other frequency-domain parametric time series analysis methods have many application limitations. None of those limitations constrains the DCM. Our simulated time series analyses ascertain the revolutionary Window Dimension Effect (WDE): "For any sample window $ΔT$, DCM inevitably detects the correct $p(t)$ trend and $h(t)$ signal(-s) when the sample size $n$ and/or data accuracy $σ$ increase." The simulations also expose the DFT's weaknesses and the DCM's efficiency. The DCM's backbone is the Gauß-Markov theorem that the Least Squares (LS) is the best unbiased estimator for linear regression models. DCM can not fail because this simple method is based on the computation of a massive number of linear model LS fits. The Fisher-test gives the signal significance estimates and identifies the best DCM model from all alternative tested DCM models. The analytical solution for the non-linear DCM model is an ill-posed problem. We present a computational well-posed solution. The best DCM model must be correct if it passes our Forecast-test.Our DCM is ideal for forecasting because its WDE spearhead is robust against short sample windows and complex time series. In our appendix, we show that the DCM can model and forecast El Nino data between 1870 and 2026. An immediate, independent and objective validity check of our analysis may save some money. 2025-09-01T15:16:11Z Submitted to Computational Statistics (Springer Verlag) Lauri Jetsu http://arxiv.org/abs/2509.08795v2 On the inclusion of non-concurrent controls in platform trials with an interim analysis 2026-03-24T14:00:42Z The analysis of platform trials can be enhanced by utilizing non-concurrent controls. Since including this data might also introduce bias in the treatment effect estimators if time trends are present, methods for incorporating non-concurrent controls adjusting for time have been proposed. However, so far their behavior has not been systematically investigated in platform trials that include interim analyses. To evaluate the impact of an interim analysis in trials utilizing non-concurrent controls, we consider a platform trial featuring two experimental arms and a shared control, with the second experimental arm entering later. We focus on a frequentist regression model that uses non-concurrent controls to estimate the treatment effect of the second arm and adjusts for time using a step function to account for temporal changes. We show that performing an interim analysis in Arm 1 may introduce bias in the point estimation of the effect in Arm 2, if the regression model is used without adjustment, and investigate how the marginal bias and bias conditional on the first arm continuing after the interim depend on different trial design parameters. Moreover, we propose a new estimator of the treatment effect in Arm 2, aiming to eliminate the bias introduced by both the interim analysis in Arm 1 and the time trends, and evaluate its performance in a simulation study. The newly proposed estimator is shown to substantially reduce the bias and type I error rate inflation while leading to power gains compared to an analysis using only concurrent controls. 2025-09-10T17:25:36Z Pavla Krotka Martin Posch Marta Bofill Roig http://arxiv.org/abs/2603.21917v2 The Cascade Identity: 2SLS as a Policy Parameter in Capacity-Constrained Settings 2026-03-24T13:55:54Z A growing literature shows that two-stage least squares (2SLS) with multiple treatments yields coefficients that are difficult to interpret under heterogeneous treatment effects and cross-effects in the first stage. We show that in capacity-constrained allocation systems, these cross-effects are not a nuisance but the source of a clean policy interpretation. When treatments are rationed and the instrument operates on the same margin as the policy of interest, the 2SLS coefficient $β_k$ equals the total societal effect of expanding treatment $k$ by one slot, including all cascading reallocations through the system. The mechanism is general: it applies whenever fixed supply constrains allocation, whether through ranked queues, waitlists, or market-clearing prices. This cascade identity $\mathbf{T} = \mathbfβ$ holds for any first-stage matrix, under arbitrary treatment effect heterogeneity, and requires only instrument relevance and that the instrument operates on the same margin as the policy. The result applies to university admissions, school choice, medical residency matching, public housing, and other rationed allocation settings. We provide an empirical application using lottery-based admission to Swedish university programs and charitable giving as the outcome. 2026-03-23T12:41:01Z 56 pages, 2 figures, 5 tables Niklas Bengtsson Per Engström http://arxiv.org/abs/2603.23205v1 Between Resolution Collapse and Variance Inflation: Weighted Conformal Anomaly Detection in Low-Data Regimes 2026-03-24T13:51:59Z Standard conformal anomaly detection provides marginal finite-sample guarantees under the assumption of exchangeability . However, real-world data often exhibit distribution shifts, necessitating a weighted conformal approach to adapt to local non-stationarity. We show that this adaptation induces a critical trade-off between the minimum attainable p-value and its stability. As importance weights localize to relevant calibration instances, the effective sample size decreases. This can render standard conformal p-values overly conservative for effective error control, while the smoothing technique used to mitigate this issue introduces conditional variance, potentially masking anomalies. We propose a continuous inference relaxation that resolves this dilemma by decoupling local adaptation from tail resolution via continuous weighted kernel density estimation. While relaxing finite-sample exactness to asymptotic validity, our method eliminates Monte Carlo variability and recovers the statistical power lost to discretization. Empirical evaluations confirm that our approach not only restores detection capabilities where discrete baselines yield zero discoveries, but outperforms standard methods in statistical power while maintaining valid marginal error control in practice. 2026-03-24T13:51:59Z 18 pages, 2 figures, 7 tables Oliver Hennhöfer Christine Preisach http://arxiv.org/abs/2312.15222v5 Is control of type I error rate needed in Bayesian clinical trial designs? 2026-03-24T12:23:14Z Practical employment of Bayesian trial designs is still rare. Even if accepted in principle, the regulators have commonly required that such designs be calibrated according to an upper bound for the frequentist type I error rate. This represents an internally inconsistent hybrid methodology, where important advantages from following the Bayesian principles are lost. In particular, all preplanned interim looks have an inflating multiplicity effect on type I error rate. To present an alternative approach, we consider the prototype case of a 2-arm superiority trial with dichotomous outcomes. The design is adaptive, using error control based on sequentially updated posterior probabilities, to conclude efficacy of the experimental treatment or futility of the trial. As gatekeepers for a proposed design, the regulators have the main responsibility in determining the parameters of the control of false positives, whereas the trial sponsors and investigators will have a natural role in specifying the criteria for stopping the trial due to futility. It is suggested that the traditional frequentist operating characteristics in the design, type I and type II error rates, be replaced, respectively, by Bayesian criteria called False Discovery Probability (FDP) and False Futility Probability (FFP), both terms corresponding directly to their probability interpretations. Importantly, the sequential error control during the data analysis based on posterior probabilities will satisfy these numerical criteria automatically, without need of preliminary computations before the trial is started. The method contains the option of applying a decision rule for terminating the trial early if the predicted costs from continuing would exceed the corresponding gains. 2023-12-23T11:05:19Z 31 pages, 2 figures Elja Arjas Dario Gasbarra http://arxiv.org/abs/2510.03131v3 Total robustness in Bayesian Nonlinear Regression 2026-03-24T11:06:18Z Modern regression analyses are often undermined by covariate measurement error, misspecification of the regression model, and misspecification of the measurement error distribution. We present, to the best of our knowledge, the first Bayesian nonparametric learning framework targeting total robustness to all three challenges in general nonlinear regression. Our framework places a joint Dirichlet process prior on the latent covariate--response distribution and updates it with posterior pseudo-samples of the latent covariates, so that inference is calibrated to the joint law. This yields estimators defined by minimizing the discrepancy between posterior realizations of the joint Dirichlet process and the model-implied joint distribution. We establish generalization bounds and provide a first proof of convergence and consistency of the resulting estimators under non-degenerate measurement error. A gradient-based implementation enables efficient computation; simulations and two real-data studies show improved stability to misspecification under increasing measurement error relative to recent Bayesian and frequentist alternatives. 2025-10-03T15:58:40Z 76 pages, 13 figures Mengqi Chen Charita Dellaporta Thomas B. Berrett Theodoros Damoulas http://arxiv.org/abs/2312.03538v4 Bayesian variable selection in sample selection models using spike-and-slab priors 2026-03-24T10:55:15Z Sample selection models are a widely used approach for correcting bias caused by data that are missing not at random. Their formulation requires specifying the variables that influence the outcome and those that drive the selection process. This specification is often based on expert knowledge, which can result in the inclusion of irrelevant variables or the omission of important ones. Moreover, to avoid inferential problems such as practical non-identifiability, practitioners frequently impose exclusion restrictions, that is, model specifications in which certain variables predict selection but have no effect on the outcome of interest. A recent proposal employs adaptive LASSO to select the variables that enter into the outcome and selection equations, but its performance depends on the so-called covariance assumption, which can be violated in small to moderate samples. To address these challenges, we propose two families of spike-and-slab priors to conduct Bayesian variable selection in sample selection models. These prior structures allow for constructing a Gibbs sampler with tractable conditionals, which is scalable to the dimensions of practical interest. We illustrate the performance of the proposed methodology through a simulation study and present a comparison against adaptive LASSO and stepwise selection. We also provide two applications using publicly available real data. 2023-12-06T15:01:47Z An implementation and code used to reproduce simulation studies and the real data applications can be found at https://github.com/adam-iqbal/selection-spike-slab Adam J. Iqbal Emmanuel O. Ogundimu F. Javier Rubio http://arxiv.org/abs/2511.04568v2 Riesz Regression As Direct Density Ratio Estimation 2026-03-24T10:29:23Z This study clarifies the relationship between Riesz regression [Chernozhukov et al., 2021] and density ratio estimation (DRE) in causal inference problems, such as average treatment effect estimation. We first show that the Riesz representer can be written as a signed density ratio and then demonstrate that the Riesz regression objective coincides with the least-squares importance fitting criterion [Kanamori et al., 2009]. Although Riesz regression applies to a broad class of representer estimation problems, this equivalence with DRE allows us to transfer existing DRE results, including convergence rate analyses, generalizations based on Bregman divergence minimization, and regularization techniques for flexible models such as neural networks. 2025-11-06T17:25:05Z Masahiro Kato http://arxiv.org/abs/2603.22990v1 A Top-Down Scale Approach for Multiscale Geographically and Temporally Weighted Regression 2026-03-24T09:32:52Z This paper proposes tds mgtwr, a multiscale geographically and temporally weighted regression (MGTWR) model with covariate-specific spatial and temporal scales. The approach combines a separable spatio-temporal kernel with a Top-Down Scale (TDS) calibration scheme, where spatial and temporal bandwidths are selected for each covariate through a coordinate-wise search over ordered grids guided by the corrected Akaike Information Criterion (AICc). By avoiding unconstrained multidimensional optimization, this strategy extends to the spatio-temporal setting the stabilizing properties of TDS calibration scheme Geniaux (2026). The multiscale backfitting procedure combines the Top-Down Scale calibration scheme with an adaptive, importance-driven update schedule that prioritizes covariates according to their current scale-normalized contribution to the fitted signal, thereby limiting the number of local recalibrations required and accelerating convergence while maintaining estimator fidelity. We also introduce a generic prediction method for MGWR and MGTWR based on kernel sharpening. Monte Carlo experiments show that modeling both space and time improves coefficient recovery and predictive accuracy relative to purely spatial multiscale models when temporal variation is present and sufficiently supported by the data. Gains increase with sample size and signal-to-noise ratio. Two empirical applications illustrate the method under contrasting regimes. For Beet Yellows severity, a plant epidemiology and pest management problem, multiscale spatial modeling is essential, while spatio-temporal extensions yield additional gains when temporal information is rich. In modeling house prices, MGTWR consistently outperforms spatial local and STVC models. In both cases, predictive performance rivals flexible machine-learning benchmarks while preserving interpretable spatio-temporal scales. 2026-03-24T09:32:52Z Preprint -- Submitted to Spatial Statistics Ghislain Geniaux INRAE César Martinez Samuel Soubeyrand http://arxiv.org/abs/2512.06428v3 Community detection in heterogeneous signed networks 2026-03-24T08:41:47Z Network data has attracted growing interest across scientific domains, prompting the development of various network models. Existing network analysis methods mainly focus on unsigned networks, whereas signed networks, consisting of both positive and negative edges, have been frequently encountered in practice but much less investigated. In this paper, we formally define strong and weak balance in signed networks, and propose a signed block $β$-model, which is capable of modeling strong- and weak-balanced signed networks simultaneously. We establish the identifiability of the proposed model by leveraging properties of bipartite graphs, and develop an efficient alternating updating algorithm to optimize the resulting log-likelihood function. More importantly, we establish the asymptotic consistencies of the proposed model in terms of both probability estimation and community detection. Its advantages are also demonstrated through extensive numerical experiments and the application to a real-world international relationship network. 2025-12-06T13:25:14Z Yuwen Wang Shiwen Ye Jingnan Zhang Junhui Wang http://arxiv.org/abs/2601.06807v2 Adversarially Perturbed Precision Matrix Estimation 2026-03-24T08:26:44Z Precision matrix estimation is a fundamental topic in multivariate statistics and modern machine learning. This paper proposes an adversarially perturbed precision matrix estimation framework, motivated by recent developments in adversarial training. The proposed framework is versatile for the precision matrix problem since, by adapting to different perturbation geometries, the proposed framework can not only recover the existing distributionally robust method but also achieve high-dimensional model selection consistency under the scale-adaptive incoherence condition, which can be viewed as a relaxation of the classic incoherence condition in the heteroscedastic settings. Additionally, the proposed perturbed precision matrix estimation framework is asymptotically equivalent to the regularized precision matrix estimation, and the asymptotic normality can be established accordingly, where the asymptotic bias introduced by perturbation is highlighted. Numerical experiments demonstrate the desirable practical performance of the proposed adversarially perturbed approach. 2026-01-11T08:40:56Z Yiling Xie http://arxiv.org/abs/2603.22914v1 Nonparametric regression with dependent censoring or competing risks 2026-03-24T08:02:47Z Single-index models or time-to-event models are frequently applied in empirical research. These models are non-identifiable in presence of unknown (dependent) censoring or competing risks and do not give informative results in empirical analysis unless rather strong, non-testable restrictions hold. Little is known, whether the known robustness properties of the single-index model carry over to models with dependent censoring or competing risks. This paper shows that the ratio of partial covariate effects on the margins is identifiable in nonparametric models with unknown dependent censoring or nonparametric competing risks models with nonparametric dependence structure, provided an exclusion restriction holds. Commonly used (semi)parametric models for the margin and independent censoring, such as Cox proportional hazards, accelerated failure time or proportional odds models, can be used to obtain relative covariate effects despite their misspecified censoring mechanism. Several nonparametric estimators for the general model are introduced and their numerical properties are studied. 2026-03-24T08:02:47Z 39 pages, 2 figures, for associated sample code, see https://github.com/ralfawilke/nonparreg Jia-Han Shih Simon M. S. Lo Ralf A. Wilke http://arxiv.org/abs/2603.22900v1 Off-Policy Evaluation and Learning for Survival Outcomes under Censoring 2026-03-24T07:50:38Z Optimizing survival outcomes, such as patient survival or customer retention, is a critical objective in data-driven decision-making. Off-Policy Evaluation~(OPE) provides a powerful framework for assessing such decision-making policies using logged data alone, without the need for costly or risky online experiments in high-stakes applications. However, typical estimators are not designed to handle right-censored survival outcomes, as they ignore unobserved survival times beyond the censoring time, leading to systematic underestimation of the true policy performance. To address this issue, we propose a novel framework for OPE and Off-Policy Learning~(OPL) tailored for survival outcomes under censoring. Specifically, we introduce IPCW-IPS and IPCW-DR, which employ the Inverse Probability of Censoring Weighting technique to explicitly deal with censoring bias. We theoretically establish that our estimators are unbiased and that IPCW-DR achieves double robustness, ensuring consistency if either the propensity score or the outcome model is correct. Furthermore, we extend this framework to constrained OPL to optimize policy value under budget constraints. We demonstrate the effectiveness of our proposed methods through simulation studies and illustrate their practical impacts using public real-world data for both evaluation and learning tasks. 2026-03-24T07:50:38Z Preprint Kohsuke Kubota Mitsuhiro Takahashi Yuta Saito http://arxiv.org/abs/2603.22845v1 DROP: Distributionally Robust Optimization for Multi-task Learning in Graphical Models 2026-03-24T06:33:55Z Gaussian Graphical Models (GGMs) are widely used to infer conditional dependence structures in high-dimensional data. However, standard precision matrix estimators are highly sensitive to data contamination, such as extreme outliers and heavy-tailed noise. In this paper, we propose DROP (Distributionally Robust Optimization), a robust estimation method formulated within a multi-task nodewise regression framework. The proposed estimator enforces structural sparsity while resisting the influence of corrupted observations. Theoretically, we establish error bounds for the DROP estimator under general contamination. Through extensive high-dimensional simulations, we demonstrate that DROP consistently controls the rate of false positive edges and outperforms conventional non-robust estimators when data deviate from standard Gaussian assumptions. Furthermore, in a functional MRI (fMRI) application, DROP maintains a stable graph structure and preserves network modularity even when subjected to severe data perturbations, whereas competing methods yield excessively dense networks. To facilitate reproducible research, the DROP R package will be made publicly available on GitHub. 2026-03-24T06:33:55Z Canruo Shen Xintong Ji Qiong Li Wenzhi Yang Xiaoping Shi http://arxiv.org/abs/2603.22838v1 Community Detection on Inhomogeneous Multilayer Networks with Extreme Sparsity 2026-03-24T06:21:48Z We study layer-specific community detection in an $L$-layer network $\{A^{(l)}\}_{l\in[L]}$ on a common set of $n$ nodes. Because modern networks are constructed from multi-modal data or with different contexts, the community labels $π^{(l)}\in[K]^n$ are layer-dependent and the degree heterogeneity parameters $θ_i^{(l)}$ vary widely across nodes and layers. The inhomogeneity and extreme sparsity raise a challenge for classical community detection methods. We propose a multilayer-assisted regularized spectral method (MARS-CD) to address this challenge. For layer $l$, MARS-CD first constructs $X^{(l)}$ from the remaining layers, so that the problem is transformed into a network-with-covariates clustering problem on $(A^{(l)}, X^{(l)})$. Then we recover $π^{(l)}$ by NAC in Hu and Wang (2024) that allows misalignment. The key component is to construct $X^{(l)}$, where we stack regularized embeddings. Building upon this, we establish the first theoretical guarantees for the quality of $X^{(l)}$ under multilayer networks with extreme sparsity. These further lead to weak and strong consistency for recovering $π^{(l)}$. We further develop an optional label alignment step to interpret the shared community structure across layers. Simulations demonstrate the superior performance of our MARS-CD method. Applying MARS-CD to international food trading networks provides an interpretable product-specific community structure. 2026-03-24T06:21:48Z 35 pages, 2 figures Tao Shen Wanjie Wang