https://arxiv.org/api/r1cF5VYpr9gmrsgtJEJwOmLDf+M 2026-06-19T05:40:19Z 23582 555 15 http://arxiv.org/abs/2605.08379v1 Transfer Learning for Dead Fuel Moisture Prediction Using Time-Warping Recurrent Neural Networks 2026-05-08T18:37:22Z

This paper proposes a time-warping transfer learning method, a technique for temporally rescaling the learned dynamics of a recurrent neural network (RNN) with a Long Short-Term Memory (LSTM) layer to enable task transfer across fuel moisture classes. Fuel moisture content (FMC) is divided into idealized classes based on characteristic lag time. Large quantities of real-time data are available for 10h fuels from sensors on weather stations, but observations of other fuel classes are sparse in space and time. We use transfer learning to adapt an RNN pretrained on 10h FMC to predict FMC for 1h, 100h, and 1000h fuels. We validate this method using data from a landmark field study conducted in Oklahoma that was used to calibrate the state-of-the-art Nelson fuel moisture model.

2026-05-08T18:37:22Z Preprint. Related to PhD thesis work that is also available for preprint at https://doi.org/10.48550/arXiv.2604.02474 Jonathon Hirschi Jan Mandel Adam Kochanski http://arxiv.org/abs/2605.08027v1 Randomization Tests for Distributions of Individual Treatment Effects via Combined Rank Statistics 2026-05-08T17:12:49Z

What proportion of treated units actually benefited from an experimental intervention? What is the median or the largest individual treatment effect? This paper develops methods for answering such questions about the distribution of individual causal effects in randomized experiments. Existing approaches require the analyst to select a rank-based test statistic before observing the data. A poor choice can substantially reduce power, while searching over multiple test statistics and adjusting for multiplicity using Bonferroni correction also incurs power loss. We propose inference procedures that adaptively combine multiple rank-based statistics while maintaining finite-sample validity. For stratified experiments, we further develop weighting schemes that effectively aggregate evidence across strata of heterogeneous sizes. The resulting combined test achieves power comparable to, or exceeding, that of the best individual test, without requiring prior knowledge of the optimal statistic. When applied to a randomized experiment evaluating a teacher training program, the combined test suggests that roughly half of treated teachers benefited, whereas a single rank-based test may indicate only a small minority. Thus, the choice of test determined whether the program appears broadly successful or narrowly effective.

2026-05-08T17:12:49Z David Kim Yongchang Su Jake Bowers Xinran Li http://arxiv.org/abs/2604.25826v2 General-Purpose Technology and Speculative Bubble Detection 2026-05-08T15:59:13Z

We show that the leading bubble test suffers severe size distortion when fundamentals incorporate general-purpose technology adoption. Embedding a hump-shaped technology shock in the Campbell-Shiller present-value model, we prove that the fundamental price becomes locally explosive during adoption, contaminating the test's limit distribution with a non-centrality parameter proportional to the shock's peak. We propose a fundamental-versus-speculative decomposition that projects prices onto observable technology proxies and applies the test to the residual. Empirically, the decomposition eliminates evidence of speculation in the 2020-2025 AI rally while confirming a speculative peak confined to December 1999-March 2000 in the dot-com episode.

2026-04-28T16:35:05Z Haiqiang Chen Li Chen Difang Huang Yuexin Li Zhengjun Zhang http://arxiv.org/abs/2605.07834v1 GenAI Powered Dynamic Causal Inference with Unstructured Data 2026-05-08T15:03:44Z

A growing number of scholars seek to estimate causal effects of unstructured data such as text, images, and video. However, existing methods typically treat each object as a single, static observation. We develop a statistical framework for dynamic causal inference with unstructured data by leveraging generative artificial intelligence (GenAI) models. Our approach enables researchers to estimate the causal effects of sequences of treatment features, including their positions within text and video. We first extract internal representations of unstructured objects from a GenAI model and then estimate a marginal structural model using a neural network architecture that jointly learns a deconfounder for each treatment feature in the sequence. Our semiparametric inference framework yields valid asymptotic confidence intervals. Simulation studies demonstrate that the proposed estimator recovers the target causal effects and that the confidence intervals achieve nominal coverage in finite samples. We further apply our method to a randomized experiment on the Hong Kong protests, showing that the effect of a treatment feature depends critically on its position within the text.

2026-05-08T15:03:44Z Kentaro Nakamura Kosuke Imai http://arxiv.org/abs/2311.08433v3 Clinical Characteristics and Laboratory Biomarkers in ICU-admitted Septic Patients with and without Bacteremia 2026-05-08T12:03:10Z

Few studies have investigated the diagnostic utilities of biomarkers for predicting bacteremia among septic patients admitted to intensive care units (ICU). Therefore, this study evaluated the prediction power of laboratory biomarkers to utilize those markers with high performance to optimize the predictive model for bacteremia. This retrospective cross-sectional study was conducted at the ICU department of Gyeongsang National University Changwon Hospital in 2019. Adult patients qualifying SEPSIS-3 (increase in sequential organ failure score greater than or equal to 2) criteria with at least two sets of blood culture were selected. Collected data was initially analyzed independently to identify the significant predictors, which was then used to build the multivariable logistic regression (MLR) model. A total of 218 patients with 48 cases of true bacteremia were analyzed in this research. Both CRP and PCT showed a substantial area under the curve (AUC) value for discriminating bacteremia among septic patients (0.757 and 0.845, respectively). To further enhance the predictive accuracy, we combined PCT, bilirubin, neutrophil lymphocyte ratio (NLR), platelets, lactic acid, erythrocyte sedimentation rate (ESR), and Glasgow Coma Scale (GCS) score to build the predictive model with an AUC of 0.907 (95% CI, 0.843 to 0.956). In addition, a high association between bacteremia and mortality rate was discovered through the survival analysis (0.004). While PCT is certainly a useful index for distinguishing patients with and without bacteremia by itself, our MLR model indicates that the accuracy of bacteremia prediction substantially improves by the combined use of PCT, bilirubin, NLR, platelets, lactic acid, ESR, and GCS score.

2023-11-14T06:44:26Z This research is not complete Sangwon Baek Seung Jun Lee http://arxiv.org/abs/2605.07421v1 There to care; not to kill: medical settings, statistics and wrongful convictions 2026-05-08T08:15:56Z

This paper discusses wrongful convictions in a medical setting, focusing on nurses. Common features are lack of strong direct evidence: the nurse was never seen doing anything wrong. There is no DNA evidence of tampering of apparatus or medications by the nurse. There is no CCTV footage showing suspicious actions. Analysis of medical records at the time led coroners to issue certificates of natural deaths, and most events were not, at the time, thought suspicious by hospital staff. There is no confession and the nurse consistently asserts they are completely innocent. There is no evidence of earlier psychopathic behaviour. Instead, private writings (e.g., in a diary) are interpreted by the prosecution as a confession; mundane behaviour is given a sinister interpretation. Motive remains speculation. The main evidence is statistical: a spike in deaths or collapses and a statistical association with a particular nurse. There is forensic evidence which suggests one or two patients might have been harmed by administration of medication much used in the hospital, and even legitimately used earlier in the care of the alleged victims. Police investigations are driven by the hospital consultants who were clinically responsible for the patients allegedly killed or harmed by the nurse.

2026-05-08T08:15:56Z Invited contribution to a volume on miscarriages of justice, in preparation Richard D. Gill http://arxiv.org/abs/2605.07409v1 The Proxy Presumption: From Semantic Embeddings to Valid Social Measures 2026-05-08T08:03:44Z

Natural Language Processing is rapidly evolving into a primary instrument for Computational Social Science, with researchers increasingly using embeddings to measure latent constructs such as novelty, creativity, and bias. However, this transition faces a fundamental validity challenge: the ''Proxy Presumption,'' or the reliance on geometric properties (e.g., cosine distance) as direct measures of social concepts. We argue that without explicit validation, unsupervised representations remain entangled mixtures of the target construct ($C$) and confounding attributes ($Z$) like topic, style, and authorship. To bridge the gap between semantic embeddings and valid social measures, we introduce the Construct Validity Protocol (CVP). Drawing on causal representation learning and psychometrics, the CVP offers a rigorous pipeline from conceptualization to quantitative verification. We further propose Counterfactual Neutralization, a novel method using LLMs to reduce confounding in embedding space. By providing a standardized Validity Suite -- including tests for discriminant, incremental, and predictive validity -- this work offers the community a toolkit to transform heuristic proxies into robust, scientifically defensible instruments.

2026-05-08T08:03:44Z ACL 2026 Baishi Li Ta Yu Kelvin J. L. Koa Ke-Wei Huang http://arxiv.org/abs/2605.07383v1 Combating Organized Platform Abuse: Amplifying Weak Risk Signals with Structural Information 2026-05-08T07:38:43Z

Large-scale online service platforms face severe challenges from organized platform abuse: multiple forms such as credit card fraud and promotion abuse continually emerge, characterized by large numbers of involved accounts, rapid outbreaks, and constantly shifting tactics. Existing mainstream approaches, whether heuristic rules limited in precision, supervised learning with insufficient generalization, or graph models that are engineering-heavy and dependent on seed users, have failed to address such threats effectively. This paper returns to first principles and, starting from the economic constraints of fraudulent behavior, proposes the Fraudster's Trilemma: organized attackers cannot simultaneously achieve scale, low cost, and dispersed cash-out. Building on this theory, we derive a robust structural invariant in organized fraud, namely centralized cash-out, and use a simple statistical method to turn low-precision individual weak signals into high-precision strong decisions. The method requires no labels, is nearly parameter-free, white-box interpretable, has linear complexity O(|E|), avoids cold-start issues, and its detection logic possesses the "open-hand" property: attackers cannot evade it even when fully informed. We validate the approach on two real fraud incidents in backtests. In the promotion abuse case, a single near-zero-cost weak signal (global Precision of only 16%) after structural amplification achieves Precision above 91% and Recall exceeding 99% (z=10.0); at a higher threshold (z=40.0), Precision reaches 93.7%. In the credit card fraud case, an infrastructure-layer weak signal (device spoofing) successfully detects payment-layer attacks without any business-logic linkage, revealing the framework's natural MO-agnostic property: it relies more on the structural invariant than on signal semantics.

2026-05-08T07:38:43Z 11 pages, 6 figures, 8 tables Meng He Jia Long Loh http://arxiv.org/abs/2605.07309v1 Variational PMB filter via coordinate descent Kullback-Leibler divergence minimisation 2026-05-08T06:18:37Z

This paper presents a new derivation of the variational Poisson multi-Bernoulli (V-PMB) filter for multi-target estimation proposed in [#Williams15]. The proposed derivation is based on considering an augmented space that includes the set of target states with their track indices and the global hypothesis variable. Then, we show that the V-PMB projection performs a coordinate descent Kullback-Leibler divergence (KLD) minimisation on this augmented space to fit the best possible PMB density to the Poisson multi-Bernoulli mixture (PMBM) posterior. We also show that this V-PMB projection keeps the probability hypothesis density of the posterior. The paper also includes a comparison with the PMBM filter and other PMB filter variants, including a track-oriented Murty-based implementation, a track-oriented loopy belief propagation implementation and a global nearest neighbour implementation, showing the benefits of the V-PMB filter compared to the other PMB filters when targets get in close proximity and then separate.

2026-05-08T06:18:37Z Accepted in Proceedings of the 29th International Conference on Information Fusion, 2026. Matlab code available at https://github.com/Agarciafernandez/MTT Ángel F. García-Fernández Yuxuan Xia http://arxiv.org/abs/2605.07300v1 A Beta-GAM Hidden Markov Model for Proportion Time Series 2026-05-08T06:11:16Z

We propose a hidden Markov model for univariate proportion time series taking values in (0,1), where regime switching captures latent structural changes and the emission distribution belongs to the Beta family. In each latent state, the Beta mean is linked to covariates through a generalized additive model (GAM) with spline-based smooth functions, while the Beta precision is state-specific, enabling flexible modeling of both nonlinear covariate effects and regime-dependent variability. Estimation is carried out via a penalized expectation--maximization algorithm, combining smoothing with numerical maximization of the penalized emission likelihood. To select the number of latent states and the smoothing penalty, we implement a grid search guided by standard information criteria (Akaike Information Criterion/Bayesian Information Criterion/Integrated Completed Likelihood) with a diagnostic filter that removes degenerate solutions characterized by explosive precision estimates. Uncertainty is quantified through a parametric bootstrap procedure for transition probabilities and state-dependent parameters. Simulation results demonstrate accurate recovery of transition dynamics, state precisions, and latent-state decoding. A motivating application to Russian age-specific mortality data (1960--2014, ages 0--40) illustrates how the proposed model summarizes smooth age patterns in female-to-total mortality ratios while identifying two persistent latent regimes that admit a substantive demographic interpretation in light of the country's well-documented mortality shocks that occurred over the second half of the twentieth century.

2026-05-08T06:11:16Z Andrea Nigri Han Lin Shang Marco Bonetti http://arxiv.org/abs/2605.07225v1 Spatiotemporal dynamics of wind-speed volatility 2026-05-08T04:23:40Z

Wind-speed processes exhibit substantial temporal variability and spatial dependence, yet volatility dynamics across monitoring networks remain relatively unexplored. This study investigates the spatiotemporal behaviour of wind-speed volatility using daily observations from 141 stations in Northern Italy over 2016--2021, with measurements at 10 m and 100 m enabling the analysis of spatial and vertical dependence. We adopt a parsimonious spatiotemporal volatility framework based on GARCH-type dynamics, in which conditional variance depends on past local shocks and spatially aggregated information from neighbouring stations. The approach combines a spatial mean specification with structured volatility models using distance-based and directionally informed weight matrices. Results show that properly modelling spatial dependence in the mean is essential for well-behaved residuals and reliable inference. Forecast performance is strongly driven by the mean specification: flexible structures perform better when residual spatial dependence remains, while parsimonious distance-based models yield robust out-of-sample forecasts once spatial interactions are captured. Persistence increases with height, and a multivariate extension reveals cross-height dependence.

2026-05-08T04:23:40Z Submitted to Environmetrics. 6 figures, 11 tables Ariane Nidelle Meli Chrisko Philipp Otto http://arxiv.org/abs/2605.08272v1 Quantifying Exposure Information Uncertainty in Regional Risk Assessment 2026-05-08T03:30:29Z

Exposure characterization in regional risk assessment aims to assign physical properties to the assets of interest so they can be associated with damage and loss functions. While this process has benefited from the growing availability of public infrastructure inventories, these datasets often lack the detailed attributes required for high-resolution risk assessment. Missing attributes are commonly inferred using predictive models or engineering-based rulesets. However, these imputations are inherently imperfect and can introduce bias and additional uncertainty in regional risk estimates. This study proposes a methodology to quantify the bias and uncertainty in regional risk assessment that arises from probabilistic exposure characterization. By integrating analytical and simulation-based approaches, the methodology decomposes the total uncertainty into contributions from incomplete exposure information as well as other sources, including hazard and damage characterization. This decomposition clarifies how bias and uncertainty associated with missing exposure information are generated and propagated through the risk assessment pipeline. The methodology is applied to both bridge-specific and regional risk assessments. A high-resolution bridge exposure inventory is developed using a data augmentation framework that combines publicly available information with machine learning and engineering-based imputation methods.

2026-05-08T03:30:29Z Chenhao Wu Henry Burton http://arxiv.org/abs/2605.07056v1 The University AI Didn't Replace -- Rethinking Universities in the AI Era 2026-05-08T00:07:55Z

Generative artificial intelligence (AI) is reshaping higher education, yet many universities remain in early stages of adoption where AI innovation occurs informally and without institutional recognition. This paper presents a framework describing four levels of AI adoption in universities and illustrates these dynamics through a case study of AI-enabled curriculum initiatives in several units. We contend that the key institutional challenge is moving from isolated innovation to strategic integration, where universities redesign learning around AI-supported reasoning and align policies, workload models, and recognition systems to support educational transformation.

2026-05-08T00:07:55Z 8 pages, 1 figure. Position paper on Generative AI and the transition from isolated educational innovation to institutionally supported adoption in higher education Karol P. Binkowski Andrew Hopkins http://arxiv.org/abs/2605.06989v1 Drawing Lines in Psychological Space: What K-means Clustering Reveals in Simulated and Real Psychometric Data 2026-05-07T22:10:05Z

K-means clustering is widely used in psychological and psychometric research to identify profiles, subgroups, and potential typologies, yet its classical formulation does not test whether such groups exist as latent psychological categories. Instead, K-means partitions multidimensional space into regions around centroids, favoring compact, approximately spherical clusters defined by geometric distance. In this paper, we examine this limitation through a sequence of controlled simulated datasets. We then extend the analysis to the SMARVUS dataset, a large international psychometric dataset comprising survey responses from university students across 35 countries, to evaluate whether similar geometric partitioning patterns emerge in empirical psychological data. By contrasting simulated and empirical data, this paper argues that K-means can produce stable and visually coherent clustering solutions even in continuous Gaussian latent spaces without true subgroup structure.

2026-05-07T22:10:05Z Methodological study on K-means clustering in psychometric data using simulated and empirical datasets Pedro Henrique Ramos Pinto Maria Jullyanna Ferreira Marques Luiz Carlos Serramo Lopez http://arxiv.org/abs/2509.02826v2 Ensemble Learning for Healthcare: A Comparative Analysis of Hybrid Voting and Ensemble Stacking in Obesity Risk Prediction 2026-05-07T20:21:00Z

Obesity is a critical global health issue driven by dietary, physiological, and environmental factors, and is strongly associated with chronic diseases such as diabetes, cardiovascular disorders, and cancer. Machine learning has emerged as a promising approach for early obesity risk prediction, yet a comparative evaluation of ensemble techniques -- particularly hybrid majority voting and ensemble stacking -- remains limited. This study aims to compare hybrid majority voting and ensemble stacking methods for obesity risk prediction, identifying which approach delivers higher accuracy and efficiency. The analysis seeks to highlight the complementary strengths of these ensemble techniques in guiding better predictive model selection for healthcare applications. Two datasets were utilized to evaluate three ensemble models: Majority Hard Voting, Weighted Hard Voting, and Stacking (with a Multi-Layer Perceptron as meta-classifier). A pool of nine Machine Learning (ML) algorithms, evaluated across a total of 50 hyperparameter configurations, was analyzed to identify the top three models to serve as base learners for the ensemble methods. Preprocessing steps involved dataset balancing, and outlier detection, and model performance was evaluated using Accuracy and F1-Score. On Dataset-1, weighted hard voting and stacking achieved nearly identical performance (Accuracy: 0.920304, F1: 0.920070), outperforming majority hard voting. On Dataset-2, stacking demonstrated superior results (Accuracy: 0.989837, F1: 0.989825) compared to majority hard voting (Accuracy: 0.981707, F1: 0.981675) and weighted hard voting, which showed the lowest performance. The findings confirm that ensemble stacking provides stronger predictive capability, particularly for complex data distributions, while hybrid majority voting remains a robust alternative.

2025-09-02T20:44:52Z There are some errors found Towhidul Islam Md Sumon Ali