https://arxiv.org/api/r0XawnMgX+jGICeofK6pDjp3S+s 2026-07-18T22:05:41Z 5620 15 15 http://arxiv.org/abs/2510.25743v4 Agentic Economic Modeling 2026-07-15T02:29:29Z

We introduce Agentic Economic Modeling (AEM), a framework that aligns synthetic LLM choices with small-sample human evidence for econometric inference. AEM first generates task-conditioned synthetic choices via LLMs, then learns a bias-correction mapping from task features and raw LLM choices to human-aligned choices, upon which standard econometric estimators perform inference to recover demand elasticities and treatment effects. We validate AEM in two experiments. In a large scale conjoint study, using only 10% of the original data to fit the correction model lowers the error of the demand-parameter estimates, while uncorrected LLM choices increase the errors. In a regional field experiment, a mixture model calibrated on 10% of geographic regions estimates a treatment effect of -65$\pm$10 bps on the hold-out regions, closely matching the full human experiment (-60$\pm$8 bps). These results demonstrate AEM's potential to improve RCT efficiency and represent a step toward LLM-based counterfactual generation.

2025-10-29T17:46:07Z Bohan Zhang Jiaxuan Li Ali Hortaçsu Xiaoyang Ye Victor Chernozhukov Angelo Ni Edward W Huang http://arxiv.org/abs/2607.11983v2 Removable Defects: The Economics and Limits of Deliberate Deficiency 2026-07-15T02:23:38Z

A specialist tolerates blind spots that a generalist does not. Usually this is treated as a cost to be minimized. We treat it as a design variable: a deficiency can be kept because it pays and removed on demand in the rare situation where it would be fatal, by routing to a compensation channel. We give three results. First, an advantage condition under which keeping the deficiency is a computable economic position; structurally it is the Ehrlich-Becker market-vs-self-insurance margin applied to a competence gap, with the detector as a Townsend costly-state-verification technology. Second, a two-sided characterization of removability. A coupling lemma shows that when the deficiency is a coarsening of perception, no switch can separate benefit from harm, yielding a converse (a confounded detector earns zero premium, and any within-defect policy insisting on positive premium is driven, under multiplicative dynamics, to negative long-run growth) and an achievability result (a detector outside the deficiency earns a positive premium). Together, over structured uncertainty classes with severity capped or miss rate O(1/L): a defect is profitably removable iff the detector-relevant distinction survives the restriction and the advantage condition holds; the premium is the support function of the class's ROC set at an economic price vector. Third, observation defects and capacity defects differ exactly on whether access to the deployment distribution rescues them; the gap decomposes as cross-leak plus a closure deficit, and per-task randomization buys back the latter, never the former. The detector can be learned from declared fatal categories at a training bill linear in loss severity (up to a log factor). The results synthesize Chow's reject option, Kelly growth under ruin, and selective prediction.

2026-07-13T13:04:15Z 30 pages Cheng Qian http://arxiv.org/abs/2607.13314v1 Tabular Foundation Models for Discrete Choice Estimation 2026-07-14T22:34:00Z

Tabular foundation models (TFMs) generate predictions on structured data via in-context learning, without task-specific estimation. We ask whether TFMs can be effectively applied to discrete choice, a central demand estimation framework in marketing and operations, and find that directly applying TFMs yields limited performance. The gap is structural: TFMs assume row-independent observations, whereas discrete choice is inherently set-valued and subject to persistent consumer preference heterogeneity. We propose a reformulation that encodes both choice-set dependence and individual heterogeneity within a row-based learning framework. Evaluated on a yogurt scanner panel, individual-level heterogeneity encoding is the dominant driver of predictive accuracy. The best reformulation outperforms hierarchical Bayesian estimation by 8\% in holdout log-likelihood and 3.6\% in hit rate, running 16 times faster, a practical advantage for large-scale demand estimation. The advantage is largest in the medium-data regime (10--40 purchase occasions per consumer), where parametric Bayesian shrinkage most distorts estimates for atypical consumers. Fine-tuning on population choice data provides additional gains for consumers with shallow purchase histories, where in-context learning has limited individual-specific signal to condition on. These results establish a principled approach for applying foundation models to consumer choice problems more broadly.

2026-07-14T22:34:00Z Liu Liu Dan Zhang http://arxiv.org/abs/2504.14354v3 Global identification of dynamic panel models with interactive effects 2026-07-14T18:49:46Z

We investigate the problem of global identification in dynamic panel models with interactive effects, in the large-N, fixed-T setting. While local identification, typically established via the Jacobian matrix, is well understood, global identification has remained a more elusive and challenging issue. It is commonly believed to be unachievable in this context. However, we demonstrate that the model is, in fact, globally identified for almost all configurations of the factors. Our analysis also covers models with additive fixed effects, including unit-root cases in which previous studies have reported non-identification from differenced moments. We show that, even in these settings, the level covariance structure delivers global identification.

2025-04-19T16:46:26Z Jushan Bai Pablo Mones http://arxiv.org/abs/2601.16274v2 A Nonlinear Target-Factor Model with Attention Mechanism for Mixed-Frequency Data 2026-07-14T13:13:35Z

We propose the Mixed-Panels-Transformer Encoder (MPTE), a framework for estimating factor models in panels with mixed frequencies and nonlinear signals. Classical factor models rely on linear signal extraction and homogeneous sampling frequencies, limiting their use when variables arrive at different frequencies. MPTE instead uses Transformer-style attention to construct context-aware signals, replacing fixed linear combinations with adaptive reweighting. We extend principal component analysis to accommodate general temporal and cross-sectional attention operators, so the model learns to aggregate information across frequencies without manual alignment. Under linear activations, we establish consistency and asymptotic normality of factor and loading estimators, show that the framework nests classical factor models as a special case, and obtain efficiency gains through transfer learning across auxiliary panels. A Transformer architecture handles the nonlinear case, which we assess through simulations and an empirical application. In simulations, MPTE outperforms linear benchmarks under nonlinear designs. On 13 quarterly U.S. macroeconomic targets drawn from 48 monthly and quarterly FRED series, it remains competitive with established benchmarks. By averaging learned attention across variables and time, we recover target-specific variable importance and lag relevance, and ablations quantify the contribution of each model component.

2026-01-22T19:11:48Z Alessio Brini Ekaterina Seregina http://arxiv.org/abs/2607.12629v1 Bivariate Isotonic Regression by Dynamic Programming 2026-07-14T11:07:17Z

This article extends the dynamic programming framework introduced by (Rote, 2019) from the univariate to the bivariate isotonic problem, using an anti-diagonal traversal procedure. The proposed algorithm is applied to the well-known baseball data set that describes the association of salary with a collection of player properties, including the number of runs batted and hits. The new algorithm is relevant in the sense that dynamic programming has a wide range of applications in economics, such as the savings problem, economic growth, job search, business cycles, oligopoly equilibrium, recursive contracts, and forecasting.

2026-07-14T11:07:17Z Pedro Afonso Fernandes http://arxiv.org/abs/2607.12622v1 Orthogonal Integrated Conditional Moment Tests for Treatment Effect Heterogeneity 2026-07-14T10:59:54Z

We propose a nonparametric integrated conditional moment (ICM) test for treatment effect heterogeneity across subpopulations defined by a given covariate subvector. Under unconfoundedness, the null is recast as a conditional moment restriction based on a Neyman-orthogonal score, which reduces the first-order sensitivity of the empirical process to nuisance parameter estimation. The test statistics are constructed as continuous functionals of a marked empirical process. We establish a uniform feasible-to-oracle approximation and derive the asymptotic properties of these test statistics under the null and fixed alternatives. We further show that the test has nontrivial power against local alternatives converging to the null at the $n^{-1/2}$ rate, and develop an easy-to-implement multiplier bootstrap for feasible inference. We also develop extensions to tests of parametric CATE specifications and to settings with endogenous treatment and a binary instrument. Finally, we apply the proposed testing approach to study whether the effect of maternal smoking during pregnancy on infant birth weight varies with maternal age.

2026-07-14T10:59:54Z 102 pages, including an online appendix; 2 figures and 12 tables Haokun Lu Xiaojun Song http://arxiv.org/abs/2607.12568v1 Interpreting (and testing) factor loadings 2026-07-14T09:39:14Z

Dynamic Factor Models (DFMs) are popular to reduce dimensionality being customary in the empirical analysis of large systems of macroeconomic and/or financial variables. In this context, the common underlying factors and their loadings are often extracted using Principal Components (PC), which are consistent and asymptotically normal under very general conditions. Consequently, inference on the factor loadings, which is crucial for the correct interpretation of the underlying factors, is often based on their asymptotic distribution with the limit covariance matrix of the loadings consistently estimated using HAC estimators. In this paper, we analyse the performance of the finite sample asymptotic approximation when constructing confidence intervals and testing about estimated PC loadings. We show that this approximation is seriously affected when the cross-sectional dimension is not large enough. We propose using HAR inference and a subsampling procedure to correct the MSE of the loadings to take into account the uncertainty associated with the estimation of the covariance matrix and of the factors, respectively. The relevance of the results is illustrated in an empirical analysis of economic convergence among the US states.

2026-07-14T09:39:14Z A. Montañés E. Ruiz http://arxiv.org/abs/2607.12299v1 Q-SCM: A Quantum-Sequential Choice Model for Driver Mental State Evolution 2026-07-14T03:16:48Z

We propose a Quantum-Sequential Choice Model (Q-SCM) for modelling driver mental state evolution in interactive traffic environments. The proposed framework retains the classical latent class choice structure, but replaces the conventional class membership formulation with a quantum cognitive state model. A unique feature of this model is that the quantum component is confined to the class membership layer, while the action choice layer remains a classical RUM. The driver's latent state is represented as a two-state quantum system on the Bloch sphere including neutral and defensive states. Perceptual cues, including separation distance, closing time-to-collision (CTTC), and lane deviation induce sequential unitary rotations governed by Pauli matrices. This formulation allows the model to capture memory, phase effects, cue order dependence, and transitions between behavioural regimes that depend on prior cue history. To ensure well-behaved state evolution, we introduce three control mechanisms: a monotonicity constraint that prevents pendulum-like overshoot, a geodesic safeguard mechanism that ensures convergence toward the defensive state under sustained threat exposure, and a relaxation step that allows recovery toward the neutral baseline when the threat weakens. The model is estimated using 85,754 observations from 9,610 drivers extracted from naturalistic trajectories. The empirical results show that defensive state formation is not governed only by the instantaneous values of traffic cues, but also by the accumulated cue history and the order in which cues are processed.

2026-07-14T03:16:48Z Rulla Al-Haideri Bilal Farooq Karim Ismail http://arxiv.org/abs/2605.18138v2 Bayesian State-Space Modeling and Model-Based Counterfactual Analysis of Dynamic Income Distributions from Grouped Data 2026-07-14T00:42:06Z

Grouped income data contain only limited information about the evolution of income distributions over time. This paper develops a Bayesian state-space model for the generalized beta distribution of the second kind (GB2) to estimate dynamic income distributions using repeated grouped income data. By borrowing information across adjacent periods through the latent GB2 parameters, the proposed framework improves estimation precision relative to independent cross-sectional estimation. Building on the estimated latent-state dynamics, we further construct a model-based counterfactual framework that quantifies the contribution of demographic covariates while preserving the estimated evolution of the remaining model components. Using Japanese household income data from 1969--2007, we find that population aging and declining household size affect different parts of the income distribution through distinct channels, with population aging becoming an increasingly important driver of income inequality after around 2000. More generally, the proposed framework provides a unified Bayesian approach to dynamic distributional analysis and model-based counterfactual inference using repeated grouped income data.

2026-05-18T09:46:21Z Kazuhiko Kakamu http://arxiv.org/abs/2607.12219v1 Partial Identification with Multiple Nonlinear Measurements of a Latent Regressor 2026-07-13T23:33:54Z

We study linear regression when the regressor is latent and observed only through multiple noisy measurements, each a smooth but possibly nonlinear function of the latent variable. The problem is acute in the measurement of occupational exposure to artificial intelligence, where competing scores yield downstream estimates that differ by a factor of eleven. A regression on any single measurement recovers a source-specific coefficient rather than the structural one. We fix the latent scale by requiring the consensus measurement function to be linear and bound the remaining curvature heterogeneity across sources relative to slope. Under this bound, the structural coefficient lies in a closed-form interval centered at a symmetric cross-source estimator. The interval is invariant to unknown source loadings, and its half-width is second order in the curvature bound and sharp to the same order. With at least four measurements, the bound is estimable from the joint distribution of the sources through a split-instrument auxiliary regression, and Imbens-Manski confidence intervals with the Stoye critical value attain uniform coverage over the curvature class, including at the point-identified boundary. The application matches six exposure measures to an American Community Survey panel of 8.88 million person-year observations for 2015 to 2024. The post-2022 employment coefficient changes sign between the language-model measures and the Webb patent-text measure, and an ex ante factor-analytic rule separates the Webb measure as a distinct construct. The five retained sources yield a loading-invariant consensus coefficient of -0.239, with a partial-identification half-width of 1.23 percent of the point estimate, or 1.88 percent at the one-sided 95 percent upper bound on the curvature. We read the application as measurement reconciliation rather than as a causal estimate of AI displacement.

2026-07-13T23:33:54Z Burhan Ogut Michelle Yin http://arxiv.org/abs/2405.07860v5 Order-Explicit Linearization of High-Dimensional $U$-Statistics 2026-07-13T19:52:34Z

We give an order-explicit large deviation bound for the difference between a high-dimensional $U$-statistic and its Hájek projection. In particular, we show that any $U$-statistic of order $b$ on $n$ observations, with a $d$-dimensional kernel whose coordinates have $ψ_1$-Orlicz norm at most $φ$, has a maximum deviation from its Hájek projection of order $O_p(φb n^{-1}\log^2(dn))$. The proof relies on the development of novel order-explicit moment inequalities for higher-order Hoeffding components. We show that this rate is unimprovable, up to the polynomial factor on the logarithmic term. As corollaries, we obtain new Bernstein-type concentration and Gaussian approximation results for high-dimensional $U$-statistics. We apply these results to establish the consistency of a set of resampling-based simultaneous confidence intervals built around a class of nonparametric regression estimators constructed with subsampled kernels. This class encompasses several forms of random forest regression, including Generalized Random Forests.

2024-05-13T15:46:11Z David M. Ritzwoller Vasilis Syrgkanis http://arxiv.org/abs/2607.11694v1 Calibrated Horizon-Weighted Local Projection Designs for Markov Switchbacks 2026-07-13T15:27:48Z

We study temporal assignment design for Markov switchback experiments when the reported object is a dynamic local-projection target. We develop a calibrated selector that chooses the feasible persistence minimizing the covariance, HAC, residual-bootstrap, or realized-schedule risk of the estimator and reporting object specified before the experiment. A balanced homoskedastic Markov benchmark yields a closed form because the lagged-assignment information matrix is AR(1)-Toeplitz with a tridiagonal inverse. The benchmark maps local-projection reporting weights into persistence recommendations within a prespecified first-order Markov class. Field recommendations replace the benchmark covariance with residualized, serially dependent, pilot-calibrated, or randomization-based risk. A semi-synthetic Low Carbon London evaluation uses observed half-hourly baseline dynamics and known injected responses to assess design risk. It evaluates the covariance calculations under realistic load autocovariance and identifies when calibrated covariance selection should replace the homoskedastic Markov formula. Near-boundary designs use randomization-first inference when many-spell normal approximations are unsupported.

2026-07-13T15:27:48Z Makoto Nakakita Teruo Nakatsuma http://arxiv.org/abs/2405.16547v2 Estimating Dyadic Treatment Effects with Unknown Confounders 2026-07-13T12:17:42Z

This paper proposes estimation and inference methods for assessing treatment effects with dyadic data. Under the assumption that the treatments follow an exchangeable distribution, our approach allows for the presence of any unobserved confounding factors that potentially cause endogeneity of treatment choice without requiring additional information other than the treatments and outcomes. Building on the literature of graphon estimation in network data analysis, we propose a neighbourhood kernel smoothing method for estimating dyadic average treatment effects, and derive the rate of convergence of the proposed estimator under certain regularity conditions. We also develop conformal inference methods for predicting outcomes conditional on treatment status. We apply our methods to international trade data to assess the impact of free trade agreements on bilateral trade flows.

2024-05-26T12:32:14Z Tadao Hoshino Takahide Yanagi http://arxiv.org/abs/2312.01162v4 High-dimensional inference on jumps in nonparametric time series regression models 2026-07-13T10:47:38Z

We study simultaneous inference on jumps in the conditional mean functions of a high-dimensional collection of heterogeneous nonparametric time series, where the number of series may exceed the sample size and the data may exhibit strong cross-sectional dependence. The jump depends on one specific covariate, and we allow the regression function to vary with additional latent variables. We propose two uniform tests: one for the existence of jumps and one for their homogeneity across series. We derive a simple closed-form approximation to the covariance structure of the jump estimators and establish a high-dimensional Gaussian approximation showing that, owing to the localized construction of the statistics, the maximum of the studentized jumps is approximated by the maximum of independent Gaussians. The cross-sectional dependence is thus asymptotically negligible for critical values, even under strong (e.g., factor) dependence, and the approximation requires estimating only the variance for each series. For pronounced cross-sectional dependence, a dependence-aware refinement restores the off-diagonal covariances, improving finite-sample size and power. Simulations show accurate size and reasonable power under both cross-sectional and serial dependence, and two empirical applications reveal significant non-smooth effects.

2023-12-02T15:52:24Z Likai Chen Georg Keilbar Liangjun Su Weining Wang