https://arxiv.org/api/FTP53nunaQO4aB+EQ8cCcYBYRnU 2026-03-20T12:25:05Z 5183 30 15 http://arxiv.org/abs/2603.13823v1 Enhancing the Accuracy of Regional Input-Output Table Estimation: A Deep Learning Approach 2026-03-14T08:12:29Z Non-survey methods have been developed and applied for estimating regional input-output tables. However, there is an ongoing debate about the assumptions necessary for these methods and their accuracy. To address these issues, this study presents a deep learning method for estimating regional input-output tables. First, the quantitative economic data for regions is augmented by linear combinations. Then, deep learning is performed on each item in the input-output table, treating these items as target variables. Finally, regional input-output tables are estimated through matrix balancing to the predicted values from the trained model. The estimation accuracy of this method is verified using the 2015 input-output table for Japan as a benchmark. Compared to matrix balancing under the ideal assumption of known row and column sums, our method generally demonstrates higher estimation accuracy. Thus, this method is anticipated to provide a foundation for deriving more precise estimates of regional input-output tables. 2026-03-14T08:12:29Z 34 pages, 10 figures, 12 tables Shogo Fukui http://arxiv.org/abs/2603.13766v1 Estimating Earth's Temperature Response with Transformed and Augmented OLS 2026-03-14T05:33:16Z The long-term relationship between radiative forcing and surface temperature is imperative for predicting the impacts of climate change. This study employs multicointegration to characterize this relationship and uses Transformed and Augmented Ordinary Least Squares (TAOLS) to estimate the model. The main goal is to estimate the Equilibrium Climate Sensitivity (ECS), defined as the global mean surface air temperature increase following a doubling of atmospheric carbon dioxide. Our results show that the ECS lies between $2.12^{\circ}$C and $2.49^{\circ}$C, which is lower than the existing maximum likelihood estimate of $2.8^{\circ}$C. TAOLS offers a more robust and accessible tool for climate research, providing novel insights for ongoing debates about Earth's warming trajectory. 2026-03-14T05:33:16Z 13 pages, 6 figures Justin Sun http://arxiv.org/abs/2603.13505v1 Testing the Exclusion Restriction in IV Models Using Non-Gaussianity: A LiNGAM-Based Approach 2026-03-13T18:35:39Z Instrumental variable (IV) methods rely critically on the exclusion restriction, which is untestable in exactly-identified models under standard assumptions. We propose a framework combining IV analysis with the LiNGAM method to test this restriction by exploiting non-Gaussianity in the data. Under non-Gaussian structural errors, the exclusion violation parameter is point-identified without additional instruments. Five complementary tests (bootstrap percentile, asymptotic normal, permutation, likelihood ratio, and independence-based) are introduced to assess the restriction under varying data conditions. Monte Carlo simulations and an empirical application to the Card (1995) dataset demonstrate controlled Type I error rates and reasonable power against economically relevant violations. 2026-03-13T18:35:39Z Fernando Delbianco http://arxiv.org/abs/2508.00263v3 Robust Econometrics for Growth-at-Risk 2026-03-13T17:32:35Z The Growth-at-Risk (GaR) framework has garnered attention in recent econometric literature, yet current approaches implicitly assume a constant Pareto exponent. We introduce novel and robust econometrics to estimate the tails of GaR based on a rigorous theoretical framework and establish validity and effectiveness. Simulations demonstrate consistent outperformance relative to existing alternatives in terms of predictive accuracy. We perform a long-term GaR analysis that provides accurate and insightful predictions, effectively capturing financial anomalies better than current methods. 2025-08-01T02:10:16Z Tobias Adrian Yuya Sasaki Yulong Wang http://arxiv.org/abs/2507.14389v2 Spatiotemporal Autoregressive Models for Areal Compositional Data 2026-03-13T13:54:35Z Compositional data, such as regional shares of economic sectors or property transactions, are central to understanding structural change in economic systems across space and time. This paper introduces a spatiotemporal multivariate autoregressive model tailored for panel data with composition-valued responses at each areal unit and time point. The proposed framework enables the joint modelling of temporal dynamics and spatial dependence under compositional constraints, and is estimated via a quasi-maximum likelihood approach. We build on recent theoretical advances to establish the identifiability and asymptotic properties of the estimator as both the number of regions and the number of time points grow. The utility and flexibility of the model are demonstrated through two applications: analysing property transaction compositions in an intra-city housing market (Berlin), and regional sectoral compositions in Spain's economy. These case studies highlight how the proposed framework captures key features of spatiotemporal economic processes that are often missed by conventional methods. 2025-07-18T22:31:18Z Matthias Eckardt Philipp Otto http://arxiv.org/abs/2603.12630v1 The Economics of AI Supply Chain Regulation 2026-03-13T04:03:55Z The rise of foundation models has driven the emergence of AI supply chains, where upstream foundation model providers offer fine-tuning and inference services to downstream firms developing domain-specific applications. Downstream firms pay providers to use their computing infrastructure to fine-tune models with proprietary data, creating a co-creation dynamic that enhances model quality. Amid concerns that foundation model providers and downstream firms may capture excessive consumer surplus, along with increasing regulatory measures, this study employs a game-theoretic model involving a provider and two competing downstream firms to analyze how policy interventions affect consumer surplus in the AI supply chain. Our analysis shows that policies promoting price competition in downstream markets (i.e., pro-price-competitive policies) boost consumer surplus only when compute or data preprocessing costs are high, while compute subsidies are effective only when these costs are low, suggesting these policies complement each other. In contrast, policies promoting quality competition in downstream markets (i.e., pro-quality-competitive policies) always improve consumer surplus. We also find that under pro-price-competitive policies or compute subsidies, both the provider and downstream firms can achieve higher profits along with greater consumer surplus, creating a win-win-win outcome. However, pro-quality-competitive policies increase the provider's profits while reducing those of downstream firms. Finally, as compute costs decline, pro-price-competitive policies may lose their effectiveness, whereas compute subsidies may shift from ineffective to effective. These findings offer insights for policymakers seeking to foster AI supply chains that are economically efficient and socially beneficial. 2026-03-13T04:03:55Z An earlier version of this paper, titled "The Economics of Fine-Tuning for Large-Scale AI Models," was presented at WISE 2023, where it won the Best Student Paper Award Sihan Qian Amit Mehra Dengpan Liu http://arxiv.org/abs/2512.24096v2 Evaluating Counterfactual Policies Using Instruments 2026-03-13T02:15:27Z We study settings in which a researcher has an instrumental variable (IV) and seeks to evaluate the effects of a counterfactual policy that alters treatment assignment, such as a directive encouraging randomly assigned judges to release more defendants. We develop a general and computationally tractable framework for computing sharp bounds on the effects of such policies. Our approach does not require the often tenuous IV monotonicity assumption. Moreover, for an important class of policy exercises, we show that IV monotonicity -- while crucial for a causal interpretation of two-stage least squares -- does not tighten the bounds on the counterfactual policy impact. We analyze the identifying power of alternative restrictions, including the policy invariance assumption used in the marginal treatment effect literature, and develop a relaxation of this assumption. We illustrate our framework using applications to quasi-random assignment of bail judges in New York City and prosecutors in Massachusetts. 2025-12-30T09:12:56Z 68 pages, including all appendices Michal Kolesár José Luis Montiel Olea Jonathan Roth http://arxiv.org/abs/2603.12536v1 Heterogeneous Elasticities, Aggregation, and Retransformation Bias 2026-03-13T00:34:54Z Economists often interpret estimates from linear regressions with log dependent variables as elasticities. However, the coefficients from log-log regressions estimate the elasticity of the geometric mean of $y_i|x_i$, not the arithmetic mean. The unbounded difference between the two is known as retransformation bias and can take either sign. We develop a specification-robust debiased estimator of the average arithmetic elasticity and re-estimate 50 results from top 5 papers published in 2020. We find that 19 are significantly different, with the median absolute difference being 65% of the OLS elasticity estimate. Furthermore, we show standard instrumental variables assumptions with log dependent variables do not identify the elasticity. We specify a control function approach and re-estimate papers that use 2SLS with log dependent variables. We find that 13 of 19 results from top 5 papers are significantly different between the two approaches. Retransformation bias arises as a result of heterogeneous responses. The geometric mean elasticity corresponds to the average response. Arithmetic and geometric means are elements of the power mean family. We show power mean elasticities are sufficient statistics for a common class of decision problems. 2026-03-13T00:34:54Z Ellen Munroe Alexander Newton Meet Shah http://arxiv.org/abs/2303.07287v3 Tight Non-asymptotic Inference via Sub-Gaussian Intrinsic Moment Norm 2026-03-13T00:13:19Z In non-asymptotic learning, variance-type parameters of sub-Gaussian distributions are of paramount importance. However, directly estimating these parameters using the empirical moment generating function (MGF) is infeasible. To address this, we suggest using the sub-Gaussian intrinsic moment norm [Buldygin and Kozachenko (2000), Theorem 1.3] achieved by maximizing a sequence of normalized moments. Significantly, the suggested norm can not only reconstruct the exponential moment bounds of MGFs but also provide tighter sub-Gaussian concentration inequalities. In practice, we provide an intuitive method for assessing whether data with a finite sample size is sub-Gaussian, utilizing the sub-Gaussian plot. The intrinsic moment norm can be robustly estimated via a simple plug-in approach. Our theoretical findings are also applicable to reinforcement learning, including the multi-armed bandit scenario. 2023-03-13T17:03:19Z This manuscript has been withdrawn by the authors as it is not yet ready for public release. Further improvements and revisions are required before a final version can be considered for distribution Huiming Zhang Haoyu Wei Guang Cheng http://arxiv.org/abs/2603.12374v1 The Privacy-Utility Trade-Off of Location Tracking in Ad Personalization 2026-03-12T18:52:18Z Firms collect vast amounts of behavioral and geographical data on individuals. While behavioral data captures an individual's digital footprint, geographical data reflects their physical footprint. Given the significant privacy risks associated with combining these data sources, it is crucial to understand their respective value and whether they act as complements or substitutes in achieving firms' business objectives. In this paper, we combine economic theory, machine learning, and causal inference to quantify the value of geographical data, the extent to which behavioral data can substitute for it, and the mechanisms through which it benefits firms. Using data from a leading in-app advertising platform in a large Asian country, we document that geographical data is most valuable in the early cold-start stage, when behavioral histories are limited. In this stage, geographical data complements behavioral data, improving targeting performance by almost 20%. As users accumulate richer behavioral histories, however, the role of geographical data shifts: it becomes largely substitutable, as behavioral data alone captures the relevant heterogeneity. These results highlight a central privacy-utility trade-off in ad personalization and inform managerial decisions about when location tracking creates value. 2026-03-12T18:52:18Z 57 pages, 11 figures. Digital advertising, causal inference, and machine learning Mohammad Mosaffa Omid Rafieian http://arxiv.org/abs/2510.07204v2 Beyond the Oracle Property: Adaptive LASSO in Cointegrating Regressions with Local-to-Unity Regressors 2026-03-12T16:17:31Z This paper derives new asymptotic results for the adaptive LASSO estimator in cointegrating regressions, allowing for uncertainty about whether the regressors are exact unit root processes. We study model selection probabilities, estimator consistency, and limiting distributions under standard and moving-parameter asymptotics. We further derive uniform convergence rates and the fastest local-to-zero rates detectable by the estimator under conservative and consistent tuning. For consistent tuning, we construct confidence regions that are easy to implement, uniformly valid over the parameter space, and achieve sure asymptotic coverage without requiring knowledge or estimation of local-to-unity or long-run covariance parameters. Simulation results reveal that the finite-sample distribution of the adaptive LASSO estimator can deviate substantially from the oracle property, whereas moving-parameter asymptotics provide much more accurate approximations. Consequently, in addition to being infeasible in applications due to their dependence on non-estimable nuisance parameters, oracle-based confidence regions are often too small to achieve adequate coverage in empirically relevant scenarios with small but non-zero coefficients. In contrast, the proposed confidence regions are always feasible and deliver reliable coverage across the parameter space. An empirical application to predicting the U.S. unemployment rate illustrates their practical usefulness for quantifying uncertainty around adaptive LASSO estimates. 2025-10-08T16:38:30Z Karsten Reichold Ulrike Schneider http://arxiv.org/abs/2406.08880v4 Jackknife inference with two-way clustering 2026-03-12T10:24:50Z For linear regression models with cross-section or panel data, it is natural to assume that the disturbances are clustered in two dimensions. However, the finite-sample properties of two-way cluster-robust tests and confidence intervals are often poor. We discuss several ways to improve inference with two-way clustering. Two of these are existing methods for avoiding, or at least ameliorating, the problem of undefined standard errors when a cluster-robust variance matrix estimator (CRVE) is not positive definite. One is a new method that always avoids the problem. More importantly, we propose a family of new two-way CRVEs based on the cluster jackknife and prove that they yield valid inferences asymptotically. Simulations for models with two-way fixed effects suggest that, in many cases, the cluster-jackknife CRVE combined with our new method yields surprisingly accurate inferences. We provide a software package, twowayjack for Stata, that implements our recommended variance estimator. 2024-06-13T07:31:46Z James G. MacKinnon Morten Ørregaard Nielsen Matthew D. Webb http://arxiv.org/abs/2310.07151v4 Identification and Estimation of a Semiparametric Logit Model using Network Data 2026-03-12T04:01:43Z This paper studies identification and estimation in semiparametric logit models when social networks are endogenous. In many applications, unobserved individual traits shape both the outcome of interest and the formation of social ties, so standard logit specifications, including those augmented with common network controls, can be biased. I show how network data can be used to address this endogeneity without imposing a parametric structure on the link formation process. Although the outcome equation is semiparametric in this social component and the network formation process is left unspecified, the logistic distribution assumption is crucial for identification. I show that slope parameters are point identified by pairwise comparisons of agents who share identical network formation behavior. I propose feasible estimators based on matching agents using network similarity measures and establish their consistency and asymptotic normality. Monte Carlo simulations demonstrate good finite-sample performance, and an empirical application to microfinance adoption demonstrates that accounting for endogenous network formation materially affects estimated covariate effects. 2023-10-11T02:54:31Z Brice Romuald Gueyap Kounga http://arxiv.org/abs/2603.11497v1 Variance Estimation with Dependence and Heterogeneous Means 2026-03-12T03:34:57Z This paper considers the problem of estimating the variance of a sum of a triangular array of random vectors with heterogeneous means. When random vectors exhibit two-way cluster dependence or weak dependence, standard variance estimators designed under homogeneous means can underestimate the true variance, which results in subsequent tests being oversized. To restore validity, this paper proposes a simple conservative variance estimator robust to heterogeneous means and shows its asymptotic validity. 2026-03-12T03:34:57Z Luther Yap http://arxiv.org/abs/2603.11457v1 Bayesian Modular Inference for Copula Models with Potentially Misspecified Marginals 2026-03-12T02:33:15Z Copula models of multivariate data are popular because they allow separate specification of marginal distributions and the copula function. These components can be treated as inter-related modules in a modified Bayesian inference approach called ''cutting feedback'' that is robust to their misspecification. Recent work uses a two module approach, where all $d$ marginals form a single module, to robustify inference for the marginals against copula function misspecification, or vice versa. However, marginals can exhibit differing levels of misspecification, making it attractive to assign each its own module with an individual influence parameter controlling its contribution to a joint semi-modular inference (SMI) posterior. This generalizes existing two module SMI methods, which interpolate between cut and conventional posteriors using a single influence parameter. We develop a novel copula SMI method and select the influence parameters using Bayesian optimization. It provides an efficient continuous relaxation of the discrete optimization problem over $2^d$ cut/uncut configurations. We establish theoretical properties of the resulting semi-modular posterior and demonstrate the approach on simulated and real data. The real data application uses a skew-normal copula model of asymmetric dependence between equity volatility and bond yields, where robustifying copula estimation against marginal misspecification is strongly motivated. 2026-03-12T02:33:15Z Lucas Kock David T. Frazier Michael Stanley Smith David J. Nott