Showing 1 - 10 of 12
As I document using evidence from a journal data repository that I manage, the datasets used in empirical work are getting larger. When we use very large datasets, it can be dangerous to rely on standard methods for statistical inference. In addition, we need to worry about computational issues....
Persistent link: https://www.econbiz.de/10013254704
Cluster-robust inference is widely used in modern empirical work in economics and many other disciplines. The key unit of observation is the cluster. We propose measures of "high-leverage" clusters and "influential" clusters for linear regression models. The measures of leverage and partial...
Persistent link: https://www.econbiz.de/10013254705
Methods for cluster-robust inference are routinely used in economics and many other disciplines. However, it is only recently that theoretical foundations for the use of these methods in many empirically relevant situations have been developed. In this paper, we use these theoretical results to...
Persistent link: https://www.econbiz.de/10012670892
Efficient computational algorithms for bootstrapping linear regression models with clustered data are discussed. For ordinary least squares (OLS) regression, a new algorithm is provided for the pairs cluster bootstrap, along with two algorithms for the wild cluster bootstrap. One of these is a...
Persistent link: https://www.econbiz.de/10012670900
We provide new and computationally attractive methods, based on jackknifing by cluster, to obtain cluster-robust variance matrix estimators (CRVEs) for linear regres- sion models estimated by least squares. These estimators have previously been com- putationally infeasible except for small...
Persistent link: https://www.econbiz.de/10014451087
Inference using difference-in-differences with clustered data requires care. Previous research has shown that, when there are few treated clusters, t-tests based on cluster-robust variance estimators (CRVEs) severely overreject, and different variants of the wild cluster bootstrap can either...
Persistent link: https://www.econbiz.de/10012431053
In many fields of economics, and also in other disciplines, it is hard to justify the assumption that the random error terms in regression models are uncorrelated. It seems more plausible to assume that they are correlated within clusters, such as geographical areas or time periods, but...
Persistent link: https://www.econbiz.de/10012431056
We study two cluster-robust variance estimators (CRVEs) for regression models with clustering in two dimensions and give conditions under which t-statistics based on each of them yield asymptotically valid inferences. In particular, one of the CRVEs requires stronger assumptions about the nature...
Persistent link: https://www.econbiz.de/10012431058
We discuss when and how to deal with possibly clustered errors in linear regression models. Specifically, we discuss situations in which a regression model may plausibly be treated as having error terms that are arbitrarily correlated within known clusters but uncorrelated across them. The...
Persistent link: https://www.econbiz.de/10012431064
Reliable inference with clustered data has received a great deal of attention in recent years. The overwhelming majority of this research assumes that the cluster structure is known. This assumption is very strong, because there are often several possible ways in which a dataset could be...
Persistent link: https://www.econbiz.de/10012431071