Showing 1 - 10 of 22
While linking records across large administrative datasets ["big data"] has the potential to revolutionize empirical social science research, many administrative data files do not have common identifiers and are thus not designed to be linked to others. To address this problem, researchers have...
Persistent link: https://www.econbiz.de/10014250118
A substantial experimental literature suggests that a personal solicitation is an effective way to induce people to make charitable donations. We examine whether this result generalizes to a non-experimental setting. Specifically, we estimate the effect of a marginal personal solicitation using...
Persistent link: https://www.econbiz.de/10012463612
In economics, alphabetical name order is the baseline norm for joint publications. A growing literature suggests, however, that alphabetical order confers uneven benefits on the first author. This paper introduces and studies certified random order, which involves randomization of names that is...
Persistent link: https://www.econbiz.de/10012456077
The last 40 years have seen huge innovations in computing technology and data availability. Data derived from millions of administrative records or by using (as we do) new methods of data generation such as text mining are now common. New data often requires new methods, which in turn can...
Persistent link: https://www.econbiz.de/10012479239
We study a model where firms accumulate data as a valuable intangible asset. Data accumulation affects firms' dynamics. It increases the skewness of the firm size distribution as large firms generate more data and invest more in active experimentation. On the other hand, small data- savvy firms...
Persistent link: https://www.econbiz.de/10012479471
Policymakers can take actions to prevent local conflict before it begins, if such violence can be accurately predicted. We examine the two countries with the richest available sub-national data: Colombia and Indonesia. We assemble two decades of fine-grained violence data by type, alongside...
Persistent link: https://www.econbiz.de/10012479929
A key challenge for research on many questions in the social sciences is that it is difficult to link historical records in a way that allows investigators to observe people at different points in their life or across generations. In this paper, we develop a new approach that relies on millions...
Persistent link: https://www.econbiz.de/10012480171
We document the degree of price dispersion and the similarities as well as differences in pricing and promotion strategies across stores in the U.S. retail (grocery) industry. Our analysis is based on "big data" that allow us to draw general conclusions based on the prices for close to 50,000...
Persistent link: https://www.econbiz.de/10012480251
Text data is ultra-high dimensional, which makes machine learning techniques indispensable for textual analysis. Text is often selected--journalists, speechwriters, and others craft messages to target their audiences' limited attention. We develop an economically motivated high dimensional...
Persistent link: https://www.econbiz.de/10012480461
Modern investors face a high-dimensional prediction problem: thousands of observable variables are potentially relevant for forecasting. We reassess the conventional wisdom on market efficiency in light of this fact. In our model economy, which resembles a typical machine learning setting, N...
Persistent link: https://www.econbiz.de/10012480530