Similar Search Results

No Ground Truth? : No Problem : Improving Administrative Data Linking Using Active Learning and a Little Bit of Guile

Tahamont, Sarah; Jelveh, Zubin; McNeill, Melissa; Yan, Shi - National Bureau of Economic Research - 2023

While linking records across large administrative datasets ["big data"] has the potential to revolutionize empirical social science research, many administrative data files do not have common identifiers and are thus not designed to be linked to others. To address this problem, researchers have...

Persistent link: https://www.econbiz.de/10014250118

The ABCs of Charitable Solicitation

Meer, Jonathan - 2009

A substantial experimental literature suggests that a personal solicitation is an effective way to induce people to make charitable donations. We examine whether this result generalizes to a non-experimental setting. Specifically, we estimate the effect of a marginal personal solicitation using...

Persistent link: https://www.econbiz.de/10012463612

Certified Random : A New Order for Co-Authorship

Ray, Debraj - 2016

In economics, alphabetical name order is the baseline norm for joint publications. A growing literature suggests, however, that alphabetical order confers uneven benefits on the first author. This paper introduces and studies certified random order, which involves randomization of names that is...

Persistent link: https://www.econbiz.de/10012456077

Technology and Big Data Are Changing Economics : Mining Text to Track Methods

Currie, Janet - 2020

The last 40 years have seen huge innovations in computing technology and data availability. Data derived from millions of administrative records or by using (as we do) new methods of data generation such as text mining are now common. New data often requires new methods, which in turn can...

Persistent link: https://www.econbiz.de/10012479239

Big Data and Firm Dynamics

Farboodi, Maryam - 2019

We study a model where firms accumulate data as a valuable intangible asset. Data accumulation affects firms' dynamics. It increases the skewness of the firm size distribution as large firms generate more data and invest more in active experimentation. On the other hand, small data- savvy firms...

Persistent link: https://www.econbiz.de/10012479471

The Promise and Pitfalls of Conflict Prediction : Evidence from Colombia and Indonesia

Bazzi, Samuel - 2019

Policymakers can take actions to prevent local conflict before it begins, if such violence can be accurately predicted. We examine the two countries with the richest available sub-national data: Colombia and Indonesia. We assemble two decades of fine-grained violence data by type, alongside...

Persistent link: https://www.econbiz.de/10012479929

Combining Family History and Machine Learning to Link Historical Records

Price, Joseph - 2019

A key challenge for research on many questions in the social sciences is that it is difficult to link historical records in a way that allows investigators to observe people at different points in their life or across generations. In this paper, we develop a new approach that relies on millions...

Persistent link: https://www.econbiz.de/10012480171

Prices and Promotions in U.S. Retail Markets : Evidence from Big Data

Hitsch, Günter J. - 2019

We document the degree of price dispersion and the similarities as well as differences in pricing and promotion strategies across stores in the U.S. retail (grocery) industry. Our analysis is based on "big data" that allow us to draw general conclusions based on the prices for close to 50,000...

Persistent link: https://www.econbiz.de/10012480251

Text Selection

Kelly, Bryan T. - 2019

Text data is ultra-high dimensional, which makes machine learning techniques indispensable for textual analysis. Text is often selected--journalists, speechwriters, and others craft messages to target their audiences' limited attention. We develop an economically motivated high dimensional...

Persistent link: https://www.econbiz.de/10012480461

Market Efficiency in the Age of Big Data

Martin, Ian - 2019

Modern investors face a high-dimensional prediction problem: thousands of observable variables are potentially relevant for forecasting. We reassess the conventional wisdom on market efficiency in light of this fact. In our model economy, which resembles a typical machine learning setting, N...

Persistent link: https://www.econbiz.de/10012480530