Showing 211 - 220 of 10,690
Outlier detection in high-dimensional datasets poses new challenges that have not been investigated in the literature. In this paper, we present an integrated methodology for the identification of outliers which is suitable for datasets with higher number of variables than observations. Our...
Persistent link: https://www.econbiz.de/10011881086
Survey error is known to be pervasive and to bias even simple, but important estimates of means, rates, and totals, such as poverty statistics and the unemployment rate. To summarize and analyze the extent, sources, and consequences of survey error, we define empirical counterparts of key...
Persistent link: https://www.econbiz.de/10011979179
We describe methods of combining administrative and survey data to improve the measurement of income. We begin by decomposing the total survey error in the mean of survey reports of dollars received from a government transfer program. We decompose this error into three parts, generalized...
Persistent link: https://www.econbiz.de/10011997525
The study of the innovative output of organizations often relies on a count of patents filed at one single office of reference such as the European Patent Office (EPO). Yet, not all organizations file their patents at the EPO, raising the specter of a selection bias. Using novel datasets of the...
Persistent link: https://www.econbiz.de/10010858821
We derive the asymptotic bias from misclassification of the dependent variable in binary choice models. Measurement error is necessarily non-classical in this case, which leads to bias in linear and non-linear models even if only the dependent variable is mismeasured. A Monte Carlo study and an...
Persistent link: https://www.econbiz.de/10010859473
The aim of this paper is to make imputations of earnings to observations with missing earnings in the Encuesta Nacional de Ocupaciones y Empleo (ENOE). We present imputations by two methods and also correction of estimations by reweighting observations with reported earnings. Then, we analyze...
Persistent link: https://www.econbiz.de/10010934475
The study of the innovative output of firms often relies on a count of patents filed at one single office of reference such as the European Patent Office (EPO). Yet, not all firms file their patents at the EPO, raising the specter of a selection bias. Using a novel dataset of the whole...
Persistent link: https://www.econbiz.de/10010957647
When producing anonymised microdata for research, national statistics institutes (NSIs) identify a number of 'risk scenarios' of how intruders might seek to attack a confidential dataset. This paper argues that the strategy used to identify confidentiality protection measures can be seriously...
Persistent link: https://www.econbiz.de/10011261260
It is increasingly common in empirical research to merge data sets containing different units of observation. When the units are not nested, a crosswalk specifying how the units from one data source are allocated to the units of the other is needed. Unfortunately, most crosswalks are ad hoc, a...
Persistent link: https://www.econbiz.de/10014582295
The examination of the causal impact of health insurance coverage on healthcare utilisation is a critical endeavour in both academic research and policy formulation. However, this endeavour faces challenges, notably the endogenous selection into coverage and prevalent misreporting of coverage...
Persistent link: https://www.econbiz.de/10014556439