Remerge: Regression-based record linkage with an application to PATSTAT
Record linkage algorithms typically find matches by comparing records on the fields they share. However, PATSTAT shares very little information with company databases. We introduce REMERGE: a flexible, open-source algorithm that allows PATSTAT, the worldwide patent database, to be intelligently linked with company databases, without limiting the comparisons to the shared fields. The results of this matching application can be used to improve research into the economics of innovation. The algorithm could also be adapted for similar problems. We provide a description of our algorithm, together with details on the coverage on a by-country and by-sector basis, performance measures, and hints for future research. We also show results from an additional application of REMERGE to the European Commission's Tenders Electronic Daily database.
Year of publication: |
2014
|
---|---|
Authors: | Peruzzi, Michele ; Zachmann, Georg ; Veugelers, Reinhilde |
Publisher: |
Brussels : Bruegel |
Saved in:
freely available
Series: | Bruegel Working Paper ; 2014/10iii |
---|---|
Type of publication: | Book / Working Paper |
Type of publication (narrower categories): | Working Paper |
Language: | English |
Other identifiers: | 798194707 [GVK] hdl:10419/126717 [Handle] |
Source: |
Persistent link: https://www.econbiz.de/10011420987
Saved in favorites
Similar items by person
-
Remerge : regression-based record linkage with an application to PATSTAT
Peruzzi, Michele, (2014)
-
Remerge: regression-based record linkage with an application to PATSTAT
Peruzzi, Michele, (2014)
-
When and how to support renewables? Letting the data speak
Zachmann, Georg, (2014)
- More ...