Huberty, Mark; Serwaah, Amma; Zachmann, Georg - Bruegel - 2014
The inventors in PATSTAT are often duplicates: the same person or company may be split into multiple entries in PATSTAT, each associated to different patents. In this paper, we address this problem with an algorithm that efficiently de-duplicates the data. It needs minimal manual input and works...