We use textual analysis of high-dimensional data from patent documents to create new indicators of technological innovation. We identify significant patents based on textual similarity of a given patent to previous and subsequent work: these patents are distinct from previous work but are...