4.7 Article

Can't see the forest for the leaves: Similarity and distance measures for hierarchical taxonomies with a patent classification example

Journal

RESEARCH POLICY
Volume 42, Issue 4, Pages 855-873

Publisher

ELSEVIER SCIENCE BV
DOI: 10.1016/j.respol.2013.01.006

Keywords

Technological space; Patent data; Jaffe's distance measure; Classification; Taxonomy; Hierarchy; Similarity; Distance; Overlap; Knowledge flows; Knowledge recombination

Categories

Ask authors/readers for more resources

Current measures of technological distance or similarity typically ignore a great deal of the information contained within the classification taxonomies upon which they are based. In this paper, I introduce two modifications which enable various common management research methods to fully make use of hierarchical classification data. Although the methods presented have broad applications, an extensive example exploring various measures of technological similarity based on the USPTO patent classification system is included. In addition to the general benefit of allowing the use of taxonomical data and thus more accurately reflecting the theoretical complexity of any underlying phenomenon, this methodology offers a number of other benefits specific to the patent context. These include the ability to model technological space within fields, since it correctly uses USPTO subclass level data, as well as the ability to accurately analyze similarity at the patent-to-patent dyadic level, since it calculates conceptual overlaps at the lowest level of the classification taxonomy. I explore the performance of these methods in two research contexts: (1) a patent-to-patent level sample within a single technological field and (2) an organization-to-organization level sample across industries. The results show that taxonomical methods generate more meaningful distributions of similarity scores within both samples and that similarity scores calculated via taxonomical methods have a more consistent relationship with citation likelihood and number of citations. Suggestions are provided for variants of this methodology for various technological and industrial classification systems and implications are drawn for future research in a wide range of domains. (C) 2013 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available