SQL query to increase data accuracy and completeness in PATSTAT
Abstract
PATSTAT is the worldwide patent statistical database created and maintained by the European Patent Office. Many methods and techniques have been developed to increase its accuracy and completeness. This paper contributes to this body of research. It proposes an allocation procedure which reduces by 44% the number of empty entries concerning the residence country of patentees, and, at the same time, it increases by 22% the accuracy of country code allocation. The procedure consists of a replicable SQL query to be run in PATSTAT. An application of this procedure illustrates that patent analyses based on raw data underestimate the role of China and Japan in the area of climate change mitigation technologies.
- Publication:
-
World Patent Information
- Pub Date:
- June 2019
- DOI:
- 10.1016/j.wpi.2019.02.001
- Bibcode:
- 2019WPatI..57....1P
- Keywords:
-
- PATSTAT;
- Data accuracy;
- Data completeness;
- Patent data;
- Data cleaning;
- Climate change mitigation technology