Documente Academic
Documente Profesional
Documente Cultură
The data are freely available below in two compressed (".zip") formats: SAS transport
(.tpt) files and ASCII comma-separated variable (.csv) files. The program read_tpt.sas
can be used to convert the .tpt files to native SAS data sets. Lines in the ASCII CSV
files are terminated by the newline character "\n". "CSV" stands for comma separated
values. All values in the ASCII CSV files are separated by commas. In addition, the
character values are enclosed by double quotes. The compression ratio for the
compressed files is about 75%. The ".zip" files can be uncompressed with winzip or
pkunzip. To check your ability to uncompress these files, download the small file
compress.zip. The SAS ".tpt" files are transferable to other formats using software such
as Stat/Transfer or DBMS/Copy, and can be used directly by Stata using the fdause
command.
To download files in Internet Explorer, right click on them and select "Save Target
As...".
Internal users can access the data at /home/data/patents
You will need a major database, statistical program, or programming language to use
these files. Most of the datasets are too large to load completely into MS Excel 2000,
which has a maximum of 65,536 observations, though Access can be used to read the
ASCII datafile. View variable descriptions and observations per file in the
"Documentation" column of the table below.
U.S. patent information can also be downloaded or purchased from the United States
Patent and Trademark Office, which also has a U.S. to IPC concordance.
To search patents, try Google -> more -> patents or http://www.freepatentsonline.com
For international patent databases check FIZ Karlsruhe, the British Library (Derwent is
one Patent Copy Service that delivers patents from the British Library.), the German
Patent and Trade Mark Office, Espacenet, Micropat, the French Intellectual Property
Institute, the IciMarques database, or the EP-CESPRI database, a database along the
lines of the NBER dataset, but for European Patent Office data.
Many of the sources above were obtained from InfoToday. Derwent has a searchable
patent glossary and a link to a text patent glossary made by The Minerals, Metals &
Materials Society. For principles and sources for patents searching see Free Pint articles
by Ron Kamenicki and Stephen Adams.
More recent data can be obtained from the U.S. Patent Office's ftp site.
Updates and changes.
Description
Documentation
Data -- Pkzipped
SAS .tpt
ASCII CSV
Overview
overview.txt
--
Cite75_99.txt
pat63_99.txt
pat63_99.zip -(90Mb)
apat63_99.zip -(56Mb)
Assignee names
coname.txt
coname.zip -(2Mb)
aconame.zip -(2Mb)
match.zip -(130Kb)
amatch.zip -(98Kb)
inventor.txt
inventor.zip -(98Mb)
ainventor.zip -(82Mb)
classes.txt
--
subcategory.txt
--
-read_tpt.sas
subcategory.csv
--
U.S. Patent Classification (USPC) System and the Standard Industrial Code (SIC)
System
PGINA PRINCIPAL
test3_fe
test3_fe
default_collection
Search
xml_no_dtd
UTF-8
UTF-8
Descripcin
Documentacin
Datos - Pkzipped
.tpt SAS
CSV ASCII
Visin de conjunto
overview.txt
Cite75_99.txt
nombres cesionario
coname.txt
coname.zip (2Mb)
aconame.zip (2Mb)
Contiene el partido a
los nmeros CUSIP
match.txt
match.zip (130Kb)
amatch.zip (98Kb)
Registros inventor
individual
inventor.txt
classes.txt
class_match.txt
tecnolgica
Categora tecnolgica subcategory.txt
y etiquetas
subcategora
Programa SAS para
convertir archivos a
formato .tpt SAS
nativa
subcategory.csv
read_tpt.sas
Dataset statistics
Nodes
3774768
Edges
16518948
3764117 (0.997)
16511741 (1.000)
1 (0.000)
0 (0.000)
0.0757
Number of triangles
7515023
0.02343
22
9.4
Source (citation)
Files
File
Description
cit-Patents.txt.gz
NBER Patents
by patents granted between 1975 and 1999, totaling 16,522,438 citations. El grfico de la cita
incluye todas las citas hechas por las patentes concedidas entre 1975 y 1999, por un total de
16,522,438 citas. For the patents dataset there are 1,803,511 nodes for which we have no
information about their citations (we only have the in-links). Para las patentes 1,803,511
conjunto de datos que hay nodos para los cuales no tenemos informacin acerca de sus citas
(slo tenemos las de los enlaces).
The data was originally released by . Los datos fueron publicados originalmente por el NBER .
3774768 3774768
Edges bordes
16518948 16518948
1 (0.000) 1 (0.000)
0 (0.000) 0 (0.000)
0.0757 0.0757
7515023 7515023
0.02343 0.02343
22 22
9.4 9.4
Files archivos
File Archivo
Description Descripcin
cit-Patents.txt.gz
Las patentes
NBER