trademark-sqlite

Parses USPTO trademark xml files into a sqlite database. Currently only tested with a set of files that appear to contain 2015 and earlier data.

Getting the data

The data is at https://data.uspto.gov/data3/trademark/dailyxml/applications/ In the data/ directory of this repo, get_data_from_uspto.php does what the filename says. It uses wget. It adds up to about 5GB.

The script only downloads the initial set of 53 files that were released on 12/31/2015, which I think (USPTO documentation is sparse) is all marks filed before that date. It does not download the daily files after that.

Initialize the database

Use the create_tmdb.sql file to initialize a sqlite database. The database name by default is ./tmdb.sqlite3 A command to initialize the database would be sqlite3 tmdb.sqlite3 < create_tmdb.sql

The default path to the database is /home/joe/trademark-sqlite. Unless your name is also joe, you will need to set the directory in parse_trademarks.php where it says "change your path".

Loading the database

Once you have the files, load the database.

parse_trademarks.php parses a single file, parses out a few fields (mark name, filing date, serial and registration number) and inserts into the database.

process_files.php runs a loop that calls the function in parse_trademarks.php that does the heavy lifting

TO-DO items

(1) Additional verification to ensure all 2015-and-earlier trademarks are included in this data.

(2) Add more fields, especially goods and services description and live/dead indicator.

(3) Parse daily xml files. Figure out how USPTO deals with updates to existing records.

If you are interested in contributing email me at joe at morris dot cloud. Eventually I hope to build an open-source trademark monitor (i.e., check periodically for similar marks to one's own existing marks).

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
LICENSE		LICENSE
README.md		README.md
create_tmdb.sql		create_tmdb.sql
demo_screen.png		demo_screen.png
parse_trademarks.php		parse_trademarks.php
process_files.php		process_files.php

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

trademark-sqlite

Getting the data

Initialize the database

Loading the database

TO-DO items

About

Releases

Packages

Contributors 3

Languages

License

xenotropic/trademark-sqlite

Folders and files

Latest commit

History

Repository files navigation

trademark-sqlite

Getting the data

Initialize the database

Loading the database

TO-DO items

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages