The National Agriculture Library and the Internet Archive have digitized a collection of 18,390 seed catalogs that were selected from the Henry G. Gilbert Nursery and Seed Trade Catalog Collection, which contains over 200,000 American and foreign catalogs dating from the 1700s.
This GitHub repository is a scratch space for the Digital Curation and Innovation Center (DCIC) at the University of Maryland to work with the data.
Before you run anything here make sure you've installed any Python dependencies:
pip install -r requirements.txt
- fetch.py - download the metadata for all the items in the collection and store it in a pairtree directory named
items
.