Skip to content

Perl scripts for merging datasets contributed to Jo's HMRC Tax Exempt Art project

Notifications You must be signed in to change notification settings

milh0use/HMRC-Data

This branch is 10 commits behind mentionthewar/HMRC-Data:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

6b1b142 · Feb 23, 2015

History

21 Commits
Oct 24, 2014
Oct 24, 2014
Oct 25, 2014
Feb 23, 2015
Feb 23, 2015
Feb 16, 2015
Feb 16, 2015
Feb 23, 2015
Jan 4, 2015

Repository files navigation

HMRC database of tax exempt art

HMRC-data*.tsv is the most recent file, containing all artworks, post some rudimentary data cleaning and a first attempt to extract artist names from the full text descriptions.

HMRC objects by county (also tab separated) contains limited location data for the objects. This data is stored separately by HMRC. Only the descripton field is common to both tables.

te-art.txt (hash separated) is the original scraped data from the HMRC website

The database contains just over 33,000 works of art in total.

About

Perl scripts for merging datasets contributed to Jo's HMRC Tax Exempt Art project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Perl 100.0%