tgn-tools

Various tools to make working with the Getty TGN linked open data dumps less painful.

Because nothing says "This is why we can't have nice things" like a single file containing 17GB worth of RDF triples...

Basically the goal is to create a set of tools for parsing the data in streams or for generating derivative representations which mean never having to deal with the hassle of loading this stuff in to a triple store or trying to wrap your head around SPARQL queries.

If nothing else it might useful for generating simple CSV files which can be combined in to more useful GeoJSON files. We'll see.

Example - local file

import tgn

path = "TGNOut_Coordinates.nt"
nt = tgn.nt(file=path)

for s,p,o in nt.parse():

	# do something with each statement

Example - remote URL

url = "http://vocab.getty.edu/tgn/1000095.nt"
nt = tgn.nt(url=url)

for s,p,o in nt.parse():

	# do something with each statement

The parse methods parses and then yields each line in your *.nt file. It returns still returns a triple (containing a subject, predicate and object in that order) but each part has been explicitly cast as a string. Predicates are simplified by default, according to the following rules:

The predicate is replaced with the basename of its URI
If the resultant predicate contains an anchor (for example #type) then the predicate is replaced with the value following the hash mark

It is assumed that at some point this will yield unexpected results or hilarity so you can disable simplified predicates in the constructor. Like this:

path = "TGNOut_Coordinates.nt"
nt = tgn.nt(path, simplify_predicates=False)

What you do afterwards is up to you but at least now you're just dealing with line-based streams containing strings.

To do

A lot, probably.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
bin		bin
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tgn-tools

Example - local file

Example - remote URL

To do

See also:

About

Releases

Packages

Languages

straup/tgn-tools

Folders and files

Latest commit

History

Repository files navigation

tgn-tools

Example - local file

Example - remote URL

To do

See also:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages