-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Start planning on how to structure the CSV to account for complex objects #56
Comments
It would be useful to not have to have a row for each page in a newspaper issue, for example. In that case, all page files could be grouped into a directory and their order expressed in their filenames. However, in that case, we'd need some sort of way to define a minimal set of field values such as title and identifier; in other words, if these values are not in the CSV file, how do we derive them when populating the child nodes? |
The method you first describe is what I had in mind (and what I was thinking of when I posted issue #18). As to the "use a directory" method: that is one of the ways CONTENTdm deals with compound objects: each compound object has a directory with the parent record's identifier. Each file in that directory corresponds to a child object with the only metadata being the file name (sans extension). E.g.
|
@seth-shaw-unlv yes, I think we can support both "with metadata" and "directory" methods. |
@seth-shaw-unlv I've got a working implementation of the first method described above in the "issue-56" branch. How it works is described in the "Creating paged content" section of that branch's README. Can you take a look to see if it's clear? |
@mjordan, your description looks good to me; although I may not be the best test-subject. |
Thanks for looking. The required fields in the CSV are pretty standard, so they shouldn't pose any problems for the spreadsheet editor. No sweat on the testing, I included an integration test that passes. I think I'll merge so that branch doesn't get stale. |
As of 2e94036, Workbench support both of the methods for creating paged content described above. Closing for now, we can reopen this one or new issues as needed. |
The paged content sprint for Islandora 8 is first half of September. Workbench should be ready to ingest paged and compound objects soon after the sprint is done. The sprint will determine how the relationship between the parent and its children (and the ordering of children) will be instantiated, but likely, given discussion so far, it will be via one or more node fields on each child pointing to the parent.
Assuming that the parent<->child relationship will be expressed in node fields, Workbench can handle populating those fields if it can translate that relationship from the structure of the metadata CSV file. For example, a structure like this for non-complex images:
could be expanded to something like this for compound objects, where the
parent
field contains theid
of the object's parent:An important ability will be to include parent and child-level metadata in the same CSV file, so they can be ingested during the same task.
The text was updated successfully, but these errors were encountered: