Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Move to Islandora Kit's output CSV is completely under our control #2

Closed
mjordan opened this issue Apr 18, 2018 · 3 comments
Closed

Comments

@mjordan
Copy link

mjordan commented Apr 18, 2018

@seth-shaw-unlv I'm sorry if I didn't clarify how MIK might be used in conjuction with Migrate Plus. I see you have provided a patch for Migrate Plus because

The Move to Islandora Kit sample metadata, however, combines [subject fields of different types] into a single column. This requires us to perform a single lookup across multiple content types, something the existing migrate_plus module doesn't support.

What I didn't explain in Islandora/documentation#452 was that the sample CSV you were looking at is input to MIK, whereas what we want to do here is to use MIK to output a CSV that we can then hand off to Migrate Plus.

The structure and content of sample CSV that I have provided in Islandora/documentation#452 (comment) and that my work in MarcusBarnes/mik#463 is completely under our control. Some hacking I did late last week added functionality to MIK to generate a CSV (and accompanying images) from an existing Islandora 7.x collection via an OAI harvest. I am happy to additional work on this MIK "toolchain" required to avoid patching anything.

The initial work I did used the metadata from DC XML harvested via OAI. We could harvest MODS, parse it, and lay values down in a CSV files as well. I started with DC since I didn't want to get tangled up in MODS->CSV mappings before I demonstrated to myself that MIK could be made to output a CSV file. @dannylamb and I have discussed doing things like spitting out some Xpath expressions from Form Builder configurations to use as ways of mapping MODS structures to CSV columns.

@seth-shaw-unlv
Copy link
Owner

seth-shaw-unlv commented Apr 18, 2018

Well, at least my updated example matches what other inputs MIK users are used to seeing.

I should note that the Migrate API actually works quite well with an XML or JSON feed by matching the destination fields to the appropriate source records' XPath (e.g. my overly-simplified MADS import), so we could do a migrate that pulls straight from either the DC XML or the MODS to populate the nodes, no CSV intermediary required. @Natkeeran demonstrates this approach on this ticket.

@seth-shaw-unlv
Copy link
Owner

@mjordan I think source CSV files that combine subjects and agents are going to be a common example, so even if split people subjects and topic subjects apart again in this example, I should leave a note about the issue with having them in the same field and the patch to make it work.

@mjordan
Copy link
Author

mjordan commented Apr 18, 2018

Totally agree that if we can skip the CSV we're ahead. It would be easy to modify a couple of MIK classes to fetch the MODS instead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants