-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Finish ERA data migration prep work. #3151
Comments
Create a rake task that packages all items in a collection. Packaging in units of collections is how Scholaris expects that import will work. For each collection in ERA we will set up a collection (manually) in Scholaris and we will use that collection ID to then import all the items that belong to that collection.
|
|
<style type="text/css"></style>
These are the four largest collections. I requested ~600 GB be added to era-app-prd-4 to accommodate the thesis collection export. We can start small and use |
Metadata preparation specs for SAF packages creation:
|
When looking into adding the file names to the SAF, I noticed a potential problem if there are duplicate filenames in different Item/Thesis in a collection. Files need to be on the same folder when creating the SAF and when moving them to create the SAF the files can be overwritten. I generated the report duplicate_filenames_report.txt to aid in analysing potential file name conflict. Please let me know if a different format would simplify this work. |
This is the schema definition for the users model. This can be used to create the mapping to export the users:
|
Related to #3662 |
To generate SAFs we could have a migration order criteria that isolates groups of resources according to resource type (item, then theses), visibility, and other indicators arising from pending queries like pending mapping issues or SAF requirements (e.g. duplicate filenames issue). Possible criteria for selection. I included what I've found via metadata mapping but there may be other criteria I am not aware of @pgwillia @lagoan
@pgwillia suggested we use a spreadsheet to track collection SAFs (from our Nov 26 discussion with Scholaris) |
@lagoan @pgwillia when it is time, would it be possible to generate the first SAF package for the following test collection? It includes two embargoed test resources. I can include other resources if useful. The current items are meant to test all available metadata that can be created from the UI forms as well as file order. Hopefully we can test thumbnails as well at some point. |
Certainly @anayram , I can create that SAF package tomorrow. |
Finish ERA data migration prep work.
Started https://gist.github.com/pgwillia/eed7dd858e17a9a67f9d90cb1d703adb
The text was updated successfully, but these errors were encountered: