-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
☄️ Go Live Checklist for Adventist Pre-Processing #343
Labels
needs discussion
has open questions or need for discussion
Comments
jeremyf
referenced
this issue
in scientist-softserv/space_stone-serverless
Jul 19, 2023
Based on the findings of the following issue, we are wanting to split pages into JPGs. - https://github.com/scientist-softserv/adventist-dl/issues/379 The desire for JPGs for split images is demonstrated in: scientist-softserv/adventist-dl@6dd0d77 Related to: - scientist-softserv/derivative_rodeo#56 - https://github.com/scientist-softserv/adventist-dl/issues/500
jeremyf
referenced
this issue
in scientist-softserv/adventist-dl
Jul 19, 2023
Based on the findings of the following issue, we are want to continue to split pages into JPGs. - https://github.com/scientist-softserv/adventist-dl/issues/379 The desire for JPGs for split images is demonstrated in: - 6dd0d77 **Note:** This changes is preliminary work for when we switch over to using the DerivativeRodeo to split PDFs. The change will not impact how the original IiifPrint PDF splitting behaves. (There is work to switch from the original splitting mechanism to the [DerivativeRodeoSplitter][1], but that is still a work in progress). Related to: - scientist-softserv/derivative_rodeo#56 - https://github.com/scientist-softserv/adventist-dl/issues/500 - scientist-softserv/space_stone-serverless#36 [1]:https://github.com/scientist-softserv/iiif_print/blob/3938eb0e2f2aa239d6ce52d2d4c597d1146f9651/lib/iiif_print/split_pdfs/derivative_rodeo_splitter.rb
24 tasks
jeremyf
referenced
this issue
in scientist-softserv/adventist-dl
Jul 27, 2023
This commit reverts using the Derivative Rodeo for PDF splitting and derivative generation. It is also breadcrumbs for how to restore those functions. We revert from the Derivative Rodeo to the already established IIIF Print pluggable derivatives derived from the Newspaper works gem. The reason to revert is that this branch includes several changes that went into local testing of the DerivativeRodeo; and I want to capture those wins and merge in an already long-running branch, to reduce the chance of further branch drift. For reference, local testing of the DerivativeRodeo has worked both with and without having SpaceStone data for both PDF splitting and generating derivatives (e.g. thumbnails, word coordinates, alto files, and plain text). However, I had only done localized testing and I believe more testing is warranted; namely how does the full text search work. To consider is how we will: - Test on staging with the Rodeo but not have it in play for Production But that is an exercise for the person undoing this commit :) Related to: - scientist-softserv/derivative_rodeo#56 - https://github.com/scientist-softserv/adventist-dl/issues/500
jeremyf
referenced
this issue
in scientist-softserv/adventist-dl
Jul 27, 2023
This commit reverts using the Derivative Rodeo for PDF splitting and derivative generation. It is also breadcrumbs for how to restore those functions. We revert from the Derivative Rodeo to the already established IIIF Print pluggable derivatives derived from the Newspaper works gem. The reason to revert is that this branch includes several changes that went into local testing of the DerivativeRodeo; and I want to capture those wins and merge in an already long-running branch, to reduce the chance of further branch drift. For reference, local testing of the DerivativeRodeo has worked both with and without having SpaceStone data for both PDF splitting and generating derivatives (e.g. thumbnails, word coordinates, alto files, and plain text). However, I had only done localized testing and I believe more testing is warranted; namely how does the full text search work. To consider is how we will: - Test on staging with the Rodeo but not have it in play for Production But that is an exercise for the person undoing this commit :) Related to: - scientist-softserv/derivative_rodeo#56 - https://github.com/scientist-softserv/adventist-dl/issues/500
jeremyf
referenced
this issue
Oct 4, 2023
Without this commit, there's nothing in Hyrax/Hyku/IIIFPrint that will extract plain text from a plain text file. Related to: - scientist-softserv/adventist-dl@1d3e1a9 - scientist-softserv/derivative_rodeo#56 - https://github.com/scientist-softserv/adventist-dl/issues/500 - https://github.com/scientist-softserv/adventist-dl/issues/538
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The following items are required as part of starting the pre-processing of Adventist's objects (Note this is a living list, we may read it and need to add more):
main
of space_stone-serverless) #547The text was updated successfully, but these errors were encountered: