Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

☄️ Go Live Checklist for Adventist Pre-Processing #343

Closed
10 of 14 tasks
Tracked by #56
jeremyf opened this issue Jul 19, 2023 · 0 comments
Closed
10 of 14 tasks
Tracked by #56

☄️ Go Live Checklist for Adventist Pre-Processing #343

jeremyf opened this issue Jul 19, 2023 · 0 comments
Labels
needs discussion has open questions or need for discussion

Comments

@jeremyf
Copy link
Contributor

jeremyf commented Jul 19, 2023

The following items are required as part of starting the pre-processing of Adventist's objects (Note this is a living list, we may read it and need to add more):

jeremyf referenced this issue in scientist-softserv/space_stone-serverless Jul 19, 2023
jeremyf referenced this issue in scientist-softserv/adventist-dl Jul 19, 2023
Based on the findings of the following issue, we are want to continue to
split pages into JPGs.

- https://github.com/scientist-softserv/adventist-dl/issues/379

The desire for JPGs for split images is demonstrated in:

- 6dd0d77

**Note:** This changes is preliminary work for when we switch over to
using the DerivativeRodeo to split PDFs.  The change will not impact how
the original IiifPrint PDF splitting behaves.  (There is work to switch
from the original splitting mechanism to the [DerivativeRodeoSplitter][1],
but that is still a work in progress).

Related to:

- scientist-softserv/derivative_rodeo#56
- https://github.com/scientist-softserv/adventist-dl/issues/500
- scientist-softserv/space_stone-serverless#36

[1]:https://github.com/scientist-softserv/iiif_print/blob/3938eb0e2f2aa239d6ce52d2d4c597d1146f9651/lib/iiif_print/split_pdfs/derivative_rodeo_splitter.rb
jeremyf referenced this issue in scientist-softserv/adventist-dl Jul 27, 2023
This commit reverts using the Derivative Rodeo for PDF splitting and
derivative generation.  It is also breadcrumbs for how to restore those
functions.  We revert from the Derivative Rodeo to the already
established IIIF Print pluggable derivatives derived from the Newspaper
works gem.

The reason to revert is that this branch includes several changes that
went into local testing of the DerivativeRodeo; and I want to capture
those wins and merge in an already long-running branch, to reduce the
chance of further branch drift.

For reference, local testing of the DerivativeRodeo has worked both with
and without having SpaceStone data for both PDF splitting and generating
derivatives (e.g.  thumbnails, word coordinates, alto files, and plain
text).  However, I had only done localized testing and I believe more
testing is warranted; namely how does the full text search work.

To consider is how we will:

- Test on staging with the Rodeo but not have it in play for Production

But that is an exercise for the person undoing this commit :)

Related to:

- scientist-softserv/derivative_rodeo#56
- https://github.com/scientist-softserv/adventist-dl/issues/500
jeremyf referenced this issue in scientist-softserv/adventist-dl Jul 27, 2023
This commit reverts using the Derivative Rodeo for PDF splitting and
derivative generation.  It is also breadcrumbs for how to restore those
functions.  We revert from the Derivative Rodeo to the already
established IIIF Print pluggable derivatives derived from the Newspaper
works gem.

The reason to revert is that this branch includes several changes that
went into local testing of the DerivativeRodeo; and I want to capture
those wins and merge in an already long-running branch, to reduce the
chance of further branch drift.

For reference, local testing of the DerivativeRodeo has worked both with
and without having SpaceStone data for both PDF splitting and generating
derivatives (e.g.  thumbnails, word coordinates, alto files, and plain
text).  However, I had only done localized testing and I believe more
testing is warranted; namely how does the full text search work.

To consider is how we will:

- Test on staging with the Rodeo but not have it in play for Production

But that is an exercise for the person undoing this commit :)

Related to:

- scientist-softserv/derivative_rodeo#56
- https://github.com/scientist-softserv/adventist-dl/issues/500
@jillpe jillpe added the needs discussion has open questions or need for discussion label Aug 31, 2023
@jillpe jillpe closed this as completed Nov 27, 2023
@kirkkwang kirkkwang transferred this issue from scientist-softserv/adventist-dl May 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
needs discussion has open questions or need for discussion
Projects
Archived in project
Development

No branches or pull requests

2 participants