Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🎁 Adding PDF Split Page Checks #36

Merged
merged 3 commits into from
May 30, 2023
Merged

🎁 Adding PDF Split Page Checks #36

merged 3 commits into from
May 30, 2023

Conversation

jeremyf
Copy link
Contributor

@jeremyf jeremyf commented May 25, 2023

🎁 Adding PDF Split Page Checks

c62729f

Prior to this commit, if we'd already pre-processed a PDF split, we
would again re-process that split (as there was no check for existing
pages).

With this commit, we check for those pre-processed pages.

One critical bit of conversation, is that one work might have multiple
PDFs uploaded. Therefore, it is important to have those PDFs pages
write to different "sub-directories". I'm putting this here so we can
account for that in a test audit of some kind.

Related to:

Co-authored-by: Rob Kaufman [email protected]
Co-authored-by: Kirk Wang [email protected]

☑️ Verifying pdf splitter finds pre-existing files

a45e57f

Updating a bit of documentation and reworking the filename to account
for a work having multiple PDFs.

☑️ Refining globbed_tail_locations for S3

9ca9fe8

Prior to this commit, we didn't have a spec for the S3 behavior. We now
have a test for an S3 Faux Bucket.

Related to:

@jeremyf jeremyf requested a review from orangewolf May 25, 2023 20:53
jeremyf and others added 2 commits May 30, 2023 13:03
Prior to this commit, if we'd already pre-processed a PDF split, we
would again re-process that split (as there was no check for existing
pages).

With this commit, we check for those pre-processed pages.

One critical bit of conversation, is that one work might have multiple
PDFs uploaded.  Therefore, it is important to have those PDFs pages
write to different "sub-directories".  I'm putting this hear so we can
account for that in a test audit of some kind.

Related to:

- https://github.com/scientist-softserv/adventist-dl/issues/330
- scientist-softserv/iiif_print#220

Co-authored-by: Rob Kaufman <[email protected]>
Co-authored-by: Kirk Wang <[email protected]>
Updating a bit of documentation and reworking the filename to account
for a work having multiple PDFs.

- https://github.com/scientist-softserv/adventist-dl/issues/330
- scientist-softserv/iiif_print#220
Prior to this commit, we didn't have a spec for the S3 behavior.  We now
have a test for an S3 Faux Bucket.

Related to:

- https://github.com/scientist-softserv/adventist-dl/issues/330
- scientist-softserv/iiif_print#220
@jeremyf jeremyf merged commit 098c741 into main May 30, 2023
@jeremyf jeremyf deleted the adding-pdf-split-finder branch May 30, 2023 18:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants