-
-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add --copy-pages option to copy pages.jsonl/extraPages.jsonl directly to WACZ #43
Conversation
Used to copy pages.jsonl and extraPages.jsonl files into WACZ without parsing or modifying.
And add tests with new pages fixtures
Hm, I do wonder if we should switch the logic, and make this be the default, while previous behavior should be something like |
Co-authored-by: Ilya Kreymer <[email protected]>
I think it might be safer from a breaking changes perspective to keep it as-is? I see your point though, if you have a strong preference can make the change. |
Nope, no strong preference, can keep as is! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good! Crawler tests pass with this branch!
…new py-wacz (#535) Cherry-picked from the use-js-wacz branch, now implementing separate writing of pages.jsonl / extraPages.jsonl to be used with py-wacz and new `--copy-page-files` flag. Dependent on py-wacz 0.5.0 (via webrecorder/py-wacz#43) --------- Co-authored-by: Tessa Walsh <[email protected]>
Fixes #42
Also bumps version to 0.5.0