Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: changing the dependency from python-pypdf2 to python-mupdf #78

Open
m040601 opened this issue Aug 16, 2020 · 10 comments
Open

Comments

@m040601
Copy link

m040601 commented Aug 16, 2020

Considering that python-pydf2, https://github.com/mstamy2/PyPDF2 or even python-pydf3 , https://github.com/mstamy2/PyPDF3 by the same author havent seen any update in the last 2 years.

Would a change to python-mupdf , https://github.com/pymupdf/PyMuPDF
, be of any interest for stapler ?

Additional info:

Is PyPDF2 dead? Is there an alive fork? · Issue #571 · mstamy2/PyPDF2
py-pdf/pypdf#571

The last commit was from 2018, there are 87 open PRs and 263 open issues. It seems as if the project is dead. Is there an alive fork?

python - Maintained alternatives to PyPDF2 - Stack Overflow
https://stackoverflow.com/questions/63199763/maintained-alternatives-to-pypdf2

...not only PDF but also XPS, OpenXPS, CBZ, CBR, FB2, and EPUB formats, .... is hosted on GitHub.....also are registered on PyPI....by far the fastest in all aspects ...Its performance stats are also very promising....

Of special note in python-mupdf is also:

Interesting PDF manipulation and generation functions have been added over time, including metadata and bookmark maintenance,....

See also my notes on the importance of the "metadata" editing thing, #39

@corollari
Copy link
Collaborator

Another possible alternative would be PyPDF4, which received it's last update about 2 months ago. See also issue #48

@MartinThoma
Copy link

Although it has a lot of stars, this part of the README makes me doubt that the project is properly managed:

While PyPDF4 will continue to be available at no charge, I have strong plans for better ongoing support to start in August 2018.

Homepage (available soon): http://claird.github.io/PyPDF4/.

The link gives a 404.

@MartinThoma
Copy link

Also, look at the commit history:

  • Sep 17, 2019
  • May 2, 2020
  • Jun 21, 2020
  • Jun 22, 2020: Many commits directly to master with messages like "Minor simplification.", "Simplification.", "Discard unneeded comment."

This looks very much like a side project with very little attention.

@tjquinn1
Copy link

PyPDF4 doesn't have documentation of any sort which should raise a red flag.

@corollari
Copy link
Collaborator

Seems like PyPDF4 has several flaws but, on the other hand, PyMuPDF only provides binding for another library, meaning that in some systems usage of stapler will require compiling that library from source (generally not a good user experience). Does anyone know any other alternatives or has any input on that?

@Frenzie
Copy link
Contributor

Frenzie commented Aug 20, 2020

I like the MuPDF codebase quite a bit (and I'm sure these Python bindings must be fine) but it seems like it might be a bit (or a lot) overkill? See https://ghostscript.com/~robin/mupdf_explored.pdf for some of the things you can do with it. As a user I don't really care as long as it works. :-)

@MartinThoma
Copy link

MartinThoma commented Aug 20, 2020

Overview over MuPDF:

Overview over PyMuPDF:

@odiebojangles
Copy link

PyMuPDF doesn't allow for working with files in bitstream. That's a killer for a lot of projects right there.

@captn3m0
Copy link

captn3m0 commented Jul 4, 2021

https://github.com/sfneal/PyPDF3 seems to be better maintained than PyPDF4. 3 Patch releases this year: https://github.com/sfneal/PyPDF3/blob/master/CHANGELOG. No docs though.

I migrated https://github.com/captn3m0/pystitcher from PyPDF2->PyPDF3 and it was seamless.

@MartinThoma
Copy link

PyPDF2 is now maintained again. I'm the maintainer :-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants