Replies: 1 comment
-
I would use a vision language model for this one - you can turn each xhtml page into a png and then ingest as images. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey team, does anybody know any good utility for saving XHTML to PDF?
I tested many libraries (pandoc, wkhtmltopdf, weasyprint and others), but none gave a good conversion, the only good conversion from XHTML to PDF is by print saving to PDF in Chromium, which is a very slow operation
example XHTML doc
example HTML doc
The ultimate goal is XHTML -> Markdown. However, none of the major conversion libraries supports XHTML
cc @emrgnt-cmplxty for visibility
Beta Was this translation helpful? Give feedback.
All reactions