From 35b04a14b5e4798cee022d5c770cbf856dcd9d79 Mon Sep 17 00:00:00 2001 From: Jeff Muizelaar Date: Fri, 10 Jan 2025 11:37:25 -0500 Subject: [PATCH] Add some more See also --- README.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index cecd5e5..23f1d02 100644 --- a/README.md +++ b/README.md @@ -14,10 +14,11 @@ assert!(out.contains("This is a small demonstration")); ## See also - https://github.com/elacin/PDFExtract/ -- https://github.com/euske/pdfminer +- https://github.com/euske/pdfminer / https://github.com/pdfminer/pdfminer.six - https://github.com/CrossRef/pdfextract - https://github.com/VikParuchuri/marker - https://github.com/kermitt2/pdfalto used by [grobid](https://github.com/kermitt2/grobid/) +- https://github.com/opendatalab/MinerU (uses PyMuPDF and pdfminer.six) ### Not PDF specific - https://github.com/Layout-Parser/layout-parser