Python library and Web service based on Poppler Pdftotext utility and Tesseract OCR for extracting text from PDF documents
ocr tesseract text-extraction tesseract-ocr pdf-to-text poppler optical-character-recognition pdf-reader pdftotext pdf2text pdf-splitting poppleract py-poppleract
-
Updated
Oct 18, 2024 - Python