Skip to content

Go application to automatically run OCR on PDF files

License

Notifications You must be signed in to change notification settings

marvint24/go-ocr-pdf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This Go application running in a Docker container automatically runs OCR on the PDF files in a mounted folder.

  1. Download this repository
  2. Modify docker-compose.yml
  3. (Optional) Add languages in the languages folder
  4. Start the container with docker compose up

Filename format

DoOcr-deu+eng-XXXX.pdf
1⮥------2⮥-----3⮥

  1. DoOcr Part that triggers the script
  2. deu+eng Languages for the ocr (Must be added during build)
  3. XXXX User specified part

Example:
DoOcr-deu+eng-blablup.pdf



Credits to OCRmyPDF and tesseract.

About

Go application to automatically run OCR on PDF files

Resources

License

Stars

Watchers

Forks

Packages

No packages published