Specify language for text extraction #314

tagliala · 2024-09-23T14:27:22Z

Close #302

This change ensure that Tika can specify the language configuration for its internal Tesseract OCR Parser. Close #302

tagliala force-pushed the bugfix/302-ocr-other-than-english branch 7 times, most recently from f8473fb to be54481 Compare September 27, 2024 07:26

Add support for specifying OCR language in Tika

8a82b10

This change ensure that Tika can specify the language configuration for its internal Tesseract OCR Parser. Close #302

tagliala force-pushed the bugfix/302-ocr-other-than-english branch from be54481 to 8a82b10 Compare September 27, 2024 07:37

tagliala merged commit 2fb6bfa into master Sep 27, 2024
3 checks passed

tagliala deleted the bugfix/302-ocr-other-than-english branch September 27, 2024 07:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specify language for text extraction #314

Specify language for text extraction #314

tagliala commented Sep 23, 2024

Specify language for text extraction #314

Specify language for text extraction #314

Conversation

tagliala commented Sep 23, 2024