diff --git a/TesseractOCR/README.md b/TesseractOCR/README.md index 3af60b8..75c2edc 100644 --- a/TesseractOCR/README.md +++ b/TesseractOCR/README.md @@ -4,7 +4,7 @@ What is TesseractOCR ========= -It is a .NET wrapper for Tesseract 5.3.0 that is originally copied from Charles Weld (https://github.com/charlesw/tesseract) and modified for my own needs +It is a .NET wrapper for Tesseract 5.3.2 that is originally copied from Charles Weld (https://github.com/charlesw/tesseract) and modified for my own needs How to use ============ @@ -14,7 +14,7 @@ You can get them at https://github.com/tesseract-ocr/tessdata or https://github. ## Microsoft Visual C++ runtimes -The DLL's Tesseract51.dll (and exe) and leptonica-1.82.0.dll are compiled with Visual Studio 2022 you need these C++ runtimes for it on your computer +The DLL's Tesseract53.dll (and exe) and leptonica-1.83.1.dll are compiled with Visual Studio 2022 you need these C++ runtimes for it on your computer - X86: https://aka.ms/vs/17/release/vc_redist.x86.exe - X64: https://aka.ms/vs/17/release/vc_redist.x64.exe diff --git a/TesseractOCR/TesseractOCR.csproj b/TesseractOCR/TesseractOCR.csproj index b1b92d6..9014e1a 100644 --- a/TesseractOCR/TesseractOCR.csproj +++ b/TesseractOCR/TesseractOCR.csproj @@ -4,13 +4,13 @@ TesseractOCR Charles Weld & Kees van Spelde Tesseract - Tesseract 5.3.1 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 + Tesseract 5.3.2 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository. Copyright 2012-2021 Charles Weld - Copyright 2021-2023 Kees van Spelde - - Updated to Tesseract 5.3.1 -- Added a method to read multi page tiffs from memory -- Added a method to blend images + - Updated to Tesseract 5.3.2 +- Updated to Leptonica 1.83.1 +- Updated nuget packages https://github.com/Sicos1977/TesseractOCR https://github.com/Sicos1977/TesseractOCR Tesseract OCR text readable PDF @@ -21,7 +21,7 @@ which support the legacy engine, for example those from the tessdata repository. 5.3.3.0 5.3.3.0 False - 5.3.3 + 5.3.4 ocr.png true bin\$(Configuration)\$(TargetFramework)\$(AssemblyName).xml