Skip to content

Releases: OCR-D/ocrd_all

v2021-07-21

21 Jul 17:10
@kba kba
Compare
Choose a tag to compare

cor-asv-ann e919020..1b8c120

Release: v0.1.12

  • 📦 0.1.12
  • wrapper.evaluate: allow aggregating histograms, add parameter
  • scripts.compare: allow aggregating histograms
  • lib.seq2seq / scripts.eval: allow aggregating histograms
  • lib.alignment: allow aggregating histograms
  • wrapper.evaluate: rename reported metrics and measure WER, too; expose gt_level parameter
  • scripts.compare: rename reported metrics and measure WER, too
  • lib.alignment: apply equivalences when aggregating confusion, too
  • lib.alignment: more level 1/2 equivalences/normalizations
  • lib.alignment: remove commented experimental code for other aligners
  • 📦 0.1.11
  • update README
  • lib.alignment: no gaps for confusion counts

core d2c6216f..271dcdf6

Release: v2.25.1

  • Merge pull request #701 from OCR-D/apidocs-standalone
  • 📦 v2.25.1
  • Merge remote-tracking branch 'origin/gds-id'
  • 📦 v2.25.0
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/ocrd_page_etree'
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/fix-recrop' ...
Read more

v2021-05-21

21 May 09:54
@kba kba
Compare
Choose a tag to compare

Hotfix (v2021-05-20 was an incomplete merge)

v2021-05-20

20 May 18:53
@kba kba
Compare
Choose a tag to compare

ocrd_anybaseocr 6fe756a..e930c93

Release: v1.6.0

  • 📦 v1.6.0
  • ragequit debugging block-segmentation, disable it
  • replace keras.saving with the up-to-date location
  • logging doesn't like standalone % signs
  • keras.engine.Layer is now tensorflow.keras.layers.Layer
  • fix keras imports
  • 📦 v1.5.0
  • Merge branch 'fix-tiseg'
  • 📦 v1.4.1
  • hotfix: re-introduce ocrd-fork-ocropy requirement
  • 📦 v1.4.0
  • cropping: edge detection: make position priority peak at 30% of margin and decrease slower inwards
  • cropping: allow steeper angles for badly skewed/keystoned images
  • cropping: apply detected ruler to edge detector result, too
  • cropping: fix for grayscale input
  • cropping: improve bombast/outdated comment header
  • cropping: reduce default left/right margin zones
  • cropping: use more adequate LSD parameters to make line reconstruction via line segment clustering easier
  • cropping: crop in last derived image (including binarized) and use transparency and bg-filling
  • cropping: add docstring
  • cropping: allow plotting debug images to files
  • cropping: fix 0e00b3e7 (absolute vs relative position)
  • cropping: edge detection: include wider range of line segments
  • cropping: edge detection: make position priority peak at 10% of margin
  • cropping: fix 370005b3 (polygon ordering)
  • cropping: make everything DPI-relative (assuming 300 DPI as natural scale)
  • cropping: use intersections from edge detection as polygon instead of bbox, and pad it inwardly
  • cropping: ignore margin-only columns before merging
  • cropping: edge detection: make consistency criterion more robust
  • cropping: edge detection: increase length priority over position
  • cropping: edge detection: fix gap criterion
  • cropping: prefer line-based detector if successful on all 4 sides
  • cropping: more logging
  • cropping: lower fg density text boxes, merge nearby columns before removing small ones
  • cropping: ruler detection does not binarization (but mask instead of clip it)
  • 📦 1.4.0
  • cropping: ignore margin-only text areas
  • cropping: rewrite edge detection based algorithm…
  • cropping: rename and describe parameters in a sane way
  • cropping: use existing parameters for same things everywhere
  • cropping: for detection based on textbox estimation, use all contours recursively (not just outer) and use exact contour area (without drawing to intermediate image)
  • cropping: paste improved pil2array, remove legacy ocrolib dependency
  • cropping: PIL/ocrolib uses RGB (not BGR) channels
  • cropping: algorithm needs raw (not binarized) images

dinglehopper bd32433..0b9af3a

  • Merge pull request #58 from kba/unorderedgroupindexed
  • 🚧 dinglehopper: Read PAGE UnorderedGroup in XML order

eynollah 26283c6..80795c9

Release: v0.0.5

cor-asv-ann 6c831fa..e919020

Release: v0.1.10

  • 📦 0.1.10
  • forgot compare script

sbb_binarization 4a5544d..8dd0506

Release: v0.0.8

  • 📦 v0.0.8
  • resolving error for inputs whcih have smaller scale than model patch
  • Merge pull request #28 from qurator-spk/fix-unbound-var

v2021-05-04

04 May 17:17
@kba kba
Compare
Choose a tag to compare

eynollah 12f5b29..26283c6

Release: v0.0.2

  • 📦 v0.0.2
  • resolving issue qurator-spk/eynollah#38
  • adding the binarization model and option to binarize input document for the cases like dark, stronly bright and other ones
  • strong erosion, more modification
  • Merge pull request #36 from mikegerber/fix/ocrd-core-req
  • Update eynollah.py
  • Merge branch 'main' of https://github.com/qurator-spk/eynollah into main
  • fixing again the error raised because of erosion

sbb_textline_detector 95d8f70..f1d2777

  • Merge pull request #57 from mikegerber/fix/setup-namespace

v2021-04-25

25 Apr 10:49
@kba kba
Compare
Choose a tag to compare

v2021-03-15

16 Mar 17:47
@kba kba
Compare
Choose a tag to compare

core 1df3f45..1aabf82

Release: v2.23.2

tesseract fe614c60..c676d5bc

Release: 5.0.0-alpha-20201231

  • STRING is no longer required for Visual Studio
  • Remove last usage of STRING
  • Remove unused parts of class STRING
  • Replace remaining STRING by std::string in unittest
  • Replace remaining STRING by std::string in src/training
  • Replace all but one remaining STRING by std::string in src/ccstruct
  • Replace remaining STRING by std::string in src/dict
  • Replace remaining STRING by std::string in src/viewer and src/wordrec
  • Replace remaining STRING by std::string in src/classify
  • Replace remaining STRING by std::string in src/ccmain
  • Remove unused file.
  • Replace typedef structs with structs.
  • snprintf will add '\0' at the end itself.
  • [training] Modernize.
  • [unittest] Use more smart ptrs, more std::make_unique instead of .reset(new T()).
  • Merge pull request #3327 from Shreeshrii/patch-1
  • [capi] Restore some of wrongly removed apis.
  • Misc.
  • Merge pull request #3325 from stweil/string
  • Merge pull request #3324 from stweil/nt_symclip
  • Remove unused include statements for strngs.h
  • [readme] Require C++17 for building.
  • Add missing export header.
  • [clang-format] Format generated protobuf source.
  • [clang-format] Format unit tests.
  • Include missing config_auto.h
  • Add missing forward decl.
  • [clang-format] Execute clang format on include and src dirs.
  • [clang-format] Update config.
  • Revert "Format public API files with 'clang-format-11 -i include/tesseract/*.h'"
  • Format public API files with 'clang-format-11 -i include/tesseract/*.h'
  • Fix tatweel_test for C++-20
  • Replace GenericVector by std::vector in class ImageData
  • Merge pull request #3323 from Shreeshrii/ci
  • autobuild: Require C++17
  • Fix build (#3322)
  • Fix linux build.
  • Pass empty strings into Tesseract::init_tesseract().
  • Pass init strings as std::string instead of const char * internally. This does not affect public APIs.
  • Remove STRING::size() method.
  • Remove unused STRING::strdup().
  • Remove unused STRING::unsigned_size().
  • Replace STRING::truncate_at() with resize().
  • More std::string replacements.
  • [cmake] Require C++17.
  • Clean implementation for (de-)serialization of a vector
  • Update .travis.yml
  • Fix serialization for vector of RecodedCharID
  • Travis: build linux matrix (#3320)
  • Merge pull request #3318 from Shreeshrii/travis
  • Add heuristic to fix swap (partially fixes issue #2586)
  • Replace GenericVector by std::vector
  • Fix two compiler warnings for serialis.h

v2020-03-08

08 Mar 14:30
@kba kba
Compare
Choose a tag to compare
  • Do not set VOLUME in Dockerfile, #239
  • Do not bake in tessdata path for tesseract, #240

core 0ca5aab..a32857d

Release: v2.23.1

  • 📦 v2.23.1
  • Merge pull request #679 from OCR-D/logging-test-fix
  • Merge pull request #680 from OCR-D/resmgr-tess-config
  • Merge pull request #677 from OCR-D/python-versions

ocrd_tesserocr fd17386..b755b26

Release: v0.12.0

  • 📦 v0.12.0
  • 📝 changelog
  • Merge pull request #172 from OCR-D/logging-function
  • Merge pull request #170 from bertsky/overwrite_text_skip

ocrd_pc_segmentation 7298475..413f5d3

Release: v0.1.3

  • pip install -U pip-tools for compat with pip 20.3+, fix #18

ocrd_wrap 9dfbdb3..5a501df

Release: v0.1.7

  • 📦 0.1.7
  • skimage-denoise: change option protect from bool to pt
  • 📦 0.1.6
  • skimage_denoise: make maxsize in pt instead of pt²
  • skimage_denoise: add option protect
  • 📦 0.1.5
  • skimage.remove_small_objects expects boolean array
  • update link labels

tesseract 51a214a..fe614c6

Release: 5.0.0-alpha-20201231

  • Enable less FP exceptions for clang compiler when running tesseract
  • GenericVector -> std::vector.
  • Allow to serialize std vectors with classes from TFile. Implementation from GenericVector.
  • Fix memory leak in DocumentCache
  • Avoid fp division by 0 (fix issue #3314)
  • Merge pull request #3313 from stweil/learning_rate

v2020-02-27

27 Feb 16:58
@kba kba
Compare
Choose a tag to compare

core fe119e4..0ca5aab

Release: v2.23.0

  • 📦 v2.23.0
  • 📝 changelog
  • Merge branch 'calamari-models'
  • 📝 changelog
  • Merge branch 'merge-mets'
  • generate_page: raise ValueError if no numeric part in start/end
  • document .. range operator
  • fix ocrdmets test
  • generate_range: include end
  • implement value range with .. operator for pageId
  • 📦 v2.22.4
  • 📝 changelog
  • Merge pull request #671 from OCR-D/resmgr-cwd-again
  • 📝 changelog
  • Merge pull request #665 from OCR-D/gds-suppress-stdout
  • Merge pull request #664 from OCR-D/fix-docstrings
  • Merge pull request #666 from OCR-D/validator-log-debug
  • Merge pull request #669 from OCR-D/fix-indentation

dinglehopper 8cd8314..bd32433

  • 🚧 dinglehopper: Try out Drone CI
  • 🚧 Replace Travis with CircleCI
  • Merge pull request #50 from b2m/fix-table-extraction

ocrd_calamari f00df75..da98713

Release: v1.0.2

  • 📦 v1.0.2
  • 🐛 Fix word and glyph coordinates
  • ✔️ Explicitly check for None
  • Merge pull request #10 from bertsky/image-features

ocrd_cis 1abf38b..426a33d

Release: v0.1.5

  • segment: avoid adding existing regions to RO group if they are not immediate children

ocrd_fileformat df95fd7..86763d1

Release: v0.2.1

  • 📦 v0.2.1
  • Merge pull request #28 from OCR-D/fix-page-schema

ocrd_pagetopdf 6047bd4..7eb2e4e

Release: v1.0.0

  • Install default-jre-headless instead of openjdk-8-jre-headless

ocrd_segment 744261f..8214d35

Release: v0.1.10

  • 📦 0.1.10
  • extract_page: draw reading-order arrows
  • extract-pages: add 'feature_filter' param
  • extract-regions: apply 'feature_filter'
  • 📦 0.1.9 (extract*: expose param feature_filter)
  • raise pylint karma
  • 📦 0.1.8
  • replace-page: fix getLogger context
  • extract-pages: multi-level mask output via plot_segmasks, optional alpha compositing via plot_overlay
  • from-masks: pick default colordict from parameter, omit initial # sign from colorspecs
  • extract-pages: expose colordict as parameter (with PageViewer scheme as default)
  • extract_words, extract_glyphs: new

opencv-python bbaa777..fd4e604

Release: 48

tesseract 5ca4e006..1ab6b0fb

Release: 5.0.0-alpha-20201231

  • Merge pull request #3311 from stweil/master
  • Don't compile OpenCL code when OpenCL is disabled
  • Format code
  • Remove blanks at line endings
  • Replace GenericVector by std::vector in class WERD_CHOICE
  • Update configure.ac
  • Merge pull request #3309 from stweil/std
  • Merge pull request #3162 from xloem/android
  • Merge pull request #3305 from Shreeshrii/vcpkg-cmake-win
  • Merge pull request #3302 from Shreeshrii/PR1
  • Merge pull request #3300 from Shreeshrii/autocmake
  • Handle file list without terminating LF (fix issue #3298)
  • Update test submodule
  • Update include statements for external header file allheaders.h
  • Fix default value for --enable-doc
  • Merge pull request #3297 from Shreeshrii/autotools
  • Fix default value for --with-archive and --with-curl
  • Merge pull request #3292 from stweil/master
  • Remove Python training scripts which were moved to tesstrain
  • Change LATIN_FONTS to use replacement fonts from TeX Gyre collection
  • Merge pull request #3276 from Shreeshrii/travis
  • Merge pull request #3279 from Shreeshrii/master
  • Merge pull request #3275 from Shreeshrii/ci
  • cmake: Use pkg_config to find required libraries
  • cmake: Don't try to set compile flags for NEON support for ARM64
  • cmake: Activate NEON support for ARM64
  • Merge pull request #3270 from Shatur95/fix-set-and-check
  • Merge pull request #3268 from Shreeshrii/ci
  • Fix path for test-suite.log with flat unittest build in GitHub actions
  • Fix build for flat unittest (CPPFLAGS for abseil and gtest)
  • Merge pull request #3265 from Shreeshrii/master
  • Merge pull request #3264 from stweil/unittest
  • Replace GenericVector by std::vector for class SimpleStats
  • Fix Memory leak in ligature_table_test
  • Merge pull request #3259 from Shreeshrii/ci
  • Merge pull request #3257 from tesseract-ocr/ci
  • Replace STRING by std::string for function read_unlv_file
  • Remove unneeded include statement for genericvector.h
  • Remove unneeded include statement for strngs.h
  • Wordrec: Modernize code
  • Merge pull request #3256 from stweil/mastertrainer_test
  • lstmtrainer: Modernize code
  • lstmtrainer: Remove unused local variable
  • Replace GenericVector by std::vector in LoadFileLinesToStrings
  • Reduce number of INFO messages from lstm_test (#3250)
  • Merge pull request #3252 from Shreeshrii/legacy
  • Remove unused conditional OSX
  • Merge pull request #3251 from stweil/master
  • Merge pull request #3249 from Shreeshrii/ciupdate
  • Merge pull request #3247 from Shreeshrii/ci
  • Merge pull request #3244 from Shreeshrii/ci
  • Update dailycmake.yml
  • Merge pull request #3243 from Shreeshrii/ci
  • Merge pull request #3242 from Shreeshrii/ci
  • Merge pull request #3241 from Shreeshrii/ci
  • Merge pull request #3239 from Shreeshrii/legacy
  • Add more information shown by combine_tessdata -l
  • Add new option -l for combine_tessdata to list the network string
  • Replace STRING by std::string for LSTMRecognizer::network_str_
  • Replace STRING by char* in LSTMRecognizer
  • Merge pull request #3236 from stweil/max_iterations
  • Merge pull request #3235 from Shreeshrii/ci
  • Add TESS_API to instantiation of template functions
  • Use explicit int32_t for serialized data type
  • Replace GenericVector by std::vector
  • Remove unnecessary subtest with missing input image
  • Remove ANDROID_BUILD macro
  • Fix memory allocation in TFile::DeSerialize(std::vector& data)
  • Merge pull request #3231 from Shreeshrii/ci
  • automake: Fix flat build for doc
  • Don't use local CP_RESULT_STRUCT variable to initialize elements of std::vector
  • Fix serialization of std::vector (fix issue #3220)
  • Replace resize(0) by clear() for std::vector
  • Remove --psm 6 for lstm.train in tesstrain.py
  • No --psm 6 for lstm.train
  • automake: Flat build for doc
  • Fix NEON also tesseractmain.cpp
  • Fix use of NEON on ARMv8
  • Revert "Try to fix aarch64 build on macos."
  • Try to fix cmake build.
  • Try to fix aarch64 build on macos.
  • Fix stringrenderer_test
  • Fix ligature_table_test
  • Fix pango_font_info_test
  • Add assertion for IntCastRounded
  • Update sw.yml
  • Delete TESSDATA_BEST_DIR macro
  • Replace snprintf by strncpy (fix compiler warning)
  • Fix some format strings
  • Add include path for leptonica for fuzzer build
  • Merge pull request #3217 from 1480c1/unicode
  • More std::vector.
  • Misc.
  • Move unicodes into files where they are used.
  • Misc.
  • Use std::bitset<16> instead of custom BITS16.
  • More std::vector.
  • Update sw.yml
  • Update sw.yml
  • Update sw.yml
  • Fix brew check for GitHub actions
  • Update sw.yml
  • Update sw.yml
  • Update sw.yml
  • Remove remaining parts of emalloc (fix autotools build)
  • Fix include statement for allheaders.h
  • Sort entries in Makefile.am
  • Remove remaining parts of tessopt (fix autotools build)
  • [test] Init FC early.
  • Remove emalloc. Replace it with malloc. To be replaced with new later.
  • Update sw.yml
  • Update sw.yml
  • ...
Read more

v2021-02-03

03 Feb 09:16
@kba kba
Compare
Choose a tag to compare

ocrd_fileformat ae52a2e..df95fd7

Release: v0.2.0

  • changelog
  • 📦 v0.2.0
  • Merge branch 'handle-false-positive-page-alto'
  • update ocr-fileformat

sbb_binarization ea69494..4a5544d

Release: v0.0.7

  • 📦 v0.0.7
  • Merge pull request #25 from OCR-D/resolve-resources
  • Merge pull request #23 from mikegerber/fix/readme-ocrd

sbb_textline_detector 4c498fc..95d8f70

  • Merge pull request #55 from qurator-spk/resolve-resources
  • Merge pull request #54 from qurator-spk/tf-1.15
  • Merge pull request #48 from bertsky/fix-coords

v2021-01-30

30 Jan 14:36
@kba kba
Compare
Choose a tag to compare
  • make check ensures executables are runnable and produce --help output
  • Use ocrd resmgr instead of wget for make install-models*

core 135acb67..fe119e4d

Release: v2.22.3

  • 📦 v2.22.3
  • 🐛 resmgr: equivalent cwd bug for list_resource_candidated :/
  • 🐛 resmgr: --location cwd should still include "ocrd-resources"
  • Processor.init: remember old cwd before chdir
  • Proceessor.resolve_resource: log/exit 1 instead of raise
  • Proceessor.resolve_resource: No more downloading on-demand
  • update assets
  • 📦 v2.22.2
  • resource list: disable < 1.0 calamari models, rename resource
  • 📦 v2.22.1
  • 🐛 mkdir -p $HOME/.config/ocrd
  • 📦 v2.22.0
  • update specs
  • Merge pull request #559 from OCR-D/resolve-files
  • update assets and spec to 3.11.0
  • 📝 changelog
  • Merge branch 'warn-unless-pageid'
  • Merge pull request #660 from OCR-D/add-guess-mimetype
  • Merge pull request #655 from OCR-D/rename-group

Submodule ocrd_anybaseocr contains untracked content
Submodule ocrd_anybaseocr contains modified content

ocrd_anybaseocr de27915..e63f555

Release: v1.3.0

  • 📦 v1.3.0
  • Merge pull request #78 from OCR-D/processor-resources
  • 📦 v1.2.0

ocrd_calamari 1c7fcda..f00df75

Release: v1.0.1

  • Merge pull request #55 from OCR-D/ocrd-tool-version
  • 📦 v1.0.1
  • Merge pull request #53 from OCR-D/resolve-resources
  • 📝 README: Use new-style OCR-D parameter syntax and new-style mixed output file groups
  • Merge pull request #49 from OCR-D/fix-48
  • Merge pull request #52 from OCR-D/checkpoint_dir
  • Merge pull request #50 from OCR-D/add-calamari-version

ocrd_cis 925d744..1abf38b

Release: v0.1.5

  • re/segment: before spreading lines, assign diacritics to seeds below
  • segment: ensure run-time hierarchy refs when modifying reading order elements
  • segment: fix typo
  • ocrd-cis-align: fix invalid TextEquiv/dataType use
  • ocrd-cis-align: fix log level override
  • ocrd-cis-align: fix getLogger context

ocrd_im6convert 11bfea3..105697f

Release: v0.0.5

  • 📦 v0.0.5
  • Merge pull request #16 from OCR-D/sharedir

ocrd_tesserocr 89b73ff..fd17386

Release: v0.11.0

  • 📦 v0.11.0
  • Merge pull request #166 from OCR-D/processor-resource

ocrd_typegroups_classifier 54c8a78..1ed0cb1

Release: v0.3.0

  • 📦 v0.3.0
  • Merge pull request #8 from OCR-D/resolve-resources