A tool for computing error rates for different algorithms:
It can calculate Character Error Rate (CER), Word Error Rate (WER), Bag of Tokens (BOG) and some more metrics. Type
java -cp <this-jar> eu.transkribus.errorrate.HtrError --help
for more information concerning evaluating an HTR result if the files are PAGE-XML-files. For raw UTF-8 encoded textfiles use
java -cp <this-jar> eu.transkribus.errorrate.HtrErrorTxt
or
java -cp <this-jar> eu.transkribus.errorrate.HtrErrorTxtLeip
To calculate measures for KWS
java -cp <this-jar> eu.transkribus.errorrate.KwsError
can be used. Use --help to see the configuration opportunities
To calculate measures for image alignment
java -cp <this-jar> eu.transkribus.errorrate.Text2ImageError
can be used. Use --help to see the configuration opportunities
- Java >= version 8
- Maven
- All further dependencies are gathered via Maven
git clone https://github.com/Transkribus/TranskribusErrorRate
cd TranskribusErrorRate
mvn install