From 61765e3bb1da3d580bc72f48b34634cf8c79ea45 Mon Sep 17 00:00:00 2001 From: gramirez-prompsit <32385845+gramirez-prompsit@users.noreply.github.com> Date: Fri, 11 Nov 2022 16:39:16 +0100 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index c02302c..36e74e9 100644 --- a/README.md +++ b/README.md @@ -37,7 +37,7 @@ ${COLLECTIONS[$collection]}/ en-index, $lang-doc, $en-doc, $lang-translated-doc The number refers to the $batch in en/$shard/$batch it is aligned with. - fixed.gz : All aligned document pairs concatenated, and + fixed.gz : All aligned sentence pairs concatenated, and processed with bifixer. Produced in step 07. hardruled.gz : bicleaner-hardrules.py scores of fixed.gz, with 0 for lines that should be ignored further down the pipeline.