Skip to content

Commit

Permalink
Add artifacts for other NER test datasets
Browse files Browse the repository at this point in the history
  • Loading branch information
ljvmiranda921 committed Jan 18, 2025
1 parent 3db3842 commit 9826f4e
Show file tree
Hide file tree
Showing 2 changed files with 12 additions and 0 deletions.
3 changes: 3 additions & 0 deletions models/v0.2.0/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -113,6 +113,9 @@ in the project directory.
| `assets/tl_newscrawl-ud-test.conllu` | URL | Test dataset for NewsCrawl |
| `assets/tl_trg-ud-test.conllu` | URL | Test dataset for TRG |
| `assets/tl_ugnayan-ud-test.conllu` | URL | Test dataset for Ugnayan |
| `assets/uner_trg.iob2` | URL | Test dataset for Universal NER TRG |
| `assets/uner_ugnayan.iob2` | URL | Test dataset for Universal NER Ugnayan |
| `assets/tfnerd.txt` | URL | Test dataset for TF-NERD |
| `assets/fasttext.tl.gz` | URL | Tagalog fastText vectors provided from the fastText website (trained from CommonCrawl and Wikipedia). |
| `assets/floret` | Git | Floret repository for training floret and fastText models. |

Expand Down
9 changes: 9 additions & 0 deletions models/v0.2.0/project.yml
Original file line number Diff line number Diff line change
Expand Up @@ -100,6 +100,15 @@ assets:
- dest: assets/tl_ugnayan-ud-test.conllu
description: "Test dataset for Ugnayan"
url: https://raw.githubusercontent.com/UniversalDependencies/UD_Tagalog-Ugnayan/refs/heads/master/tl_ugnayan-ud-test.conllu
- dest: assets/uner_trg.iob2
description: "Test dataset for Universal NER TRG"
url: https://raw.githubusercontent.com/UniversalNER/UNER_Tagalog-TRG/refs/heads/master/tl_trg-ud-test.iob2
- dest: assets/uner_ugnayan.iob2
description: "Test dataset for Universal NER Ugnayan"
url: https://raw.githubusercontent.com/UniversalNER/UNER_Tagalog-Ugnayan/refs/heads/master/tl_ugnayan-ud-test.iob2
- dest: assets/tfnerd.txt
description: "Test dataset for TF-NERD"
url: https://huggingface.co/datasets/rkramos/tfnerd/raw/main/txt/test.txt
- dest: "assets/fasttext.tl.gz"
description: "Tagalog fastText vectors provided from the fastText website (trained from CommonCrawl and Wikipedia)."
url: "https://dl.fbaipublicfiles.com/fasttext/vectors-crawl/cc.tl.300.vec.gz"
Expand Down

0 comments on commit 9826f4e

Please sign in to comment.