Brief Notes for ViBRANT Corpus

Introduction

The corpus is intended to meet the need for gold standard data to assist in the development and evaluation of natural language processing tools for biodiversity literature. A particular feature of this corpus is the presence of clean (re-keyed) and dirty (OCR) versions of the same text.

Availability

The corpus can be downloaded from ViBRANT's git repository as an anonymous user with the following command:
$ git clone https://git.scratchpads.eu/git/vibrantcorpus.git

Licence

As with all content produced by the ViBRANT project, the corpus is released under Creative Commons CC0 licence.

Acknowledgements

This corpus was developed as part of the ViBRANT project.
ViBRANT was funded by the European Union 7th Framework Programme within the Research Infrastructures group.
Contract no. RI-261532. Period, Dec. 2010 to Nov. 2013.
Coordinator: Dr Vince Smith.
E-mail: [email protected]

Thanks also to Anna Weitzman and Chris Lyal of the INOTAXA project for making their project’s re‐keyed texts of the Biologi Centrali-Americana available for our research.

Thanks to Pensoft, and especially Lyubomir Penev, for developing a publishing process that makes articles available in a machine readable format, and for being passionately committed to open data.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
aves_v1		aves_v1
bca_03_aves_v4		bca_03_aves_v4
bca_03_aves_v4_ocr		bca_03_aves_v4_ocr
bot_v1		bot_v1
coleopt_v1p1		coleopt_v1p1
mamm_v1		mamm_v1
zookeys_210_3071		zookeys_210_3071
ReadMe.html		ReadMe.html
ReadMe.md		ReadMe.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Brief Notes for ViBRANT Corpus

Introduction

Contents

Availability

Licence

Acknowledgements

About

Releases

Packages

Languages

VBRANT/vibrantcorpus

Folders and files

Latest commit

History

Repository files navigation

Brief Notes for ViBRANT Corpus

Introduction

Contents

Availability

Licence

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages