Update

The VetBERT model has been converted to PyTorch and moved to HuggingFace. The pretrained model without finetuing is located at: https://huggingface.co/havocy28/VetBERT

The VetBERT model finetuned on the disease syndrome classification task is located at: https://huggingface.co/havocy28/VetBERTDx

VetBERT

Brian Hur, [email protected]

VetBERT is a BERT based contextualized language model pretrained on over 15 million veterinary clinical notes and can be trained to perform a variety of tasks such as the disease indicated in a veterinary clinical record.

The classifier model implements VetBERT as described in the paper and presentation from the BioNLP workshop @ ACL 2020 which can be used to classify the disease syndrome in a veterinary clinical note.

Instructions

To run, install the requirements

Download the zipped VETBERT model here

Download the zipped trained classifier here

unzip the folders contained in the files in the same file that the scripts are being ran.

ensure you have python 3.6 or higher running.


pip install requirements.txt

to perform test classification run:


python vetbert_classify_demo.py ./input/clinical_notes.xls

If test successful, you should see the output results and there should be a file in the folder:

./output/predicted_outputs.xls

To classify your own notes, follow the format in ./input/clinical_notes.xls and save using Excel 97-2003 format. You need to supply a dummy label if you do not have the labels and are note testing the model. The labels that can be used are listed in labels.txt.

The following paper should be cited if you use any of these resources:


@inproceedings{hur2020domain,
  title={Domain Adaptation and Instance Selection for Disease Syndrome Classification over Veterinary Clinical Notes},
  author={Hur, Brian and Baldwin, Timothy and Verspoor, Karin and Hardefeldt, Laura and Gilkerson, James},
  booktitle={Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing},
  pages={156--166},
  year={2020}
}

Please comment or message me if you have any questions or run into any issues.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.ipynb_checkpoints		.ipynb_checkpoints
input		input
output		output
.gitignore		.gitignore
README.md		README.md
classes.npy		classes.npy
labels.txt		labels.txt
requirements.txt		requirements.txt
vetbert_classify_demo.py		vetbert_classify_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Update

VetBERT

Instructions

About

Releases

Packages

Contributors 2

Languages

havocy28/VetBERT

Folders and files

Latest commit

History

Repository files navigation

Update

VetBERT

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages