Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
pavanchhatpar authored Jul 3, 2020
1 parent 3eca6a6 commit b7bb581
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,7 @@ Fine tuning ELECTRA to break sentences into two parts when the discourse marker
## Results
- With little-to-no effort on hyperparameter tuning, fine tuning the model for 2 epochs gets to a `test accuracy of 91.8%` while `training accuracy is 95.4%`
- Check the [notebook](https://github.com/pavanchhatpar/sentence-splitter/blob/master/Sentence_splitting_model.ipynb) or [![Try it on Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/pavanchhatpar/sentence-splitter/blob/master/Sentence_splitting_model.ipynb) for more details
![Sample prediction](sentence-splitter.png)

## TODOs
- Make the pretrained model available probably via Hugging Face's own model upload facility
Expand All @@ -32,3 +33,4 @@ Fine tuning ELECTRA to break sentences into two parts when the discourse marker
- [ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators](https://arxiv.org/abs/2003.10555)
- `Hugging Face 🤗 Transformers` library for the pretrained ELECTRA model and their extremely lucid API
- `Google Colaboratory` for providing the free GPU environment
- [Mining Discourse Markers for Unsupervised Sentence Representation Learning](https://arxiv.org/abs/1903.11850) for providing the dataset

0 comments on commit b7bb581

Please sign in to comment.