Skip to content

Commit

Permalink
Add content content/Data/wikitext.md
Browse files Browse the repository at this point in the history
  • Loading branch information
saddlerto committed May 12, 2024
1 parent ef8864d commit fdd8b26
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions content/Data/wikitext.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
---
{"publish":true,"path":"Data/wikitext.md","permalink":"/data/wikitext/"}
---

The WikiText language modeling dataset is a collection of over 100 million tokens extracted from the set of verified Good and Featured articles on Wikipedia. The dataset is available under the Creative Commons Attribution-ShareAlike License.

0 comments on commit fdd8b26

Please sign in to comment.