Throne2Vec

Training a word2vec model on a data-set containing the entire Game of Thrones book collection

This notebook is based on assignment 5 of the Udacity Deep-Learning course.

Besides the data-set, what is new here:

Text Pre-Processing
Finding word analogies using the learned embedding
More detailed comments
Optimizations

This is a Jupyter notebook so explanations are included as markdowns in the notebook. Feel free to play around with it and share comments if you have any.

The GOT corpus file is not included in this repository due to book copyrights considerations, sorry about that. However, you can create your own data-set with whichever book (or text in general) you'd like. Just make sure it is in a .zip file with one or more .txt files in it.

Dependencies:

Tensorflow (version > 1.0)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.gitattributes		.gitattributes
1M.png		1M.png
License.txt		License.txt
README.md		README.md
embed1M.pickle		embed1M.pickle
embed500k.pickle		embed500k.pickle
throne2vec.ipynb		throne2vec.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Throne2Vec

Training a word2vec model on a data-set containing the entire Game of Thrones book collection

Dependencies:

About

Releases

Packages

Languages

License

eyalzk/throne2vec

Folders and files

Latest commit

History

Repository files navigation

Throne2Vec

Training a word2vec model on a data-set containing the entire Game of Thrones book collection

Dependencies:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages