nanoGPT: Decoder only Transformer model for text generation

This repository contains the code for the nanoGPT model, a lightweight version of the GPT model. The model is trained on the all Williams Shakespeare's works and can be used for generate text in the style of Shakespeare.

This is decoder only transformer model, which means it only contains the decoder part of the transformer model. All the code is has been implemented in PyTorch from scratch.

Usage

To use the model, you first have to download the pre-trained model from the following link:

Download nanoGPT model

After downloading the model, you can generate text using the following command:

python generate_text.py --model_path models/nanoGPT.pth --tokens 100

NOTE:

bigram_v1.py is the first version of the bigram model.
bigram_v2.py is the second version of the bigram model with single head self-attention
bigram_v3.py is the third version of the bigram model with multi-head self-attention and residual connections.
nanoGPT.py is the final version of the model with multi-head self-attention, residual connections, and some regularization techniques

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
bigram_v1.py		bigram_v1.py
bigram_v2.py		bigram_v2.py
bigram_v3.py		bigram_v3.py
generate_text.py		generate_text.py
input.txt		input.txt
nanoGPT.py		nanoGPT.py
playground.ipynb		playground.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nanoGPT: Decoder only Transformer model for text generation

Usage

About

Releases

Packages

Languages

Preet-Sojitra/nanoGPT

Folders and files

Latest commit

History

Repository files navigation

nanoGPT: Decoder only Transformer model for text generation

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages