optimized-bert

About The Project

This project is an attempt to democratize BERT by reducing its number of trainable parameters and in-turn making it faster to train and fine-tune. Since cosine similarity (computed in BERT while performing dot product for Query, Key and Value matrices) is prone to convex hull, we plan to replace cosine with other similarity metrics and check how that impacts reduced model dimensions.

We are benchmarking the model's performance on the following distance/similarity measures:

Cosine
Euclidean
Gaussian softmax

Due to constraint of compute resources, we are currently validating our hypothesis on just 1% book corpus data. We intend to increase the train data in the subsequent iterations.

Getting Started

Clone the repo

git clone https://github.com/gaushh/optimized-bert.git

Create a virtual environment on your IDE

Now you can set everything up using the single shell script or following step-by-step instructions.

Single Script Setup

Run the shell script to set everything up with default configs and start pre-training BERT.

shell script
```
sh setup.sh
```

Step By Step Setup

Here are the step by step instructions to setup the repo and in-turn understand the process.

Install required packages.
```
pip install -r requirements.txt
```

Login to Weights and Biases

wandb login --relogin 8c46e02a8d52f960fb349e009c5b6773c25b6957

Writing config file
```
cd helper
python write_config.py
cd ..
```
Preparing dtataset
```
cd src/data
python dataset.py
cd ../..
```

Training tokenizer

cd src/modelling
python train_tokenizer.py

Performing post-processing
```
python preparation.py
```
Starting model training
```
python train_bert.py
```

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Name	Name	Last commit message	Last commit date
Latest commit shikher7 update run_glue.sh Dec 25, 2022 9485730 · Dec 25, 2022 History 42 Commits
data	data	fixed bugs in train_bert and requirements	Dec 18, 2022
helper	helper	environment variables	Dec 20, 2022
models	models	fixed bugs in train_bert and requirements	Dec 18, 2022
notebooks	notebooks	bug fix	Dec 19, 2022
src	src	Merge branch 'main' of https://github.com/gaushh/optimized-bert	Dec 25, 2022
###	###	"update setup"	Dec 25, 2022
.DS_Store	.DS_Store	fixed bugs in train_bert and requirements	Dec 18, 2022
.gitignore	.gitignore	"update setup"	Dec 25, 2022
LICENSE	LICENSE	Initial commit	Nov 25, 2022
README.md	README.md	Readme minor changes	Dec 4, 2022
requirements.txt	requirements.txt	add glue task	Dec 20, 2022
run_glue.sh	run_glue.sh	update run_glue.sh	Dec 25, 2022
setup.sh	setup.sh	"update setup"	Dec 25, 2022
yaml_parser.sh	yaml_parser.sh	Use yaml file for glue config	Dec 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

optimized-bert

About The Project

Getting Started

Single Script Setup

Step By Step Setup

License

About

Releases

Packages

Contributors 2

Languages

License

gaushh/optimized-bert

Folders and files

Latest commit

History

Repository files navigation

optimized-bert

About The Project

Getting Started

Single Script Setup

Step By Step Setup

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages