🏘️ Airbert: In-domain Pretraining for Vision-and-Language Navigation 🏘️

This repository stores some models trained in our experiments.

⌨️ Downloading from the command line

We stored our models on Google Drive, as the service provides a storage of 15 GB for free.

You can use the great gdown script for downloading the models:

pip install gdown
gdown [link to Google Drive]

We also provide a Makefile to help you:

# Download everything
make all 
# Download a specific model
make airbert-r2rRSA
# Get all commands
make help

Model	Description
airbert	Airbert model pretrained on the BNB dataset

Model	Description
vilbert	ViLBERT model pretrained on Conceptual Captions
vlnbert	VLN-BERT: `vilbert` fine-tuned on R2R

Model	Description
airbert-r2rRS	`airbert` fine-tuned on R2R with the shuffling loss
airbert-r2rRSA	`airbert` fine-tuned on R2R with the shuffling loss + speaker data

Model	Description
REVERIE	Recurrent VLN-BERT for remote referring expression with pretrained Airbert as backbone
R2R	Recurrent VLN-BERT for vision-and-language navigation with pretrained Airbert as backbone