Artistic-style-transfer

Unofficial Pytorch implementation of Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization [Huang+, ICCV2017]

Original Lua implementation from the author can be found here.

This implementation uses Nvidia DALI and AMP to accelerate the training process, with WanDB employed for monitoring.

Prerequisites

Clone this repository

git clone https://github.com/jackdaw213/Artistic-style-transfer
cd Artistic-style-transfer

Install Conda and create an environment

conda create -n artistic_style_transfer python=3.12

Install all dependencies from requirements.txt

conda activate artistic_style_transfer
pip install nvidia-pyindex
pip install -r requirements.txt

This should prepare the Conda environment for both training and testing (pretrained model available below)

Train

Download the COCO dataset for content images and the Wikiart dataset for style images. Extract the files and organize them into the 'data' folder, with subfolders 'train_content', 'val_content', 'train_style', and 'val_style'.
Preprocess the dataset

WikiArt dataset contains corrupted JPEG images (file ends prematurely) and images with 105x pixel counts of a 4K image. This step should remove MOST of the corrupted images and resize any images with pixel counts higher than 3840 * 2160.
```
python preprocess.py
```
```
preprocess.py [-h]
              [--train_style TRAIN_STYLE_FOLDER]
              [--val_style VAL_STYLE_FOLDER]
```

Train the model.

python train.py --enable_dali --enable_amp --enable_wandb

train.py [-h]
         [--epochs EPOCHS]
         [--batch_size BATCH_SIZE]
         [--num_workers NUM_WORKERS]
         [--train_dir_content TRAIN_DIR_CONTENT]
         [--val_dir_content VAL_DIR_CONTENT]
         [--train_dir_style TRAIN_DIR_STYLE]
         [--val_dir_style VAL_DIR_STYLE]
         [--optimizer OPTIMIZER]
         [--learning_rate LEARNING_RATE]
         [--momentum MOMENTUM]
         [--resume_id RESUME_ID]
         [--checkpoint_freq CHECKPOINT_FREQ]
         [--amp_dtype AMP_DTYPE]
         [--enable_dali]
         [--enable_amp]
         [--enable_wandb]

The model was trained on an RTX 3080 10G for 10 epoches.

Training setup	Batch size	GPU memory usage	Training time
DALI	4	6GB	3.8 hours
DALI + AMP	8	6.5GB	2.2 hours
DataLoader	8	9GB	4.4 hours
DataLoader + AMP	8	4GB	2.4 hours

WARNING: Nvidia DALI only supports Nvidia GPUs. BFloat16 is supported only on RTX 3000/Ampere GPUs and above, while GPU Direct Storage (GDS) is supported only on server-class GPUs. Using Float16 might cause NaN loss during training, whereas BFloat16 does not.

Test

Download the pretrained model here and put it in the model folder

Generate the output image using the command bellow.

python test -c content_image_path -s style_image_path

test.py [-h] 
        [--content CONTENT] 
        [--style STYLE]
        [--model MODEL_PATH]

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
content		content
data		data
img		img
model		model
style		style
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
exinput.py		exinput.py
model.py		model.py
model_parts.py		model_parts.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Artistic-style-transfer

Prerequisites

Train

Test

Result

References

About

Releases

Packages

Languages

jackdaw213/Artistic-style-transfer

Folders and files

Latest commit

History

Repository files navigation

Artistic-style-transfer

Prerequisites

Train

Test

Result

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages