How to perform multi-stage multi-resolution training? #83

zzk88862 · 2024-11-27T14:09:08Z

Very great work！I will perform multi-stage multi-resolution SFT based on cogvideox-fun.
some problems need your help

1、Parameter Settings for Multi-Resolution Training, such as learning rate、epoch and so on for diff stage:
How should I set the parameters for training at the 512 resolution stage? s
How should I set the parameters for the 768 resolution stage?
How should I set the parameters for the 1024 resolution stage?

2、Training Data for Each Stage:
Is it acceptable to use the same training data for each stage, or should the data differ for each resolution stage?

tks

bubbliiiing · 2024-11-28T06:36:25Z

Like these:

512 stage.

export MODEL_NAME="models/Diffusion_Transformer/CogVideoX-Fun-2b-InP"
export DATASET_NAME="datasets/internal_datasets/"
export DATASET_META_NAME="datasets/internal_datasets/metadata.json"
export NCCL_IB_DISABLE=1
export NCCL_P2P_DISABLE=1
NCCL_DEBUG=INFO

accelerate launch --mixed_precision="bf16" scripts/train.py \
  --pretrained_model_name_or_path=$MODEL_NAME \
  --train_data_dir=$DATASET_NAME \
  --train_data_meta=$DATASET_META_NAME \
  --image_sample_size=1024 \
  --video_sample_size=256 \
  --token_sample_size=512 \
  --video_sample_stride=3 \
  --video_sample_n_frames=49 \
  --train_batch_size=1 \
  --video_repeat=1 \
  --gradient_accumulation_steps=1 \
  --dataloader_num_workers=8 \
  --num_train_epochs=100 \
  --checkpointing_steps=50 \
  --learning_rate=2e-05 \
  --lr_scheduler="constant_with_warmup" \
  --lr_warmup_steps=100 \
  --seed=42 \
  --output_dir="output_dir" \
  --gradient_checkpointing \
  --mixed_precision="bf16" \
  --adam_weight_decay=3e-2 \
  --adam_epsilon=1e-10 \
  --vae_mini_batch=1 \
  --max_grad_norm=0.05 \
  --random_hw_adapt \
  --training_with_video_token_length \
  --enable_bucket \
  --train_mode="inpaint" \
  --trainable_modules "."

768 stage.

export MODEL_NAME="models/Diffusion_Transformer/CogVideoX-Fun-2b-InP"
export DATASET_NAME="datasets/internal_datasets/"
export DATASET_META_NAME="datasets/internal_datasets/metadata.json"
export NCCL_IB_DISABLE=1
export NCCL_P2P_DISABLE=1
NCCL_DEBUG=INFO

accelerate launch --mixed_precision="bf16" scripts/train.py \
  --pretrained_model_name_or_path=$MODEL_NAME \
  --train_data_dir=$DATASET_NAME \
  --train_data_meta=$DATASET_META_NAME \
  --image_sample_size=1024 \
  --video_sample_size=256 \
  --token_sample_size=768 \
  --video_sample_stride=3 \
  --video_sample_n_frames=49 \
  --train_batch_size=1 \
  --video_repeat=1 \
  --gradient_accumulation_steps=1 \
  --dataloader_num_workers=8 \
  --num_train_epochs=100 \
  --checkpointing_steps=50 \
  --learning_rate=2e-05 \
  --lr_scheduler="constant_with_warmup" \
  --lr_warmup_steps=100 \
  --seed=42 \
  --output_dir="output_dir" \
  --gradient_checkpointing \
  --mixed_precision="bf16" \
  --adam_weight_decay=3e-2 \
  --adam_epsilon=1e-10 \
  --vae_mini_batch=1 \
  --max_grad_norm=0.05 \
  --random_hw_adapt \
  --training_with_video_token_length \
  --enable_bucket \
  --train_mode="inpaint" \
  --trainable_modules "."

bubbliiiing · 2024-11-28T06:36:59Z

If the resolution of the video is greater than 1024, it can be used in three stages.

zzk88862 · 2024-12-09T08:09:28Z

hi，thanks for your reply，
Two very critical questions: If I have 300,000 training data that includes videos of different resolutions,
1、 how many epochs should be trained at each stage, and 2、does the training data need to be completely different for each stage?

tks
looking forward for your reply

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to perform multi-stage multi-resolution training? #83

How to perform multi-stage multi-resolution training? #83

zzk88862 commented Nov 27, 2024 •

edited

Loading

bubbliiiing commented Nov 28, 2024

bubbliiiing commented Nov 28, 2024

zzk88862 commented Dec 9, 2024 •

edited

Loading

How to perform multi-stage multi-resolution training? #83

How to perform multi-stage multi-resolution training? #83

Comments

zzk88862 commented Nov 27, 2024 • edited Loading

bubbliiiing commented Nov 28, 2024

bubbliiiing commented Nov 28, 2024

zzk88862 commented Dec 9, 2024 • edited Loading

zzk88862 commented Nov 27, 2024 •

edited

Loading

zzk88862 commented Dec 9, 2024 •

edited

Loading