最新代码的MiniCPM-V-2_6训练报错 #6655

ML-GCN · 2025-01-15T08:22:40Z

Reminder

I have read the above rules and searched the existing issues.

System Info

root@ecs-50958108:/workspace/train/LLaMA-Factory# llamafactory-cli env
[2025-01-15 08:20:32,880] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)

llamafactory version: 0.9.2.dev0
Platform: Linux-5.15.0-86-generic-x86_64-with-glibc2.35
Python version: 3.10.12
PyTorch version: 2.4.0a0+07cecf4168.nv24.05 (GPU)
Transformers version: 4.46.1
Datasets version: 3.1.0
Accelerate version: 1.0.1
PEFT version: 0.12.0
TRL version: 0.9.6
GPU type: NVIDIA A100-PCIE-40GB
DeepSpeed version: 0.16.2

Reproduction

sh文件

model

model_name_or_path: /dataNfs/pre-trained/MiniCPM-V-2_6
trust_remote_code: true

method

stage: sft
do_train: true
finetuning_type: lora
lora_target: all

dataset

dataset_dir: data
dataset: mllm_demo # video: mllm_video_demo
template: minicpm_v
cutoff_len: 32000
max_samples: 10000
overwrite_cache: true
preprocessing_num_workers: 16
image_resolution: 1003520

output

output_dir: /dataNfs/checkpoint/visual/MiniCPM-V-2_6/test
logging_steps: 500
save_steps: 2000
plot_loss: true
overwrite_output_dir: true

train

per_device_train_batch_size: 1
gradient_accumulation_steps: 1
learning_rate: 1.0e-4
num_train_epochs: 3.0
lr_scheduler_type: cosine
warmup_ratio: 0.1
bf16: true
ddp_timeout: 180000000

eval

#val_size: 0.1
#per_device_eval_batch_size: 1
#eval_strategy: steps
#eval_steps: 500

报错

使用的是给定的mllm_demo数据是否数据格式不兼容还是什么原因望解答谢谢

Others

No response

The text was updated successfully, but these errors were encountered:

hiyouga · 2025-01-15T08:27:33Z

手动更新模型文件 https://huggingface.co/openbmb/MiniCPM-V-2_6/blob/main/modeling_minicpmv.py

ML-GCN · 2025-01-15T08:40:18Z

手动更新模型文件 https://huggingface.co/openbmb/MiniCPM-V-2_6/blob/main/modeling_minicpmv.py

更新后可以了再请教一下

出现这三个警告是否有影响因为我使用qwen2vl时没有出现过这种警告

hiyouga · 2025-01-15T09:32:40Z

没有

ML-GCN added bug pending labels Jan 15, 2025

hiyouga closed this as completed Jan 15, 2025

hiyouga added solved and removed bug pending labels Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

最新代码的MiniCPM-V-2_6训练报错 #6655

最新代码的MiniCPM-V-2_6训练报错 #6655

ML-GCN commented Jan 15, 2025

hiyouga commented Jan 15, 2025

ML-GCN commented Jan 15, 2025 •

edited

Loading

hiyouga commented Jan 15, 2025

最新代码的MiniCPM-V-2_6训练报错 #6655

最新代码的MiniCPM-V-2_6训练报错 #6655

Comments

ML-GCN commented Jan 15, 2025

Reminder

System Info

Reproduction

model

method

dataset

output

train

eval

Others

hiyouga commented Jan 15, 2025

ML-GCN commented Jan 15, 2025 • edited Loading

hiyouga commented Jan 15, 2025

ML-GCN commented Jan 15, 2025 •

edited

Loading