You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
However, when the program runs to DataCollatorForPreference, pixel_values disappears.
This leads to the fact that in subsequent training, the model cannot receive pixel_values, but it can receive image_sizes, which is very strange.
Expected behavior
I hope to know why the dataset itself contains pixel_values but why it disappears during data_collect
Checklist
I have checked that my issue isn't already filed (see open issues)
I have included my system information
Any code provided is minimal, complete, and reproducible (more on MREs)
Any code provided is properly formatted in code blocks, (no screenshot, more on code blocks)
Any traceback provided is complete
The text was updated successfully, but these errors were encountered:
I know the reason why pixel_values disappears.
We should run the code the param "--remove_unused_columns false", otherwise pixel_values will be eliminated.
System Info
trl env
Information
Tasks
examples
folderReproduction
The model is
llava-v1.6-vicuna-7b-hf
, and the dataset isRLAIF-V-Dataset
.In the
dpo_trainer.py
,self.train_dataset
includesHowever, when the program runs to
DataCollatorForPreference
,pixel_values
disappears.This leads to the fact that in subsequent training, the model cannot receive
pixel_values
, but it can receiveimage_sizes
, which is very strange.Expected behavior
I hope to know why the dataset itself contains
pixel_values
but why it disappears during data_collectChecklist
The text was updated successfully, but these errors were encountered: