Missing Multimodel Pretraining step #32

shubhamgarg21 · 2024-03-12T10:17:17Z

Hi,

For the paper https://arxiv.org/pdf/2310.01218.pdf , the following is mentioned in pretraining section :

For efficiency, we first train SEED-LLaMA using LoRA [32] tuning and together optimize the
parameters of the embedding layer and decoder head layer due to the added visual codes. We then
merge the parameters of LoRA onto the LLM backbone and fine-tune all parameters except for
the embedding layer.

But in the training steps, the part about fine-tuning all parameters except for the embedding layer is missing.

The text was updated successfully, but these errors were encountered:

Update Stage 1 Proposal

Cerf-Volant425 · 2024-06-12T03:02:01Z

Same question, could you please add the corresponding script of the fine-tuning part?
Thanks in advance.

zheedong added a commit to KU-AGI/SEED that referenced this issue Mar 13, 2024

Merge pull request AILab-CVC#32 from KU-AGI/seed_agi

a18bb32

Update Stage 1 Proposal

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing Multimodel Pretraining step #32

Missing Multimodel Pretraining step #32

shubhamgarg21 commented Mar 12, 2024

Cerf-Volant425 commented Jun 12, 2024

Missing Multimodel Pretraining step #32

Missing Multimodel Pretraining step #32

Comments

shubhamgarg21 commented Mar 12, 2024

Cerf-Volant425 commented Jun 12, 2024