RuntimeError: Error(s) in loading state_dict for EmbeddingPipe: size mismatch for word_embeddings.weight #645

mcao516 · 2022-07-07T22:17:39Z

Describe the bug
RuntimeError: Error(s) in loading state_dict for EmbeddingPipe: size mismatch for word_embeddings.weight: copying a param with shape torch.Size([25216, 6144]) from checkpoint, the shape in current model is torch.Size([50304, 6144]).

To Reproduce

Download Slim weights
Update the vocabulary and checkpoint path in ./configs/20B.yml (HFTokenizer is used)
Run: ./deepy.py generate.py ./configs/20B.yml -i prompt.txt -o sample_outputs.txt

Screenshots

Environment (please complete the following information):

GPUs: 2x RTX8000 (48G)

The text was updated successfully, but these errors were encountered:

jdagdelen · 2022-07-16T04:15:39Z

I'm experiencing this too. Not sure what I'm doing wrong. Downloaded the weights from here which the "fixed" link from #646. However, I also downloaded the slim weights and that seems to load ok, although the output from the model is gibberish.

FayZ676 · 2022-12-09T23:23:12Z

I am getting the same problem too when trying to train a 1-3B model.

To Reproduce:

Download Slim weights
Update ./configs/1-3B.yml as shown in the screen shots below.
Run python ./deepy.py train.py -d configs 1-3B.yml

Screenshots:

Environment:

GPU's: 4x 3090 (96G)

binglun30 · 2023-03-28T07:43:18Z

I also had the same problem, when using a single machine to load the slim weight downloaded on github, it reported a similar error, here is a screenshot of the error message

Environment:

GPU's: 4x 3090 (96G)

djaym7 · 2023-04-19T21:58:43Z

What's the solution ? and why closed ?

StellaAthena · 2023-04-19T22:39:49Z

@djaym7 Thanks for saying something. I don't recall closing this and have reopened it.

StellaAthena · 2023-04-30T15:29:04Z

@FayZ676 the url you’re linking to does not contain the weights for a 1.3B model, it contains the weights for a 20B model. That’s why you’re getting a size mismatch: it’s quite simply the wrong size. I suspect that this is unrelated to the problems the others are having.

@leclem so that change allows you to finetune the 20B model? Can you post a WandB link showing it training so I can check out the loss etc are as expected?

shaunstoltz · 2023-09-25T13:03:36Z

I have the same issue trying to train. Downloaded slim weight and using ./config/20B.yml and running "python3 ./deepy.py train.py ./configs/20B.yml" gives this error:

RuntimeError: Error(s) in loading state_dict for EmbeddingPipe:
size mismatch for word_embeddings.weight: copying a param with shape torch.Size([12608, 6144]) from checkpoint, the shape in current model is torch.Size([12672, 6144]).

dashstander · 2023-10-03T23:52:25Z

I suspect that this is an error that has to do with model parallelism. @shaunstoltz how many GPUs were you loading the model onto / what was the model parallelism setting?

diazero-security · 2024-08-17T14:40:04Z

Does anyone have a solution for this problem?

mcao516 added the bug Something isn't working label Jul 7, 2022

StellaAthena closed this as completed Dec 11, 2022

StellaAthena reopened this Apr 19, 2023

StellaAthena added good first issue Good for newcomers help wanted This issue needs assistance labels Apr 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: Error(s) in loading state_dict for EmbeddingPipe: size mismatch for word_embeddings.weight #645

RuntimeError: Error(s) in loading state_dict for EmbeddingPipe: size mismatch for word_embeddings.weight #645

mcao516 commented Jul 7, 2022

jdagdelen commented Jul 16, 2022 •

edited

Loading

FayZ676 commented Dec 9, 2022

binglun30 commented Mar 28, 2023

djaym7 commented Apr 19, 2023

StellaAthena commented Apr 19, 2023

StellaAthena commented Apr 30, 2023

shaunstoltz commented Sep 25, 2023

dashstander commented Oct 3, 2023

diazero-security commented Aug 17, 2024

RuntimeError: Error(s) in loading state_dict for EmbeddingPipe: size mismatch for word_embeddings.weight #645

RuntimeError: Error(s) in loading state_dict for EmbeddingPipe: size mismatch for word_embeddings.weight #645

Comments

mcao516 commented Jul 7, 2022

jdagdelen commented Jul 16, 2022 • edited Loading

FayZ676 commented Dec 9, 2022

binglun30 commented Mar 28, 2023

djaym7 commented Apr 19, 2023

StellaAthena commented Apr 19, 2023

StellaAthena commented Apr 30, 2023

shaunstoltz commented Sep 25, 2023

dashstander commented Oct 3, 2023

diazero-security commented Aug 17, 2024

jdagdelen commented Jul 16, 2022 •

edited

Loading