You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, thank you for the amazing work and code.
So I'm trying to adapt the code to Gemma 2 9B it model, after changing the prompts to chat template required and running the code it gives following error
attn_weights = attn_weights + causal_mask
RuntimeError: The size of tensor a (7538) must match the size of tensor b (46) at non-singleton dimension 3
seems like past_key_values and current inputs is creating this problem, apparently setting usePrompt as True the generation works, but then just using cache with questions on hotpot dataset setting usePrompt as False dosent work
Does this imply that need to make changes in the generate function for some other issue?
The text was updated successfully, but these errors were encountered:
Could you provide the full script so we could reproduce the error?
My first guess is the KV Cache in Gemma may have different dimension representations, but would need experiment for confirmation.
Hello, thank you for the amazing work and code.
So I'm trying to adapt the code to Gemma 2 9B it model, after changing the prompts to chat template required and running the code it gives following error
attn_weights = attn_weights + causal_mask
RuntimeError: The size of tensor a (7538) must match the size of tensor b (46) at non-singleton dimension 3
seems like past_key_values and current inputs is creating this problem, apparently setting usePrompt as True the generation works, but then just using cache with questions on hotpot dataset setting usePrompt as False dosent work
Does this imply that need to make changes in the generate function for some other issue?
The text was updated successfully, but these errors were encountered: